Query lcl|NC_020883.1_cdsid_YP_007678082.1 [gene=K203_gp56] [protein=hypothetical protein] [protein_id=YP_007678082.1] [location=24592..26361] Match_columns 589 No_of_seqs 58 out of 60 Neff 5.3 Searched_HMMs 1612 Date Thu Nov 7 17:00:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_56 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_56_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98883 Length: 517 100.0 8.4E-78 5.2E-81 443.1 37.0 468 1-567 3-517 (517) 2 protein:vir:79703 Length: 505 100.0 2.4E-74 1.5E-77 424.2 38.1 460 1-546 3-505 (505) 3 protein:vir:4782 Length: 522 # 100.0 1.3E-73 8E-77 420.2 37.3 464 1-567 3-522 (522) 4 protein:vir:1587 Length: 508 # 100.0 2.3E-73 1.4E-76 418.9 37.7 460 1-570 3-508 (508) 5 protein:vir:78907 Length: 518 100.0 3.8E-73 2.4E-76 417.6 36.8 471 1-559 1-518 (518) 6 protein:vir:3028 Length: 500 # 100.0 2.3E-69 1.4E-72 396.9 36.1 461 1-566 3-500 (500) 7 protein:vir:9815 Length: 500 # 100.0 2.3E-69 1.4E-72 396.9 36.1 461 1-566 3-500 (500) 8 protein:vir:80959 Length: 499 100.0 5.5E-67 3.4E-70 383.9 36.1 461 1-570 1-499 (499) 9 protein:vir:38 Length: 496 # N 100.0 1.8E-66 1.1E-69 381.1 35.5 462 1-570 1-496 (496) 10 protein:vir:106639 Length: 481 100.0 1.1E-45 7.1E-49 267.0 32.5 446 1-569 29-481 (481) 11 protein:vir:96494 Length: 501 100.0 2.6E-44 1.6E-47 259.5 34.1 456 1-583 38-501 (501) 12 protein:vir:105292 Length: 478 100.0 1.5E-44 9.2E-48 260.9 31.1 447 1-577 1-478 (478) 13 protein:vir:96839 Length: 474 100.0 3.1E-44 1.9E-47 259.1 32.1 444 1-571 1-474 (474) 14 protein:vir:99781 Length: 511 100.0 5.6E-44 3.5E-47 257.7 33.0 463 1-579 28-511 (511) 15 protein:vir:96179 Length: 468 100.0 4.4E-44 2.7E-47 258.3 30.4 442 1-574 1-468 (468) 16 protein:vir:4898 Length: 502 # 100.0 2E-43 1.2E-46 254.7 33.8 458 1-583 34-502 (502) 17 protein:vir:78805 Length: 511 100.0 1.8E-43 1.1E-46 254.9 32.4 459 1-579 28-511 (511) 18 protein:vir:96366 Length: 511 100.0 1.8E-43 1.1E-46 254.9 32.4 459 1-579 28-511 (511) 19 protein:vir:96240 Length: 511 100.0 3.7E-43 2.3E-46 253.2 33.7 459 1-579 28-511 (511) 20 protein:vir:94546 Length: 506 100.0 3.8E-43 2.3E-46 253.2 31.4 462 1-582 14-506 (506) 21 protein:vir:79043 Length: 479 100.0 5.4E-43 3.4E-46 252.3 31.9 456 1-567 11-479 (479) 22 protein:vir:105461 Length: 470 100.0 6.1E-43 3.8E-46 252.0 31.5 459 1-572 1-470 (470) 23 protein:vir:95113 Length: 474 100.0 1E-42 6.3E-46 250.8 32.3 447 1-574 1-474 (474) 24 protein:vir:106571 Length: 499 100.0 9.2E-43 5.7E-46 251.1 32.0 452 1-589 15-498 (499) 25 protein:vir:5961 Length: 503 # 100.0 2.8E-42 1.7E-45 248.4 34.6 469 1-585 1-503 (503) 26 protein:vir:2732 Length: 501 # 100.0 3E-42 1.9E-45 248.2 34.4 456 1-586 38-501 (501) 27 protein:vir:95806 Length: 440 100.0 2.5E-42 1.5E-45 248.7 33.0 431 13-572 1-440 (440) 28 protein:vir:103951 Length: 511 100.0 2.8E-42 1.8E-45 248.4 32.6 461 1-579 28-511 (511) 29 protein:vir:9306 Length: 511 # 100.0 4.1E-42 2.6E-45 247.5 33.1 463 1-579 28-511 (511) 30 protein:vir:107112 Length: 478 100.0 7.5E-42 4.7E-45 246.0 33.4 452 1-569 1-478 (478) 31 protein:vir:97171 Length: 512 100.0 8.1E-42 5E-45 245.9 31.9 463 1-579 35-512 (512) 32 protein:vir:3964 Length: 453 # 100.0 4.7E-42 2.9E-45 247.2 30.3 433 1-579 9-453 (453) 33 protein:vir:9922 Length: 489 # 100.0 1.7E-41 1E-44 244.1 32.7 450 1-573 8-489 (489) 34 protein:vir:94498 Length: 474 100.0 2.4E-41 1.5E-44 243.2 33.5 446 1-579 1-474 (474) 35 protein:vir:97447 Length: 474 100.0 2.4E-41 1.5E-44 243.2 33.5 446 1-579 1-474 (474) 36 protein:vir:96266 Length: 474 100.0 2.3E-41 1.4E-44 243.4 32.1 447 1-579 1-474 (474) 37 protein:vir:95899 Length: 474 100.0 2.3E-41 1.4E-44 243.4 32.1 447 1-579 1-474 (474) 38 protein:vir:1236 Length: 483 # 100.0 4.9E-41 3E-44 241.6 33.4 448 1-579 28-483 (483) 39 protein:vir:93747 Length: 472 100.0 4.9E-41 3.1E-44 241.6 31.8 448 1-579 15-472 (472) 40 protein:vir:3609 Length: 452 # 100.0 3.8E-41 2.3E-44 242.2 30.3 434 1-579 8-452 (452) 41 protein:vir:99522 Length: 470 100.0 8.2E-41 5.1E-44 240.4 32.0 436 1-578 1-470 (470) 42 protein:vir:102950 Length: 471 100.0 8.5E-41 5.3E-44 240.3 31.7 452 2-574 1-471 (471) 43 protein:vir:97336 Length: 492 100.0 2E-39 1.3E-42 232.7 31.6 448 1-579 37-492 (492) 44 protein:vir:105889 Length: 474 100.0 2E-39 1.2E-42 232.8 31.5 453 1-579 7-474 (474) 45 protein:vir:94101 Length: 474 100.0 2E-39 1.2E-42 232.8 31.5 453 1-579 7-474 (474) 46 protein:vir:9871 Length: 429 # 100.0 1.2E-39 7.6E-43 233.9 30.4 424 1-576 1-429 (429) 47 protein:vir:733 Length: 453 # 100.0 1E-39 6.2E-43 234.4 29.7 437 1-566 8-453 (453) 48 protein:vir:78083 Length: 537 100.0 1.9E-39 1.2E-42 232.9 30.6 499 1-589 8-533 (537) 49 protein:vir:94805 Length: 492 100.0 5.3E-39 3.3E-42 230.4 30.0 447 1-579 35-492 (492) 50 protein:vir:102330 Length: 451 100.0 1E-38 6.5E-42 228.8 29.1 428 9-555 1-451 (451) 51 protein:vir:104082 Length: 485 100.0 1.6E-37 1E-40 222.3 30.0 460 1-588 8-485 (485) 52 protein:vir:2427 Length: 485 # 100.0 3.9E-37 2.4E-40 220.2 31.0 459 2-588 1-485 (485) 53 protein:vir:2341 Length: 488 # 100.0 4.8E-37 3E-40 219.7 28.7 466 1-576 6-488 (488) 54 protein:vir:7768 Length: 484 # 100.0 2.7E-36 1.7E-39 215.6 28.8 460 1-589 1-484 (484) 55 protein:vir:80680 Length: 441 100.0 1.1E-35 6.9E-39 212.2 31.1 432 2-562 1-441 (441) 56 protein:vir:4223 Length: 486 # 100.0 6.4E-36 4E-39 213.5 28.4 457 1-587 7-486 (486) 57 protein:vir:78227 Length: 480 100.0 1.3E-34 8.2E-38 206.3 29.7 464 3-582 1-480 (480) 58 protein:vir:99916 Length: 504 100.0 7.4E-34 4.6E-37 202.2 30.5 467 1-585 17-504 (504) 59 protein:vir:78537 Length: 480 100.0 1.6E-33 9.6E-37 200.5 31.5 459 3-589 1-475 (480) 60 protein:vir:99072 Length: 479 100.0 3.3E-34 2E-37 204.2 27.5 452 1-589 1-473 (479) 61 protein:vir:101494 Length: 527 100.0 1.5E-33 9.1E-37 200.6 30.1 475 1-572 1-527 (527) 62 protein:vir:102239 Length: 527 100.0 1.6E-33 1E-36 200.4 30.1 475 1-572 1-527 (527) 63 protein:vir:7987 Length: 456 # 100.0 5.5E-33 3.4E-36 197.5 29.5 443 1-563 1-456 (456) 64 protein:vir:102602 Length: 456 100.0 2.4E-32 1.5E-35 194.0 30.2 441 1-569 1-456 (456) 65 protein:vir:105819 Length: 456 100.0 2.4E-32 1.5E-35 194.0 30.2 441 1-569 1-456 (456) 66 protein:vir:2500 Length: 501 # 100.0 4.2E-32 2.6E-35 192.6 30.9 470 1-587 15-501 (501) 67 protein:vir:7430 Length: 563 # 100.0 2.5E-30 1.5E-33 182.9 29.5 493 1-587 14-563 (563) 68 protein:vir:98444 Length: 434 100.0 6.4E-30 4E-33 180.6 31.2 419 57-575 1-434 (434) 69 protein:vir:9568 Length: 410 # 99.9 3.7E-28 2.3E-31 171.0 31.1 400 14-545 1-410 (410) 70 protein:vir:9751 Length: 422 # 99.9 6.5E-28 4E-31 169.6 28.5 409 1-538 5-422 (422) 71 protein:vir:94742 Length: 409 99.9 1.8E-27 1.1E-30 167.3 29.2 397 1-523 5-409 (409) 72 protein:vir:8184 Length: 474 # 99.9 4.5E-26 2.8E-29 159.6 31.3 445 1-564 1-474 (474) 73 protein:vir:1634 Length: 409 # 99.9 6E-26 3.7E-29 158.9 28.8 396 2-523 1-409 (409) 74 protein:vir:93630 Length: 776 99.2 6.7E-11 4.2E-14 76.4 24.1 512 1-589 1-670 (776) 75 protein:vir:80040 Length: 461 99.0 2.6E-09 1.6E-12 67.6 24.2 425 1-575 11-461 (461) 76 protein:vir:94956 Length: 452 98.8 3.5E-08 2.2E-11 61.5 24.3 431 1-570 1-452 (452) 77 protein:vir:8846 Length: 705 # 98.8 4.8E-09 3E-12 66.2 19.1 495 1-589 10-619 (705) 78 protein:vir:80165 Length: 651 98.6 1.6E-07 9.8E-11 57.9 26.1 502 1-589 15-622 (651) 79 protein:vir:108295 Length: 711 98.6 1.9E-07 1.2E-10 57.5 24.4 519 1-589 1-637 (711) 80 protein:vir:97265 Length: 513 98.6 2.6E-07 1.6E-10 56.7 31.5 457 9-577 1-513 (513) 81 protein:vir:107742 Length: 537 98.1 3.7E-06 2.3E-09 50.4 24.1 445 1-589 56-533 (537) 82 protein:vir:94049 Length: 532 98.0 6.8E-06 4.2E-09 48.9 21.7 447 1-589 35-518 (532) 83 protein:vir:95449 Length: 584 98.0 3E-07 1.9E-10 56.3 11.7 480 1-589 11-579 (584) 84 protein:vir:79538 Length: 502 97.9 1.3E-05 8.1E-09 47.4 34.4 440 1-580 3-502 (502) 85 protein:vir:104338 Length: 422 97.8 6.8E-06 4.2E-09 48.9 15.7 409 14-576 1-422 (422) 86 protein:vir:107662 Length: 427 97.7 2.9E-05 1.8E-08 45.5 21.5 410 14-583 1-427 (427) 87 protein:vir:79647 Length: 435 97.7 3.1E-05 1.9E-08 45.3 18.9 424 1-579 1-435 (435) 88 protein:vir:95821 Length: 763 97.5 5.2E-05 3.2E-08 44.1 25.2 468 1-589 15-634 (763) 89 protein:vir:95149 Length: 501 97.5 5.4E-05 3.3E-08 44.0 32.1 457 1-570 1-501 (501) 90 protein:vir:96068 Length: 765 97.4 6.9E-05 4.3E-08 43.4 18.9 461 1-589 43-567 (765) 91 protein:vir:96738 Length: 505 97.4 6.9E-05 4.3E-08 43.4 31.4 450 1-577 14-505 (505) 92 protein:vir:80453 Length: 535 97.3 8.9E-05 5.5E-08 42.8 31.4 462 1-576 32-535 (535) 93 protein:vir:95542 Length: 548 97.2 0.00015 9.2E-08 41.6 28.8 469 7-589 1-524 (548) 94 protein:vir:6382 Length: 553 # 97.0 0.0002 1.3E-07 40.8 31.6 479 1-586 1-553 (553) 95 protein:vir:5249 Length: 437 # 96.8 0.00035 2.2E-07 39.5 24.8 419 3-580 1-437 (437) 96 protein:vir:77597 Length: 725 96.8 0.00036 2.3E-07 39.4 23.9 503 1-589 1-609 (725) 97 protein:vir:10321 Length: 495 96.6 0.00046 2.9E-07 38.9 25.8 450 2-579 1-495 (495) 98 protein:vir:104437 Length: 714 96.5 0.00053 3.3E-07 38.6 26.5 505 1-589 1-645 (714) 99 protein:vir:100920 Length: 725 96.3 0.00078 4.8E-07 37.6 25.9 505 1-589 1-609 (725) 100 protein:vir:389 Length: 530 # 96.2 0.00083 5.2E-07 37.5 32.1 458 1-576 27-530 (530) 101 protein:vir:78393 Length: 489 96.1 0.001 6.5E-07 36.9 30.5 450 1-572 1-489 (489) 102 protein:vir:105429 Length: 708 96.0 0.0011 7E-07 36.7 22.7 495 1-589 1-632 (708) 103 protein:vir:817 Length: 714 # 96.0 0.0011 7E-07 36.7 27.5 497 1-589 1-645 (714) 104 protein:vir:3296 Length: 714 # 96.0 0.0011 7E-07 36.7 27.5 497 1-589 1-645 (714) 105 protein:vir:10117 Length: 714 96.0 0.0011 7E-07 36.7 27.5 497 1-589 1-645 (714) 106 protein:vir:9950 Length: 714 # 96.0 0.0011 7E-07 36.7 27.5 497 1-589 1-645 (714) 107 protein:vir:2764 Length: 714 # 96.0 0.0011 7E-07 36.7 27.5 497 1-589 1-645 (714) 108 protein:vir:9263 Length: 725 # 95.9 0.0013 8.4E-07 36.3 26.1 502 1-589 1-609 (725) 109 protein:vir:172 Length: 708 # 95.6 0.0018 1.1E-06 35.6 24.2 505 1-589 1-621 (708) 110 protein:vir:96783 Length: 488 95.4 0.0021 1.3E-06 35.3 28.2 438 1-538 1-488 (488) 111 protein:vir:99563 Length: 862 94.9 0.0031 1.9E-06 34.3 25.0 438 1-589 91-568 (862) 112 protein:vir:105619 Length: 772 94.7 0.0036 2.3E-06 34.0 26.1 493 7-589 1-652 (772) 113 protein:vir:3420 Length: 533 # 94.6 0.004 2.5E-06 33.8 32.5 454 1-576 35-533 (533) 114 protein:vir:103219 Length: 201 93.9 0.0061 3.8E-06 32.7 13.3 199 296-576 1-201 (201) 115 protein:vir:94599 Length: 641 93.1 0.0087 5.4E-06 31.9 20.0 484 1-589 20-594 (641) 116 protein:vir:7407 Length: 392 # 92.8 0.0098 6.1E-06 31.6 25.7 372 67-574 1-392 (392) 117 protein:vir:6240 Length: 457 # 92.8 0.01 6.3E-06 31.5 23.5 443 21-587 1-457 (457) 118 protein:vir:80644 Length: 551 92.5 0.011 6.9E-06 31.3 21.6 469 14-589 1-529 (551) 119 protein:vir:95014 Length: 491 92.0 0.013 8.2E-06 30.9 26.5 449 1-571 1-491 (491) 120 protein:vir:102080 Length: 429 91.3 0.016 1E-05 30.4 19.3 408 21-571 1-429 (429) 121 protein:vir:93610 Length: 454 91.1 0.017 1.1E-05 30.2 23.4 430 1-589 1-452 (454) 122 protein:vir:105520 Length: 706 91.0 0.018 1.1E-05 30.1 24.3 506 1-589 1-618 (706) 123 protein:vir:3989 Length: 392 # 90.1 0.023 1.4E-05 29.6 25.8 358 82-574 1-392 (392) 124 protein:vir:1023 Length: 392 # 90.1 0.023 1.4E-05 29.6 25.8 358 82-574 1-392 (392) 125 protein:vir:1326 Length: 457 # 86.9 0.042 2.6E-05 28.1 22.0 443 7-587 1-457 (457) 126 protein:vir:106716 Length: 698 86.0 0.048 3E-05 27.8 17.5 466 1-589 54-548 (698) 127 protein:vir:63755 Length: 547 85.6 0.051 3.2E-05 27.6 22.0 463 18-589 1-525 (547) 128 protein:vir:3153 Length: 467 # 84.7 0.059 3.6E-05 27.3 24.8 440 49-589 1-466 (467) 129 protein:vir:101541 Length: 694 84.6 0.059 3.7E-05 27.3 17.8 453 1-589 90-547 (694) 130 protein:vir:94709 Length: 522 80.3 0.097 6E-05 26.2 20.7 488 1-569 1-522 (522) 131 protein:vir:97060 Length: 432 79.1 0.11 6.7E-05 25.9 24.6 383 78-583 1-432 (432) 132 protein:vir:78589 Length: 695 78.8 0.11 6.9E-05 25.8 17.4 453 1-589 91-548 (695) 133 protein:vir:3648 Length: 695 # 76.6 0.13 8.3E-05 25.4 17.3 453 1-589 91-548 (695) 134 protein:vir:10362 Length: 432 75.7 0.14 8.9E-05 25.2 24.5 382 75-583 1-432 (432) 135 protein:vir:98853 Length: 219 74.3 0.16 9.9E-05 24.9 15.1 214 198-489 1-219 (219) 136 protein:vir:4952 Length: 386 # 71.6 0.19 0.00012 24.5 25.6 377 21-573 1-386 (386) 137 protein:vir:3843 Length: 397 # 68.7 0.23 0.00015 24.0 25.1 382 33-582 1-397 (397) 138 protein:vir:4854 Length: 386 # 63.8 0.31 0.00019 23.4 23.5 377 21-573 1-386 (386) 139 protein:vir:4995 Length: 384 # 58.4 0.41 0.00026 22.7 18.8 375 21-558 1-384 (384) 140 protein:vir:107605 Length: 432 56.2 0.46 0.00029 22.4 20.9 405 18-581 1-432 (432) 141 protein:vir:105002 Length: 432 56.2 0.46 0.00029 22.4 20.9 405 18-581 1-432 (432) 142 protein:vir:102855 Length: 432 56.2 0.46 0.00029 22.4 20.9 405 18-581 1-432 (432) 143 protein:vir:78161 Length: 355 54.4 0.5 0.00031 22.2 20.1 327 191-589 1-348 (355) 144 protein:vir:103860 Length: 528 54.0 0.51 0.00032 22.2 19.2 425 1-589 32-475 (528) 145 protein:vir:4454 Length: 414 # 52.1 0.56 0.00035 22.0 24.4 401 21-575 1-414 (414) 146 protein:vir:1538 Length: 535 # 51.0 0.59 0.00037 21.8 22.2 494 1-589 1-531 (535) 147 protein:vir:7321 Length: 556 # 45.6 0.76 0.00047 21.2 25.9 490 1-565 1-556 (556) 148 protein:vir:99452 Length: 651 41.7 0.91 0.00057 20.8 23.6 433 1-589 77-544 (651) 149 protein:vir:4337 Length: 434 # 39.7 1 0.00063 20.6 19.3 393 83-577 1-434 (434) 150 protein:vir:106999 Length: 564 39.0 1 0.00065 20.5 21.9 470 1-589 20-554 (564) 151 protein:vir:4194 Length: 540 # 37.9 1.1 0.00068 20.4 22.3 434 1-589 1-486 (540) 152 protein:vir:99312 Length: 563 36.9 1.1 0.00071 20.3 22.0 473 1-589 1-537 (563) 153 protein:vir:95599 Length: 563 36.9 1.1 0.00071 20.3 22.0 473 1-589 1-537 (563) 154 protein:vir:8418 Length: 409 # 35.5 1.2 0.00076 20.1 24.2 390 29-581 1-409 (409) 155 protein:vir:3361 Length: 535 # 32.9 1.4 0.00087 19.8 22.0 477 1-589 1-531 (535) 156 protein:vir:100150 Length: 437 30.9 1.5 0.00096 19.6 20.3 425 10-580 1-437 (437) 157 protein:vir:105064 Length: 421 30.1 1.6 0.00099 19.5 24.5 407 1-585 1-421 (421) 158 protein:vir:99232 Length: 526 28.5 1.7 0.0011 19.3 18.7 427 1-589 32-473 (526) 159 protein:vir:3139 Length: 599 # 27.3 1.9 0.0012 19.1 23.0 491 1-568 15-599 (599) 160 protein:vir:8883 Length: 543 # 26.6 1.9 0.0012 19.0 23.7 496 1-576 1-543 (543) 161 protein:vir:80796 Length: 574 26.4 1.9 0.0012 19.0 21.1 475 6-589 1-541 (574) 162 protein:vir:3520 Length: 720 # 25.4 2.1 0.0013 18.9 23.5 491 1-589 1-619 (720) 163 protein:vir:81152 Length: 411 25.0 2.1 0.0013 18.8 20.9 395 7-572 1-411 (411) 164 protein:vir:102727 Length: 945 21.2 2.6 0.0016 18.3 23.4 460 1-589 7-542 (945) 165 protein:vir:100691 Length: 535 20.4 2.8 0.0017 18.1 25.9 441 1-587 52-535 (535) No 1 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=8.4e-78 Score=443.14 Aligned_cols=468 Identities=16% Similarity=0.168 Sum_probs=339.9 Q ss_pred CccceeccchhHH--------HHh------------hcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWTVRGWTDKT--------TKN------------VHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~~~~~~~~~--------~~~------------~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) ||+| +++|--++ +++ ...-|.+||.+|+|+|.++++++..-.. T Consensus 3 ~~~~-ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~---------------- 65 (517) T protein:vir:98 3 VIQR-IKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKI---------------- 65 (517) T ss_pred hHHH-HHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccc---------------- Confidence 6665 45554332 222 2345778999999999999877422111 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) ...-++-+|+++.||++-|+||-+...+|+- +++.+.....++. ...+|+++.++ T Consensus 66 -~~~~~~sl~~~~~i~~~~A~Ll~~e~~~i~v---~d~~~~~~~~~~~---------------------~~~~e~l~~i~ 120 (517) T protein:vir:98 66 -QERDYMTLNLRKLSADVLSGLVFNEQCEVYV---SDAKDEEKKDNSF---------------------KTAHEFIQHVF 120 (517) T ss_pred -cccceeecCcHHHHHHHhhhhhcCCcceEEe---cccccccccccch---------------------hHHHHHHHHHH Confidence 1223667899999999999999444333333 1111111111111 12377999999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceeccc-ccCc---ceeEEEeec---CCCccceEEEEEee Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPH-DDEK---GADLAYYID---HGQYGQFLHIYRER 213 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~-~d~~---~~div~~~e---~~~~~~~l~~~~~~ 213 (589) ++++|+..+..+++++++.||+++|||||+++++|+|++||||||. .+.. .|.|+|... ..+..+|+++++|. T Consensus 121 ~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~ 200 (517) T protein:vir:98 121 QHNKFIKNLSDYLEPTFALGGLTVRPYVDNGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHE 200 (517) T ss_pred HhccHHHHHHHHHHHHhhhCCEEEEEEEeCCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEe Confidence 9999999999999999999999999999999999999999999994 3222 467777543 33455799999998 Q ss_pred ecc-----ccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe- Q lcl|NC_020883. 214 VEK-----DGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW- 287 (589) Q Consensus 214 ~~~-----~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv- 287 (589) ..+ .+|+|.|.+|+.. .....|..+++.+..+++ +...+.||+++|+++|+ T Consensus 201 ~~~~~~~~~~y~I~n~ly~s~-------~~~~lG~~v~L~~~~e~l----------------~~~~~~~g~~~Plf~y~~ 257 (517) T protein:vir:98 201 WEKTEEGESLYVITNELYKSD-------NEGEIGKRIPLEELYEGM----------------QEKTYIQGLSRPLFNYLK 257 (517) T ss_pred cCceeccCCcEEEEEEEEecC-------CCccccccccccccccCC----------------CcceeECCCCcceEEEec Confidence 764 4689999999641 112246556555443322 22457799999999997 Q ss_pred ---cCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccc-------ccccccccccccccc Q lcl|NC_020883. 288 ---ANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLL-------NIAYERDGHSAKEAS 357 (589) Q Consensus 288 ---PN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~-------g~~~d~dge~~~~~~ 357 (589) ||+....+|||+|+|+++.++||+||++|+++++++ +.|+.||+||++||++.. +..+|.+.+.+ T Consensus 258 ~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~-~~g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y---- 332 (517) T protein:vir:98 258 PSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEI-KMGQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVY---- 332 (517) T ss_pred CCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHH-HhCCcceecChhhhccccCCCCcccCCCCCccccee---- Confidence 566777899999999999999999999999999999 569999999999996531 11122222221 Q ss_pred ccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhh Q lcl|NC_020883. 358 MMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLT 437 (589) Q Consensus 358 ~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~ 437 (589) ..+..+.++..++.+|++||+++|++.++.++++|...+++|+++||+. +.|.++| ++.+.+..+ T Consensus 333 -----------~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~-~~~~kTA---TEi~s~~~~ 397 (517) T protein:vir:98 333 -----------KSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFD-GRSMKTA---TEIVSENDL 397 (517) T ss_pred -----------eeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccc-ccccccH---HHHHHHHHH Confidence 2222334445678899999999999999999999999999999999974 4555533 444555556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---ccc-CcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHH Q lcl|NC_020883. 438 TILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSI-RIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVR 513 (589) Q Consensus 438 ~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~-~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr 513 (589) +.+++++++..+..+|++++++++++++.++ ..+ ...++.|.|+|+++.|+. +.+++...++++|+||++++|+ T Consensus 398 ~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~--~~~~~~~~~v~aG~ms~~~~i~ 475 (517) T protein:vir:98 398 TYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRS--ALLRFYGQAKTFGFIPTVEAIQ 475 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHH--HHHHHHHHHHhcCCCCHHHHHH Confidence 6677888999999999999999999887642 222 334678999999998855 4466777788999999999999 Q ss_pred HhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCC Q lcl|NC_020883. 514 RMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEG 567 (589) Q Consensus 514 ~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg 567 (589) ++|+ |+++++++||+||++|++.++|++.. .++.++.+.||. T Consensus 476 ~~~g-~~eeeA~~e~~~i~~E~~~~~~~~~~-----------~~~~~~~~gd~e 517 (517) T protein:vir:98 476 RIFK-VPKKTAEQWLEEIRKDQIELDPVTIS-----------QRAQKRMFGDEE 517 (517) T ss_pred HhCC-CChHHHHHHHHHHHHhccccCCCCcc-----------ccccCCCCCCCC Confidence 9996 89999999999999999876554332 233333333321 No 2 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=2.4e-74 Score=424.21 Aligned_cols=460 Identities=15% Similarity=0.129 Sum_probs=333.9 Q ss_pred Cccce---eccchh-----HHHHhh-------c-----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWT---VRGWTD-----KTTKNV-------H-----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~---~~~~~~-----~~~~~~-------~-----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) ||++- +|.|-- +.+++. . .-|.+||.+|+|+|..|..++.. +.+. T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~-------------~~~~-- 67 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSY-------------GDTQ-- 67 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccC-------------CCcc-- Confidence 55542 334322 222221 1 11356899999999998776421 1111 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) ...++.+|+++.||++.|+||-+...+|+. + + ...++++++++ T Consensus 68 --~~~~~slnl~~~i~~~~A~ll~~e~~~i~~-------------~----------d------------~~~~e~l~~i~ 110 (505) T protein:vir:79 68 --KHELQSVNVTKLASAKLASLIFNEQCQVTV-------------S----------D------------ETANDFLDDVF 110 (505) T ss_pred --ccceeecchHHHHHHHHHhhhcCCCceeec-------------C----------C------------hHHHHHHHHHH Confidence 335788899999999999999333222222 0 0 12377999999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecc--cccCcceeEEEeecCC-----CccceEEEEEee Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFP--HDDEKGADLAYYIDHG-----QYGQFLHIYRER 213 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P--~~d~~~~div~~~e~~-----~~~~~l~~~~~~ 213 (589) ++++|+..+..+++++.+.||++++||||+++++|.|++|||||| ++.++-..+++..+.. +...|+++++|. T Consensus 111 ~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~ 190 (505) T protein:vir:79 111 QQNDFYTTFEEKLEEWIALGSGCVRPYVDSGKIKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQ 190 (505) T ss_pred HhccHHHHHHHHHHHHhhcCCeEEEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEE Confidence 999999999999999999999999999999999999999999999 4677765666655433 233689999998 Q ss_pred eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe----cC Q lcl|NC_020883. 214 VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW----AN 289 (589) Q Consensus 214 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv----PN 289 (589) +....+.|.|.+|++.. ....|..+++.+. ++ ++++.+ +..+||+++|+++|+ || T Consensus 191 ~~~~~~~I~n~ly~~~~-------~~~lG~~v~l~~~----~~---------~~~l~~-~~~~~g~~~p~f~~~~~~~~N 249 (505) T protein:vir:79 191 WDHGDYVITNELYRSEA-------AETVGINVPLNSL----EQ---------YEGLEP-QVKITGLKHPLFAFYRNKGAN 249 (505) T ss_pred ecCceEEEEEEEEecCC-------CCccCcccchhhc----cc---------ccccCc-ceeecCCCcceEEEecCCccc Confidence 88889999999996411 1112433332221 11 122333 445699999999998 45 Q ss_pred CCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccc Q lcl|NC_020883. 290 NETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDME 369 (589) Q Consensus 290 ~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dle 369 (589) +....+|||+|||+++.+++|+||++||++++.+ +.|+.||+||++||++......+......+ .+..+...+. T Consensus 250 ~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~-~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~-----~fd~~~~~y~ 323 (505) T protein:vir:79 250 NKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEV-KKGQRRLIVPAEWLKTGSSYGGQASETHPP-----MFDPDETVYQ 323 (505) T ss_pred ccccCCccCCchhhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcccCCCCccccccccc-----CCCccceeee Confidence 6778899999999999999999999999999999 569999999999998753222111111111 1111222222 Q ss_pred ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020883. 370 ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEY 449 (589) Q Consensus 370 v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~ 449 (589) .+..++.+..++.+|+++|+++|++.++.++++|+..+++|+.+||+. +.|.+ |+++.+.+..++.++++.++..+ T Consensus 324 ~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~-~~~~~---TAtei~s~~~~l~~t~~~~~~~~ 399 (505) T protein:vir:79 324 AMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTS-PSGIQ---TATEVVTNNSQTYQTRSSYITQV 399 (505) T ss_pred eccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCC-ccccc---hHHHHHHHHhHHHHHHHHHHHHH Confidence 233345556788899999999999999999999999999999999973 34443 55666666677778888999999 Q ss_pred HHHHHHHHHHHHHHHhhcCc----------ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCC Q lcl|NC_020883. 450 IDFLKELYESCLWLLNDQDS----------SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDA 519 (589) Q Consensus 450 ~~aLk~li~~~l~L~~~~~~----------~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw 519 (589) ..+|+++++++++++..++. .+...++.|.|+|+++.|+.+ .++....++++|++|.++++++ ||.| T Consensus 400 ~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~--~~~~~~~~v~~Gi~s~e~~l~~-~~~~ 476 (505) T protein:vir:79 400 EKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQES--KRAADLQAVQAQVMPKKQFLMR-NYGL 476 (505) T ss_pred HHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHH--HHHHHHHHHHcCCCCHHHHHHh-cCCC Confidence 99999999999999876432 122346789999999988654 3556667788899999998876 6889 Q ss_pred CHHHHHHHHHHHHhhcccccccc--cccc Q lcl|NC_020883. 520 SEDWIQEEIARIEEEQAGSDTSS--LMGI 546 (589) Q Consensus 520 ~dE~v~eEv~RI~~E~a~~~p~~--~g~~ 546 (589) +|++|++|++||++|++.++|.. +|++ T Consensus 477 ~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 477 DEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred ChHHHHHHHHHHHHhccccCCCchhccCC Confidence 99999999999999998765443 2222 No 3 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=1.3e-73 Score=420.18 Aligned_cols=464 Identities=15% Similarity=0.182 Sum_probs=331.5 Q ss_pred CccceeccchhHHH-----Hhh-----c----------chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWTVRGWTDKTT-----KNV-----H----------GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~~~~~~~~~~-----~~~-----~----------~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) ||+| +++|--|.. |++ | .-|.++|++|+|+|.++..++.. ++.. T Consensus 3 ~~~~-~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~----~~~~----------- 66 (522) T protein:vir:47 3 LFQK-VKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTD----GDIK----------- 66 (522) T ss_pred hHHH-HHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccC----cchh----------- Confidence 5554 444544433 211 2 33567789999999998766421 1111 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) ..-++-+|+++.||++.|+|| +|.-++... ++ +..++++++++ T Consensus 67 --~~~~~slnl~~~i~~~~A~lv---~~e~~~i~v--------------------~d------------~~~~~~l~~~l 109 (522) T protein:vir:47 67 --SRPMNHLPIARTASKKIASLV---YNEQATITT--------------------KN------------EILQKFLDDML 109 (522) T ss_pred --cccceecchHHHHHHHHhhhh---cCCcceeec--------------------CC------------hHHHHHHHHHH Confidence 223777899999999999999 443332111 00 22467899999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceeccc-ccC---cceeEEEeecCCCcc---ceEEEEEee Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPH-DDE---KGADLAYYIDHGQYG---QFLHIYRER 213 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~-~d~---~~~div~~~e~~~~~---~~l~~~~~~ 213 (589) ++++|+..+..+++++.+.||+++||||++++++|.|++||+|||. .+. ..|.+++..-.++++ +|..++.|. T Consensus 110 ~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he 189 (522) T protein:vir:47 110 TNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHE 189 (522) T ss_pred hhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEee Confidence 9999999999999999999999999999999999999999999994 332 357677766554333 344455554 Q ss_pred e------------ccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCc Q lcl|NC_020883. 214 V------------EKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNR 281 (589) Q Consensus 214 ~------------~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~ 281 (589) + ...+++|.|.+|++. .....|..+++.+..+ ++++.+ ..+.||+++ T Consensus 190 ~~~~~~~~~~~~~~~~~~~I~n~ly~~~-------~~~~lG~~v~l~~~~e-------------~~~l~~-~~~~~~~~~ 248 (522) T protein:vir:47 190 WVTADGQETGSTNDKKYYRITNELYRSD-------VNDVLGQRVNLSELDK-------------YKNLEP-VTVFENLSR 248 (522) T ss_pred ecccccccccccccCCceEEEEEEeecC-------CCcccCcccccccccc-------------ccCCCC-ceEeCCCCc Confidence 2 344788999998641 1112355554443322 233333 457799999 Q ss_pred ceEEEe----cCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccc---------cccc Q lcl|NC_020883. 282 PFISYW----ANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNI---------AYER 348 (589) Q Consensus 282 plvvyv----PN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~---------~~d~ 348 (589) |+++|+ ||+....+|||+|+|+++.++||+||++||++++.+ +.|+.||+||++||++.... .++. T Consensus 249 Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~-~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~ 327 (522) T protein:vir:47 249 PLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEV-RMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFDV 327 (522) T ss_pred ceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHH-HhccceeecchHHhccCCCCCCcccccccccCc Confidence 999996 677888999999999999999999999999999999 57999999999999874211 1111 Q ss_pred cccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHH Q lcl|NC_020883. 349 DGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSG 428 (589) Q Consensus 349 dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg 428 (589) +.+.+. .+...+.++..++.+|+++|+++|++.+..+++.+...+++|+++||+. +++.+ |+ T Consensus 328 ~~~~f~--------------~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~-~~~~k---TA 389 (522) T protein:vir:47 328 EQNVYM--------------QIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFD-GQGMK---TA 389 (522) T ss_pred ccceEe--------------ecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCcc-ccccc---cH Confidence 221111 1122234445688899999999999999999999999999999999973 34444 45 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC----cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccc Q lcl|NC_020883. 429 VAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD----SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQ 504 (589) Q Consensus 429 ~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~----~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~ 504 (589) ++.+.+..++.+++++++..+..+|++++++++++++.++ ......++.|.|+|+++.|+. +.+++...++++| T Consensus 390 tEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~--~~~~~~~~~v~aG 467 (522) T protein:vir:47 390 TEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRH--AELDYWAKMVAAG 467 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHH--HHHHHHHHHHhcC Confidence 5666677777788899999999999999999999987643 223345678999999998854 4466777778899 Q ss_pred hhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCC Q lcl|NC_020883. 505 GQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEG 567 (589) Q Consensus 505 ~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg 567 (589) +||.++++++++ .|+++++++|++||++|++.++|...+.. + ++ .++.....+|| T Consensus 468 ~~s~e~~i~~~~-g~~eeea~~el~ri~~E~~~~~~~~~~~~----~--~~-~~~~~~~d~~~ 522 (522) T protein:vir:47 468 FSTKKRAIGKTL-NISGVEAEKELNAINSELLPMNDAELAIY----G--MH-DQNEEKADDKG 522 (522) T ss_pred CCCHHHHHHhcC-CCChHHHHHHHHHHHHhhccCCCCCCCCC----C--CC-CcccccCCCCC Confidence 999999999875 49999999999999999987655322211 1 11 11111122222 No 4 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=2.3e-73 Score=418.86 Aligned_cols=460 Identities=17% Similarity=0.190 Sum_probs=324.1 Q ss_pred CccceeccchhH---------HHHhhcc------------hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee Q lcl|NC_020883. 1 MIDWTVRGWTDK---------TTKNVHG------------DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA 59 (589) Q Consensus 1 ~~~~~~~~~~~~---------~~~~~~~------------~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~ 59 (589) ||++. ..|.-| .+++.-+ -|.+++++|+|+|..+-.|+ .++.+. T Consensus 3 ~~~~~-k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~-------------~~~~~~- 67 (508) T protein:vir:15 3 LIQRI-KDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQA-------------SDGIKK- 67 (508) T ss_pred hHHHH-HHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCccccccc-------------CCCCcc- Confidence 55542 233222 2222211 25566799999997664442 111111 Q ss_pred eecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHH Q lcl|NC_020883. 60 RETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQI 139 (589) Q Consensus 60 ~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v 139 (589) ...++.+|+++.||++.|+||-+...+++- .+ .+..+++++++ T Consensus 68 ---~~~~~sln~~~~i~~~~A~lv~~e~~~i~v-----------------------~~-----------~~~~~e~l~~i 110 (508) T protein:vir:15 68 ---KRLKNTINMAKTAARRIASVVFNEKAEIHV-----------------------KD-----------NNEADKFLNDV 110 (508) T ss_pred ---ccceeecchHHHHHHHHHhhhhCCCceEEe-----------------------CC-----------chHHHHHHHHH Confidence 235788899999999999999333222221 00 01235689999 Q ss_pred HhhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecc--cccCcceeEEEeec--CC---CccceEEEEEe Q lcl|NC_020883. 140 TKNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFP--HDDEKGADLAYYID--HG---QYGQFLHIYRE 212 (589) Q Consensus 140 ~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P--~~d~~~~div~~~e--~~---~~~~~l~~~~~ 212 (589) +++++|+..+..+++++++.||++++||||+++++|.|++|||||| .+.++-.++|+..+ .+ +..+|.++++| T Consensus 111 l~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h 190 (508) T protein:vir:15 111 LEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFH 190 (508) T ss_pred HHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEE Confidence 9999999999999999999999999999999999999999999999 46666444444333 22 34468999887 Q ss_pred e-eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe---- Q lcl|NC_020883. 213 R-VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW---- 287 (589) Q Consensus 213 ~-~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv---- 287 (589) . .+..+++|.|.+|++.. ....|..+++.+- ++ ++++.+ +..+||+++|+++|+ T Consensus 191 ~~~~~~~~~I~n~ly~~~~-------~~~lG~~v~l~~~----~e---------~~~l~~-~~~~~g~~~p~f~y~~~~~ 249 (508) T protein:vir:15 191 QWQDNGSYQITNELYKSDS-------PDIVGNQVPLSTL----PV---------YKELAP-QVTISGLQRPLFAYFKTPG 249 (508) T ss_pred EEecCcceEEEEEEEecCC-------chhcCcccchhhc----cc---------ccCCCc-ceEecCCCcceeEEecCCc Confidence 5 46668999999996411 1122443333221 11 222333 456799999999997 Q ss_pred cCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccc Q lcl|NC_020883. 288 ANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRD 367 (589) Q Consensus 288 PN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~d 367 (589) ||+....+|||+|||+++.+++|+||++||+++++| ++|++||+||++||+.. .++.. .+..+.+- T Consensus 250 ~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d------~~~~~-------~~~~~~~~ 315 (508) T protein:vir:15 250 ANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFD------DEHKP-------TFDTEQNV 315 (508) T ss_pred cccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCC------CCCcc-------ccCCCCee Confidence 567788999999999999999999999999999999 79999999999999743 11111 01111111 Q ss_pred cc-ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_020883. 368 ME-ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ 446 (589) Q Consensus 368 le-v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R 446 (589) +. +...++.|..++.+|++||+++|++.++.+++.++..+++|+++||+. ++|.+ |+++.+.+..++.+++..++ T Consensus 316 ~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~-~~~~~---TAtei~s~~~~~~~t~~~~~ 391 (508) T protein:vir:15 316 YVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYS-NDGVK---TATEVVSNNSMTYQTRSSYL 391 (508) T ss_pred EEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccc-cCccc---cHHHHHHHHHHHHHHHHHHH Confidence 11 222345566788899999999999999999999999999999999974 34444 45566656666667778888 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcc------------cCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHH Q lcl|NC_020883. 447 KEYIDFLKELYESCLWLLNDQDSS------------IRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRR 514 (589) Q Consensus 447 ~~~~~aLk~li~~~l~L~~~~~~~------------~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~ 514 (589) ..|..+|++++++++++....+.. ....++.|.|+|+++.|+. +.++....++++|++|.++++++ T Consensus 392 ~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~--~~~~~~~~~v~aGi~s~e~~i~~ 469 (508) T protein:vir:15 392 TMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKD--KQLEEDAKVLAIGALSKQTFLQR 469 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHH--HHHHHHHHHHhcCCCCHHHHHHh Confidence 888899999999999887754211 1123578999999998855 34556666778899999999977 Q ss_pred hCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCC Q lcl|NC_020883. 515 MNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTE 570 (589) Q Consensus 515 Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ 570 (589) ||.|++|++++|++||++|++..++ .++. -.|.+.++.| T Consensus 470 -~~g~~deea~~el~ri~~E~~~~~~--~~~~--------------~~~~~g~~ge 508 (508) T protein:vir:15 470 -NYGMTDEQAAEELAKIQSEAPTDTF--EGGR--------------SAILNGGDGE 508 (508) T ss_pred -cCCCChHHHHHHHHHHHHhccccCc--cccc--------------cccCCCCCCC Confidence 6889999999999999999875322 1211 1133333333 No 5 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=3.8e-73 Score=417.62 Aligned_cols=471 Identities=13% Similarity=0.100 Sum_probs=329.1 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCcc----ccCHHHHHHHhhc-------cccceeccCcceeeecCcceEEE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHE----LLFPRAKRLIEEG-------DAVGRFLDSSQTARETQTPYVIF 69 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~----~~f~ra~~~~~~~-------~~~~~~~~~~~~~~~~~~~y~~~ 69 (589) |==|++ +| .+=+-.|.|++. ++.++.+++.... -|.+++-+.-+ .+.+....|-. T Consensus 1 ~~~~~~-------~~------~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~-~~~~~~~~~~~ 66 (518) T protein:vir:78 1 MGVWSV-------MT------RFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGY-VPTVHDKLMNS 66 (518) T ss_pred Ccchhh-------HH------HHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCC-CCccccccccC Confidence 333321 11 122345666655 5666665554322 11222222211 23445567889 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+++.||++.|+||-+.-.+|+-+-. +..+++. + +++++.|+++++|+..+ T Consensus 67 ~l~~~i~~~~A~ll~~e~~~i~v~~~--------------------------~~~d~e~--~-~~~l~~il~~n~f~~~~ 117 (518) T protein:vir:78 67 GTGNEIVVVAAEYISGKPLSIDVTGV--------------------------NGSKDEN--L-TKQLKEALRIDNFDSKS 117 (518) T ss_pred ChHHHHHHHHHHhhcCCCceEEecCc--------------------------cccCcHH--H-HHHHHHHHHhccHHHHH Confidence 99999999999999333333221000 0000110 1 67899999999999999 Q ss_pred hhhHHHHHHcCceeEEEEEecCceeEEEecCceeccc-ccCcceeEEEeecCC---CccceEEEEEeee--------ccc Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPH-DDEKGADLAYYIDHG---QYGQFLHIYRERV--------EKD 217 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~-~d~~~~div~~~e~~---~~~~~l~~~~~~~--------~~~ 217 (589) ..+++++++.||+++||||++++++|+|++||||||. .+++-..++|..+-. +...|.++++|.. ... T Consensus 118 ~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~ 197 (518) T protein:vir:78 118 VKIVELAGGSGVSAVKINILNGRPSISVHSSSQFWIDFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLS 197 (518) T ss_pred HHHHHHhhccCceEEEEEEECCeeEEEEEcCCeeEEEeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeeccc Confidence 9999999999999999999999999999999999995 677777788866532 2334777877753 234 Q ss_pred cceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecC----CCCC Q lcl|NC_020883. 218 GLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWAN----NETF 293 (589) Q Consensus 218 ~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN----~~~~ 293 (589) .++|.|.+|+. +.|+.+..... +..+.+.......+..+...+.||.+.|+++|+|| +... T Consensus 198 ~~~I~n~ly~~-----------~~~~~v~~~~~----~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~ 262 (518) T protein:vir:78 198 GGFVTYSVIKI-----------DGDKTTPISAE----RLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYP 262 (518) T ss_pred ceeEEEEEeee-----------cCccccccccc----ccccccccccccccCccceeeccCCccceEEeecccccccccc Confidence 56777777752 22332222211 11111111112223345567889999999999875 4556 Q ss_pred CCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccc-------ccccccccccccccccccccc Q lcl|NC_020883. 294 MNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIA-------YERDGHSAKEASMMTPRIDHR 366 (589) Q Consensus 294 ~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~-------~d~dge~~~~~~~~~~~~d~~ 366 (589) ++|||+|||+++.++||+||++||+++++| ++|++||+||++||++..... ++.+.+.+.. T Consensus 263 ~splG~S~~~~~~~~id~lD~~~s~~~~e~-~~g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~----------- 330 (518) T protein:vir:78 263 HLNLGESDLSQCTNYLFAVDYFFTVYMREG-EKTKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQ----------- 330 (518) T ss_pred CCCcCcchHhhhhHHHHHHHHHHHHHHHHH-HhCCceeeechhHhccCCCCCCCccccccCCCCceEEE----------- Confidence 788999999999999999999999999999 569999999999997642111 1112121111 Q ss_pred ccccccccccc----CccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHH Q lcl|NC_020883. 367 DMEITTFDENG----RSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKS 442 (589) Q Consensus 367 dlev~~~de~g----~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv 442 (589) +....+.| ..++.+|++||+++|++.++.+++.++..+++|+.+||+. ++. .|+++.+++..++.+++ T Consensus 331 ---i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~--~~~---~TATei~s~~~~~~~t~ 402 (518) T protein:vir:78 331 ---FKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLG--NRE---VKATEIWSLQDATVRKI 402 (518) T ss_pred ---ecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcc--ccc---ccHHHHHHHHHHHHHHH Confidence 11111111 2367789999999999999999999999999999999863 222 36777777778888899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCcc------cCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC Q lcl|NC_020883. 443 RRLQKEYIDFLKELYESCLWLLNDQDSS------IRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN 516 (589) Q Consensus 443 ~~~R~~~~~aLk~li~~~l~L~~~~~~~------~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh 516 (589) ++++..+..+|++++++++++.+.+++. ....++.|.|+|+++.|+. +.+++++.++++|+||++++|+++| T Consensus 403 ~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~--~~~~~~~~~v~aGimS~e~~i~~~~ 480 (518) T protein:vir:78 403 EKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLN--ELSSTLNNMNSALAMSVEEKVKLIH 480 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHH--HHHHHHHHHHhcCCCCHHHHHHHhC Confidence 9999999999999999999888765432 2234688999999998866 4588888899999999999999999 Q ss_pred CCCCHHHHHHHHHHHHhhccccc---cccccccccccccccCcccC Q lcl|NC_020883. 517 PDASEDWIQEEIARIEEEQAGSD---TSSLMGINQTFEQMNDNRDE 559 (589) Q Consensus 517 pdw~dE~v~eEv~RI~~E~a~~~---p~~~g~~~~~l~~~~~~~~~ 559 (589) |+|+||++++|++||++|++.++ |..+++. ++++. T Consensus 481 ~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~--------~~~~g 518 (518) T protein:vir:78 481 PKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGM--------ETKGG 518 (518) T ss_pred CCCCHHHHHHHHHHHHHHhcccCCCCCccccCC--------CCCCC Confidence 99999999999999999998653 3344433 22222 No 6 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=2.3e-69 Score=396.92 Aligned_cols=461 Identities=17% Similarity=0.173 Sum_probs=324.5 Q ss_pred Cccceeccchh--------HHHHhh------------cchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWTVRGWTD--------KTTKNV------------HGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~~~~~~~--------~~~~~~------------~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) |++ .+++|-- |.++++ +.-|.+++++|+|+|.++..++.. + + T Consensus 3 ~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-------------~----~ 64 (500) T protein:vir:30 3 VIQ-KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTD-------------G----E 64 (500) T ss_pred hHH-HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCC-------------C----C Confidence 333 1223322 222222 234666779999999888655311 1 1 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) .....++.+|+++.||++.|+||-+...+|+. + .+..++++++++ T Consensus 65 ~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~---~--------------------------------d~~~~~~l~~il 109 (500) T protein:vir:30 65 TKKRDLNHLPIARTAAKKIASLVFNEQAEIKV---D--------------------------------DDAANEFISETL 109 (500) T ss_pred cccCceeecchHHHHHHHHhhhhcCCcceEec---C--------------------------------ChHHHHHHHHHH Confidence 11234778899999999999999333322222 0 012467899999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceeccc--ccCc--ceeEEEeec---CCCccceEEEEEee Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPH--DDEK--GADLAYYID---HGQYGQFLHIYRER 213 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~--~d~~--~~div~~~e---~~~~~~~l~~~~~~ 213 (589) ++++|+..+..+++++.+.||++++|||++++++|.|++||||||. +..+ .+.+++... .++..+|+|+++|. T Consensus 110 ~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~ 189 (500) T protein:vir:30 110 KNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHE 189 (500) T ss_pred hhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEE Confidence 9999999999999999999999999999999999999999999994 3233 466665433 44566799998875 Q ss_pred -eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe----c Q lcl|NC_020883. 214 -VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW----A 288 (589) Q Consensus 214 -~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv----P 288 (589) ...++++|+|.+|++. .....|..+++.+..++ + +.+...||+++|+++|+ | T Consensus 190 ~~~~~~~~I~n~ly~~~-------~~~~lG~~v~l~~~~~~---------------l-~~~~~~~~~~~p~f~~~~~~~~ 246 (500) T protein:vir:30 190 WQSSDDYVISNELYRSD-------DKAKVGSRVPLSEVYKD---------------L-KDEAKVTDVTRPIFTYLKTPGM 246 (500) T ss_pred EeCCceeEEEEEEEecc-------cccccCcccccccccCC---------------c-CcceEeccCCCccEEEecCCcc Confidence 4667899999999641 11133555554433222 1 22456799999999995 6 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) |+....+|||+|+|+++.++||+||++||++++++ +.|++||+||++||+...... +|+..+. ..+..+.+-+ T Consensus 247 N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~-~~g~~~i~v~~~~l~~~~~~~---~g~~~~~---~~~d~~~~~~ 319 (500) T protein:vir:30 247 NNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEV-KMGQRRVAVPESLTALTVRTT---DGDVVPR---PRFESDQNVY 319 (500) T ss_pred ccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHH-HhCcceeeechHHhcccCCCC---CccccCC---cccCCCcceE Confidence 78889999999999999999999999999999999 569999999999997541111 1111110 0000111111 Q ss_pred c-ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 369 E-ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 369 e-v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) . +...++.+..++.+|++||+++|++.++.+++++...+++|+.+||+. ++|.+ |+++.+.+..++.++++.++. T Consensus 320 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~-~~g~~---TAtei~s~~~~~~~t~~~~~~ 395 (500) T protein:vir:30 320 IRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFD-GKSMK---TATEIVSENSDTYQMRNSIVA 395 (500) T ss_pred EEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccC-cCccc---cHHHHHHHHHHHHHHHHHHHH Confidence 1 122234445688899999999999999999999999999999999974 34444 455666666667777888999 Q ss_pred HHHHHHHHHHHHHHHHHhhcC---ccc-CcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQD---SSI-RIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDW 523 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~---~~~-~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~ 523 (589) .+..+|++++++++++.+.++ ... ...++.|.|+|+++.|+. +.+++...++++|+||.+++++++++ |++++ T Consensus 396 ~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~--~~~~~~~~~v~aGi~s~~~~i~~~~g-~~eee 472 (500) T protein:vir:30 396 LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRD--AELDYWIKVVNAGFGTREMAIQKVLN-VTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHH--HHHHHHHHHHHcCCCCHHHHHHhcCC-CCHHH Confidence 999999999999999887542 222 223567999999998855 44666777788899999999999875 99999 Q ss_pred HHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCC Q lcl|NC_020883. 524 IQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEE 566 (589) Q Consensus 524 v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~de 566 (589) +++|++||++|+.... ..+... +.+.=| T Consensus 473 a~~~l~~i~~E~~~~~-~~~~~~--------------~~~~g~ 500 (500) T protein:vir:30 473 AQEIAAEINTGIVDEI-NQQRTD--------------THLYGE 500 (500) T ss_pred HHHHHHHHHHhccccC-CCCCcc--------------ccccCC Confidence 9999999999975321 111111 111111 No 7 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=2.3e-69 Score=396.92 Aligned_cols=461 Identities=17% Similarity=0.173 Sum_probs=324.5 Q ss_pred Cccceeccchh--------HHHHhh------------cchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWTVRGWTD--------KTTKNV------------HGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~~~~~~~--------~~~~~~------------~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) |++ .+++|-- |.++++ +.-|.+++++|+|+|.++..++.. + + T Consensus 3 ~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-------------~----~ 64 (500) T protein:vir:98 3 VIQ-KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTD-------------G----E 64 (500) T ss_pred hHH-HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCC-------------C----C Confidence 333 1223322 222222 234666779999999888655311 1 1 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) .....++.+|+++.||++.|+||-+...+|+. + .+..++++++++ T Consensus 65 ~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~---~--------------------------------d~~~~~~l~~il 109 (500) T protein:vir:98 65 TKKRDLNHLPIARTAAKKIASLVFNEQAEIKV---D--------------------------------DDAANEFISETL 109 (500) T ss_pred cccCceeecchHHHHHHHHhhhhcCCcceEec---C--------------------------------ChHHHHHHHHHH Confidence 11234778899999999999999333322222 0 012467899999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceeccc--ccCc--ceeEEEeec---CCCccceEEEEEee Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPH--DDEK--GADLAYYID---HGQYGQFLHIYRER 213 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~--~d~~--~~div~~~e---~~~~~~~l~~~~~~ 213 (589) ++++|+..+..+++++.+.||++++|||++++++|.|++||||||. +..+ .+.+++... .++..+|+|+++|. T Consensus 110 ~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~ 189 (500) T protein:vir:98 110 KNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHE 189 (500) T ss_pred hhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEE Confidence 9999999999999999999999999999999999999999999994 3233 466665433 44566799998875 Q ss_pred -eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe----c Q lcl|NC_020883. 214 -VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW----A 288 (589) Q Consensus 214 -~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv----P 288 (589) ...++++|+|.+|++. .....|..+++.+..++ + +.+...||+++|+++|+ | T Consensus 190 ~~~~~~~~I~n~ly~~~-------~~~~lG~~v~l~~~~~~---------------l-~~~~~~~~~~~p~f~~~~~~~~ 246 (500) T protein:vir:98 190 WQSSDDYVISNELYRSD-------DKAKVGSRVPLSEVYKD---------------L-KDEAKVTDVTRPIFTYLKTPGM 246 (500) T ss_pred EeCCceeEEEEEEEecc-------cccccCcccccccccCC---------------c-CcceEeccCCCccEEEecCCcc Confidence 4667899999999641 11133555554433222 1 22456799999999995 6 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) |+....+|||+|+|+++.++||+||++||++++++ +.|++||+||++||+...... +|+..+. ..+..+.+-+ T Consensus 247 N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~-~~g~~~i~v~~~~l~~~~~~~---~g~~~~~---~~~d~~~~~~ 319 (500) T protein:vir:98 247 NNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEV-KMGQRRVAVPESLTALTVRTT---DGDVVPR---PRFESDQNVY 319 (500) T ss_pred ccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHH-HhCcceeeechHHhcccCCCC---CccccCC---cccCCCcceE Confidence 78889999999999999999999999999999999 569999999999997541111 1111110 0000111111 Q ss_pred c-ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 369 E-ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 369 e-v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) . +...++.+..++.+|++||+++|++.++.+++++...+++|+.+||+. ++|.+ |+++.+.+..++.++++.++. T Consensus 320 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~-~~g~~---TAtei~s~~~~~~~t~~~~~~ 395 (500) T protein:vir:98 320 IRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFD-GKSMK---TATEIVSENSDTYQMRNSIVA 395 (500) T ss_pred EEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccC-cCccc---cHHHHHHHHHHHHHHHHHHHH Confidence 1 122234445688899999999999999999999999999999999974 34444 455666666667777888999 Q ss_pred HHHHHHHHHHHHHHHHHhhcC---ccc-CcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQD---SSI-RIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDW 523 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~---~~~-~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~ 523 (589) .+..+|++++++++++.+.++ ... ...++.|.|+|+++.|+. +.+++...++++|+||.+++++++++ |++++ T Consensus 396 ~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~--~~~~~~~~~v~aGi~s~~~~i~~~~g-~~eee 472 (500) T protein:vir:98 396 LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRD--AELDYWIKVVNAGFGTREMAIQKVLN-VTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHH--HHHHHHHHHHHcCCCCHHHHHHhcCC-CCHHH Confidence 999999999999999887542 222 223567999999998855 44666777788899999999999875 99999 Q ss_pred HHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCC Q lcl|NC_020883. 524 IQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEE 566 (589) Q Consensus 524 v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~de 566 (589) +++|++||++|+.... ..+... +.+.=| T Consensus 473 a~~~l~~i~~E~~~~~-~~~~~~--------------~~~~g~ 500 (500) T protein:vir:98 473 AQEIAAEINTGIVDEI-NQQRTD--------------THLYGE 500 (500) T ss_pred HHHHHHHHHHhccccC-CCCCcc--------------ccccCC Confidence 9999999999975321 111111 111111 No 8 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=5.5e-67 Score=383.86 Aligned_cols=461 Identities=16% Similarity=0.197 Sum_probs=327.6 Q ss_pred Ccc----ceeccchhH-----HHHhh--c----------chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee Q lcl|NC_020883. 1 MID----WTVRGWTDK-----TTKNV--H----------GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA 59 (589) Q Consensus 1 ~~~----~~~~~~~~~-----~~~~~--~----------~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~ 59 (589) |+| | ||+|--| .+++. | .-|.++|.+|+|+|..+..|...... . T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~-----------~--- 65 (499) T protein:vir:80 1 MINQIIAG-VKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNG-----------N--- 65 (499) T ss_pred ChhHHHHH-HHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCC-----------C--- Confidence 666 4 4555433 34433 3 23678889999999988877432110 0 Q ss_pred eecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHH Q lcl|NC_020883. 60 RETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQI 139 (589) Q Consensus 60 ~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v 139 (589) ....-+|.+|+++.||+..|.+| +|+-.+.-. + + +..+|+++.+ T Consensus 66 -~~~~~~~s~n~~~~iv~~~a~~l---~~ep~~i~~--------------------~--------d----~~~~e~l~~~ 109 (499) T protein:vir:80 66 -PVNRRQLSMNLPKVTAKYMSKLL---FNEKVKINI--------------------D--------D----ETAEEFVLNV 109 (499) T ss_pred -ccccceeecchHHHHHHHHHHhh---hCCcceEee--------------------C--------C----HHHHHHHHHH Confidence 11234688999999999999999 554322111 0 0 1236789999 Q ss_pred HhhccccccchhhHHHHHHcCceeEEEEEe-cCceeEEEecCceecc--cccCcceeEEEeecCCCcc-ceEEEEEeeec Q lcl|NC_020883. 140 TKNSKLERRHWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFP--HDDEKGADLAYYIDHGQYG-QFLHIYRERVE 215 (589) Q Consensus 140 ~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P--~~d~~~~div~~~e~~~~~-~~l~~~~~~~~ 215 (589) +++++|+..+...++++.+.|+++++||+| ++.++|.|++|+|+|| .+.++-..++|......++ .|.++++|... T Consensus 110 ~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~ 189 (499) T protein:vir:80 110 LKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWK 189 (499) T ss_pred HhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEec Confidence 999999999999999999999999999999 5569999999999999 3656655566655544333 56777666543 Q ss_pred c---ccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec---- Q lcl|NC_020883. 216 K---DGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA---- 288 (589) Q Consensus 216 ~---~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP---- 288 (589) . ..|+|+|.+|+.. .....|..+++.+- +...+.....+|+.+|+++|++ T Consensus 190 ~~~~~~y~I~n~~~~~~-------~~~~lG~~v~l~~~----------------~~~~~~~~~~~~~~~p~f~~~~~~~~ 246 (499) T protein:vir:80 190 GEKEEVYTVTTELYQSD-------DPNELGGKVSLKLL----------------FNDIEPVVPLPSLTRPTFIYIKPNIA 246 (499) T ss_pred ccceeeEEEEEEEEecc-------CccccCcccchhhh----------------ccCcCCceeecCCCccceEeecCCcc Confidence 3 3688888888631 11123444433221 1222334566899999999974 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) |+....+|||+|||+++.+++|+||++||++++.++ .|+.||+||++||+...+. +|+..+.+ ..+...+ T Consensus 247 N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~----~g~~~~~~-----~~~~~~~ 316 (499) T protein:vir:80 247 NNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFK-LGKKKVLVPSSFVKTAVNL----DGSTTQYF-----DSTDEAF 316 (499) T ss_pred ccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHH-hcccceecchhhhhccCCC----CCCcccCC-----Cccccee Confidence 566789999999999999999999999999999995 5999999999999865322 12111110 0111111 Q ss_pred c--ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_020883. 369 E--ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ 446 (589) Q Consensus 369 e--v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R 446 (589) . .+..++++..++.+|+++|+++|++.++.++++++..+++|+++||+. ++|.+ |+++.+++..++.+++..++ T Consensus 317 ~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~-~~g~~---TAtei~s~~~~l~~~~~~~~ 392 (499) T protein:vir:80 317 FLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFD-ENGLK---TATEVVSEKSETYQTKNSHS 392 (499) T ss_pred eEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCC-cccch---hHHHHHHHHHHHHHHHHHHH Confidence 1 122334455677889999999999999999999999999999999974 33433 56777777777777788888 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC----cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHH Q lcl|NC_020883. 447 KEYIDFLKELYESCLWLLNDQD----SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASED 522 (589) Q Consensus 447 ~~~~~aLk~li~~~l~L~~~~~----~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE 522 (589) ..|..+|++++++++++.+.++ ......++.|.|+|++|.|+++ .+++...++++|++|.+|+++++ |..+++ T Consensus 393 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~--~~~~~~~~~~~Gi~S~et~l~~~-~~~~d~ 469 (499) T protein:vir:80 393 QLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDT--TINRYTTAKNQGMIPLKIALQRA-WNITEA 469 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHH--HHHHHHHHHHcCCCCHHHHHhhc-CCCChH Confidence 8888999999999998877543 2344567889999999988654 46677778888999999998775 678899 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCC Q lcl|NC_020883. 523 WIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTE 570 (589) Q Consensus 523 ~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ 570 (589) ++++|++||++|++.+.|.+-.++ -.|++| T Consensus 470 ea~~el~~i~~E~~~~~~~~d~~g------------------~~ge~e 499 (499) T protein:vir:80 470 EADEWAEMLAKEKQAEIPNNDMTG------------------IFGEEE 499 (499) T ss_pred HHHHHHHHHHHHhhcCCCCCCccc------------------cCCCCC Confidence 999999999999986543221111 111111 No 9 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1.8e-66 Score=381.09 Aligned_cols=462 Identities=17% Similarity=0.203 Sum_probs=330.8 Q ss_pred Cccce---eccc-----hhHHHHhh--cc----------hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee Q lcl|NC_020883. 1 MIDWT---VRGW-----TDKTTKNV--HG----------DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR 60 (589) Q Consensus 1 ~~~~~---~~~~-----~~~~~~~~--~~----------~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~ 60 (589) |++=- ||+| ..+.++.+ |- -|.+++.+|+|+|..+..|.... .+... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~-----------~~~~~-- 67 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEH-----------NGNPV-- 67 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhcc-----------CCCcc-- Confidence 54421 3333 33444444 32 36677899999998776553211 11111 Q ss_pred ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 61 ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 61 ~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) .+-+|.+|+++.|++..|.+| +|+-.+.-. + + +..+|+++.++ T Consensus 68 --~~~~~~~n~~k~i~~~~a~~l---~~~p~~i~~----------~------------------d----~~~~e~l~~~~ 110 (496) T protein:vir:38 68 --NRRQLSMNLPKVTAKYMSKLL---FNEKVKINI----------D------------------D----KAAEEFVLNVL 110 (496) T ss_pred --ccceeecchHHHHHHHHhhhh---hCCcceEee----------C------------------C----hHHHHHHHHHH Confidence 234688899999999999999 555444111 0 0 12356899999 Q ss_pred hhccccccchhhHHHHHHcCceeEEEEEe-cCceeEEEecCceeccc--ccCcceeEEEeecCCC-ccceEEEEEeeecc Q lcl|NC_020883. 141 KNSKLERRHWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPH--DDEKGADLAYYIDHGQ-YGQFLHIYRERVEK 216 (589) Q Consensus 141 kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~--~d~~~~div~~~e~~~-~~~~l~~~~~~~~~ 216 (589) ++++|+..+...+.++.+.|+++++||+| +++++|.|++|+|+||. +.++-..+||...... ...|+++++|.+.. T Consensus 111 ~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~ 190 (496) T protein:vir:38 111 KTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQG 190 (496) T ss_pred hccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeC Confidence 99999999999999999999999999999 56799999999999994 3333344566555333 34688999998888 Q ss_pred ccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe----cCCCC Q lcl|NC_020883. 217 DGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW----ANNET 292 (589) Q Consensus 217 ~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv----PN~~~ 292 (589) ..+.|.|.+|+... ....|..+++.+..+ ..+....++|+++|+|+|+ ||+.. T Consensus 191 ~~~~I~~~~y~~~~-------~~~~g~~v~~~~~~~----------------~~~~~~~~~~~~~~~f~~~~~~~~N~~~ 247 (496) T protein:vir:38 191 DVYTVTTELYQSDD-------PNELGTKVSLTLLFD----------------DIEPVVPLPDFTRPTFIYIKPNIANNKN 247 (496) T ss_pred ceEEEEEEEEecCC-------ccccCcccccccccc----------------ccccceeecCCCcceEEEecCCcccccc Confidence 89999999996411 112244444433322 1223456789999999997 46678 Q ss_pred CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccc-- Q lcl|NC_020883. 293 FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEI-- 370 (589) Q Consensus 293 ~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev-- 370 (589) ..+|+|+|||+++.+++|+||++||++++.++ .|++||+||++||+...... |+..+. +..+...+.+ T Consensus 248 ~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~----g~~~~~-----~~~~~~~~~~~~ 317 (496) T protein:vir:38 248 LTSPLGISVYANALDTLKTLDLMFDSYYQEFK-LGKKKVLVPSSFVKTAVNLD----GSTTQY-----FDSTDEAFFLYQ 317 (496) T ss_pred cCCcCCCchHhhHHHHHHHHHHHHHHHHHHHh-hcccceecchHHhhccCCCC----CccccC-----CCCccceEEEee Confidence 89999999999999999999999999999995 59999999999998653221 111110 0011111111 Q ss_pred cccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_020883. 371 TTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYI 450 (589) Q Consensus 371 ~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~ 450 (589) ....+++..++.+++++|+++|++.++.++++++..+++|+++||+. ++|. .|+++.+.+...+.+++..++..+. T Consensus 318 ~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~-~~g~---~tAtei~~~~~~l~~~~~~~~~~~~ 393 (496) T protein:vir:38 318 GDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFD-ENGL---KTATEVVSEKSETYQTKNSHSQLIE 393 (496) T ss_pred cCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCC-cccc---chHHHHHHHHHHHHHHHHHHHHHHH Confidence 11233445577788999999999999999999999999999999974 3333 3667777777888888999999999 Q ss_pred HHHHHHHHHHHHHHhhc----CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHH Q lcl|NC_020883. 451 DFLKELYESCLWLLNDQ----DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQE 526 (589) Q Consensus 451 ~aLk~li~~~l~L~~~~----~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~e 526 (589) .+|++++++++++.+.+ +......++.|.|+|++|.|+.+. +++...++++|++|.+|++++ ||.++++++++ T Consensus 394 ~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~--~~~~~~~~~~GiiS~et~l~~-~~~~~d~ea~~ 470 (496) T protein:vir:38 394 QGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTT--INRYTNAKNQGMIPLKIALQR-AWNITEAEADE 470 (496) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHH--HHHHHHHHhcCCCCHHHHHHh-cCCCChHHHHH Confidence 99999999998887643 233445568899999999886654 556666778899999999876 58899999999 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCC Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTE 570 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ 570 (589) |++||++|++++.|.+-+++ . .|++| T Consensus 471 el~ri~~E~~~~~~~~d~~~------~------------~~~~e 496 (496) T protein:vir:38 471 WAEMLAKEKQAEMPNNDMNG------I------------FGEEE 496 (496) T ss_pred HHHHHHHhhhccCccccccC------C------------CCCCC Confidence 99999999987655332221 0 01111 No 10 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.1e-45 Score=266.97 Aligned_cols=446 Identities=14% Similarity=0.069 Sum_probs=287.5 Q ss_pred CccceeccchhHHHHhh----cchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNV----HGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) |++... -.+.|+.. +.-|.++...|+|+|.+++.+-.+....... .+-.|..|+++.|+ T Consensus 29 ~~~~~~---i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~--------------~~~ki~~n~~~~iv 91 (481) T protein:vir:10 29 LLKEEN---LRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDK--------------ADHRAVHNYAKYVS 91 (481) T ss_pred hcCHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcccccccccc--------------ccceeecchHHHHH Confidence 444321 22333332 2345666689999999988763332221111 12358899999999 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) +..+.++ +|+=.+.... +.. -++.+..+.+.++|...+......+ T Consensus 92 d~~~~~l---~g~~~~~~~~----------------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~ 136 (481) T protein:vir:10 92 RFIVGYL---TGNPITITHQ----------------------------DNQ----TNDKIIELNDLNDADEVNSDLALNL 136 (481) T ss_pred HHHHhhh---ccCCceEecC----------------------------Chh----HHHHHHHHHHhcChhHHHHHHHHHH Confidence 9999988 4433221110 001 1346778889999999999999999 Q ss_pred HHcCceeEEEEEecCc-eeEEEecCceecccccCcceeEEEeecC-CCccceEEEEEeeeccccceeehhhhccccccch Q lcl|NC_020883. 157 QVDGGIVAAPVIDELG-PRIVFKARDVYFPHDDEKGADLAYYIDH-GQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGD 234 (589) Q Consensus 157 ~v~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~-~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~ 234 (589) .+-|.+...+|++.++ ++|.+.+|++.||. |...- .+---++++|+......+....-.+|- +.. T Consensus 137 ~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v---------~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~----~~~ 203 (481) T protein:vir:10 137 SIYGRAYEIVYRDFEDRDTFKVLDPKSTFVV---------YDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYT----TDK 203 (481) T ss_pred HhcCeEEEEEEeCCCCeEEEEEEcccceEEE---------EcCCCCCceEEEEEEEEEeeCCCceEEEEEEEe----cCe Confidence 9999999999998544 99999999999994 22110 011112333333222221111112221 111 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINW 314 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~ 314 (589) +++....+..+..++ ..+.++.+..|++++|+ .+|+|+|+++.+++|++|. T Consensus 204 i~~~~~~~~~~~~~~------------------------~~~~~~g~vPvv~~~n~-----~~g~~~~~~v~~lida~~~ 254 (481) T protein:vir:10 204 IYYIEIKGGTYHRVE------------------------EVEHYYNDVPIIEYLND-----QFKQGDFENVIALIDLYDS 254 (481) T ss_pred EEEEEecCCceeecc------------------------cccccCCceeEEEeecC-----CCCCCchhhHHHHHHHHHH Confidence 112222232222111 12234455558888884 4799999999999999999 Q ss_pred HHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHH Q lcl|NC_020883. 315 TITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMD 394 (589) Q Consensus 315 t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~ 394 (589) ++|+.+..++.++.|.+.++...... ++++......+....+... .....+.+..+++++|++..+.... T Consensus 255 ~~s~~~~~~~~~~~~~~~~~g~~~~~------~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~~~~~~~~~~ 324 (481) T protein:vir:10 255 AQSDTANYMTDLNDAMLAIIGNVDLD------SEDAKAFRDANMIHLEPGT----NANGSEGKAEVKYVYKQYDVAGVEA 324 (481) T ss_pred HHHHHHHHHHHhcCceeEeecCcCCC------ccchhhhhhccceeccccc----cccCCCCCcceeEEeecCCHHHHHH Confidence 99999999999999999886432221 1222222211111111100 1112334456789999999999999 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CcccCc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQ-DSSIRI 473 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~-~~~~~~ 473 (589) +++.|.+.|+..+++|..+|+..+ ++.||+|++..+..+..|+.+++..|.++|++++++++.+.+.. ...... T Consensus 325 ~~~~l~~~i~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 399 (481) T protein:vir:10 325 YKKRLQNDIHKYTNTPDLNDEQFS-----GVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNY 399 (481) T ss_pred HHHHHHHHHHHHhCCccccccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Confidence 999999999999999988887532 24599999999999999999999999999999998877666543 333455 Q ss_pred ccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccc Q lcl|NC_020883. 474 EEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQM 553 (589) Q Consensus 474 e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~ 553 (589) ..+.|.|++.+|.++++. |+++..+. +++|.+|+++++ |..++ +++|++||++|+....+.....+ T Consensus 400 ~~i~v~f~~~~~~~~~~~--a~~~~kl~--g~is~et~~~~l-~~i~d--~~~E~~ri~~E~~~~~~~~~~~~------- 465 (481) T protein:vir:10 400 AELTITFTPNLPKSMMES--INAFNALS--GGVSESTRLSLL-DFIDN--PKEELEKMQEEEAQREKQADKRG------- 465 (481) T ss_pred ceeeEEeCCCCCcCHHHH--HHHHHHHh--ccCChHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHhhhhhcc------- Confidence 567899999999886654 66776654 589999999876 65554 56899999998876544333222 Q ss_pred cCcccCCCCCCCCCCC Q lcl|NC_020883. 554 NDNRDEDGNIIEEGDT 569 (589) Q Consensus 554 ~~~~~~~~~p~deg~~ 569 (589) +....+++.+.|+|++ T Consensus 466 ~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 466 YGEAFENHLNVDDSNG 481 (481) T ss_pred CCccCCCCCCCCCCCC Confidence 2222333444455555 No 11 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=2.6e-44 Score=259.51 Aligned_cols=456 Identities=13% Similarity=0.070 Sum_probs=286.1 Q ss_pred CccceeccchhHHHHhh----cchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNV----HGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) +-+|+ |-.+.|... +--|.++++.|+|+|+.++.|... ..+...+-.|+.|+++.|+ T Consensus 38 ~~~~~---~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~----------------~~~~~~~~ri~~n~~k~Iv 98 (501) T protein:vir:96 38 VNNWE---LLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR----------------KDNEMADKRAVHNYGRMIS 98 (501) T ss_pred CChHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCcccc----------------CccccccceeecchHHHHH Confidence 23332 223333322 123677778999999888877211 0011123469999999999 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) +..+.++ +|+=.+.... . ...+.. -+++++++.+.++|...+.....++ T Consensus 99 d~~~~yl---~g~p~~~~~~------------------~------~~~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~ 147 (501) T protein:vir:96 99 KFKTGYL---AGNPIRVEYD------------------D------NDDNSQ----NDDAIKRIGRINDLDSLNRTLIRDL 147 (501) T ss_pred HHHhhhh---cccCeeEeeC------------------C------ccchhH----HHHHHHHHHHhcCHHHHHHHHHHHH Confidence 9999888 5442221110 0 001111 1568899999999999999999999 Q ss_pred HHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhccccccch Q lcl|NC_020883. 157 QVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGD 234 (589) Q Consensus 157 ~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~ 234 (589) .+-|.....+|++. +.++|.+.+|.+.||. |... .++-.-+|++|+........ ....+| .+.. T Consensus 148 ~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v---------~d~~~~~~~~~~v~~~~~~~~~~~~-~~~~vy----t~~~ 213 (501) T protein:vir:96 148 SQTGRAYEVIYRSEYDETRIKRLSPLETFVI---------YDNSLEDNSIAAVRYYNRGTLQSAK-DVVEIY----TDEH 213 (501) T ss_pred hhcCeEEEEEEEcCCCceEEEEEccceeEEE---------EcCCCCCceEEEEEEEEeecCCCcE-EEEEEE----cCCc Confidence 99999999999984 4599999999999993 3221 11112233444332221111 111122 0011 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINW 314 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~ 314 (589) ++.....+.. .+......+..++.|++++|+ ++|+|||+++.+++|++|. T Consensus 214 i~~~~~~~~~-------------------------~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~d~ 263 (501) T protein:vir:96 214 IYTLDASDDF-------------------------NEISVTTHAFGTVPITEYLNN-----IDGIGDYETELYLIDLYDS 263 (501) T ss_pred EEEEeeCCCc-------------------------eeccccccCCCccceEEecCC-----ccCCCchhhhHHHHHHHHH Confidence 1111111110 011122345566668888884 5799999999999999999 Q ss_pred HHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHH Q lcl|NC_020883. 315 TITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMD 394 (589) Q Consensus 315 t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~ 394 (589) ++|+.+..++.++.|.+.+.-..++.. +..+......++.. ............+..+++++|++..+.... T Consensus 264 ~~s~~~~~~~~~~~~~l~i~G~~~~~~-----~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 334 (501) T protein:vir:96 264 AESDTANHMSDMADAILAIYGDLALPK-----GMQASDMKRTRLMQ----LKPPKSADGKEGTVKAEYLTKSYDVSGAEA 334 (501) T ss_pred HHHHHHHHHHHhcCceeeeecccccCc-----ccchhhhhhcCeee----ecccccccccccCcceeeEeccCCHHHHHH Confidence 999999999999999887643222110 11111111111110 000111222344556788999999999999 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CcccC Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQ--DSSIR 472 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~--~~~~~ 472 (589) +++.|.+.|+..+++|..+++..+ ++.||+|++..+..+..|+..++..|..+|++++++++.+.+.. +.... T Consensus 335 ~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d 409 (501) T protein:vir:96 335 YKTRLNRDIHIFTNTPDMSDTNFS-----GNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFD 409 (501) T ss_pred HHHHHHHHHHHHhCCcccCccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 999999999999999988887432 34599999999999999999999999999999998887766543 23344 Q ss_pred cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 473 IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 473 ~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) ...+.|.|.+.+|.++++. |+++..+. +++|.+|+++++ |.+++ +++|++||++|+..+++.+.......... T Consensus 410 ~~~i~i~f~~~~p~n~~e~--ad~~~kl~--g~iS~et~~~~l-~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 482 (501) T protein:vir:96 410 ESLLKITFTPNLPKSLNEQ--VSILTGLG--GQVSQETALSLS-GLVES--PNEELDKINKEMSEIDFKGYSNDFNEHVG 482 (501) T ss_pred cccceEEeCCCCCcCHHHH--HHHHHHHh--ccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHhhccccccchhhccc Confidence 5567899999999886655 66776664 589999999976 65665 56899999999876544333222111111 Q ss_pred ccCcccCCCCCCCCCCCCCCCCcchhhhhhc Q lcl|NC_020883. 553 MNDNRDEDGNIIEEGDTEEEPSAEENEEIEK 583 (589) Q Consensus 553 ~~~~~~~~~~p~deg~~~eep~~~~~e~~~~ 583 (589) . +++++.|..++|.|+-+. T Consensus 483 ~------------~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 483 K------------YTDEVKETHTDDFEREYE 501 (501) T ss_pred c------------cCCcCCCCCCCccccccC Confidence 1 112222222222222222 No 12 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=1.5e-44 Score=260.86 Aligned_cols=447 Identities=14% Similarity=0.129 Sum_probs=269.9 Q ss_pred Cccce---------------------eccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDWT---------------------VRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~~---------------------~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) |+|-. ..-|-.+.|+..- ..|.+.++.|+|+|. +..|..+...+++.. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~-i~~~~~~~~~~~~~~------- 72 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPD-ILDAPPKRDVNGDYD------- 72 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-hhccccccccccccc------- Confidence 65531 1122233333211 235555789999995 554532221111111 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) +...+-.|+.|+++.|++..+.++ +|+=.+.-. +.+ -.++++ T Consensus 73 ---~~~~~~ki~~n~~~~ivd~~~~~l---~g~~~~~~~----------~~d----------------------~~~~~l 114 (478) T protein:vir:10 73 ---ETKPDWRMYTNYHQNLVDQKVAYA---VANPVTFGV----------DND----------------------KALKQI 114 (478) T ss_pred ---cccccceeccchHHHHHHHHHhhh---ccCCeeeec----------CCh----------------------HHHHHH Confidence 122344799999999999999998 554333111 000 123456 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEe-cCceeEEEecCceeccc-ccCc-c--eeEE-EeecCCCccceEEEE Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPH-DDEK-G--ADLA-YYIDHGQYGQFLHIY 210 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~-~d~~-~--~div-~~~e~~~~~~~l~~~ 210 (589) .+++. ++|.........++.+-|..+..+|++ ++.+++.+.++++.||. ++.. + ...+ |....+ ..++.+| T Consensus 115 ~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~--~~~~~~y 191 (478) T protein:vir:10 115 QHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG--AERVEYW 191 (478) T ss_pred HHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--ceEEEEE Confidence 66665 467777777888899999999999998 45599999999999994 3322 1 1111 111111 1112222 Q ss_pred Eeeeccccceeehhhhccccccchhheeeccccccc-ccccccccchhhhhhcccCCccccccccccCCCCcceEEEecC Q lcl|NC_020883. 211 RERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVT-NVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWAN 289 (589) Q Consensus 211 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~-~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN 289 (589) +.. .+. ++....+..+. .... .-++...........+..+..|++++| T Consensus 192 ~~~------~i~-------------~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~vPvv~~~n 240 (478) T protein:vir:10 192 TKD------DVT-------------YYELKEGQLIPDFYRS------------DDHIQPHYYQGNKLMSWGRVPFIPFKN 240 (478) T ss_pred eCC------eEE-------------EEEEcCCeeecccccc------------ccccccceecccccccCCccceEEecc Confidence 110 000 01111111000 0000 001111111122335566666888888 Q ss_pred CCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccc Q lcl|NC_020883. 290 NETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDME 369 (589) Q Consensus 290 ~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dle 369 (589) +++|+|||+++.+++|++|.++|+.+..++.+++|.+.+. ++ ..+..++.. ..+. ...-+. T Consensus 241 -----~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~-g~-------~~~~~~~~~--~~~~----~~~~~~ 301 (478) T protein:vir:10 241 -----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK-GY-------EGEDMKDFM--HNLK----YYKAIS 301 (478) T ss_pred -----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee-cC-------Cccccchhh--hhhh----hcceEE Confidence 4689999999999999999999999999999999976542 21 111111110 0000 111122 Q ss_pred ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020883. 370 ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEY 449 (589) Q Consensus 370 v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~ 449 (589) + ..+.|..+++++|++..+....+++.|.+.|+..+++|..+++.. +++.||+|++++++.+..|+.+++..| T Consensus 302 ~--~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~~~~~~ 374 (478) T protein:vir:10 302 V--AGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-----GNSPSGIALKFMYSNLDLKANKLKNKT 374 (478) T ss_pred e--cCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccc-----ccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 123456688999999999999999999999999999998777542 235699999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHH Q lcl|NC_020883. 450 IDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIA 529 (589) Q Consensus 450 ~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~ 529 (589) ..+|++++++++.+. +..+....+.|.|.+.+|.++++. |++++.+ ++++|.+|+++++ |.+++ +++|++ T Consensus 375 ~~~l~~~~~li~~~~---g~~~~~~~i~i~f~~~~p~d~~e~--a~~~~kl--~g~iS~et~~~~l-~~v~D--~~~E~~ 444 (478) T protein:vir:10 375 LTALQELLQYIIDFY---RLDVKVQDIEITFNFNVMVNELEN--SQIAMNS--TGLLSKETILSNH-AWVED--PVAEME 444 (478) T ss_pred HHHHHHHHHHHHHHh---CCCcccccceEEecCCCCCCHHHH--HHHHHHH--hCCCChHHHHHhC-CCCCC--HHHHHH Confidence 999999988776554 334556678899999999886654 7777665 5689999999876 76665 568999 Q ss_pred HHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcch Q lcl|NC_020883. 530 RIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEE 577 (589) Q Consensus 530 RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~ 577 (589) ||++|+...... +... ..+ +.++.+.+++-..+| T Consensus 445 ri~~E~~~~~~~-~~~~---~~~----------~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 445 RIEQENIELNQQ-LPDI---EEG----------LNGEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHHHhh-cccc---ccc----------cCCCCCCCCCCCCCC Confidence 999887542111 1100 000 111111111111111 No 13 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=3.1e-44 Score=259.09 Aligned_cols=444 Identities=15% Similarity=0.170 Sum_probs=271.2 Q ss_pred Ccc--ce-------------------eccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MID--WT-------------------VRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~--~~-------------------~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) |++ |- ..-|-.+.|+..- ..|.++++.|+|+| +|..|......++... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~-~i~~~~~~~~~~~~~~------- 72 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDP-DVLRLAPKLDNKGEID------- 72 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCC-cchhccchhccccccc------- Confidence 322 21 1122234444322 34555557899998 7877754443322211 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) +..-+-.|+.|+++.|++..+.++ +|+=.+.... +. -.++.+ T Consensus 73 ---~~~~~~ki~~n~~~~Ivd~~~~~l---~g~p~~~~~~----------------------------d~----~~~~~l 114 (474) T protein:vir:96 73 ---PLKPDWRMFTNYHQNLVDQKVAYA---VANPVTFSSD----------------------------DD----KSLKTI 114 (474) T ss_pred ---ccccchhcccchHHHHHHhhhhhh---cccCceeecC----------------------------ch----HHHHHH Confidence 111234688999999999999998 5543332110 00 113345 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCc-c---eeEEEeecCCCccceEEEE Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEK-G---ADLAYYIDHGQYGQFLHIY 210 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~-~---~div~~~e~~~~~~~l~~~ 210 (589) .+++.| ++.........++.+-|..+..+|++. +.++|.+.+|++.||. ++.. + +.+-|.... ...++.+| T Consensus 115 ~~~~~n-~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~--~~~~~~~y 191 (474) T protein:vir:96 115 QEVLNH-KWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLD--GAERVEYW 191 (474) T ss_pred HHHHhc-CHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--CceEEEEE Confidence 566554 555556666778899999999999984 4599999999999994 2221 1 111111111 11122222 Q ss_pred EeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC Q lcl|NC_020883. 211 RERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN 290 (589) Q Consensus 211 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~ 290 (589) ... .+.+ | ....+.......+ .. .+++..........+..+..|++++|+ T Consensus 192 t~~------~v~~--~-----------~~~~~~~~~~~~~--~~---------~~~~~~~~~~~~~~~~g~iPvv~~~nn 241 (474) T protein:vir:96 192 TDS------DVTY--Y-----------EYQDGILIPDYYH--GE---------EHIQSHYYVGNKRVSWGRVPFIPFKNN 241 (474) T ss_pred eCC------eEEE--E-----------EecCCceeecccc--cc---------ccccccccccccccCCCceeEEEeccC Confidence 111 0000 0 0011111100000 00 011112222234456777778999984 Q ss_pred CCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccc Q lcl|NC_020883. 291 ETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEI 370 (589) Q Consensus 291 ~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev 370 (589) ++|+|||+++.+++|++|.++|+.++.++.++.|.+.+.- ...+..++... +.....+ T Consensus 242 -----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g--------~~~~~~~~~~~---------~~~~~~~ 299 (474) T protein:vir:96 242 -----PQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKG--------YEGQDLDEFMR---------NLKYYKA 299 (474) T ss_pred -----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec--------CCcccccchhh---------hhhcCce Confidence 5799999999999999999999999999999999776532 11111111111 1111223 Q ss_pred cccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_020883. 371 TTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYI 450 (589) Q Consensus 371 ~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~ 450 (589) +..+++|..+++++|++..+....+++.|.++|+..+++|..+|+.. +++.||+|+++++..+..|+.+++..|. T Consensus 300 i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 374 (474) T protein:vir:96 300 INVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-----GNSPSGIALKFMYSNLDLKANKLKNKTL 374 (474) T ss_pred EEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-----ccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33344556688999999999999999999999999999998877532 2346999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHH Q lcl|NC_020883. 451 DFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIAR 530 (589) Q Consensus 451 ~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~R 530 (589) ++|++++++++.+. +.......+.|.|.+.+|.++++. |++ +..++++|.+|+++++ |.+++ +++|++| T Consensus 375 ~~l~~~~~~i~~~~---~~~~~~~~i~i~f~~~~p~~~~e~--~~~---~~~ag~iS~et~~~~~-~~v~d--~~~E~~r 443 (474) T protein:vir:96 375 TALQELLQYIIDFY---KLNIKVQDVEITFNFNVMVNELEQ--SQI---GVQSQYLSKETVVTNH-PWVDD--PVAELER 443 (474) T ss_pred HHHHHHHHHHHHHh---CCCcccceeeEEeccCCCcCHHHH--HHH---HHhcCCCchHHHHHhC-CCCCC--HHHHHHH Confidence 99999998876554 334455667899999999986654 444 3457899999999865 77765 5689999 Q ss_pred HHhhccccccccccccccccccccCcccCCCCCCCCCCCCC Q lcl|NC_020883. 531 IEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEE 571 (589) Q Consensus 531 I~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~e 571 (589) |++|+.... ..+. +.. .++++...|.+.+.+ T Consensus 444 i~~E~~e~~-~~~~----~~~-----~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 444 IEQDNIDFN-KQLP----PLE-----GDANGRAQDNESETN 474 (474) T ss_pred HHHHHHHHH-hccc----ccc-----cccccccCCCcccCC Confidence 998876421 1111 000 011111111111111 No 14 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=5.6e-44 Score=257.69 Aligned_cols=463 Identities=14% Similarity=0.061 Sum_probs=276.9 Q ss_pred Ccccee--ccc--hhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV--RGW--TDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~--~~~--~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. .-+ +-+.++.+ |- .|.++++.|+|+|..+..+....-. .. -+-.|+ T Consensus 28 ~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~------------~~----~~~ki~ 91 (511) T protein:vir:99 28 VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE------------YM----ADNRVA 91 (511) T ss_pred ccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCccccc------------cc----Ccceee Confidence 233321 111 11222222 21 3566778999999877665321111 00 122589 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) +|+++.|++..+.++ +|+=.+.-. +. .+. ++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~d------------------~~~----~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:99 92 HDYASYISDFINGYF---LGNPIQYQD----------DD------------------KDV----LEAIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHHhhh---cccCceeec----------Cc------------------hHH----HHHHHHHHhhcCHhHH Confidence 999999999999988 554333110 00 001 4588899999999999 Q ss_pred chhhHHHHHHcCceeEEEEEe-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeecc--ccceeeh- Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEK--DGLRTTN- 223 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~--~~~~~~~- 223 (589) +......+.+-|..+..+|++ ++.+++...+|.+.||. |... ..+---++++|+..... ..-.+.+ T Consensus 137 ~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~v---------yd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~ 207 (511) T protein:vir:99 137 NRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVI---------YDNTIERNSIAGVRYLRTKPIDKTDEDEVFTV 207 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEE---------EcCCCCCceEEEEEEEEeeecccCccceEEEE Confidence 999999999999999999998 45599999999999993 3211 00101122222211100 0000000 Q ss_pred hhhccccccchhhee-ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch Q lcl|NC_020883. 224 MLYPVVKAKGDVKKE-IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL 302 (589) Q Consensus 224 ~~y~~~~~~~~~~~~-~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~ 302 (589) .+|- +..+++. ..++..+. ....+......+.....|++++|+ +.|+||| T Consensus 208 ~vyt----~~~i~~~~~~~~~~~~--------------------~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~ 258 (511) T protein:vir:99 208 DLFT----SHGVYRYLTSRTNGLK--------------------LTPRENGFESHSFERMPITEFSNN-----ERRKGDY 258 (511) T ss_pred EEEe----CCcEEEEEecCCcccc--------------------ccccccccccCCCCccceEEecCC-----CCCCCch Confidence 0110 0000000 01110000 000111223345566678888884 4799999 Q ss_pred hhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccce Q lcl|NC_020883. 303 DNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEI 382 (589) Q Consensus 303 ~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~ 382 (589) +++.+++|++|.++|+.+..++.+++|.+.+.-...... .........++................+.+..+.+ T Consensus 259 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 332 (511) T protein:vir:99 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDP------VEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGY 332 (511) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCc------hhhcccccccceecccccccccccccCCCCcceeE Confidence 999999999999999999999999999776532111110 11111111111111000000011112344566889 Q ss_pred eeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 383 HQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLW 462 (589) Q Consensus 383 iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~ 462 (589) ++|+...+....+++.|.+.|+..+++|..+++..+ ++.||+|+++.++.+..|+.+++..|.++|++++++++. T Consensus 333 l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-----gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~ 407 (511) T protein:vir:99 333 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999998876432 245999999999999999999999999999999988876 Q ss_pred HHhhcC---cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccc Q lcl|NC_020883. 463 LLNDQD---SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSD 539 (589) Q Consensus 463 L~~~~~---~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~ 539 (589) +....+ .......+.|.|.+.+|.+.++. ++++..+. +++|.||+++++ |.+++ +++|++||++|+..+. T Consensus 408 ~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~--~~~~~kl~--GiiS~et~l~~l-~~v~D--~~~E~~ri~~E~~~~~ 480 (511) T protein:vir:99 408 ILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEE--LKAYIDSG--GKISQTTLMSLF-SFFQD--PELEVKKIEEDEKESI 480 (511) T ss_pred HHHhcCCcccccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCCHHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH Confidence 654332 22334456899999999887655 66766654 689999999876 76665 6799999999987543 Q ss_pred cccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 540 TSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 540 p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +...........+. ++.++ +++++..+++.| T Consensus 481 ~~~~~~~~~~~~~~-~~~~~--------~~~~~~~~d~~e 511 (511) T protein:vir:99 481 KKAQKNMYQDPRNI-NDDEQ--------DDSTKDSIDKKE 511 (511) T ss_pred HHHhhcccccCCCC-CCCCC--------CCCCcCcccccC Confidence 33332221111111 11111 111111111111 No 15 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=4.4e-44 Score=258.26 Aligned_cols=442 Identities=16% Similarity=0.185 Sum_probs=270.2 Q ss_pred Cccc-----------eec------cchhHHHHhh---c----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDW-----------TVR------GWTDKTTKNV---H----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~-----------~~~------~~~~~~~~~~---~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) ||+- .|. ..+.+.|+.+ | ..|.++++.|+|+|.-++.+..... +++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~-~~~~~------- 72 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNV-KGEID------- 72 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc-ccccc------- Confidence 5442 222 2233333322 2 3466666999999955554422211 11110 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) .+ ..+--|+.|+++.|++..+.++ +|+=.+.-. + + +-.++++ T Consensus 73 ~~---~~~~ki~~n~~~~Iv~~~~~~l---~g~p~~~~~----------~------------------d----~~~~~~l 114 (468) T protein:vir:96 73 PF---KPDWRMYTNYHQNLVDQKVAYA---VANPVTYGT----------E------------------D----EKSLKTI 114 (468) T ss_pred cc---ccccccccchHHHHHHHHHhhh---ccCCceecc----------C------------------C----hHHHHHH Confidence 01 1123588999999999999999 554333111 0 0 1124467 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecCCCc-cceEEEEEeee Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQY-GQFLHIYRERV 214 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~-~~~l~~~~~~~ 214 (589) .++++| +|.........++.+-|..+..+|++. +.++|.+.+|++.||. |......+ --+++.|.... T Consensus 115 ~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v---------~~~~~~~~~~~~ir~~~~~~ 184 (468) T protein:vir:96 115 QEVLNH-KWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPI---------WTNKERDELKAFIRLYELDG 184 (468) T ss_pred HHHHhc-CHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccceEEE---------EcCCCCCceEEEEEEEEecC Confidence 777764 566666777788999999999999984 4599999999999993 32221111 11222221110 Q ss_pred ccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCC Q lcl|NC_020883. 215 EKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFM 294 (589) Q Consensus 215 ~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~ 294 (589) ... . .+|-...+. ++....+.... ..... .-++.........+.+..+..|++++|+ T Consensus 185 -~~~--~--~~~~~~~~~---~~~~~~~~~~~--~~~~~---------~~~~~~~~~~~~~~~~~~~iPvv~~~n~---- 241 (468) T protein:vir:96 185 -GER--V--EYWTANDVT---FYELKDGQLIP--DYYQG---------EEHVQAHYYVGNKSMSWNRVPFIPFKNN---- 241 (468) T ss_pred -ceE--E--EEEeCCeEE---EEEEcCCceee--ccccc---------ccccccceeeccccccCCcccEEEecCC---- Confidence 000 0 111100000 00111111000 00000 0001111112223456677778888884 Q ss_pred CcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 295 NPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 295 ~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d 374 (589) +.|+|||+++.+++|+||.++|+.++.++.++.|.+.++-..++. .++... .+. ....+.+. . T Consensus 242 -~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~--------~~~~~~--~~~----~~~~i~~~--~ 304 (468) T protein:vir:96 242 -PQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGED--------LEEFMY--NLK----YYKAINVD--G 304 (468) T ss_pred -CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc--------cchhhh--hhh----cCceEEec--C Confidence 579999999999999999999999999999999988765332221 111100 000 11112222 2 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLK 454 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk 454 (589) +.+..+++++|++..+.+..+++.|.++|+..+++|..+++.. +++.||+|+++.++.+..|+..++..|.++|+ T Consensus 305 d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-----~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~ 379 (468) T protein:vir:96 305 DGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKF-----GNSPSGIALKFMYSNLDLKANKLKNKTLTALQ 379 (468) T ss_pred CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3345578999999999999999999999999999998766532 23569999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhh Q lcl|NC_020883. 455 ELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEE 534 (589) Q Consensus 455 ~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E 534 (589) +++++++.+. +..+....+.|.|.+.+|.++++. |+++ ..++++|.+|+++++ |.+++ +++|++||++| T Consensus 380 ~~~~li~~~~---g~~~d~~~i~i~f~~~~p~d~~e~--a~~~---~~~g~iS~et~i~~l-~~v~D--~~~E~~ri~~E 448 (468) T protein:vir:96 380 ELLQYIIDFY---KLSIKVQDVEITFNFNVMVNELEQ--SQIG---VNSQYLSKETVVTNH-PWVDD--PVAEMERIDQE 448 (468) T ss_pred HHHHHHHHHh---CCCcccceeeEEecCCCCcCHHHH--HHHH---HhcCCCchHHHHHhC-CCCCC--HHHHHHHHHHH Confidence 9998876554 334556677899999999886654 5554 456899999999765 76765 67999999999 Q ss_pred ccccccccccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 535 QAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 535 ~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) +.+..... + ++. |.+.++|. T Consensus 449 ~~~~~~~~-----~---~~~------------~~~~~~~~ 468 (468) T protein:vir:96 449 ELALPSIE-----E---GLN------------GKENNEPT 468 (468) T ss_pred HHHHHHHh-----h---ccC------------CCCCCCCC Confidence 86532110 0 111 12222333 No 16 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=2e-43 Score=254.66 Aligned_cols=458 Identities=14% Similarity=0.085 Sum_probs=284.3 Q ss_pred Cccceecc--chhHHHHhhcc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcch Q lcl|NC_020883. 1 MIDWTVRG--WTDKTTKNVHG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPK 73 (589) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~ 73 (589) .=+++.-. |-.+.|.. |- -|.++++.|+|+|++++++.... -+...+-.|+.|+++ T Consensus 34 ~~~~~~~~~~~i~~~i~~-h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~----------------~~~~~~~ki~~n~~k 96 (502) T protein:vir:48 34 LEELMVNNWELLKNFINH-HKLRQAPRIQELLDYARGENHDVLKSGRRK----------------DNEMADKRAVHNYGR 96 (502) T ss_pred hhhhccccHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccccccc----------------ccccccceeecchHH Confidence 11111111 12233332 32 35566788999999888762110 001123368999999 Q ss_pred hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhH Q lcl|NC_020883. 74 VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNI 153 (589) Q Consensus 74 ~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l 153 (589) .|++..+.++ +|+=.+.-..+ + ..+ ..-+++++.+.+.++|...+.... T Consensus 97 ~Ivd~~~~yl---~g~p~~~~~~d--------~----------------~~~----~~~~~~l~~~~~~N~~~~~~~~~~ 145 (502) T protein:vir:48 97 MISKFKTGYL---AGNPIRVEYDD--------N----------------EDN----SQNDDAIKRIGRINDIDTHNRNLI 145 (502) T ss_pred HHHHHHhhhh---cccCeeEecCC--------c----------------cch----hHHHHHHHHHHhhcCHhHHHHHHH Confidence 9999999888 44322211100 0 000 112567899999999999999999 Q ss_pred HHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhccccc Q lcl|NC_020883. 154 VQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKA 231 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~ 231 (589) ..+.+-|.+...+|++. +.++|.+.+|.+.|| +|... ..+---++|+|.......+.. .-.+|- T Consensus 146 ~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~---------vydd~~~~~~~~~ir~~~~~~~~~~~~-~~~iyt---- 211 (502) T protein:vir:48 146 RDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV---------IYDNSLEDNSIAAVRYYNRGTLQNAKD-VVEIYT---- 211 (502) T ss_pred HHHhhcCeEEEEEEeCCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEEEeecCCcEE-EEEEEe---- Confidence 99999999999999984 569999999999998 34321 111112344443332222211 111231 Q ss_pred cchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHH Q lcl|NC_020883. 232 KGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDE 311 (589) Q Consensus 232 ~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~De 311 (589) +..+++....|. +. .......+..+..|++++|+ +.|+|||+++.+++|+ T Consensus 212 ~~~i~~~~~~~~-~~------------------------~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa 261 (502) T protein:vir:48 212 NQHIYTLDASDS-FN------------------------EISVTPHAFGTVPITEFLNN-----ADGIGDYETELYLIDL 261 (502) T ss_pred CCeEEEEEeCCc-ee------------------------eccceecCCCccceEEecCC-----CCCCCchhhhHHHHHH Confidence 111111111121 10 11112234455557777874 5799999999999999 Q ss_pred HHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHH Q lcl|NC_020883. 312 INWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIG 391 (589) Q Consensus 312 Ld~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirvee 391 (589) +|.++|+.+..++.++.|.+.++-..... .+..+......++.... +.........+..+++++|++..+. T Consensus 262 ~d~~~S~~~~~~~~~~~~~lv~~g~~~~~-----~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~d~~~l~~~~~~~~ 332 (502) T protein:vir:48 262 YDSAESDTANHMSDMADAILAIYGDLALP-----QGMQASDMKRTRLMQLK----PPKSADGKEGTVKAEYLTKSYDVSG 332 (502) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCcccc-----cccchhhhhhcceeecc----ccccccccccCcceeEeeecCCHHH Confidence 99999999999999999988764322211 11112111122211110 0011112334566789999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-- Q lcl|NC_020883. 392 DMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-- 469 (589) Q Consensus 392 h~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-- 469 (589) +..+++.|.+.|+..+++|..+++... +..||+|+++.++.+..|+..++..|..+|++++++++.+.+..+. T Consensus 333 ~~~~~~~L~~~I~~~s~~p~~~~~~~~-----~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 407 (502) T protein:vir:48 333 AEAYKTRLNKDIHVFTNTPDMSDNHFS-----GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK 407 (502) T ss_pred HHHHHHHHHHHHHHHhCCCCcCccccc-----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 999999999999999999988876432 2359999999999999999999999999999999888776654332 Q ss_pred ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020883. 470 SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQT 549 (589) Q Consensus 470 ~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~ 549 (589) ......+.|.|.+.+|.+.++. |+++..+ ++++|.+|+++++ |..++ +++|++||++|+...+....... T Consensus 408 ~~d~~~i~i~f~~~~p~d~~e~--a~~~~kl--~g~iS~et~l~~l-~~v~D--~~~E~~ri~~E~~~~~~~~~~~~--- 477 (502) T protein:vir:48 408 DFDESRLKITFTPNLPKSLYEQ--VSILNDL--GGQVSQETALSLS-GLVEN--PTEELDKINEESSKIDFKGYPSY--- 477 (502) T ss_pred ccccccceEEeCCCCCcCHHHH--HHHHHHH--hccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHhhhhhccccc--- Confidence 3445567899999999886655 6677665 4689999999887 65554 46999999988765322222211 Q ss_pred cccccCcccCCCCCCCCCCCCCCCCcchhhhhhc Q lcl|NC_020883. 550 FEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEK 583 (589) Q Consensus 550 l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~ 583 (589) .++. ..+.+++.+|-+++|.|+.-. T Consensus 478 ----~~~~-----~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 478 ----FYDN-----VGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred ----cccc-----ccccCCCccCCCCcCcCCCCC Confidence 1110 112222223333333333222 No 17 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.8e-43 Score=254.87 Aligned_cols=459 Identities=14% Similarity=0.061 Sum_probs=274.4 Q ss_pred Ccccee----ccchhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV----RGWTDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~----~~~~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. .-++-+.|+.+ |- .|.++++.|+|+|..+..+....- +.--+-.|+ T Consensus 28 ~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~----------------~~~~~~ki~ 91 (511) T protein:vir:78 28 VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE----------------EYMADNRVA 91 (511) T ss_pred cccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccc----------------cccCcceee Confidence 223321 01112222222 21 355667899999977655422110 111223688 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) .|+++.|++..+.++ +|+=.+.-. +. .+ -++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~d------------------~~----~~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:78 92 HDYASYISDFINGYF---LGNPIQYQD----------DD------------------KD----VLEAIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHhhhh---cccCceeec----------Cc------------------hH----HHHHHHHHHhhcChhHH Confidence 999999999999988 443222100 00 00 14578899999999999 Q ss_pred chhhHHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCc----ceeEEEe-ecCC---CccceEEEEEeeecccc Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEK----GADLAYY-IDHG---QYGQFLHIYRERVEKDG 218 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~----~~div~~-~e~~---~~~~~l~~~~~~~~~~~ 218 (589) .......+.+-|.....+|++. +.+++.+.+|.+.||. ++.. -+.+-|. .... ..+...|++. T Consensus 137 ~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~v------- 209 (511) T protein:vir:78 137 NRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDL------- 209 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEE------- Confidence 9999999999999999999985 5599999999999993 2211 1222221 1110 1111111110 Q ss_pred ceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCccc Q lcl|NC_020883. 219 LRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYG 298 (589) Q Consensus 219 ~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG 298 (589) -+.+.+|+ +...++...... .........+...+.|++++|+ ++| T Consensus 210 -yt~~~i~~---------~~~~~~~~~~~~--------------------~~~~~~~~~~~g~vPvv~~~n~-----~~g 254 (511) T protein:vir:78 210 -FTSHGVYR---------YLTNRTNGLKLT--------------------PRENSFESHSFERMPITEFSNN-----ERR 254 (511) T ss_pred -EeCCcEEE---------EEecCCCccccc--------------------ccccccccCcCcccceEEecCC-----CCC Confidence 01111110 111111100000 0011122345555668888885 479 Q ss_pred CcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC Q lcl|NC_020883. 299 ISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR 378 (589) Q Consensus 299 ~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~ 378 (589) +|||+++.+++|++|.++|+.+..++.+++|.+.+.-..............+.......+..... .......+. T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~ 328 (511) T protein:vir:78 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDA------EGRETEGSV 328 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceecc------ccccCCCCc Confidence 99999999999999999999999999999997765322221110000000110000000000000 011224456 Q ss_pred ccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 379 SMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYE 458 (589) Q Consensus 379 ~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~ 458 (589) .+.+++|+...+....+++.|.+.|+..|++|..+++... ++.||+|+++.++.+..|+..++..|..+|+++++ T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~ 403 (511) T protein:vir:78 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK 403 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6788999999999999999999999999999998887532 24599999999999999999999999999999998 Q ss_pred HHHHHHhhcCc---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 459 SCLWLLNDQDS---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 459 ~~l~L~~~~~~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) ++..+....+. ......+.|.|.+.+|.++++. ++++..+. |++|.||++.++ |..++ +++|++||++|+ T Consensus 404 li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~--~d~~~kl~--G~iS~et~l~~l-~~v~d--~~~El~ri~~E~ 476 (511) T protein:vir:78 404 LLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDSG--GKISQTTLMSLF-SFFQD--PELEVKKIEEDE 476 (511) T ss_pred HHHHHHHhcCCCccccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCChHHHHHhC-CCCCC--HHHHHHHHHHHH Confidence 87766543221 2334456899999999886655 66776664 689999999876 75554 679999999997 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ..+.+...... .. ++++.-.++.+++.++.+.|.| T Consensus 477 ~~~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 477 KESIKKAQKGI---YK------DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHhhcc---cc------CCCCCCCCCCCCCccCcccccC Confidence 65433332211 00 0000011111222222222222 No 18 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.8e-43 Score=254.87 Aligned_cols=459 Identities=14% Similarity=0.061 Sum_probs=274.4 Q ss_pred Ccccee----ccchhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV----RGWTDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~----~~~~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. .-++-+.|+.+ |- .|.++++.|+|+|..+..+....- +.--+-.|+ T Consensus 28 ~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~----------------~~~~~~ki~ 91 (511) T protein:vir:96 28 VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE----------------EYMADNRVA 91 (511) T ss_pred cccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccc----------------cccCcceee Confidence 223321 01112222222 21 355667899999977655422110 111223688 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) .|+++.|++..+.++ +|+=.+.-. +. .+ -++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~d------------------~~----~~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:96 92 HDYASYISDFINGYF---LGNPIQYQD----------DD------------------KD----VLEAIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHhhhh---cccCceeec----------Cc------------------hH----HHHHHHHHHhhcChhHH Confidence 999999999999988 443222100 00 00 14578899999999999 Q ss_pred chhhHHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCc----ceeEEEe-ecCC---CccceEEEEEeeecccc Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEK----GADLAYY-IDHG---QYGQFLHIYRERVEKDG 218 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~----~~div~~-~e~~---~~~~~l~~~~~~~~~~~ 218 (589) .......+.+-|.....+|++. +.+++.+.+|.+.||. ++.. -+.+-|. .... ..+...|++. T Consensus 137 ~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~v------- 209 (511) T protein:vir:96 137 NRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDL------- 209 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEE------- Confidence 9999999999999999999985 5599999999999993 2211 1222221 1110 1111111110 Q ss_pred ceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCccc Q lcl|NC_020883. 219 LRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYG 298 (589) Q Consensus 219 ~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG 298 (589) -+.+.+|+ +...++...... .........+...+.|++++|+ ++| T Consensus 210 -yt~~~i~~---------~~~~~~~~~~~~--------------------~~~~~~~~~~~g~vPvv~~~n~-----~~g 254 (511) T protein:vir:96 210 -FTSHGVYR---------YLTNRTNGLKLT--------------------PRENSFESHSFERMPITEFSNN-----ERR 254 (511) T ss_pred -EeCCcEEE---------EEecCCCccccc--------------------ccccccccCcCcccceEEecCC-----CCC Confidence 01111110 111111100000 0011122345555668888885 479 Q ss_pred CcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC Q lcl|NC_020883. 299 ISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR 378 (589) Q Consensus 299 ~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~ 378 (589) +|||+++.+++|++|.++|+.+..++.+++|.+.+.-..............+.......+..... .......+. T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~ 328 (511) T protein:vir:96 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDA------EGRETEGSV 328 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceecc------ccccCCCCc Confidence 99999999999999999999999999999997765322221110000000110000000000000 011224456 Q ss_pred ccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 379 SMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYE 458 (589) Q Consensus 379 ~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~ 458 (589) .+.+++|+...+....+++.|.+.|+..|++|..+++... ++.||+|+++.++.+..|+..++..|..+|+++++ T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~ 403 (511) T protein:vir:96 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK 403 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6788999999999999999999999999999998887532 24599999999999999999999999999999998 Q ss_pred HHHHHHhhcCc---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 459 SCLWLLNDQDS---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 459 ~~l~L~~~~~~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) ++..+....+. ......+.|.|.+.+|.++++. ++++..+. |++|.||++.++ |..++ +++|++||++|+ T Consensus 404 li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~--~d~~~kl~--G~iS~et~l~~l-~~v~d--~~~El~ri~~E~ 476 (511) T protein:vir:96 404 LLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDSG--GKISQTTLMSLF-SFFQD--PELEVKKIEEDE 476 (511) T ss_pred HHHHHHHhcCCCccccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCChHHHHHhC-CCCCC--HHHHHHHHHHHH Confidence 87766543221 2334456899999999886655 66776664 689999999876 75554 679999999997 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ..+.+...... .. ++++.-.++.+++.++.+.|.| T Consensus 477 ~~~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 477 KESIKKAQKGI---YK------DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHhhcc---cc------CCCCCCCCCCCCCccCcccccC Confidence 65433332211 00 0000011111222222222222 No 19 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=3.7e-43 Score=253.19 Aligned_cols=459 Identities=14% Similarity=0.067 Sum_probs=276.6 Q ss_pred Ccccee--ccc--hhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV--RGW--TDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~--~~~--~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. .-+ +-+.++.+ |- .|.++++.|+|+|..+..+.... .+.--+..|+ T Consensus 28 ~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~----------------~~~~~~~ki~ 91 (511) T protein:vir:96 28 VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRK----------------EEYMADNRVA 91 (511) T ss_pred ccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCc----------------ccccCcceee Confidence 223321 111 11223322 32 25556788999998776542110 0111223689 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) .|+++.|++..+.++ +|+=.+.-. + +.+ -++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~------------------~~~----~~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:96 92 HDYASYISDFINGYF---LGNPIQYQD----------D------------------DKD----VLEAIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHHhhh---ccCCceeec----------C------------------chH----HHHHHHHHHhhcCHHHH Confidence 999999999999888 443332111 0 001 14578999999999999 Q ss_pred chhhHHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCc----ceeEEEeec-CCC---ccceEEEEEeeecccc Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEK----GADLAYYID-HGQ---YGQFLHIYRERVEKDG 218 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~----~~div~~~e-~~~---~~~~l~~~~~~~~~~~ 218 (589) +.....++.+-|..+..+|++. +.+++.+.+|.+.||. ++.. -+.+-|... ... .+...|+. T Consensus 137 ~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~-------- 208 (511) T protein:vir:96 137 NRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVD-------- 208 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEE-------- Confidence 9999999999999999999985 5599999999999993 2211 121211111 100 01111111 Q ss_pred ceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCccc Q lcl|NC_020883. 219 LRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYG 298 (589) Q Consensus 219 ~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG 298 (589) +.+.+.+|+ +....+..... ...+......+.....|++++|+ ..| T Consensus 209 iyt~~~i~~---------~~~~~~~~~~~--------------------~~~~~~~~~~~~~~vPvv~~~nn-----~~g 254 (511) T protein:vir:96 209 LFTSHGVYR---------YLTSRTNGLKL--------------------TPRENGFESHSFERMPITEFSNN-----ERR 254 (511) T ss_pred EEeCCcEEE---------EEecCCCcccc--------------------cccccccccccCCceeeEEecCC-----CCC Confidence 001111110 00011110000 00111223355666678888884 479 Q ss_pred CcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC Q lcl|NC_020883. 299 ISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR 378 (589) Q Consensus 299 ~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~ 378 (589) +|||+++.+++|++|.++|+.+..++.+++|.+.+.-.......... ..+..++................+.+. T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:96 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR------KQKEANVLFLEPTVYADSEGRETEGSV 328 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhc------ccccccceecccccccccccccCCCCc Confidence 99999999999999999999999999999997765322221111111 111111110000000000111234456 Q ss_pred ccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 379 SMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYE 458 (589) Q Consensus 379 ~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~ 458 (589) .+.+++|+...+....+++.|.+.|+..+++|..+++... ++.||+|+++.++.+..|+.+++..|.++|+++++ T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~ 403 (511) T protein:vir:96 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK 403 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6789999999999999999999999999999998887532 24599999999999999999999999999999998 Q ss_pred HHHHHHhhcC---cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 459 SCLWLLNDQD---SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 459 ~~l~L~~~~~---~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) ++..+.+..+ .......+.|.|.+.+|.+.++. ++++..+ +|++|.||+++++ |..++ +++|++||++|+ T Consensus 404 li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~--~~~~~kl--~G~iS~et~l~~l-~~v~D--~~~E~~ri~~E~ 476 (511) T protein:vir:96 404 LLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDS--GGKISQTTLMSLF-SFFQD--PELEVKKIEEDE 476 (511) T ss_pred HHHHHHHhhcCcccccccccceEEeCCCCCCCHHHH--HHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHHHHHHHH Confidence 8776554322 12344567899999999987655 5666554 5789999999766 65554 578999999997 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ..+.+...... . .++++...++.++++++..++.| T Consensus 477 ~~~~~~~~~~~----~-----~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 477 KESIKKAQKGI----Y-----KDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHhhcc----c-----cCCCCCCCCCCCCcccccccccC Confidence 65433322211 0 00111111222222233333222 No 20 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=3.8e-43 Score=253.17 Aligned_cols=462 Identities=11% Similarity=0.065 Sum_probs=279.7 Q ss_pred CccceeccchhHHHHhh---c-c-h---hhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNV---H-G-D---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~-~-~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) .+.=-...+|.+.|..+ | . . |.++++.|+|+|.+++.|......++ ..+-.|++|++ T Consensus 14 ~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~---------------~~~~ki~~n~~ 78 (506) T protein:vir:94 14 IYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDG---------------KADHRATHSFA 78 (506) T ss_pred ecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccc---------------CCcceeecchH Confidence 11111123444444333 1 1 2 45566889999999998843322211 12346899999 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhh Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSN 152 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~ 152 (589) +.|++..+.++ +|+=.+.-.. . +--+++++.+.+.++|....... T Consensus 79 ~~Iv~~~~~~l---~G~p~~~~~~----------d----------------------~~~~~~l~~~~~~N~~~~~~~~~ 123 (506) T protein:vir:94 79 KYIADFQTSYS---VGNPINVKLP----------D----------------------DGSNSGFDTFNKANDVDAENYDL 123 (506) T ss_pred HHHHHHhhhhh---cccCceeecC----------c----------------------chHHHHHHHHHhccCHhHHHHHH Confidence 99999999998 5543331110 0 00145788899999999999999 Q ss_pred HHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCcceeEEEeecCCCccceEEEEEeeecc-cc---ceeehhhh Q lcl|NC_020883. 153 IVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEKGADLAYYIDHGQYGQFLHIYRERVEK-DG---LRTTNMLY 226 (589) Q Consensus 153 l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~~~div~~~e~~~~~~~l~~~~~~~~~-~~---~~~~~~~y 226 (589) ..++.+-|.....+|+++ +.++|.+.++.+.||. +++..-...+ ++++|+..... .+ +.....+| T Consensus 124 ~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~~~~~~~~---------~v~~~~~~~~~~~~~~~~~~~~~~y 194 (506) T protein:vir:94 124 FLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTDVDPKPIM---------AVRYHQIELVDDNQVSTINYVPETW 194 (506) T ss_pred HHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCCCCCceEE---------EEEEEeeeeccCCceeEEEEEEEEE Confidence 999999999999999985 4599999999999994 3322100010 11222111100 00 00000111 Q ss_pred ccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhh Q lcl|NC_020883. 227 PVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLE 306 (589) Q Consensus 227 ~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie 306 (589) -. ...+. ..+.. .+. ........++.+..|++++|+. .|.|||+++. T Consensus 195 t~----~~~~~--~~~~~-------------------~~~---~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~~~ 241 (506) T protein:vir:94 195 TA----DTYTL--YNPTP-------------------IMG---KMQVDTTKPITTFPVVEFKNSN-----FRLGDFENVL 241 (506) T ss_pred eC----ceEEE--ecccc-------------------Ccc---ceeccccccCCccceEEecCCC-----CCCCchhhhH Confidence 00 00000 00000 000 0001123455566688888853 6999999999 Q ss_pred HHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccc-----ccccccccccccccccc--cccc---ccccc-----cc Q lcl|NC_020883. 307 SKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLN-----IAYERDGHSAKEASMMT--PRID---HRDME-----IT 371 (589) Q Consensus 307 ~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g-----~~~d~dge~~~~~~~~~--~~~d---~~dle-----v~ 371 (589) +++|++|.++|+.+..++.+++|.+.+--.......+ .....+.+..+..+... ...+ ..-+. .. T Consensus 242 ~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (506) T protein:vir:94 242 PLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTV 321 (506) T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccc Confidence 9999999999999999999999876542111110000 00000000000000000 0000 00000 01 Q ss_pred ccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_020883. 372 TFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYID 451 (589) Q Consensus 372 ~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~ 451 (589) ...+.+..++|++|+...+.+..+++.|.+.|+..+++|..+++... +..||+|+++++..+..|+.+++..|.+ T Consensus 322 ~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Aik~~~~~l~~k~~~k~~~~~~ 396 (506) T protein:vir:94 322 NGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFA-----SNSSGVAMQYKVLGTVELASTKRRMFER 396 (506) T ss_pred cCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12234556889999999999999999999999999999987665322 2459999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcC--cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHH Q lcl|NC_020883. 452 FLKELYESCLWLLNDQD--SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIA 529 (589) Q Consensus 452 aLk~li~~~l~L~~~~~--~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~ 529 (589) +|++++++++.+.+..+ ..+....+.|.|.+.+|.++++. |++++.+ +|++|.+|++.++ |.+++ +++|++ T Consensus 397 ~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~--a~~~~kl--~g~iS~et~~~~l-p~v~d--~~~E~~ 469 (506) T protein:vir:94 397 GLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQ--IKALVQA--GATLPQKYLYQQL-PGVTN--PQDIVD 469 (506) T ss_pred HHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHH--HHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHH Confidence 99999999888776433 23455567899999999886655 6777765 4689999999876 76776 458999 Q ss_pred HHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhh Q lcl|NC_020883. 530 RIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIE 582 (589) Q Consensus 530 RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~ 582 (589) ||++|+..+.+...... .++.++ ..++.+++.+|++- T Consensus 470 ri~~E~~~~~~~~~~~~-------~~~~~~---------~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 470 MMKEQSANGDYSFDQNG-------VISNDG---------QTNTTATQTDEEVR 506 (506) T ss_pred HHHHHHHHHhhcchhhc-------CCCccc---------CccccccccccCCC Confidence 99999875443321111 111111 11122233333333 No 21 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=5.4e-43 Score=252.29 Aligned_cols=456 Identities=16% Similarity=0.132 Sum_probs=275.8 Q ss_pred Cccceecc----chhHHHHhh---c--chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEc Q lcl|NC_020883. 1 MIDWTVRG----WTDKTTKNV---H--GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNL 71 (589) Q Consensus 1 ~~~~~~~~----~~~~~~~~~---~--~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~ 71 (589) +|+-..++ |.-+.++.+ | ..|.+++++|+|+|.-++.+.......+.... +..-+..|+.|+ T Consensus 11 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~---------~~~~~~ki~~~~ 81 (479) T protein:vir:79 11 LIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDD---------FTKVNNKAINNY 81 (479) T ss_pred eEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccc---------cccCcceeecch Confidence 22222222 222334333 3 23667779999999776655322221111111 112344799999 Q ss_pred chhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchh Q lcl|NC_020883. 72 PKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWS 151 (589) Q Consensus 72 ~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~ 151 (589) ++.|++..+.++ +|+=.+.-. +. +..+++++.+.+ ++|...+.. T Consensus 82 ~~~Ivd~~~~~l---~g~p~~~~~----------~~----------------------~~~~~~~~~~~~-n~~~~~~~~ 125 (479) T protein:vir:79 82 HKLLVDQKVGYS---VGNPIVFNA----------DD----------------------DNLTKLLNDLLG-EEFDDTITE 125 (479) T ss_pred HHHHHHHHHhhh---hcCCceecc----------CC----------------------HHHHHHHHHHHh-cCHHHHHHH Confidence 999999999998 444333111 00 011335555555 478888888 Q ss_pred hHHHHHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccc Q lcl|NC_020883. 152 NIVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVV 229 (589) Q Consensus 152 ~l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~ 229 (589) ....+.+-|.++..+|++.+ .+++.+.++.+.||. ++... ++-..++++|+...........-.+|-.. T Consensus 126 ~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~---------~~~~~~ir~y~~~~~~~~~~~~~e~y~~~ 196 (479) T protein:vir:79 126 LYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSKRQ---------RELVAFIRFYYIEDIDGNKIKRVEYYTEN 196 (479) T ss_pred HHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCCCC---------CceEEEEEEEEEeecCCceEEEEEEEeCC Confidence 88999999999999999854 499999999999994 22221 11112233333222111111111122111 Q ss_pred cccchhheeecccccccccccccccchhhhhhcccCCccc-cccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHH Q lcl|NC_020883. 230 KAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDD-RPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESK 308 (589) Q Consensus 230 ~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~-~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l 308 (589) .+.+ +....+..+....+.. .....+++.. ........+.....|++++|+ .+|+|||+++.++ T Consensus 197 ~i~~---~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~g~sd~~~v~~l 261 (479) T protein:vir:79 197 DITY---FIERGNSFIQEFLYDE-------YGKMTDIQEGHFRINNKEQGWGKVPFIPFKNN-----EKCVSDLTFYKSL 261 (479) T ss_pred cEEE---EEecCCcccccccccc-------cccccccccccccccccccCCCcccEEEecCC-----CCCCcchhhhHHH Confidence 1111 1111121111111100 1111111111 111233456666668888885 4799999999999 Q ss_pred HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeeccc Q lcl|NC_020883. 309 QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 309 ~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) +|++|.++|+.++.++.+++|.+.++- ...+.+++... .+. ...++..++ +..+++++|++. T Consensus 262 iDa~d~~~S~~~~~~~~~~~~~~v~~g--------~~~~~~~~~~~--~~~-------~~~~i~~~~-~~~~~~l~~~~~ 323 (479) T protein:vir:79 262 IDIYDNNISTLADNLDEIQEVIYVLKE--------YPGTSLQEFID--NIR-------YYKSIKVDG-GGGVDKLEINIP 323 (479) T ss_pred HHHHHHHHHHHHHHHHHhhCceeeeec--------CCccccccchh--hhh-------hccceecCC-CCcceEEeccCC Confidence 999999999999999999999876542 11111111111 000 111222222 345789999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-c Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND-Q 467 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~-~ 467 (589) .+....+++.|.+.|+..+++|..+++. . +..||+|++..+..+..|+..++..|.++|++++++++.+.+. . T Consensus 324 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~---gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 397 (479) T protein:vir:79 324 VEAKKELLDRLEKNIIIFGQGVNPESQN---T---GDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG 397 (479) T ss_pred HHHHHHHHHHHHHHHHHHhCcccccccc---c---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999876652 1 2459999999999999999999999999999999888766554 3 Q ss_pred CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020883. 468 DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGIN 547 (589) Q Consensus 468 ~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~ 547 (589) +..+....+.|.|++.+|.++++. |+++..+. |++|.||+++++ |.+++ +++|++||++|+....... .. T Consensus 398 ~~~~~~~~i~i~f~~~~p~~~~~~--a~~~~kl~--g~iS~et~l~~l-~~v~d--~~~E~~ri~~E~~~~~~~~-~~-- 467 (479) T protein:vir:79 398 NKSYDYKTVQITFNHSMIINEAEK--IDMAAKST--GIVSDETIVSNH-PWVED--VNDELERLKKQEDTQKEYD-DL-- 467 (479) T ss_pred CCccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHH-hc-- Confidence 445666778899999999886655 66776654 689999999876 76665 5789999999976421110 00 Q ss_pred cccccccCcccCCCCCCCCC Q lcl|NC_020883. 548 QTFEQMNDNRDEDGNIIEEG 567 (589) Q Consensus 548 ~~l~~~~~~~~~~~~p~deg 567 (589) +. +..++ -.||. T Consensus 468 --~~---~~~~~---~~~e~ 479 (479) T protein:vir:79 468 --IP---NNQDG---VIDET 479 (479) T ss_pred --cC---cccCC---CcCcC Confidence 00 00000 11111 No 22 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=6.1e-43 Score=252.01 Aligned_cols=459 Identities=14% Similarity=0.070 Sum_probs=272.2 Q ss_pred CccceeccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAE 77 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~ 77 (589) |.-=++...-++.+...- ..|.++++.|+|+|.-++.+-......++ +........+..|+.|+.+.|++ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~-------~~~~~~~~~~~ki~~n~~k~Iv~ 73 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKE-------GKKDPLRSADNRIPSNFYQLLVD 73 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhccccc-------ccccccccCCcccccchHHHHHH Confidence 333333344444443322 34556678999999544332111111000 00000112355799999999999 Q ss_pred cchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHH Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQ 157 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~ 157 (589) ..+.++ +|+=.+... +.+ .. ++.+.+++.+ +|...+......+. T Consensus 74 ~~~~yl---~G~p~~~~~----------~d~------------------~~----~~~l~~~~~~-~~~~~~~~l~~~~~ 117 (470) T protein:vir:10 74 QEAGYV---ASVFPDIDV----------GKD------------------AD----NKKIIDVLGD-DRALTLNGLLVDSS 117 (470) T ss_pred hhhhhe---eccceeeec----------Cch------------------HH----HHHHHHHHhh-hHHHHHHHHHHHHh Confidence 999999 554333211 000 00 1233444443 34445555667788 Q ss_pred HcCceeEEEEEecCc-eeEEEecCceeccc-ccCc----ceeEEEee-cCCCccceEEEEEeeeccccceeehhhhcccc Q lcl|NC_020883. 158 VDGGIVAAPVIDELG-PRIVFKARDVYFPH-DDEK----GADLAYYI-DHGQYGQFLHIYRERVEKDGLRTTNMLYPVVK 230 (589) Q Consensus 158 v~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~-~d~~----~~div~~~-e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~ 230 (589) +-|.....+|++.++ +++...+|.+.||. ++.. -+.|-|.. ......++++.+ .+|-... T Consensus 118 ~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~-------------e~yt~~~ 184 (470) T protein:vir:10 118 NAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVH-------------EYWTDKE 184 (470) T ss_pred hcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEE-------------EEEcCCc Confidence 889999999999654 99999999999994 2221 12222221 111222222221 1121000 Q ss_pred ccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHH Q lcl|NC_020883. 231 AKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQD 310 (589) Q Consensus 231 ~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~D 310 (589) +. ++....+.......+..... .... -.....+......|..+..|++++|+. .|.|||+++.+++| T Consensus 185 ~~---~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~v~~liD 251 (470) T protein:vir:10 185 AQ---FFRTNATDSTVIEPYNIITS----YDLS-AGYETGQSNTLKHNFGRVPFIEFSKNK-----YRLPELNKYKGLID 251 (470) T ss_pred EE---EEEeecCcceeccccccccc----cccc-cccccccccccccCCCeeeEEEeecCC-----CCCCchhHHHHHHH Confidence 00 01111111011000000000 0000 011122334455677777788889863 79999999999999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccc-cccccCccceeeecccH Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITT-FDENGRSMEIHQIDISK 389 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~-~de~g~~~~~iq~Dirv 389 (589) ++|.++|+.+..++.|++|.+.++...++.. ++.... + .....+.+.. .+..+..+++++|++.. T Consensus 252 a~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~--------~~~~~~--~----~~~~~i~~~~~~~~~~~~~~~lt~~~~~ 317 (470) T protein:vir:10 252 AYDDIYNGFINDLDDVQTVILVLTNYGGADL--------HQFMND--L----RKYKSIKINNTGNGDNSGVDKLQIDIPV 317 (470) T ss_pred HHHHHHHHHHHHHHHhcCcceeeecCCcccc--------chhhhh--h----hhcCeEeccCCCCCcCceeEEEeecCCh Confidence 9999999999999999999998865433321 111000 0 0011111211 12345668899999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_020883. 390 IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS 469 (589) Q Consensus 390 eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~ 469 (589) +....+++.|.+.||..+++|..+++. . ++.||+|++++++.+..|+.+++..|.++|++++++++.+.+.. T Consensus 318 ~~~~~~~~~L~~~I~~~s~~p~~~~~~-----~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~-- 389 (470) T protein:vir:10 318 EARDDALKITRKNIFLFGQGIDPANFE-----S-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-- 389 (470) T ss_pred HHHHHHHHHHHHHHHHHhCCCCCCccc-----c-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-- Confidence 999999999999999999999766542 1 24699999999999999999999999999999998887655432 Q ss_pred ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020883. 470 SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQT 549 (589) Q Consensus 470 ~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~ 549 (589) ......+.|.|.+.+|.+++|+ |++++.+ ++++|.||+++++ |.+++ +++|++||++|+....+.. T Consensus 390 ~~d~~~i~i~f~~~~p~d~~e~--~~~~~~~--~g~iS~et~l~~~-p~v~D--~~~E~eri~~E~~e~~~~~------- 455 (470) T protein:vir:10 390 DADKRHISQHWTRTKVEDSLTK--AQIVSTV--ANYSSKEAVAKAN-PIVDD--WQQELKDLAKDKEENDPYS------- 455 (470) T ss_pred CcccceeeEEeccCCCCCHHHH--HHHHHHH--hccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHhh------- Confidence 2344566899999999987765 6777775 4689999999775 86665 6799999999987543311 Q ss_pred cccccCcccCCCCCCCCCCCCCC Q lcl|NC_020883. 550 FEQMNDNRDEDGNIIEEGDTEEE 572 (589) Q Consensus 550 l~~~~~~~~~~~~p~deg~~~ee 572 (589) .++ .+-.+. |.++|| T Consensus 456 -~~~-----~~~~~~--~~dde~ 470 (470) T protein:vir:10 456 -NQA-----DELNGK--GVNDEQ 470 (470) T ss_pred -ccc-----cccCCC--CCCCCC Confidence 111 100111 111122 No 23 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1e-42 Score=250.82 Aligned_cols=447 Identities=14% Similarity=0.137 Sum_probs=265.4 Q ss_pred Cccceeccch------------------hHHHHhh----cc---hhhhhhhhhcCCccccCHHHHHHHhhccccceeccC Q lcl|NC_020883. 1 MIDWTVRGWT------------------DKTTKNV----HG---DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS 55 (589) Q Consensus 1 ~~~~~~~~~~------------------~~~~~~~----~~---~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~ 55 (589) |..-.-+-|| .+.|+.+ -. .|.++++.|.|+| ++..|.......++.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~-~i~~r~~~~~~~~~~~------ 73 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDN-DIVKQMKKVDVYGNID------ 73 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccC-chhccccccccccccc------ Confidence 3322222232 1222222 11 2445558899999 5666632222111111 Q ss_pred cceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhH Q lcl|NC_020883. 56 SQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEI 135 (589) Q Consensus 56 ~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~ 135 (589) +...+-.|+.|+++.|++..+.++ +|+=.+... + +. -.+++ T Consensus 74 ----~~~~~~ki~~n~~~~Ivd~~~~~l---~g~p~~~~~----------~------------------d~----~~~~~ 114 (474) T protein:vir:95 74 ----YDKPDWRITTNFHQNLVDQKVSYV---ASKPVTYSC----------E------------------DE----SVLKI 114 (474) T ss_pred ----cccccceeccchHHHHHHHHHhhh---ccCCceecc----------C------------------ch----HHHHH Confidence 111345789999999999999988 553333111 0 00 11345 Q ss_pred HHHHHhhccccccchhhHHHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCcceeEEEeecCCCccceEEEEEee Q lcl|NC_020883. 136 IEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEKGADLAYYIDHGQYGQFLHIYRER 213 (589) Q Consensus 136 i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~~~div~~~e~~~~~~~l~~~~~~ 213 (589) +..+..| +|..........+.+-|.+...+|+++ +.++|.+.++++.||. ++.. .++--.+++.|+.. T Consensus 115 l~~~~~n-~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~---------~~~~~~~i~~~~~~ 184 (474) T protein:vir:95 115 IHDVLDT-RWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKE---------REELKSFIRYYKFN 184 (474) T ss_pred HHHHHhc-cHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccceEEEEcCCC---------CCceEEEEEEEEEc Confidence 6666554 577778888889999999999999985 4599999999999993 2221 11111112222111 Q ss_pred eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCC Q lcl|NC_020883. 214 VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETF 293 (589) Q Consensus 214 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~ 293 (589) .. ..+ .+|-... ..++....+. +....... ....+......+..++.|++++|+ T Consensus 185 ~~-~~~----~~y~~~~---~~~~~~~~~~-~~~~~~~~--------------~~~~~~~~~~~~~g~iPvv~~~nn--- 238 (474) T protein:vir:95 185 NE-EKV----EFWTDTT---VTYYVLENGG-LIPDYYYG--------------ANHIQSHFSNGNWGRVPFIAFKNN--- 238 (474) T ss_pred Ce-eEE----EEEeCCe---EEEEEEcCCc-cccccccC--------------cccccccccccCCCccceEeecCC--- Confidence 00 000 1110000 0001111111 00000000 000111222344556667887774 Q ss_pred CCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 294 MNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF 373 (589) Q Consensus 294 ~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~ 373 (589) +.|.|||+++.+++|++|.++|+.++.++.|+.|.+.++-..++. ..+... . .....++.. T Consensus 239 --~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~--------~~~~~~--~-------~~~~~~i~~ 299 (474) T protein:vir:95 239 --PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQD--------LEEFMR--G-------LKYYKAINV 299 (474) T ss_pred --CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc--------chhhhh--h-------hhccceeec Confidence 579999999999999999999999999999999988765332221 111100 0 001112222 Q ss_pred ccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_020883. 374 DENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFL 453 (589) Q Consensus 374 de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL 453 (589) ++ +..+++++|++..+.+..+++.|.++|+..+++|..+++.. +++.||+|++++++.+..|+.+++..|..+| T Consensus 300 ~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l 373 (474) T protein:vir:95 300 DG-DGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKF-----GSAPSGIALKFLYGNLDLKANKLKNKATVAI 373 (474) T ss_pred cC-CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-----cccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 34578999999999999999999999999999998776542 2345999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 454 KELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 454 k~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) ++++++++.+.. ...+.....|.|++.+|.++++. |++ +..++++|.+|+++++ |.+++ +++|++||++ T Consensus 374 ~~~~~li~~~~g---~~~d~~~i~v~f~~~~p~d~~e~--a~~---~~~~g~iS~et~i~~l-~~v~d--~~~E~~ri~~ 442 (474) T protein:vir:95 374 QELIGFIIDFNN---LKMDVKDIEISFNFNRMMNDAEQ--SQI---IAQSQYLSRETLVKSS-PLVDD--YKAELERIEQ 442 (474) T ss_pred HHHHHHHHHHhC---CCcccceeeEEeccCCCcCHHHH--HHH---HHhcCCCchHHHHHhC-CCCCC--HHHHHHHHHH Confidence 999988766542 23445566899999999886654 444 3456899999999875 76665 5689999998 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) |+...... +... +.-+.+....+|.+++++|. T Consensus 443 E~~~~~~~-~~~~--------~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 443 EQMEYNKQ-LPNL--------DDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHHhc-cccc--------ccccCCCCcCCCCCccCCCC Confidence 87542111 1100 01111111112212222222 No 24 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=9.2e-43 Score=251.05 Aligned_cols=452 Identities=13% Similarity=0.044 Sum_probs=281.3 Q ss_pred Cccceec-cchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVR-GWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) .+++.+- --.++- +.-+..|.++++.|+|+|.- ..|-. ..+...+-.|+.|+++.|++.. T Consensus 15 ~~~~~~i~~~i~~~-~~~~~~~~~l~~Yy~g~~~i-~~~~~-----------------~~~~~~~~ki~~n~~~~Iv~~~ 75 (499) T protein:vir:10 15 EPNIEAINYAIREL-QNRKKRLDKLSDYYNGKQEI-EKHEF-----------------DNATVEAANVMVNHAKYITDMN 75 (499) T ss_pred cCCHHHHHHHHHHH-HHHHHHHHHHHHHhccccch-hcCCc-----------------CcCCCCcceeecchHHHHHHHH Confidence 2222211 111222 22345677777999999853 33310 0112245688999999999998 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) +.++ +|+=.+... +.. +. ++.+..+.+.++|..........+.+- T Consensus 76 ~~~l---~g~p~~~~~----------~~~------------------~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~ 120 (499) T protein:vir:10 76 VGFM---TGNPVKYVA----------EKG------------------KN----IDDILEVFNQIDIHKHDIELEKDLSVF 120 (499) T ss_pred hhhh---cccCceeec----------CCh------------------hH----HHHHHHHHhhcCHhHHHHHHHHHHHhc Confidence 8888 554222111 000 11 234667888888998899999999999 Q ss_pred CceeEEEEEecCc------------------eeEEEecCceeccc-ccCc----ceeEEE-eecCCCcc---ceEEEEEe Q lcl|NC_020883. 160 GGIVAAPVIDELG------------------PRIVFKARDVYFPH-DDEK----GADLAY-YIDHGQYG---QFLHIYRE 212 (589) Q Consensus 160 Gg~~~~~~~~~~~------------------~~i~f~~~d~~~P~-~d~~----~~div~-~~e~~~~~---~~l~~~~~ 212 (589) |.....+|++.++ +++..++|.+.||. ++.. -+.|.| .......+ .|+.+|+. T Consensus 121 G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~ 200 (499) T protein:vir:10 121 GYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMP 200 (499) T ss_pred CceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeC Confidence 9999999999664 55666677666662 1111 122222 11111111 11111211 Q ss_pred eeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCC Q lcl|NC_020883. 213 RVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNET 292 (589) Q Consensus 213 ~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~ 292 (589) . .+| .....+.. ..... .........++.++.||+++|+ T Consensus 201 ~----------~i~----------~~~~~~~~-~~~~~------------------~~~~~~~~~~~g~vPvv~~~n~-- 239 (499) T protein:vir:10 201 Q----------RIV----------EYRTKTTM-EVSAN------------------DPIVYDGENLFGAVPIIEFRNN-- 239 (499) T ss_pred C----------eEE----------EEEecCCc-cccCc------------------ceecccccCCCCccceEEecCC-- Confidence 1 011 00011100 00000 0111223356666778888883 Q ss_pred CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 293 FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITT 372 (589) Q Consensus 293 ~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~ 372 (589) ++|.|||+++.+++|++|.++|+.+..++.++.|.+.+.-..+ +.+.+..... ...-+.... T Consensus 240 ---~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~--------~~~~~~~~~~-------~~~~~~~~~ 301 (499) T protein:vir:10 240 ---EERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGL--------GDDKDDIQRL-------KRGAIEAPP 301 (499) T ss_pred ---CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc--------ccccchhhhh-------hhcceeccC Confidence 5799999999999999999999999999999999888752221 2221111110 000111121 Q ss_pred cccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_020883. 373 FDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDF 452 (589) Q Consensus 373 ~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~a 452 (589) .+.+..+++++|++..+.+..+++.|.++|+..+++|..+++.. +++.||+|+++++..+..|+..++..|..+ T Consensus 302 -~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-----~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 375 (499) T protein:vir:10 302 -REEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKF-----MGNVSGEAMKFKLFGLENLLSIKQRYFFDG 375 (499) T ss_pred -CCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhh-----cccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23445688999999999999999999999999999998877643 124599999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHH Q lcl|NC_020883. 453 LKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIE 532 (589) Q Consensus 453 Lk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~ 532 (589) |+++++++..+.+..+.......+.|.|++.+|.++++. |++++.+ ++++|.||+++++ |.+++ +++|++||+ T Consensus 376 l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~--~~~~~kl--~g~iS~et~~~~l-~~v~d--~~~E~~ri~ 448 (499) T protein:vir:10 376 LRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDV--VNNVKNA--DGIIPRKYTYSWL-PDVDN--PQDVIDEMN 448 (499) T ss_pred HHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHH--HHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHHHHH Confidence 999999998887766656666778899999999886655 6777776 4689999999875 76665 578999998 Q ss_pred hhcccccc---ccccccccccccccCcccCCCCCCCCC-CCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 533 EEQAGSDT---SSLMGINQTFEQMNDNRDEDGNIIEEG-DTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 533 ~E~a~~~p---~~~g~~~~~l~~~~~~~~~~~~p~deg-~~~eep~~~~~e~~~~~~~~~~ 589 (589) +|+..... .++++.++. ....+++ ++++++.++..-+..+++--=| T Consensus 449 ~E~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (499) T protein:vir:10 449 QQDAETIKKNQEALRGQDPD-----------RLELEDKQDDSSENDKEAGSNHNQSHRTRA 498 (499) T ss_pred HHHHHHHHHHHhhhccCCCC-----------CCCCCCCCcccCCCCCCCccccccCCCCCC Confidence 88754321 222222111 1111212 2333333334445555555555 No 25 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=2.8e-42 Score=248.42 Aligned_cols=469 Identities=14% Similarity=0.072 Sum_probs=276.0 Q ss_pred Ccc---c-----------------eeccchhHHHHhh---c--chhhhhhhhhcCCccccCHHHHHHHhhccccceeccC Q lcl|NC_020883. 1 MID---W-----------------TVRGWTDKTTKNV---H--GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS 55 (589) Q Consensus 1 ~~~---~-----------------~~~~~~~~~~~~~---~--~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~ 55 (589) |.| | -.-.|+-..|+.+ | ..|.+..+.|+|+| ++..|-..........- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~----- 74 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCEN-DIEKKRRTYYDAAGQQL----- 74 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhcccc-chhhccchhcccccccc----- Confidence 000 0 0011222233333 2 22444446699998 67666433222111100 Q ss_pred cceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhH Q lcl|NC_020883. 56 SQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEI 135 (589) Q Consensus 56 ~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~ 135 (589) .-+..-+..|+.|+++.|++..+.++ +|+=.+.-. + +.. .+++ T Consensus 75 --~~~~~~~~ri~~n~~~~ivd~~~~yl---~g~~~~~~~----------~------------------d~~----~~~~ 117 (503) T protein:vir:59 75 --VDDTKTNNRTSHAWHKLFVDQKTQYL---VGEPVTFTS----------D------------------NKT----LLEY 117 (503) T ss_pred --cccccccceeecchHHHHHHHHHhhh---hcCCeeecc----------C------------------cHH----HHHH Confidence 00112235789999999999999998 444333111 0 001 1335 Q ss_pred HHHHHhhccccccchhhHHHHHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCc--c--eeEEE-eecCCCccceEE Q lcl|NC_020883. 136 IEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEK--G--ADLAY-YIDHGQYGQFLH 208 (589) Q Consensus 136 i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~--~--~div~-~~e~~~~~~~l~ 208 (589) ++.+.. ++|.........++.+-|..+..+|++++ .+++.+.+|.++||. ++.. . +.|-| ..+....+.+.| T Consensus 118 l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~ 196 (503) T protein:vir:59 118 VNELAD-DDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQK 196 (503) T ss_pred HHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEE Confidence 555555 46777888899999999999999999855 599999999999993 2221 1 11211 222222222222 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) ++ +|-.. .+++....+..+....... .....+.. .......+..++.|++++ T Consensus 197 ~e--------------vy~~~----~i~~~~~~~~~~~~~~~~~---------~~~~~~~~-~~~~~~~~~~~vPiv~~~ 248 (503) T protein:vir:59 197 AE--------------LYTDT----HVYYYEKIDGVYQMDYSYG---------ENNPRPHM-TKGGQAIGWGRVPIIPFK 248 (503) T ss_pred EE--------------EEeCC----cEEEEEEcCCccccccccc---------ccccccce-eecceeccCCccceEEec Confidence 22 22110 0011111111111100000 00000101 112233566666677777 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) |+ .+|.|||+++.+++|++|.++|+.+..++.++.|.+.+. |...+.+.+... +.... T Consensus 249 nn-----~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~--------g~~~~~~~~~~~---------~~~~~ 306 (503) T protein:vir:59 249 NN-----EEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLK--------NYDGENPKEFTA---------NLRYH 306 (503) T ss_pred CC-----CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEee--------cCCccccchhhh---------hhhcc Confidence 74 369999999999999999999999999999999977653 211111111111 01111 Q ss_pred cccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_020883. 369 EITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKE 448 (589) Q Consensus 369 ev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~ 448 (589) .+...+++| .+.+++|+++.+....+++.|.+.|+..+++|..+++... +..||+|++.++.++..|+.+++.. T Consensus 307 ~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~~~Sg~Ai~~~~~~l~~k~~~~~~~ 380 (503) T protein:vir:59 307 SVIKVSGDG-GVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIG-----GGATGPALENLYALLDLKANMAERK 380 (503) T ss_pred cceeccCCC-cceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-----ccccHHHHHHHHHHHHHHHHHHHHH Confidence 122223333 3678999999999999999999999999999988776432 2358999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcCcc--cCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHH Q lcl|NC_020883. 449 YIDFLKELYESCLWLLNDQDSS--IRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQE 526 (589) Q Consensus 449 ~~~aLk~li~~~l~L~~~~~~~--~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~e 526 (589) |..+|++++++++.+.+..+.. .......|.|++.+|.++++. |++...++++|++|.+|++.++ |.+++ +++ T Consensus 381 ~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~--~~~~~kl~~~GiiS~et~l~~l-~~v~d--~~~ 455 (503) T protein:vir:59 381 IRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEI--VQSLVQGVTGGIMSKETAVARN-PFVQD--PEE 455 (503) T ss_pred HHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHH--HHHHHHHHhCCCCchHHHHHhC-CCCCC--HHH Confidence 9999999998877666543322 223456899999999997765 6777778888999999999875 65653 568 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG 585 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~ 585 (589) |++||++|+......... .+++. +-..++.++.+++.+.++-..-+-. T Consensus 456 E~~ri~~E~~~~~~~~~~--------~~~~~---~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 456 ELARIEEEMNQYAEMQGN--------LLDDE---GGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHHHHHHHHHHHhhhcc--------ccCcc---CCCCCCCcCCCCCCcccCCCCCCcC Confidence 999999887653221111 11111 1112222233333333322211111 No 26 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=3e-42 Score=248.20 Aligned_cols=456 Identities=14% Similarity=0.087 Sum_probs=281.6 Q ss_pred CccceeccchhHHHHhhc----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVH----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) +.+|. |--+.|.... .-|.++.+.|+|+|+.+..+.... .+...+-.|+.|+++.|+ T Consensus 38 ~~~~~---~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~----------------~~~~~~~ki~~n~~k~Iv 98 (501) T protein:vir:27 38 VNNWE---LLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRK----------------DREMADKRAVHNYGRMIS 98 (501) T ss_pred cccHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccC----------------ccccccceeccchHHHHH Confidence 33332 2234443222 235666688999988887662210 011123468999999999 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) +..+.++ +|+=.+... +. ....+.-++++..+.+.++|.........++ T Consensus 99 d~~~~yl---~g~p~~~~~----------~d------------------~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~ 147 (501) T protein:vir:27 99 KFKTGYL---AGNPIRVEY----------DD------------------NDNNSQNDDTIKRIGRINDIDSHNRTLIRDL 147 (501) T ss_pred HHHhhhh---cccCeeEec----------CC------------------ccchHHHHHHHHHHHHhcChhHHHHHHHHHH Confidence 9999888 443222110 00 0000111457888899999999999999999 Q ss_pred HHcCceeEEEEEecC-ceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhccccccch Q lcl|NC_020883. 157 QVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGD 234 (589) Q Consensus 157 ~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~ 234 (589) .+-|.....+|++.+ .++|.+.+|.+.||. |... ..+---++++|+......+. ....+|- +.. T Consensus 148 ~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v---------~d~~~~~~~~~~ir~~~~~~~~~~~-~~~~vyt----~~~ 213 (501) T protein:vir:27 148 SQTGRAYEVIYRNEYDETRIKRLNPLETFVI---------YDNSLEDNSIAAVRYYNRGTLQNAK-DVVEIYT----NEH 213 (501) T ss_pred hhCCeEEEEEEeCCCCceEEEEEccceeEEE---------ecCCCCCceEEEEEEEEeeecCCcE-EEEEEEe----CCe Confidence 999999999999954 499999999999993 3221 11111233444332222221 1112220 111 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINW 314 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~ 314 (589) +++....|+. .+......+..++.|++++|+ ++|+|||+++.+++|++|. T Consensus 214 v~~~~~~~~~-------------------------~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~d~ 263 (501) T protein:vir:27 214 IYTLDASDDF-------------------------NEISVTTHAFGTVPITEFLNN-----VDGIGDYETELYLIDLYDS 263 (501) T ss_pred EEEEEeCCce-------------------------eeccccccCCCcccEEEecCC-----CCCCCchhhhHHHHHHHHH Confidence 1111111110 011122345556668888885 5799999999999999999 Q ss_pred HHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHH Q lcl|NC_020883. 315 TITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMD 394 (589) Q Consensus 315 t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~ 394 (589) ++|+.+..++.++.|.+.+.-...+.. +..+......++.... +.........+..+++++|+...+.... T Consensus 264 ~~S~~~~~~~~~~~~~~v~~g~~~~~~-----~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 334 (501) T protein:vir:27 264 AESDTANHMSDMADAILAIYGDLALPK-----GMQASDMKRTRLMQLK----PPKSADGKEGTVKAEYLTKSYDVSGAEA 334 (501) T ss_pred HHHHHHHHHHHhcCceeeeecCccCCc-----ccchhhhhhcCceeec----ccccccCCCCCcceeeeeccCCHHHHHH Confidence 999999999999999877642222110 0000000111111000 0011112234456789999999999999 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc--ccC Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS--SIR 472 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~--~~~ 472 (589) +++.|.+.|+..+++|..+++... ++.||+|++..+..+..|+.+++..|.++|++++++++.+.+..+. ... T Consensus 335 ~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d 409 (501) T protein:vir:27 335 YKTRLNRDIHIFTNIPDMSDTNFS-----GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD 409 (501) T ss_pred HHHHHHHHHHHHhCCcccCccccc-----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 999999999999999987776432 2459999999999999999999999999999999988877654332 344 Q ss_pred cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 473 IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 473 ~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) ...+.|.|.+.+|.++++. |+++..+ ++++|.+|++.++ |.+++ +++|++||++|+..+++....++ + T Consensus 410 ~~~i~v~f~~~~p~n~~e~--ad~~~kl--~g~iS~et~l~~l-~~v~D--~~~E~eri~~E~~e~~~~~~~~~---~-- 477 (501) T protein:vir:27 410 ESLLKITFTPNLPKSLNEQ--VSILTGL--GGQVSQETALSLS-GLVES--PNEELDKINKEVSEIDFKGYSND---F-- 477 (501) T ss_pred cccceEEeCCCCCcCHHHH--HHHHHHH--hccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHhhhHhhhcCc---c-- Confidence 5567899999999886655 6666665 4689999999876 66664 56899999999876554444332 1 Q ss_pred ccCcccCCCCCCCCCCCCCCCCcchhhhhhcccc Q lcl|NC_020883. 553 MNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGE 586 (589) Q Consensus 553 ~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~ 586 (589) ++. ..+.+++.++..++|+|... | T Consensus 478 --~~~-----~~~~~d~~~~~~~d~~e~~~---~ 501 (501) T protein:vir:27 478 --NEH-----VGKYTDEVKETHTDDFERAY---E 501 (501) T ss_pred --ccc-----cccccCCCCCCccccccccC---C Confidence 111 11111111111111111110 0 No 27 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=2.5e-42 Score=248.68 Aligned_cols=431 Identities=14% Similarity=0.055 Sum_probs=275.4 Q ss_pred HHHhhcch----hhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccc Q lcl|NC_020883. 13 TTKNVHGD----YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIG 88 (589) Q Consensus 13 ~~~~~~~~----~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~ 88 (589) .+..+|.. |.++++.|+|+|..++.+.... .+...+-.|++|+++.|++..+.++ +| T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~----------------~~~~~~~ki~~n~~~~ivd~~~~~l---~g 61 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRL----------------DDEKADYRVRHKWGGYISSFATGYV---IG 61 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccc----------------cccCCcceeecchHHHHHHhhhhhe---ec Confidence 55555553 5556689999999887762211 1111234689999999999999988 55 Q ss_pred cccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEEE Q lcl|NC_020883. 89 QIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVI 168 (589) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~ 168 (589) +=.+.-..+ . .+++ ..+++..+.+.++|.........++.+-|..+..+|+ T Consensus 62 ~~~~~~~~~---------~----------------~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~ 112 (440) T protein:vir:95 62 NPVSIGVME---------G----------------GSAD----QLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFR 112 (440) T ss_pred cCceEeeCC---------C----------------ccHH----HHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEe Confidence 433311100 0 0011 1447889999999999999999999999999999999 Q ss_pred ecCc-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccc Q lcl|NC_020883. 169 DELG-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 169 ~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~ 247 (589) +.++ +++.+.++.+.||. |...- ..+.+...+.+...+...++ +|-...+. .+....+. T Consensus 113 d~~~~~~i~~~~p~~~~~~---------~d~~~--~~~~~~~i~~~~~~~~~~~~--vyt~~~~~---~~~~~~~~---- 172 (440) T protein:vir:95 113 DKDKVDRVVLISPLEMFVI---------RDLTV--EQNIIAAVHLPIYADKVNMT--VYTKDKVI---TYKPYSNN---- 172 (440) T ss_pred cCCCceEEEEEcccceEEE---------EcCCC--CCceEEEEEEEEecCceEEE--EEeCCeEE---EEEEecCC---- Confidence 8655 99999999999984 21110 01111111111111111111 22000000 00000000 Q ss_pred cccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhC Q lcl|NC_020883. 248 VEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNG 327 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~g 327 (589) ............++.+..||+++|+ +.|.||++++.+++|++|.++|+.++.++.|+ T Consensus 173 ------------------~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~ 229 (440) T protein:vir:95 173 ------------------SVRLVVDDVKKHSYNDVPVVEWWNN-----RFRMGDYESEISLIDAYDAGQSDTANYMSDLN 229 (440) T ss_pred ------------------ccceeecceeeccCceeeEEEeeCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 0001111233456667778888984 46999999999999999999999999999999 Q ss_pred CCcEEechhhhhcccccccccccc---ccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHH Q lcl|NC_020883. 328 KPRISITKEMMDTLLNIAYERDGH---SAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLML 404 (589) Q Consensus 328 kpRI~VP~~~L~t~~g~~~d~dge---~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il 404 (589) .|.+.+ .++.. ....+++ .....++.... +.........+..+++++|++..+....+++.|.+.|+ T Consensus 230 ~~~~v~-~g~~~-----~~~~~~e~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~ 299 (440) T protein:vir:95 230 DAMLLV-KGDLD-----GIKLSPEDAAKMKDANMLFLK----TGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIH 299 (440) T ss_pred cceeee-ecccc-----cCCCCccchhhhhhccceecc----cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHH Confidence 997654 22110 0111111 11111111110 00001112334567899999999999999999999999 Q ss_pred HHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCcccCcccceeeeCCc Q lcl|NC_020883. 405 IETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND-QDSSIRIEEPNIETQDM 483 (589) Q Consensus 405 ~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~-~~~~~~~e~p~I~f~D~ 483 (589) ..+++|..+|+... ++.||+|++..+..+..|+.+++..|.++|+++++++..+... .+.......+.|.|.+. T Consensus 300 ~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~ 374 (440) T protein:vir:95 300 RFSRIPNLDDDRFN-----STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPN 374 (440) T ss_pred HHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCC Confidence 99999988887532 3459999999999999999999999999999999887766554 33445566778999999 Q ss_pred CCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCC Q lcl|NC_020883. 484 ILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNI 563 (589) Q Consensus 484 lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p 563 (589) +|.++++. |+++..+ ++++|.||++.++ |..+++ +|++||++|+..+.+.. . +. ..+ T Consensus 375 ~p~~~~~~--ad~~~kl--~g~iS~et~~~~l-~~~d~~---~E~~ri~~E~~~~~~~~--------~----~~---~~~ 431 (440) T protein:vir:95 375 IPQDVWTE--IKAYIEA--GGEISQETLMENA-SFTDYK---TEHSRILKQGGSSDLEI--------G----QI---VGD 431 (440) T ss_pred CCCCHHHH--HHHHHHH--hccCcHHHHHHhC-CCCCcH---HHHHHHHHHHHHhhhhH--------H----hh---ccC Confidence 99886655 6677665 4689999999987 656543 69999999987532211 1 11 112 Q ss_pred CCCCCCCCC Q lcl|NC_020883. 564 IEEGDTEEE 572 (589) Q Consensus 564 ~deg~~~ee 572 (589) .|.|++++| T Consensus 432 ~~~~~~~~e 440 (440) T protein:vir:95 432 ADVGQADTE 440 (440) T ss_pred CCCCCcCCC Confidence 333333333 No 28 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=2.8e-42 Score=248.37 Aligned_cols=461 Identities=13% Similarity=0.063 Sum_probs=275.2 Q ss_pred Ccccee----ccchhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV----RGWTDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~----~~~~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. .-++-+.++.+ |. .|.++++.|+|+|..+...... . . +.--+..|+ T Consensus 28 ~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~-----------~----~~~~~~ki~ 91 (511) T protein:vir:10 28 VYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-K-----------E----EYMADNRVA 91 (511) T ss_pred CccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-c-----------c----cccCcceee Confidence 222221 11122333333 22 3455668999999765433111 0 0 001223688 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) .|+++.|++..+.++ +|+=.+.-. +. .+ -++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~d------------------~~----~~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:10 92 HDYASYISDFINGYF---LGNPIQYQD----------DD------------------KD----VLEAIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHhhhh---cccCceeec----------Cc------------------hH----HHHHHHHHHhhcCHHHH Confidence 999999999988888 443222100 00 00 13578889999999999 Q ss_pred chhhHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecC-CCccceEEEEEeeecc-----cccee Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDH-GQYGQFLHIYRERVEK-----DGLRT 221 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~-~~~~~~l~~~~~~~~~-----~~~~~ 221 (589) +......+.+-|..+..+|+++ +.+++.+.+|.+.||. |.+.- .+---++++|+..... ..... T Consensus 137 ~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~v---------ydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~ 207 (511) T protein:vir:10 137 NRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVI---------YDNTIERNSIAGVRYLRTKPIDKTDEDEVFTV 207 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEE---------EcCCCCCceEEEEEEEEeeecccCccceEEEE Confidence 9999999999999999999985 5599999999999983 32110 0111122222211100 00000 Q ss_pred ehhhhccccccchhheeecc-cccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCc Q lcl|NC_020883. 222 TNMLYPVVKAKGDVKKEIKK-GELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGIS 300 (589) Q Consensus 222 ~~~~y~~~~~~~~~~~~~~~-gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~S 300 (589) .+|- +..+++.... +..... ...+......+.....|++++|+ ..|.| T Consensus 208 --~iyt----~~~i~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~vPvv~f~nn-----~~g~g 256 (511) T protein:vir:10 208 --DLFT----SHGVYRYLTSRTNGLKL--------------------TPRENGFESHSFERMPITEFSNN-----ERRKG 256 (511) T ss_pred --EEEe----CCcEEEEEecCCCcccc--------------------cccccccccccCcceeEEEecCC-----CCCCC Confidence 1120 0000111111 110000 00111122345556668888884 37999 Q ss_pred chhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCcc Q lcl|NC_020883. 301 ALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSM 380 (589) Q Consensus 301 D~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~ 380 (589) ||+++.+++|++|.++|+.+..++.+++|.+.+.-..............+........... . ......+.+..+ T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-----~-~~~~~~~~~~d~ 330 (511) T protein:vir:10 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYA-----D-SEGRETEGSVDG 330 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceeccccccc-----c-cccccCCCCcce Confidence 9999999999999999999999999999977654322211111111111111000000000 0 011123445667 Q ss_pred ceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 381 EIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESC 460 (589) Q Consensus 381 ~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~ 460 (589) .+++|++..+.+..+++.|.+.|+..+++|..+++... ++.||+|+++.++.+..|+.+++..|..+|+++++++ T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:10 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999999999999999998876432 2459999999999999999999999999999999887 Q ss_pred HHHHhhcC---cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccc Q lcl|NC_020883. 461 LWLLNDQD---SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAG 537 (589) Q Consensus 461 l~L~~~~~---~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~ 537 (589) ..+....+ .......+.|.|.+.+|.+.++. ++++..+. |++|.||+++++ |..++ +++|++||++|+.. T Consensus 406 ~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~--~~~~~kl~--G~iS~et~~~~l-~~v~d--~~~E~~ri~~E~~~ 478 (511) T protein:vir:10 406 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDSG--GKISQTTLMSLF-SFFQD--PELEVKKIEEDEKE 478 (511) T ss_pred HHHHHhhCCcccccccceeeEEeCCCCCcCHHHH--HHHHHHHh--ccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHH Confidence 66654322 12334456799999999987755 66666664 689999999887 65665 46899999999765 Q ss_pred cccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 538 SDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 538 ~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +.+...... .. ++++...++.++++++..++.| T Consensus 479 ~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 479 SIKKAQKGI---YK------DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHhhhc---cc------CCCCCCCCCCCCcccCcccccC Confidence 433322211 00 0111111222233333333333 No 29 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=4.1e-42 Score=247.48 Aligned_cols=463 Identities=14% Similarity=0.050 Sum_probs=274.7 Q ss_pred Ccccee--c--cchhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTV--R--GWTDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~--~--~~~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) ...|.. . -++-+.++.+ |- .|.++++.|+|+|..+.......- +..-+-.|+ T Consensus 28 ~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~----------------~~~~~~ki~ 91 (511) T protein:vir:93 28 VYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE----------------EYMADNRVA 91 (511) T ss_pred cccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcc----------------cccCcceee Confidence 333321 1 1111223332 32 255567899999977654421100 001123689 Q ss_pred EEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccccc Q lcl|NC_020883. 69 FNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERR 148 (589) Q Consensus 69 ~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~ 148 (589) .|+++.|++..+.++ +|+=.+.-. +.+ . -++++..+.+.++|... T Consensus 92 ~n~~k~Iv~~~~~yl---~g~p~~~~~----------~d~------------------~----~~~~l~~~~~~n~~~~~ 136 (511) T protein:vir:93 92 HDYASYISDFINGYF---LGNPIQYQD----------DDK------------------D----VLEVIEAFNDLNDVESH 136 (511) T ss_pred cchHHHHHHHHhhhh---cccCeeecc----------CCh------------------H----HHHHHHHHHhhcCHhHH Confidence 999999999999988 554333111 000 0 14578899999999999 Q ss_pred chhhHHHHHHcCceeEEEEEe-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeecccc---ceeeh Q lcl|NC_020883. 149 HWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDG---LRTTN 223 (589) Q Consensus 149 ~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~---~~~~~ 223 (589) .......+.+-|..+..+|++ ++.+++.+.+|.+.||. |.+. ..+---++++|........ ....- T Consensus 137 ~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~v---------ydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~ 207 (511) T protein:vir:93 137 NRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVI---------YDNTIERNSIAGVRYLRTKPIDKTDEDEVFTV 207 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEE---------EcCCCCCceEEEEEEEEeeeccccccceEEEE Confidence 999999999999999999998 45599999999999983 2211 0010112222221110000 00000 Q ss_pred hhhccccccchhhee-ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch Q lcl|NC_020883. 224 MLYPVVKAKGDVKKE-IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL 302 (589) Q Consensus 224 ~~y~~~~~~~~~~~~-~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~ 302 (589) .+|-. ..+++. ..++..... ...+......++..+.|++++|+ .+|.||| T Consensus 208 ~iyt~----~~i~~~~~~~~~~~~~--------------------~~~~~~~~~~~~g~vPvv~~~nn-----~~g~gd~ 258 (511) T protein:vir:93 208 DLFTS----HGVYRYLTSRTNGLKL--------------------TPRENGFESHSFERMPITEFSNN-----ERRKGDY 258 (511) T ss_pred EEEeC----CcEEEEEecCCCcccc--------------------ccccccccccCCCccceEEecCC-----CCCCCch Confidence 11100 000000 111110000 00111122345566668888884 4799999 Q ss_pred hhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccce Q lcl|NC_020883. 303 DNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEI 382 (589) Q Consensus 303 ~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~ 382 (589) +++.+++|++|.++|+.+..++.+++|.+++.-......... ......++................+.+..+.| T Consensus 259 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (511) T protein:vir:93 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV------RKQKEANVLFLEPTVYADSEGRETEGSVDGGY 332 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhh------cccccccceecccccccccccccCCCCcceeE Confidence 999999999999999999999999999776542221111111 11111111110000000011122345667889 Q ss_pred eeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 383 HQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLW 462 (589) Q Consensus 383 iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~ 462 (589) ++|+...+....+++.|.+.|+..+++|..+++..+ ++.||+|++++++.+..|+.+++..|..+|+++++++.. T Consensus 333 l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:93 333 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999988876432 245999999999999999999999999999999988776 Q ss_pred HHhhcCc---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccc Q lcl|NC_020883. 463 LLNDQDS---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSD 539 (589) Q Consensus 463 L~~~~~~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~ 539 (589) +.+..+. ......+.|.|.+.+|.+.++. ++++..+ +|++|.||++.++ |..++ +++|++||++|+..+. T Consensus 408 ~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~--~~~~~kl--~g~iS~et~~~~l-~~v~d--~~~E~~ri~~E~~~~~ 480 (511) T protein:vir:93 408 ILKNTWSIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDS--GGKISQTTLMSLF-SFFQD--PELEVKKIEEDEKESI 480 (511) T ss_pred HHHhccCcccccccccceEEeCCCCCCCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH Confidence 5443221 2334456899999999887655 5666665 4689999999876 65654 4689999999876543 Q ss_pred cccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 540 TSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 540 p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ....... . .++++.-.++++++.+...++.| T Consensus 481 ~~~~~~~----~-----~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 481 KKAQKGI----Y-----KDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhhhc----c-----cCCCCCCCCCCCCcccccccccC Confidence 3222111 0 00101111111222222222222 No 30 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=7.5e-42 Score=246.04 Aligned_cols=452 Identities=14% Similarity=0.123 Sum_probs=269.2 Q ss_pred Cccc---------------------eeccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDW---------------------TVRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~---------------------~~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) |+|- +..-|-.+.++..- ..|.+.++.|+|+| ++..|....-. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~-~i~~~~~~~~~----------~~ 69 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHP-DILDAPFKRDV----------NG 69 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccchhhhc----------cc Confidence 4332 11223334443321 23555568899998 45555222111 00 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) ..-+..-+-.|+.|+++.|++..+.++ +|+=.+.-. +.+ -.++++ T Consensus 70 ~~~~~~~~~ki~~n~~k~ivd~~~~yl---~g~p~~~~~----------~~~----------------------~~~~~l 114 (478) T protein:vir:10 70 DYDETKPDWRMYTNYHQNLVDQKVAYA---VANPVTFGV----------DND----------------------KALKQI 114 (478) T ss_pred ccccccccceeccchHHHHHHHHhhhh---cccCceeec----------CCh----------------------HHHHHH Confidence 111222345789999999999999888 554333211 000 013455 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeee Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERV 214 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~ 214 (589) ..++. ++|.........++.+-|.++..+|++. +.+++.+.++.+.||. |... .++---+++.|... T Consensus 115 ~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~~~v---------~d~~~~~~~~~~ir~~~~~- 183 (478) T protein:vir:10 115 QHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPI---------WTNKERDELQAFIRVYELD- 183 (478) T ss_pred HHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEE---------EcCCCCCceEEEEEEEeee- Confidence 66654 5677777777889999999999999994 5699999999999993 3222 11111112211110 Q ss_pred ccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCC Q lcl|NC_020883. 215 EKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFM 294 (589) Q Consensus 215 ~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~ 294 (589) ..-.+ .+|-...+. ++....+....-. ....-+............+..+..|++++|+ T Consensus 184 --~~~~~--~~y~~~~i~---~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~---- 241 (478) T protein:vir:10 184 --GAERV--EYWTKDDVT---FYELKEGQLIPDF-----------YRSEDHIQPHYYQGNKLMSWGRVPFIPFKNN---- 241 (478) T ss_pred --CceEE--EEEeCCcEE---EEEecCCeeeccc-----------cccccccccceecccccccCCcceEEEeccC---- Confidence 00000 111100000 0011111100000 0000001111111223456777778888884 Q ss_pred CcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 295 NPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 295 ~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d 374 (589) +.|+|||+++.+++|++|.++|+.+..++.|+.|.+.+- |...+..++.... +.. ...+.+ .. T Consensus 242 -~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~--------g~~~~~~~~~~~~--~~~----~~~~~~--~~ 304 (478) T protein:vir:10 242 -PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--------GYEGEDMKDFMHN--LKY----YKAISV--AG 304 (478) T ss_pred -CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeee--------cCCcccccchhhh--hhh----CceeEe--cC Confidence 579999999999999999999999999999999966542 2211222221111 010 111122 22 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLK 454 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk 454 (589) +.|..+++++|++..+.+..+++.|.+.|+..+++|..+++.. +++.||+|+++.++.+..|+..++..|..+|+ T Consensus 305 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-----~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 379 (478) T protein:vir:10 305 ESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-----GNSPSGIALKFMYSNLDLKANKLKNKTLTALQ 379 (478) T ss_pred CCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3456688999999999999999999999999999998776542 23569999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhh Q lcl|NC_020883. 455 ELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEE 534 (589) Q Consensus 455 ~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E 534 (589) +++++++.+. +..+....+.|.|++.+|.++++. |++++.+ ++++|.+|+++++ |.+++ +++|++||++| T Consensus 380 ~~~~li~~~~---~~~~d~~~i~i~f~~~~p~~~~e~--~~~~~~~--~g~iS~et~i~~~-~~v~d--~~~E~~ri~~E 449 (478) T protein:vir:10 380 ELLQYIIDFY---RLDVRVQDIEITFNFNVMVNELEN--SQIAMNS--TGLLSKETILGNH-SWVQD--PVAEMERIEQE 449 (478) T ss_pred HHHHHHHHHh---CCCcccccceEEeCCCCCCCHHHH--HHHHHHH--hCCCChHHHHHhC-CCCCC--HHHHHHHHHHH Confidence 9988776554 334455567899999999987765 6666664 5689999999765 76655 67999999999 Q ss_pred ccccccccccccccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 535 QAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 535 ~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) +........ +.+....+ ++++.+.|++.| T Consensus 450 ~~~~~~~~~----~~~~~~~d--~~~~~~~d~~~e 478 (478) T protein:vir:10 450 NIELNQQLP----DIEEGLND--EQQRQSEDNQSE 478 (478) T ss_pred HHHHHHhcc----ccCCCCcc--cccccCcCCCCC Confidence 875322110 01111111 111111111111 No 31 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=8.1e-42 Score=245.87 Aligned_cols=463 Identities=14% Similarity=0.061 Sum_probs=274.3 Q ss_pred Cccceec-cchhHHHHhhc----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MIDWTVR-GWTDKTTKNVH----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) --+|.-. ...-+.|+..+ .-|.++++.|+|+|..+...... .. +.--.-.|++|+++.| T Consensus 35 e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~-~~---------------~~~~~~ki~~n~~k~I 98 (512) T protein:vir:97 35 ESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KE---------------EYMADNRVAHDYASYI 98 (512) T ss_pred hhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cc---------------cccCcceeecchHHHH Confidence 0011100 00112222211 12556668999999665443111 00 0112236889999999 Q ss_pred hccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 76 ~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) ++..+.++ +|+=.+.-. +. .. -++++..+.+.++|...+.....+ T Consensus 99 vd~~~~yl---~g~p~~~~~----------~d------------------~~----~~~~l~~~~~~n~~~~~~~~~~~~ 143 (512) T protein:vir:97 99 SDFINGYF---LGNPIQCQD----------DD------------------KD----VLEAIEAFNDLNDVESHNRSLGLD 143 (512) T ss_pred HHHHhhhh---cccCceecc----------CC------------------hH----HHHHHHHHHhhcCHHHHHHHHHHH Confidence 99999888 443222111 00 00 135788899999999999999999 Q ss_pred HHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecC-CCccceEEEEEeeecccc--ceeeh-hhhcccc Q lcl|NC_020883. 156 HQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDH-GQYGQFLHIYRERVEKDG--LRTTN-MLYPVVK 230 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~-~~~~~~l~~~~~~~~~~~--~~~~~-~~y~~~~ 230 (589) +.+-|..+..+|+++ +.+++...+|.+.||. |.... .+---++++|+......+ -.+.+ .+|-. T Consensus 144 ~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~i---------yd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~-- 212 (512) T protein:vir:97 144 LSIYGKAYELMIRNQDDETRLYKSDAMSTFVI---------YDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS-- 212 (512) T ss_pred HHhcCeEEEEEEeCCCCceEEEEEcccceEEE---------EcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeC-- Confidence 999999999999984 5599999999999983 32211 111112333321110000 00000 12200 Q ss_pred ccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHH Q lcl|NC_020883. 231 AKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQ 309 (589) Q Consensus 231 ~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~ 309 (589) ..+++... .+.... +..........+.....|++++|+ .+|+|||+++.+++ T Consensus 213 --~~i~~~~~~~~~~~~--------------------~~~~~~~~~~~~~g~vPvv~~~nn-----~~~~gd~e~v~~li 265 (512) T protein:vir:97 213 --HGVYRYLTSRTNGLK--------------------LTPRENGFESHSFERMPITEFSNN-----ERRKGDYEKVITLI 265 (512) T ss_pred --CcEEEEEecCCCccc--------------------ccccccccccccCcccceEeecCC-----CCCCCchhhhHHHH Confidence 00001111 111000 000111223456666678888884 47999999999999 Q ss_pred HHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccc-cccccccccccccccCccceeeeccc Q lcl|NC_020883. 310 DEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPR-IDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 310 DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~-~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) |++|.++|+.+..++.+++|.+.+.-..+... .++...+..+..... ............+.|..+++++|+.. T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~ 339 (512) T protein:vir:97 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDP------VEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYD 339 (512) T ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCccCCc------hhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCC Confidence 99999999999999999999887643222111 111111111111110 00000111112345566889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+....+++.|.+.|+..+++|..+++... ++.||+|+++++..+..|+..++..|..+|+++++++..+....+ T Consensus 340 ~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-----gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~ 414 (512) T protein:vir:97 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 414 (512) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccCccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 999999999999999999999998887532 245999999999999999999999999999999988876654322 Q ss_pred c---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020883. 469 S---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 469 ~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~ 545 (589) . ......+.|.|.+.+|.+.++. |+++..+ ++++|.||++.++ |..++ +++|++||++|+....+..... T Consensus 415 ~~~~~~d~~~i~~~f~~~~p~~~~e~--~~~~~kl--~giiS~et~~~~l-~~v~d--~~~E~eri~~E~~~~~~~~~~~ 487 (512) T protein:vir:97 415 SIDANKDFNTVRYVYNRNLPKSLIEE--LKAYIDS--GGKISQTTLMSLF-SFFQD--PELEVKKIEEDEKESIKKAQKG 487 (512) T ss_pred CcccccccccceEEeCCCCCcCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHhhc Confidence 1 2334457899999999886655 6666665 4689999999887 65654 5689999999976543322211 Q ss_pred cccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 546 INQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 546 ~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) . .. ++++.-.++.+++.++..++.| T Consensus 488 ~---~~------~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 488 I---YK------DPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred c---cC------CCCCCCCCCCCCCccccccccC Confidence 1 00 0000001111111111111111 No 32 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=4.7e-42 Score=247.16 Aligned_cols=433 Identities=14% Similarity=0.110 Sum_probs=274.5 Q ss_pred Cc----cceeccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcch Q lcl|NC_020883. 1 MI----DWTVRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPK 73 (589) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~ 73 (589) |+ .=....|-.+.|...- ..|.++++.|+|+|.-+..+. ..+...+..|++|+++ T Consensus 9 ~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~------------------~~~~~~~~ki~~n~~~ 70 (453) T protein:vir:39 9 MTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPT------------------KDLWKPDNRLTVNFTK 70 (453) T ss_pred eEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCC------------------ccccCccceeecchHH Confidence 11 0011223333333321 234555689999984322211 0111233468899999 Q ss_pred hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhH Q lcl|NC_020883. 74 VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNI 153 (589) Q Consensus 74 ~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l 153 (589) .|++..+.++ +|.=.+.-.. + .+ -++.+.++.+.++|........ T Consensus 71 ~ivd~~~~~l---~g~~~~~~~~---------d-------------------~~----~~~~l~~i~~~N~~~~~~~~~~ 115 (453) T protein:vir:39 71 YIVDTFTGYF---NGIPVKKSHS---------D-------------------KE----TLSKLQEFDNLNDMEDEESELA 115 (453) T ss_pred HHHHHHhhhh---cccCceeccC---------C-------------------hH----HHHHHHHHHHhcChhHHHHHHH Confidence 9999999888 4432221110 0 00 1457899999999999999999 Q ss_pred HHHHHcCceeEEEEEec-CceeEEEecCceeccc-ccCcceeEEEeecCC-Ccc--ceEEEEEeeeccccceeehhhhcc Q lcl|NC_020883. 154 VQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPH-DDEKGADLAYYIDHG-QYG--QFLHIYRERVEKDGLRTTNMLYPV 228 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~-~d~~~~div~~~e~~-~~~--~~l~~~~~~~~~~~~~~~~~~y~~ 228 (589) .++.+-|.+...+|++. +.++|.+.++.+.||. ++..+-.+.+..+.. ..+ .|+. +|- T Consensus 116 ~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~----------------~yt- 178 (453) T protein:vir:39 116 KMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGE----------------VYT- 178 (453) T ss_pred HHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEE----------------EEe- Confidence 99999999999999985 4599999999999994 222211111111100 011 1111 220 Q ss_pred ccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHH Q lcl|NC_020883. 229 VKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESK 308 (589) Q Consensus 229 ~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l 308 (589) +..+++....+..+. ....+..+..+..||+++|+ .+|+|||+.+.++ T Consensus 179 ---~~~i~~~~~~~~~~~------------------------~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~l 226 (453) T protein:vir:39 179 ---KETTYALNGTMGFYN------------------------MTEQAPNPFDDLPVVEFYFN-----EERMSIFESVISL 226 (453) T ss_pred ---CCeEEEEEecCCcee------------------------eecccccCCCceeEEEecCC-----CCCCcchhhhHHH Confidence 011111111111111 11233456677778888884 4799999999999 Q ss_pred HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeeccc Q lcl|NC_020883. 309 QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 309 ~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) +|++|.++|+.+..++.|+.|.+.+.-..++. + +..-....++.. . ........+..+.+++|++. T Consensus 227 iDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~------~-~~~~~~~~~~~~--~-----~~~~~~~~~~~~~~lt~~~~ 292 (453) T protein:vir:39 227 VNAFNKAISEKANDVDYFSDQYLTFLGAAVEE------E-DLKNIRSNRVIN--Y-----YGESSEAKNVDVKFLEKPDS 292 (453) T ss_pred HHHHHHHHHHHHHHHHHhhCceeeeecCCCCc------h-hhhhhhhcceee--e-----cCCCCCCCCCceeEEeecCC Confidence 99999999999999999999988875322211 1 110011111110 0 00011223455778999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+....+++.|.+.|+..|++|..+++.. | ..||+|++..++.+..|+.+++..|..+|++++++++.+.+..+ T Consensus 293 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~---g---n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~ 366 (453) T protein:vir:39 293 DSQTENLLDRLTKLIFQTTMVANISDESF---G---SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS 366 (453) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccc---c---CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999997665432 1 34899999999999999999999999999999999988888776 Q ss_pred cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccc Q lcl|NC_020883. 469 SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQ 548 (589) Q Consensus 469 ~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~ 548 (589) .......+.|.|++.+|.+++++ |+++..+ ++++|.||++.++ |..++ +++|++||++|+......... T Consensus 367 ~~~~~~~i~v~f~~~~p~~~~~~--a~~~~kl--~g~is~et~l~~l-~~v~D--~~~E~~ri~~E~~~~~~~~~~---- 435 (453) T protein:vir:39 367 NKEAWKDIEYTFTRNEPKDIKEQ--AETANIL--MGITSQETALSVI-SVIPD--VQAEMEKIKKEEASTAIFDKD---- 435 (453) T ss_pred CccccccceEEeCCCCCcCHHHH--HHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHHh---- Confidence 66666777899999999886654 6777665 4689999999876 75654 579999999998754322111 Q ss_pred ccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 549 TFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 549 ~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ..+.++|.+++-|+.++ | T Consensus 436 ------------~~~~~~~~~~~~~~~~~-e 453 (453) T protein:vir:39 436 ------------KQPSEKGTDTVVPETNE-E 453 (453) T ss_pred ------------ccCCCCCCCCCCCCcCC-C Confidence 01112222221111111 1 No 33 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=1.7e-41 Score=244.13 Aligned_cols=450 Identities=12% Similarity=0.065 Sum_probs=268.3 Q ss_pred Ccccee---ccchhHHHHhhc----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcch Q lcl|NC_020883. 1 MIDWTV---RGWTDKTTKNVH----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPK 73 (589) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~ 73 (589) .++--. ..|-.+.+...| .-|.++++.|+|+|.-++.+ ...- + . + .+-.|+.|+++ T Consensus 8 ~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~-~~~~---~-------~----~--~~~ki~~n~~~ 70 (489) T protein:vir:99 8 AIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRP-AKTD---K-------Y----A--ADNRIASDFAK 70 (489) T ss_pred eeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc-cccc---c-------c----C--CcceeecchHH Confidence 111100 012223343333 33677779999999544332 1100 0 0 0 11258899999 Q ss_pred hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhH Q lcl|NC_020883. 74 VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNI 153 (589) Q Consensus 74 ~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l 153 (589) .|++..+.++ +|+=.+.-.. + .. -+++++.+.+.++|...+.... T Consensus 71 ~iv~~~~~~l---~g~~~~~~~~-----------d-----------------~~----~~~~l~~~~~~n~~~~~~~~~~ 115 (489) T protein:vir:99 71 YITVFEQGYM---LGVPVEYKNE-----------N-----------------KD----LQAAIDLMSVRNNEDYHNVKIK 115 (489) T ss_pred HHHHHHhhhh---ccCCceeecC-----------C-----------------hh----HHHHHHHHHhhcChhHHHHHHH Confidence 9999999888 5543331110 0 00 1457888899999998999999 Q ss_pred HHHHHcCceeEEEEEe-----cCceeEEEecCceeccc-ccCc----ceeEE-EeecCCCccceEEEEEeeeccccceee Q lcl|NC_020883. 154 VQHQVDGGIVAAPVID-----ELGPRIVFKARDVYFPH-DDEK----GADLA-YYIDHGQYGQFLHIYRERVEKDGLRTT 222 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~-----~~~~~i~f~~~d~~~P~-~d~~----~~div-~~~e~~~~~~~l~~~~~~~~~~~~~~~ 222 (589) .++.+-|..+..+|+. ++.++|.+.+|.++||. ++.. -+.+- |..+.+......|++. +.....+ T Consensus 116 ~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~-y~~~~i~--- 191 (489) T protein:vir:99 116 TDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKA-YTSDTIY--- 191 (489) T ss_pred HHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEE-EeCCcEE--- Confidence 9999999888888863 44589999999999994 2221 11111 1222221111111111 0000000 Q ss_pred hhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch Q lcl|NC_020883. 223 NMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL 302 (589) Q Consensus 223 ~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~ 302 (589) .|+. ...+ ............++.+..|++++|+ +.|+|+| T Consensus 192 --~~~~----------~~~~-----------------------~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~s~~ 231 (489) T protein:vir:99 192 --TYED----------YNLE-----------------------TKGMRLKDYEGHFFKGVPVNEYANN-----EERTGAY 231 (489) T ss_pred --EEEe----------cCCC-----------------------cccceecccccccCCceeEEEeecC-----CCCCCch Confidence 0100 0000 0000011123345666678888885 4799999 Q ss_pred hhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccc------ccccccccccccccccccccccccccccccc--- Q lcl|NC_020883. 303 DNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLL------NIAYERDGHSAKEASMMTPRIDHRDMEITTF--- 373 (589) Q Consensus 303 ~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~------g~~~d~dge~~~~~~~~~~~~d~~dlev~~~--- 373 (589) +++.+++|++|.++|+.+..++.++.|.+.+--..+.... ....+.++.......++ .+.-+.+... T Consensus 232 ~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 307 (489) T protein:vir:99 232 ESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFK----KAQVLILDDNPNP 307 (489) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccc----cceeeeeccccCc Confidence 9999999999999999999999999987765221111100 00000011000000000 0000001000 Q ss_pred ccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_020883. 374 DENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFL 453 (589) Q Consensus 374 de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL 453 (589) ...+..+++++|++..+....+++.|.+.|+..+++|..+++.. +++.||+|++++++.+.+|+..++..|..+| T Consensus 308 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 382 (489) T protein:vir:99 308 NGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKF-----SGVQSGESMKYKLMASDNYREKQERLFKKGL 382 (489) T ss_pred cccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-----cccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11123467899999999999999999999999999997665432 2345999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCcc----cCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHH Q lcl|NC_020883. 454 KELYESCLWLLNDQDSS----IRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIA 529 (589) Q Consensus 454 k~li~~~l~L~~~~~~~----~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~ 529 (589) +++++++..+.+..+.. .......|.|++.+|.++++. ++++..++ +++|.||+++++ |.++++.+++|++ T Consensus 383 ~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~--~~~~~kl~--giis~et~~~~l-~~v~~~d~~~E~~ 457 (489) T protein:vir:99 383 MRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEI--VTAAQNLY--GIVSDQTIFEIL-NTVTGVDAEAELK 457 (489) T ss_pred HHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCCHHHHHHhc-CCCCchhHHHHHH Confidence 99998887766543222 223456799999999887765 56666654 689999999876 7899889999999 Q ss_pred HHHhhccccccccccccccccccccCcccCCCCCCCCCC-CCCCC Q lcl|NC_020883. 530 RIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGD-TEEEP 573 (589) Q Consensus 530 RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~-~~eep 573 (589) ||++|+.......-. +..+++.++.+ ++++| T Consensus 458 ri~~E~~~~~~~~~~-------------~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 458 RLKEEADKKQSLPEP-------------RLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHhccccc-------------cccCCCCCCcCCCCCCC Confidence 998887543211100 01111111111 22222 No 34 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=2.4e-41 Score=243.23 Aligned_cols=446 Identities=14% Similarity=0.137 Sum_probs=270.6 Q ss_pred Cc-----cceeccchh--------------HHHHhh---c----chhhhhhhhhcCCccccCHHHHHHHhhccccceecc Q lcl|NC_020883. 1 MI-----DWTVRGWTD--------------KTTKNV---H----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLD 54 (589) Q Consensus 1 ~~-----~~~~~~~~~--------------~~~~~~---~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~ 54 (589) || +|+.+- +- +.++.+ | ..|.+++..|+|+| ++..|-.....++. T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~------- 71 (474) T protein:vir:94 1 MFNIIRMPWDKPY-GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDN-DIVKQMKKVDVHGN------- 71 (474) T ss_pred CcccccccCCCch-hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhcccchhccccc------- Confidence 22 233111 11 222222 2 23556668999999 56556322222111 Q ss_pred CcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhh Q lcl|NC_020883. 55 SSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNE 134 (589) Q Consensus 55 ~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e 134 (589) .-+...+-.|+.|+++.|++..+.++ +|+=.+... + +.. -++ T Consensus 72 ---~~~~~~~~ki~~n~~k~Ivd~~~~~l---~g~p~~~~~----------~------------------d~~----~~~ 113 (474) T protein:vir:94 72 ---IDYDKPDWRITTNFHQNLVDQKVSYV---ASKPVTYSC----------E------------------DEN----VLK 113 (474) T ss_pred ---cccccCcceeecchHHHHHHHHHhhh---hcCCceecc----------C------------------cHH----HHH Confidence 11122355789999999999998888 553333111 0 000 134 Q ss_pred HHHHHHhhccccccchhhHHHHHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCcceeEEEeecCCCccceEEEEEe Q lcl|NC_020883. 135 IIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEKGADLAYYIDHGQYGQFLHIYRE 212 (589) Q Consensus 135 ~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~~~div~~~e~~~~~~~l~~~~~ 212 (589) +++.+..| +|.........++.+-|.+...+|++++ .++|.+.++++.||. ++... ++---+++.|+. T Consensus 114 ~l~~~~~n-~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~---------~~~~~~ir~~~~ 183 (474) T protein:vir:94 114 VIHDVLDT-RWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKER---------EELKSFIRYYKF 183 (474) T ss_pred HHHHHHhc-cHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCC---------CceEEEEEEEEe Confidence 56666554 6777888888999999999999999855 499999999999994 22211 111111222221 Q ss_pred eeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCC Q lcl|NC_020883. 213 RVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNET 292 (589) Q Consensus 213 ~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~ 292 (589) ... -.. .+|-...+ .++....|.... ..... . ...+......+..++.|++++|+ T Consensus 184 ~~~---~~~--~~yt~~~~---~~y~~~~~~~~~-~~~~~-------------~-~~~~~~~~~~~~g~vPvv~~~nn-- 238 (474) T protein:vir:94 184 NNE---EKV--EFWTDTTV---TYYVLENGGLIP-DYYYG-------------A-NHVQSHFSNGNWGRVPFIAFKNN-- 238 (474) T ss_pred cCe---EEE--EEEeCCeE---EEEEEcCCcccc-ccccC-------------c-CcccccccccCCCccceEEecCC-- Confidence 100 000 12210000 001111111100 00000 0 00122233456677778888885 Q ss_pred CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 293 FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITT 372 (589) Q Consensus 293 ~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~ 372 (589) ++|+|||+++.+++|++|.++|+.+..++.++.|.+.+.-..+ +..++... ++ ....++. T Consensus 239 ---~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~--------~~~~~~~~--~~-------~~~~~i~ 298 (474) T protein:vir:94 239 ---PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEG--------EDLEEFMR--GL-------KYYKAIN 298 (474) T ss_pred ---cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc--------ccchhhhh--hh-------hccceee Confidence 5799999999999999999999999999999999887642211 11111111 00 0111222 Q ss_pred cccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_020883. 373 FDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDF 452 (589) Q Consensus 373 ~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~a 452 (589) .+ .+..+++++|++..+.+..+++.|.+.|+..+++|..+++..+ ++.||+|+++++..+..|+.+++..|..+ T Consensus 299 ~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:94 299 VD-GDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-----SAPSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred cc-CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 2345889999999999999999999999999999987766432 34599999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHH Q lcl|NC_020883. 453 LKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIE 532 (589) Q Consensus 453 Lk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~ 532 (589) |+++++++..+... ..+.....|.|.+.+|.++++. |+++ ..++++|.+|+++++ |..++ +++|++||+ T Consensus 373 l~~~~~li~~~~~~---~~d~~~i~v~f~~~~p~~~~e~--a~~~---~~~g~iS~et~l~~l-~~v~D--~~~E~eri~ 441 (474) T protein:vir:94 373 IQELISFIIDFNNL---KTDVKDIEISFNFNRMMNDAEQ--SQII---AQSQYLSRETLVKSS-PLVDD--YKAELERIE 441 (474) T ss_pred HHHHHHHHHHHhCC---CcccceeeEEeccCcccCHHHH--HHHH---HHcCCCCHHHHHHhC-CCCCC--HHHHHHHHH Confidence 99999887665432 2344556799999999886544 4443 456899999999876 65655 568999999 Q ss_pred hhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 533 EEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 533 ~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +|+...... .+.+. +. ..|.+++.+++..++.| T Consensus 442 ~E~~~~~~~-----~~~~~----~~-----~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 442 QEQMEYNKQ-----LPNLD----DG-----GADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHhh-----ccccC----CC-----CCCCcccCCCCcccccC Confidence 888542110 01111 11 11233344444444444 No 35 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=2.4e-41 Score=243.23 Aligned_cols=446 Identities=14% Similarity=0.137 Sum_probs=270.6 Q ss_pred Cc-----cceeccchh--------------HHHHhh---c----chhhhhhhhhcCCccccCHHHHHHHhhccccceecc Q lcl|NC_020883. 1 MI-----DWTVRGWTD--------------KTTKNV---H----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLD 54 (589) Q Consensus 1 ~~-----~~~~~~~~~--------------~~~~~~---~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~ 54 (589) || +|+.+- +- +.++.+ | ..|.+++..|+|+| ++..|-.....++. T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~------- 71 (474) T protein:vir:97 1 MFNIIRMPWDKPY-GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDN-DIVKQMKKVDVHGN------- 71 (474) T ss_pred CcccccccCCCch-hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhcccchhccccc------- Confidence 22 233111 11 222222 2 23556668999999 56556322222111 Q ss_pred CcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhh Q lcl|NC_020883. 55 SSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNE 134 (589) Q Consensus 55 ~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e 134 (589) .-+...+-.|+.|+++.|++..+.++ +|+=.+... + +.. -++ T Consensus 72 ---~~~~~~~~ki~~n~~k~Ivd~~~~~l---~g~p~~~~~----------~------------------d~~----~~~ 113 (474) T protein:vir:97 72 ---IDYDKPDWRITTNFHQNLVDQKVSYV---ASKPVTYSC----------E------------------DEN----VLK 113 (474) T ss_pred ---cccccCcceeecchHHHHHHHHHhhh---hcCCceecc----------C------------------cHH----HHH Confidence 11122355789999999999998888 553333111 0 000 134 Q ss_pred HHHHHHhhccccccchhhHHHHHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCcceeEEEeecCCCccceEEEEEe Q lcl|NC_020883. 135 IIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEKGADLAYYIDHGQYGQFLHIYRE 212 (589) Q Consensus 135 ~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~~~div~~~e~~~~~~~l~~~~~ 212 (589) +++.+..| +|.........++.+-|.+...+|++++ .++|.+.++++.||. ++... ++---+++.|+. T Consensus 114 ~l~~~~~n-~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~---------~~~~~~ir~~~~ 183 (474) T protein:vir:97 114 VIHDVLDT-RWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKER---------EELKSFIRYYKF 183 (474) T ss_pred HHHHHHhc-cHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCC---------CceEEEEEEEEe Confidence 56666554 6777888888999999999999999855 499999999999994 22211 111111222221 Q ss_pred eeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCC Q lcl|NC_020883. 213 RVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNET 292 (589) Q Consensus 213 ~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~ 292 (589) ... -.. .+|-...+ .++....|.... ..... . ...+......+..++.|++++|+ T Consensus 184 ~~~---~~~--~~yt~~~~---~~y~~~~~~~~~-~~~~~-------------~-~~~~~~~~~~~~g~vPvv~~~nn-- 238 (474) T protein:vir:97 184 NNE---EKV--EFWTDTTV---TYYVLENGGLIP-DYYYG-------------A-NHVQSHFSNGNWGRVPFIAFKNN-- 238 (474) T ss_pred cCe---EEE--EEEeCCeE---EEEEEcCCcccc-ccccC-------------c-CcccccccccCCCccceEEecCC-- Confidence 100 000 12210000 001111111100 00000 0 00122233456677778888885 Q ss_pred CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 293 FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITT 372 (589) Q Consensus 293 ~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~ 372 (589) ++|+|||+++.+++|++|.++|+.+..++.++.|.+.+.-..+ +..++... ++ ....++. T Consensus 239 ---~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~--------~~~~~~~~--~~-------~~~~~i~ 298 (474) T protein:vir:97 239 ---PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEG--------EDLEEFMR--GL-------KYYKAIN 298 (474) T ss_pred ---cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc--------ccchhhhh--hh-------hccceee Confidence 5799999999999999999999999999999999887642211 11111111 00 0111222 Q ss_pred cccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_020883. 373 FDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDF 452 (589) Q Consensus 373 ~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~a 452 (589) .+ .+..+++++|++..+.+..+++.|.+.|+..+++|..+++..+ ++.||+|+++++..+..|+.+++..|..+ T Consensus 299 ~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:97 299 VD-GDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-----SAPSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred cc-CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 2345889999999999999999999999999999987766432 34599999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHH Q lcl|NC_020883. 453 LKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIE 532 (589) Q Consensus 453 Lk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~ 532 (589) |+++++++..+... ..+.....|.|.+.+|.++++. |+++ ..++++|.+|+++++ |..++ +++|++||+ T Consensus 373 l~~~~~li~~~~~~---~~d~~~i~v~f~~~~p~~~~e~--a~~~---~~~g~iS~et~l~~l-~~v~D--~~~E~eri~ 441 (474) T protein:vir:97 373 IQELISFIIDFNNL---KTDVKDIEISFNFNRMMNDAEQ--SQII---AQSQYLSRETLVKSS-PLVDD--YKAELERIE 441 (474) T ss_pred HHHHHHHHHHHhCC---CcccceeeEEeccCcccCHHHH--HHHH---HHcCCCCHHHHHHhC-CCCCC--HHHHHHHHH Confidence 99999887665432 2344556799999999886544 4443 456899999999876 65655 568999999 Q ss_pred hhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 533 EEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 533 ~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +|+...... .+.+. +. ..|.+++.+++..++.| T Consensus 442 ~E~~~~~~~-----~~~~~----~~-----~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 442 QEQMEYNKQ-----LPNLD----DG-----GADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHhh-----ccccC----CC-----CCCCcccCCCCcccccC Confidence 888542110 01111 11 11233344444444444 No 36 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=2.3e-41 Score=243.42 Aligned_cols=447 Identities=13% Similarity=0.105 Sum_probs=269.0 Q ss_pred Ccccee----------------------ccchhHHHHhhcch---hhhhhhhhcCCccccCHHHHHHHhhccccceeccC Q lcl|NC_020883. 1 MIDWTV----------------------RGWTDKTTKNVHGD---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS 55 (589) Q Consensus 1 ~~~~~~----------------------~~~~~~~~~~~~~~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~ 55 (589) ||.-.- .-|-.+.|+..-.. |.+.++.|.|+| ++..|....-.++..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~-~i~~~~~~~~~~~~~~------ 73 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDN-DINYQAYKQDLHGNID------ 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccC-ccccccchhhhccccc------ Confidence 222211 12333444333223 444467899998 5666632211111111 Q ss_pred cceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhH Q lcl|NC_020883. 56 SQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEI 135 (589) Q Consensus 56 ~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~ 135 (589) +...+..|+.|+++.|++..+.++ +|+=.+.-. + +. .-+++ T Consensus 74 ----~~~~~~ki~~n~~k~Iv~~~~~yl---~g~p~~~~~----------~------------------~~----~~~~~ 114 (474) T protein:vir:96 74 ----YTKPDWRITTNFHQNLVDQKVSYV---AGKPVTYAH----------D------------------DD----KVLDV 114 (474) T ss_pred ----ccccccccccchHHHHHHhhhhhh---cccCceecc----------C------------------Ch----HHHHH Confidence 111234588999999999999998 554333211 0 00 11356 Q ss_pred HHHHHhhccccccchhhHHHHHHcCceeEEEEEe-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEee Q lcl|NC_020883. 136 IEQITKNSKLERRHWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRER 213 (589) Q Consensus 136 i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~ 213 (589) ++++.. ++|..........+.+-|..+..+|++ ++.+++.+.+|.+.|| +|... ..+--.+++.|+. T Consensus 115 l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~---------v~d~~~~~~~~a~ir~~~~- 183 (474) T protein:vir:96 115 IHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIP---------IWTDKEREQLNAFIRIFTF- 183 (474) T ss_pred HHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEee- Confidence 666665 467777788888899999999999998 4459999999999998 33221 1111112222221 Q ss_pred eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCC Q lcl|NC_020883. 214 VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETF 293 (589) Q Consensus 214 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~ 293 (589) ..... ..+|-...+.. +....+. ........ ...........+..+..|++++|+ T Consensus 184 --~~~~~--~~vy~~~~i~~---~~~~~~~-~~~~~~~~--------------~~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:96 184 --NGETK--VEYWTAETVTY---YVYENGG-LIPDFYYG--------------DEHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred --cCeeE--EEEEeCCeEEE---EEEcCCc-eeeccccc--------------cccccCcccccCCCccceEEecCC--- Confidence 11111 11331111111 1111121 11000000 000111222345666667888874 Q ss_pred CCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 294 MNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF 373 (589) Q Consensus 294 ~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~ 373 (589) +.|.|||+++.+++|++|.++|+.+..++.|+.|.+.+. |...+..++... +.....++.. T Consensus 239 --~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~~~~~~~~~~~---------~~~~~~~i~~ 299 (474) T protein:vir:96 239 --PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--------GYEGEDLSEFME---------GLKYYKAINV 299 (474) T ss_pred --CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--------CCCcccccchhh---------hhhccceeec Confidence 579999999999999999999999999999999976432 221111111110 0111112222 Q ss_pred ccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_020883. 374 DENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFL 453 (589) Q Consensus 374 de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL 453 (589) ++ +..+.+++|++..+....+++.|.++||..+++|..+++.. +++.||+|++++++.+..|+.+++..|.++| T Consensus 300 ~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-----~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l 373 (474) T protein:vir:96 300 SS-DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-----GSATSGIALKFLYTNLNLKANKLKNKANVAL 373 (474) T ss_pred cC-CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 34578999999999999999999999999999997766532 2346999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 454 KELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 454 k~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) +++++++..+.. .......+.|.|.+.+|+++.+. |++++ .++++|.||++.++ |..++ +++|++||++ T Consensus 374 ~~~~~~i~~~~g---~~~d~~~i~i~f~~~~p~~~~e~--a~~~~---~~giiS~et~~~~l-p~v~D--~~~E~eri~~ 442 (474) T protein:vir:96 374 QELMQFILDFNK---IKLDAKEIEITFNFNVMVNDLEQ--SQIGA---QSQYLSKETLVRHH-PWVDD--PKAELERLDE 442 (474) T ss_pred HHHHHHHHHHhC---CCcccceeeEEecCCCccCHHHH--HHHHH---HcCCCChHHHHHhC-CCCCC--HHHHHHHHHH Confidence 999988766542 23445566899999999886654 55543 46899999999775 75655 5799999998 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) |+...... +... . +...+.+.++.++.+++.| T Consensus 443 E~~~~~~~-~~~~---~----------~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 443 EQLELNKQ-LPNL---D----------DGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHHHHhh-cccc---c----------cccCCCCCCcCCCCccccC Confidence 87543111 0000 0 0112222333444444444 No 37 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=2.3e-41 Score=243.42 Aligned_cols=447 Identities=13% Similarity=0.105 Sum_probs=269.0 Q ss_pred Ccccee----------------------ccchhHHHHhhcch---hhhhhhhhcCCccccCHHHHHHHhhccccceeccC Q lcl|NC_020883. 1 MIDWTV----------------------RGWTDKTTKNVHGD---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS 55 (589) Q Consensus 1 ~~~~~~----------------------~~~~~~~~~~~~~~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~ 55 (589) ||.-.- .-|-.+.|+..-.. |.+.++.|.|+| ++..|....-.++..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~-~i~~~~~~~~~~~~~~------ 73 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDN-DINYQAYKQDLHGNID------ 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccC-ccccccchhhhccccc------ Confidence 222211 12333444333223 444467899998 5666632211111111 Q ss_pred cceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhH Q lcl|NC_020883. 56 SQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEI 135 (589) Q Consensus 56 ~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~ 135 (589) +...+..|+.|+++.|++..+.++ +|+=.+.-. + +. .-+++ T Consensus 74 ----~~~~~~ki~~n~~k~Iv~~~~~yl---~g~p~~~~~----------~------------------~~----~~~~~ 114 (474) T protein:vir:95 74 ----YTKPDWRITTNFHQNLVDQKVSYV---AGKPVTYAH----------D------------------DD----KVLDV 114 (474) T ss_pred ----ccccccccccchHHHHHHhhhhhh---cccCceecc----------C------------------Ch----HHHHH Confidence 111234588999999999999998 554333211 0 00 11356 Q ss_pred HHHHHhhccccccchhhHHHHHHcCceeEEEEEe-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEee Q lcl|NC_020883. 136 IEQITKNSKLERRHWSNIVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRER 213 (589) Q Consensus 136 i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~ 213 (589) ++++.. ++|..........+.+-|..+..+|++ ++.+++.+.+|.+.|| +|... ..+--.+++.|+. T Consensus 115 l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~---------v~d~~~~~~~~a~ir~~~~- 183 (474) T protein:vir:95 115 IHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIP---------IWTDKEREQLNAFIRIFTF- 183 (474) T ss_pred HHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEee- Confidence 666665 467777788888899999999999998 4459999999999998 33221 1111112222221 Q ss_pred eccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCC Q lcl|NC_020883. 214 VEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETF 293 (589) Q Consensus 214 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~ 293 (589) ..... ..+|-...+.. +....+. ........ ...........+..+..|++++|+ T Consensus 184 --~~~~~--~~vy~~~~i~~---~~~~~~~-~~~~~~~~--------------~~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:95 184 --NGETK--VEYWTAETVTY---YVYENGG-LIPDFYYG--------------DEHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred --cCeeE--EEEEeCCeEEE---EEEcCCc-eeeccccc--------------cccccCcccccCCCccceEEecCC--- Confidence 11111 11331111111 1111121 11000000 000111222345666667888874 Q ss_pred CCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 294 MNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF 373 (589) Q Consensus 294 ~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~ 373 (589) +.|.|||+++.+++|++|.++|+.+..++.|+.|.+.+. |...+..++... +.....++.. T Consensus 239 --~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~~~~~~~~~~~---------~~~~~~~i~~ 299 (474) T protein:vir:95 239 --PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--------GYEGEDLSEFME---------GLKYYKAINV 299 (474) T ss_pred --CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--------CCCcccccchhh---------hhhccceeec Confidence 579999999999999999999999999999999976432 221111111110 0111112222 Q ss_pred ccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_020883. 374 DENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFL 453 (589) Q Consensus 374 de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL 453 (589) ++ +..+.+++|++..+....+++.|.++||..+++|..+++.. +++.||+|++++++.+..|+.+++..|.++| T Consensus 300 ~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-----~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l 373 (474) T protein:vir:95 300 SS-DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-----GSATSGIALKFLYTNLNLKANKLKNKANVAL 373 (474) T ss_pred cC-CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 34578999999999999999999999999999997766532 2346999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 454 KELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 454 k~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) +++++++..+.. .......+.|.|.+.+|+++.+. |++++ .++++|.||++.++ |..++ +++|++||++ T Consensus 374 ~~~~~~i~~~~g---~~~d~~~i~i~f~~~~p~~~~e~--a~~~~---~~giiS~et~~~~l-p~v~D--~~~E~eri~~ 442 (474) T protein:vir:95 374 QELMQFILDFNK---IKLDAKEIEITFNFNVMVNDLEQ--SQIGA---QSQYLSKETLVRHH-PWVDD--PKAELERLDE 442 (474) T ss_pred HHHHHHHHHHhC---CCcccceeeEEecCCCccCHHHH--HHHHH---HcCCCChHHHHHhC-CCCCC--HHHHHHHHHH Confidence 999988766542 23445566899999999886654 55543 46899999999775 75655 5799999998 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) |+...... +... . +...+.+.++.++.+++.| T Consensus 443 E~~~~~~~-~~~~---~----------~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 443 EQLELNKQ-LPNL---D----------DGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHHHHhh-cccc---c----------cccCCCCCCcCCCCccccC Confidence 87543111 0000 0 0112222333444444444 No 38 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=4.9e-41 Score=241.59 Aligned_cols=448 Identities=15% Similarity=0.124 Sum_probs=268.3 Q ss_pred Cccce---eccchhHHHHhhcchhhhh---hhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWT---VRGWTDKTTKNVHGDYERY---RQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~~~~~~~---r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~ 74 (589) .+++- +.-|-.+.|......+.|| ++.|+|+| +++.|.......+..- +...+-.|+.|+++. T Consensus 28 ~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~-~i~~~~~~~~~~~~~~----------~~~~~~ki~~n~~k~ 96 (483) T protein:vir:12 28 RTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGAVD----------PLKPDDRMITNFHAN 96 (483) T ss_pred ccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccccccccccccc----------ccccccccccchHHH Confidence 11111 1113334444434444444 57889998 6776632221111100 111234688999999 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) |++..+.++ +|+=.+.-. + +. ..+++++.+.+| +|.....+-.. T Consensus 97 Ivd~~~~~l---~G~p~~~~~----------~------------------d~----~~~~~l~~~~~n-~~~~~~~~~~~ 140 (483) T protein:vir:12 97 LVDQKVSYI---VGKPIAFKH----------T------------------DD----EVVKRIDEVLGN-RFDDKLHSVLT 140 (483) T ss_pred HHHHHhhhh---cccCceecc----------C------------------Ch----HHHHHHHHHHhc-cHHHHHHHHHH Confidence 999999888 553222111 0 00 113466677665 56667777778 Q ss_pred HHHHcCceeEEEEEecC-ceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhcccccc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAK 232 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~ 232 (589) ++.+-|..+..+|++.+ .+++.+.+|.+.|| +|... .++---++++|+.... .. . .+|-...+. T Consensus 141 ~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~---------v~d~~~~~~~~~~ir~~~~~~~-~~--~--~~y~~~~v~ 206 (483) T protein:vir:12 141 GASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP---------IWTDKEHEELEAFIRMYKLENE-TK--V--EYWDKVTVN 206 (483) T ss_pred HHhhCCeEEEEEEEcCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEEeecc-eE--E--EEEecCeEE Confidence 88999999999999854 59999999999998 45332 1111223333332211 11 1 122111111 Q ss_pred chhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHH Q lcl|NC_020883. 233 GDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEI 312 (589) Q Consensus 233 ~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeL 312 (589) .. ....|....-.. ...+. .+......+.....|++++|+ ++|.|||+++.+++|++ T Consensus 207 ~~---~~~~~~~~~~~~------------~~~~~---~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~ 263 (483) T protein:vir:12 207 YY---VYENGSLIPDYS------------NNLEN---SKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAY 263 (483) T ss_pred EE---EEeCCeeeeccc------------ccccc---cccccccCCCCccceEEecCC-----CCCCCchhhHHHHHHHH Confidence 11 111121110000 00001 111223345666668888884 57999999999999999 Q ss_pred HHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHH Q lcl|NC_020883. 313 NWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGD 392 (589) Q Consensus 313 d~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh 392 (589) |.++|+.++.++.++.|.+.+. |...+..++... +.....+...++ +..+++++|++..+.. T Consensus 264 d~~~S~~~~~~~~~~~~~lv~~--------g~~~~~~~~~~~---------~~~~~~~~~~~~-~~~~~~l~~~~~~~~~ 325 (483) T protein:vir:12 264 NRRLSDLSNTFKDSNELTYVLT--------NYDDQELPEFKR---------LLRYYGAIKVSD-NGGVDTIQVEVPVENS 325 (483) T ss_pred HHHHHHHHHHHHHhcCceeeee--------cCCcccchhHHH---------hhhhccccccCC-CCcceEEeecCCHHHH Confidence 9999999999999999988653 221111111111 001111222223 3457899999999999 Q ss_pred HHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccC Q lcl|NC_020883. 393 MDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIR 472 (589) Q Consensus 393 ~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~ 472 (589) ..+++.|.+.|+..+++|..+++... ++.||+|+++.+..+..|+.+++..|..+|+++++++..+... ... T Consensus 326 ~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~---~~~ 397 (483) T protein:vir:12 326 KKYLDELYQKIMLFGQAVDFSSDKFG-----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGE 397 (483) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccc-----cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCc Confidence 99999999999999999988876422 2459999999999999999999999999999999876655432 223 Q ss_pred cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 473 IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 473 ~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) .....|.|++.+|.++++. |++++.+ +|++|.||+++++ |..++ +++|++||++|+..... .+... ... T Consensus 398 ~~~i~v~f~~~~p~~~~~~--a~~~~kl--~GiiS~et~~~~~-~~v~d--~~~E~~ri~~E~~~~~~-~~~~~---~~~ 466 (483) T protein:vir:12 398 HKDVDISFNYNKVANTELQ--VQTAQQS--MGIVSHETVLENH-PFVED--LQAELERIEQEQMEYNK-QLPNL---DDG 466 (483) T ss_pred cceeeEEeCCCCCCCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHh-hcccc---ccc Confidence 4556799999999987765 6677665 4689999999876 65554 67899999888753211 01000 000 Q ss_pred ccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 553 MNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 553 ~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ..|+. +.+ +++..+|.| T Consensus 467 ~~d~~-----~~~-----~~~~~~e~e 483 (483) T protein:vir:12 467 GADGA-----QQQ-----ERSNNKESE 483 (483) T ss_pred ccCCc-----ccC-----CCCCcccCC Confidence 11111 111 111111111 No 39 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=4.9e-41 Score=241.55 Aligned_cols=448 Identities=15% Similarity=0.146 Sum_probs=269.0 Q ss_pred Cccce-----eccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWT-----VRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~-----~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) .+-++ +..|-.+.|...- ..|.+.++.|+|+| +++.|.......+..- +.-.+..|++|++ T Consensus 15 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~----------~~~~~~ri~~n~~ 83 (472) T protein:vir:93 15 IVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGAVD----------PLKPDDRMITNFH 83 (472) T ss_pred eeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-ccccccchhhcccccc----------ccccccccccchH Confidence 11111 1122223332222 34444568899998 4666633222211100 1112346789999 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhh Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSN 152 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~ 152 (589) +.|++..+.++ +|+=.+.-. + +.. .+++++.+.+| +|.....+. T Consensus 84 ~~ivd~~~~~l---~g~~~~~~~----------~------------------d~~----~~~~l~~~~~n-~~~~~~~~~ 127 (472) T protein:vir:93 84 ANLVDQKVSYI---VGKPIAFKH----------T------------------DDE----VVKRIDEVLGN-RFDDKLHSV 127 (472) T ss_pred HHHHHHHhhhh---cccCeeecc----------C------------------ChH----HHHHHHHHHhc-cHHHHHHHH Confidence 99999999988 553222111 0 001 13466666655 677777777 Q ss_pred HHHHHHcCceeEEEEEecC-ceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhcccc Q lcl|NC_020883. 153 IVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVK 230 (589) Q Consensus 153 l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~ 230 (589) ..++.+-|.....+|++.+ .+++.+.+|.+.|| +|... .++---++++|+.... .. + .+|-... T Consensus 128 ~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~---------i~d~~~~~~~~~~ir~~~~~~~--~~-~--~~~~~~~ 193 (472) T protein:vir:93 128 LTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP---------IWTDKEHEELEAFIRMYKLENE--TK-V--EYWDKVT 193 (472) T ss_pred HHHHhhcCeEEEEEEECCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEEeecc--ee-E--EEEecCe Confidence 8889999999999999854 59999999999998 44322 1111122333322111 11 1 1221000 Q ss_pred ccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHH Q lcl|NC_020883. 231 AKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQD 310 (589) Q Consensus 231 ~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~D 310 (589) +.. +....+....... ...+ ..+......++.++.|++++|+ ++|+|||+++.+++| T Consensus 194 ~~~---~~~~~~~~~~~~~------------~~~~---~~~~~~~~~~~~~vPvv~~~nn-----~~g~s~~e~v~~liD 250 (472) T protein:vir:93 194 VNY---YVYENGSLIPDYS------------NNLE---NSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLID 250 (472) T ss_pred EEE---EEEecCeeeeccc------------cccc---ccccccccCCCCCcceEEecCC-----CCCCCchhhhHHHHH Confidence 111 0111111111000 0001 1122334567788889999984 479999999999999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHH Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKI 390 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirve 390 (589) ++|.++|+.++.++.++.|.+.+. |...+..++... ......+...+++ ..+.+++|++..+ T Consensus 251 a~~~~~s~~~~~~~~~~~~~~~~~--------g~~~~~~~~~~~---------~~~~~~~~~~~~~-~~~~~l~~~~~~~ 312 (472) T protein:vir:93 251 AYNRRLSDLSNTFKDSNELTYVLT--------NYDDQELPEFKR---------LLRYYGAIKVSDN-GGVDTIQVEVPVE 312 (472) T ss_pred HHHHHHHHHHHHHHHhcCceeEee--------cCCcccchhhHH---------HHhhccccccCCC-CcceeEeecCCHH Confidence 999999999999999999988763 221111111111 0011122322333 3478899999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_020883. 391 GDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSS 470 (589) Q Consensus 391 eh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~ 470 (589) ....+++.|.+.|+..+++|..+++..+ ++.||+|++..+..+..|+.+++..|..+|+++++++..+... . T Consensus 313 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~---~ 384 (472) T protein:vir:93 313 NSKKYLDELYQKIMLFGQAVDFSSDKFG-----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---K 384 (472) T ss_pred HHHHHHHHHHHHHHHHhCCCCCCccccc-----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---C Confidence 9999999999999999999988886432 2459999999999999999999999999999999887655432 2 Q ss_pred cCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccc Q lcl|NC_020883. 471 IRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTF 550 (589) Q Consensus 471 ~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l 550 (589) .......|.|.+.+|.++++. |+++..+ ++++|.+|+++++ |..++ +++|++||++|+.... ..++.. T Consensus 385 ~~~~~i~v~f~~~~p~~~~~~--~~~~~k~--~giis~et~l~~l-~~~~d--~~~E~~ri~~E~~~~~-~~~~~~---- 452 (472) T protein:vir:93 385 GEHKDVDISFNYNKVANTELQ--VQTAQQS--MGIVSHETVLENH-PFVED--LQAELERIEQEQMEYN-KQLPNL---- 452 (472) T ss_pred cccceeeEEeCCCCCCCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH-HhccCc---- Confidence 234456799999999887755 6666665 4689999999877 65554 5789999988864321 111111 Q ss_pred ccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 551 EQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 551 ~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +..+.++.+.+ +.+..++.| T Consensus 453 ----~~~~~d~~~~~-----~~~~~~~~e 472 (472) T protein:vir:93 453 ----DDGGADGAQQQ-----ERSNNKESE 472 (472) T ss_pred ----CcccCCCCCCC-----CCCCcccCC Confidence 11111111111 112222222 No 40 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=3.8e-41 Score=242.20 Aligned_cols=434 Identities=14% Similarity=0.105 Sum_probs=270.4 Q ss_pred Cccceecc-chhHHHHhh---c-c---hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWTVRG-WTDKTTKNV---H-G---DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~~~~-~~~~~~~~~---~-~---~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) |+..+-.. -+.++|..+ | . -|.++++.|+|+|. +..|... .+..-+..|+.|++ T Consensus 8 ~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~-i~~~~~~-----------------~~~~~~~ki~~n~~ 69 (452) T protein:vir:36 8 LMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMA-IDDEPAK-----------------DSWKPDNRLAVNFT 69 (452) T ss_pred eEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccCccc-----------------cccCccceeecchH Confidence 33322111 111222221 2 2 23444588999995 4333110 01112346889999 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhh Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSN 152 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~ 152 (589) +.|++..+.++ +|+=.+.-+. + . . -++++..+.+.++|....... T Consensus 70 ~~ivd~~~~~l---~g~~~~~~~~---------d-~------------------~----~~~~l~~~~~~n~~~~~~~~~ 114 (452) T protein:vir:36 70 KYIVDTFTGYF---NGIPVKKSHS---------D-K------------------E----ILTKLQEFDNLNDMEDEESEL 114 (452) T ss_pred HHHHHHHhhhh---cccCceeecC---------C-h------------------h----HHHHHHHHHhhcChhHHHHHH Confidence 99999999888 4433331110 0 0 0 134788889989999999999 Q ss_pred HHHHHHcCceeEEEEEe-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhcccc Q lcl|NC_020883. 153 IVQHQVDGGIVAAPVID-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVK 230 (589) Q Consensus 153 l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~ 230 (589) ..++.+-|.+...+|++ ++.+++.+.++.+.||. |... ..+-..++++|.... ....+ .+|- T Consensus 115 ~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v---------~d~~~~~~~~~~i~~~~~~~--~~~~~--~vyt--- 178 (452) T protein:vir:36 115 AKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMV---------YDDTVKQEPLFAVRYGVDED--KKLQG--EVYT--- 178 (452) T ss_pred HHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEE---------EcCCCCCceEEEEEEEEecC--ceEEE--EEEe--- Confidence 99999999999999998 45699999999999993 3221 000011122221110 01111 1220 Q ss_pred ccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHH Q lcl|NC_020883. 231 AKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQD 310 (589) Q Consensus 231 ~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~D 310 (589) +..+++....+..+.+ ....+.+..++.|++++|++ .|+|||+++.+++| T Consensus 179 -~~~i~~~~~~~~~~~~------------------------~~~~~~~~g~iPvv~~~n~~-----~g~sd~e~v~~liD 228 (452) T protein:vir:36 179 -LLETIKISGENDEISF------------------------GEGTYNPYPDLPVVEFYFNE-----ERMSIFESVISLVN 228 (452) T ss_pred -cCeEEEEEEcCCceEE------------------------ecceeccCCcccEEEecCCC-----CCCcchHHHHHHHH Confidence 0111111111111110 01122345555688888853 69999999999999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc-ccccCccceeeecccH Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF-DENGRSMEIHQIDISK 389 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~-de~g~~~~~iq~Dirv 389 (589) ++|.++|+.+..++.++.|.+.+.-..++ .+..+.... ...+.+... ...+..+.+++|+... T Consensus 229 a~d~~~s~~~~~~~~~~~p~~~~~g~~~~------~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~l~~~~~~ 292 (452) T protein:vir:36 229 AFNKAISEKANDVDYFSDQYLTFLGAAVE------EEDLKNIRS----------NRVINYYADGEGKNVDVKFLEKPDSD 292 (452) T ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCcC------chhhhhhhh----------cceEEecCCCCccCCcceeEeecCCH Confidence 99999999999999999998876422211 111111110 011122221 2234557889999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_020883. 390 IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS 469 (589) Q Consensus 390 eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~ 469 (589) +....+++.|.+.|+..+++|..+++.. +..||+|++..+..+..|+..++..|..+|++++++++.+.+..+. T Consensus 293 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~------gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 366 (452) T protein:vir:36 293 SQTENLLDRLTKLIFQTTMVANISDESF------GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSN 366 (452) T ss_pred HHHHHHHHHHHHHHHHHhCccccCcccc------cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 9999999999999999999997665421 1348999999999999999999999999999999999888877666 Q ss_pred ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020883. 470 SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQT 549 (589) Q Consensus 470 ~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~ 549 (589) ......+.|.|++.+|.+++++ |+++..+ ++++|.||+++++ |.+++ +++|++||++|++.+..... T Consensus 367 ~~~~~~i~i~f~~~~p~d~~~~--a~~~~k~--~g~iS~et~~~~~-~~~~d--~~~E~~ri~~E~~~~~~~~~------ 433 (452) T protein:vir:36 367 KDSWKDIEYTFTRNEPKDIKEQ--AETANIL--MGITSQETALSVI-SVIPD--VQAEMEKIKKEEASTAIFDK------ 433 (452) T ss_pred ccccccceEEeCCCCCcCHHHH--HHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHh------ Confidence 6666677899999999886655 6677665 4689999999766 76654 67999999999865311100 Q ss_pred cccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 550 FEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 550 l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) +. .+.+.|.+ ++....++| T Consensus 434 ------~~----~~~~~~~~-~~~~~~~~e 452 (452) T protein:vir:36 434 ------DK----QPSEKGTD-TVVSETNEE 452 (452) T ss_pred ------hc----cCCCCccc-ccCccccCC Confidence 00 11222211 111111111 No 41 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=8.2e-41 Score=240.36 Aligned_cols=436 Identities=15% Similarity=0.140 Sum_probs=272.7 Q ss_pred Cccceec----------------cchhHHHHhh---cc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDWTVR----------------GWTDKTTKNV---HG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~~~~----------------~~~~~~~~~~---~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) |-|-+-+ .||.+.++.+ |- -|.++++.|+|+|. +..|... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~-i~~~~~~--------------- 64 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHK-ILTAPEK--------------- 64 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccc-cccCccc--------------- Confidence 3332211 3555655554 32 35666788999985 4444210 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) +..-+..|++|+++.|++..+.++ +|+=.+... ..+ + ..++.+ T Consensus 65 ---~~~~~~ki~~n~~~~Ivd~~~~~l---~g~p~~~~~--------~~d-------------------~----~~~~~l 107 (470) T protein:vir:99 65 ---ETGADNRIVVNSAKYVVDVYNGYF---CGIEPKLAL--------LND-------------------S----SKIDEI 107 (470) T ss_pred ---ccCCcceeecchHHHHHHHHhhhh---ccCCeeEee--------CCc-------------------h----hHHHHH Confidence 111234689999999999999988 554222100 000 0 012356 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCcc----eeEEEeecCCCcc--ceEE Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEKG----ADLAYYIDHGQYG--QFLH 208 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~~----~div~~~e~~~~~--~~l~ 208 (589) ..+.+.++|..........+.+-|.....+|++.+ .++|.+.+|.+.||. ++... +.+-|....+... .|+. T Consensus 108 ~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~ 187 (470) T protein:vir:99 108 ARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGV 187 (470) T ss_pred HHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEE Confidence 77888889999999999999999999999999854 599999999999994 22111 1111211111111 1222 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) +|... ..|. ....+- + +..........+..++.|++++ T Consensus 188 ~~~~~----------~~~~----------~~~~~~---------------------~-~~~~~~~~~~~~~g~vPvv~~~ 225 (470) T protein:vir:99 188 IQYAD----------KFYK----------FKGYDI---------------------E-EDTNAAGYAINPYGLVPAVEFF 225 (470) T ss_pred EEecC----------eEEE----------EEeccc---------------------c-cccccccccccCCCccceEeec Confidence 22111 1110 000000 0 0000111223455566688888 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) |+ .+|+|||+++.+++|++|.++|+.+..++.++.|.+.++-..+. .+++|+.... ++. ...+ T Consensus 226 n~-----~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~------~~~~g~~~~~--~~~----~~~~ 288 (470) T protein:vir:99 226 EN-----EERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLP------EDDEGNPKFD--FKN----NRVL 288 (470) T ss_pred CC-----CCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc------cccccchhhh--hhh----ccee Confidence 74 47999999999999999999999999999999999988643332 2233332211 111 1111 Q ss_pred ccc-ccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 369 EIT-TFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 369 ev~-~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) .+. ...+.+..+++++|++..+.+..+++.|.+.|+..+++|..+++.. +++.||+|++.++..+..|+.+++. T Consensus 289 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~n~Sg~Ai~~~~~~l~~k~~~~~~ 363 (470) T protein:vir:99 289 YVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNF-----AGNSSGVALQYKLFAMKNKADSKER 363 (470) T ss_pred eecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-----ccCchHHHHHHHHHHHHHHHHHHHH Confidence 111 1123455678999999999999999999999999999998777642 2345999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhc-CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQ-DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQE 526 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~-~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~e 526 (589) .|..+|++++++++.+.+.. ........+.|.|.+.+|.++++. |++++.++ +++|.||++.++ |..+ +++ T Consensus 364 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~--a~~~~kl~--giis~et~l~~l-~~vd---~~~ 435 (470) T protein:vir:99 364 KFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASA--IDNAKNAE--GIVSKKTQLGMI-PDIE---PDA 435 (470) T ss_pred HHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHH--HHHHHHHh--ccCCHHHHHHhC-CCCC---HHH Confidence 99999999998877665543 334455677899999999886655 66776654 589999999886 6443 458 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchh Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEEN 578 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~ 578 (589) |++||++|+......... . ..+.|.+ +..|.++|+ T Consensus 436 E~eri~~E~~~~~~~~~~-~--------------~~~~d~~--~~d~~~ee~ 470 (470) T protein:vir:99 436 EMKQIAKEKADAIKQTQQ-L--------------SMPIDIL--KRDNNAEEE 470 (470) T ss_pred HHHHHHHHHHHHHHHHHh-h--------------cCCCCcC--CCCCCccCC Confidence 999999887532111110 0 0011100 000111111 No 42 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=8.5e-41 Score=240.25 Aligned_cols=452 Identities=13% Similarity=0.088 Sum_probs=272.4 Q ss_pred ccceeccchhHHHHhh---c----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchh Q lcl|NC_020883. 2 IDWTVRGWTDKTTKNV---H----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKV 74 (589) Q Consensus 2 ~~~~~~~~~~~~~~~~---~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~ 74 (589) +|- -...++|..+ | ..|.+.++.|+|+|.-++.+ +..-.++...-.. .....+...+-+|+.|+.+. T Consensus 1 ~~~---e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~ki~~n~~~~ 74 (471) T protein:vir:10 1 MEI---EVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKR-KPADKKGAENEAK--AEDNAFRNADNRISHNWHQL 74 (471) T ss_pred CCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-chhhhhccccccc--ccccccccccceeccchhHH Confidence 111 0112222222 2 35777889999999555433 2211111111000 11111122345899999999 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) |++..+.++ +|+=.+.-. + + +-.+++++.+.. ++|......... T Consensus 75 Ivd~~~~yl---~G~p~~~~~----------~------------------~----~~~~~~l~~~~~-n~~~~~~~~~~~ 118 (471) T protein:vir:10 75 LLDQKKAYA---LTYPPTFDV----------D------------------D----KKVNDMIVDVLG-DDYERISKQLCV 118 (471) T ss_pred HHHhhhhhh---cccCceecc----------C------------------C----hHHHHHHHHHHh-cCHHHHHHHHHH Confidence 999999888 554333111 0 0 112345666655 467777777788 Q ss_pred HHHHcCceeEEEEEe--cCceeEEEecCceeccc-ccCc----ceeEEEeec----CCCccceEEEEEeeeccccceeeh Q lcl|NC_020883. 155 QHQVDGGIVAAPVID--ELGPRIVFKARDVYFPH-DDEK----GADLAYYID----HGQYGQFLHIYRERVEKDGLRTTN 223 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~--~~~~~i~f~~~d~~~P~-~d~~----~~div~~~e----~~~~~~~l~~~~~~~~~~~~~~~~ 223 (589) .+.+-|..+..+|++ ++.+++.+.+|.+.||. ++.. -+.|-|... .++...|+.+|... ..+ T Consensus 119 ~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~---~~~---- 191 (471) T protein:vir:10 119 NAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDK---ECS---- 191 (471) T ss_pred HHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCC---cEE---- Confidence 889999999999998 35699999999999984 2221 222322221 11222344444221 111 Q ss_pred hhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh Q lcl|NC_020883. 224 MLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD 303 (589) Q Consensus 224 ~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~ 303 (589) .|. ..++................... ....+......|..+..|++++|+ ..|.|||+ T Consensus 192 -~y~-----------~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~iPvv~~~n~-----~~~~sd~e 249 (471) T protein:vir:10 192 -FYR-----------HEKEKPLEELETFQAISLIDTMN-----GDRSSDNSFKHDFGLVPFIPFKNN-----EIETNDLK 249 (471) T ss_pred -EEE-----------ecCCccccccccccccccccccc-----ccccccccccCCCCceeEEEeccC-----CCCCCchH Confidence 111 01111111111111010000000 011233445567788889999984 47999999 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc-ccccCccce Q lcl|NC_020883. 304 NLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF-DENGRSMEI 382 (589) Q Consensus 304 ~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~-de~g~~~~~ 382 (589) .+.+++|++|.++|+.+..++.+++|.+.+.- ...+..++..... .....+.+... ...+..+++ T Consensus 250 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g--------~~~~~~~~~~~~~------~~~~~i~~~~~~~~~~~~~~~ 315 (471) T protein:vir:10 250 PIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTN--------YGGQDKQEFLEDL------KRYKMIKMDNDGMGDQSGVTT 315 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCceeeeec--------CCccccchhHHHh------hcCCeEEecCCCCccCccceE Confidence 99999999999999999999999999775532 1111111111100 00111111111 234456889 Q ss_pred eeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 383 HQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLW 462 (589) Q Consensus 383 iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~ 462 (589) ++|++..+....+++.|.+.|+..+++|..+++.. | ..||+|++.+++.+..|+..++..|.++|++++++++. T Consensus 316 l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~---g---n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 389 (471) T protein:vir:10 316 IAIDIPTEARNLILERTKKQIFISGQGVNPETDKL---G---NSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILK 389 (471) T ss_pred EeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc---c---CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998766531 2 34999999999999999999999999999999988776 Q ss_pred HHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccc Q lcl|NC_020883. 463 LLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSS 542 (589) Q Consensus 463 L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~ 542 (589) +.+.. +.....|.|.+.+|.++++. +++++.+ ++++|.||++.++ |.+++ +++|++||++|+.... T Consensus 390 ~~~~~----d~~~i~i~f~~~~p~n~~e~--~~~~~kl--~g~iS~et~~~~~-p~v~D--~~~E~eri~~E~~~~~--- 455 (471) T protein:vir:10 390 HLGLS----DKLKIKQTWTRNSINNDTEM--AQVVSTL--ATITSRENVAKSN-PIVED--WQDELRLQKAEQEGRS--- 455 (471) T ss_pred HhccC----CCceeEEEeCCCCCCCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH--- Confidence 55432 33456799999999987755 6777775 4689999999875 87775 5789999998875420 Q ss_pred ccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 543 LMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 543 ~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) ..++ +..++.+++|.. T Consensus 456 -----~~~~-----------~~~~~~~~~e~~ 471 (471) T protein:vir:10 456 -----EKLY-----------DMEEVEHESEVE 471 (471) T ss_pred -----hccc-----------ccCCCCCccccC Confidence 0011 112222222222 No 43 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=2e-39 Score=232.70 Aligned_cols=448 Identities=14% Similarity=0.130 Sum_probs=265.4 Q ss_pred Cccceec---cchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWTVR---GWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~~~---~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~ 74 (589) ..+.-.- .+-.+.|...- ..|.+.++.|+|+| +++.|.......+... +.-.+-.|+.|+++. T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~-~i~~~~~~~~~~~~~~----------~~~~~~ri~~n~~k~ 105 (492) T protein:vir:97 37 RTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGAVD----------PLKPDDRMITNFHAN 105 (492) T ss_pred cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccC-cccccccccccccccc----------ccccccccccchHHH Confidence 1222111 11222222222 23444458899998 5776632221111110 111334688999999 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) |++..+.++ +|+=.+.-. .+.. .+++++.+.+| +|.....+... T Consensus 106 Ivd~~~~yl---~g~p~~~~~----------------------------~d~~----~~~~l~~~~~n-~~~~~~~~~~~ 149 (492) T protein:vir:97 106 LVDQKVSYI---VGKPIAFKH----------------------------TDDE----VVKRIDEVLGN-RFDDKLHSVLT 149 (492) T ss_pred HHHHHhhhh---cccCceecc----------------------------CchH----HHHHHHHHHhc-cHHHHHHHHHH Confidence 999999888 443222111 0011 13466667655 67777777788 Q ss_pred HHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhcccccc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAK 232 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~ 232 (589) ++.+-|.+...+|++. +.+++.+.+|.+.|| +|... .++---++++|+... .-.+ .+|-...+. T Consensus 150 ~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~---------i~d~~~~~~~~~~vr~~~~~~---~~~~--~~y~~~~v~ 215 (492) T protein:vir:97 150 GASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP---------IWTDKEHEELEAFIRMYKLEN---ETKV--EYWDKVTVN 215 (492) T ss_pred HHhhcCeEEEEEEecCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEeecc---ceeE--EEEecCeEE Confidence 8899999989999984 459999999999998 34322 111111233332211 1011 122111111 Q ss_pred chhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHH Q lcl|NC_020883. 233 GDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEI 312 (589) Q Consensus 233 ~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeL 312 (589) .. ....|....... .. +...+......+.....|++++|+ +.|+|||+++.+++|++ T Consensus 216 ~~---~~~~~~~~~~~~------------~~---~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~ 272 (492) T protein:vir:97 216 YY---VYENGSLIPDYS------------NN---LENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAY 272 (492) T ss_pred EE---EEecCeeeeccc------------cc---ccccccccccCCCCCcceEEecCC-----CCCCCchHhHHHHHHHH Confidence 11 111121100000 00 001112233456666778888884 47999999999999999 Q ss_pred HHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHH Q lcl|NC_020883. 313 NWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGD 392 (589) Q Consensus 313 d~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh 392 (589) |.++|+.++.++.++.|.+.+. |...+..+++.. ......++..+++ ..+++++|++..+.. T Consensus 273 d~~~S~~~~~~~~~~~~~l~~~--------g~~~~~~~~~~~---------~~~~~~~~~~~~~-~~~~~l~~~~~~~~~ 334 (492) T protein:vir:97 273 NRRLSDLSNTFKDSNELTYVLK--------NYDDQELPEFKR---------LLRYYGAIKVSDN-GGVDTIQVEVPVENS 334 (492) T ss_pred HHHHHHHHHHHHHhccceeeee--------cCCcccchhHHH---------HHhhccceecCCC-CcceeEeccCCHHHH Confidence 9999999999999999977652 221111111110 0011112222333 347889999999999 Q ss_pred HHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccC Q lcl|NC_020883. 393 MDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIR 472 (589) Q Consensus 393 ~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~ 472 (589) ..+++.|.+.|+..+++|..+++... ++.||+|++..++.+..|+.+++..|..+|+++++++..+.... .. T Consensus 335 ~~~~~~L~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~---~~ 406 (492) T protein:vir:97 335 KKYLDELYQKIMLFGQAVDFSSDKFG-----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GE 406 (492) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccc-----cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---cc Confidence 99999999999999999987776422 24599999999999999999999999999999998766554322 23 Q ss_pred cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 473 IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 473 ~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) .....|.|.+.+|.++++. |++++.+ +|++|.||+++++ |..++ +++|++||++|+..... .+... . T Consensus 407 ~~~i~v~f~~~~p~~~~e~--a~~~~kl--~G~iS~et~l~~l-~~v~d--~~~Eleri~~E~~~~~~-~~~~~----~- 473 (492) T protein:vir:97 407 HKDVDISFNYNKVANTELQ--VQTAQQS--MGIVSHETVLENH-PFVED--LQAELERIEQEQTEYNK-QLPNL----D- 473 (492) T ss_pred cceeeEEecCCCCCCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHH-hhhcc----c- Confidence 3456799999999987765 6677665 4689999999876 65554 56899999888753211 11100 0 Q ss_pred ccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 553 MNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 553 ~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) .-+ .+-+.+++.+..++.| T Consensus 474 ---~~~-----~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 474 ---DGG-----ADSAQQQERSNNKESE 492 (492) T ss_pred ---cCC-----CCCCcccccccccccC Confidence 000 1111222222222222 No 44 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=2e-39 Score=232.79 Aligned_cols=453 Identities=13% Similarity=0.072 Sum_probs=269.8 Q ss_pred Cccceeccchh----HHHHhhcchhhhhh---hhhcCCcc--ccCHHHHHHHhhccccceeccCcce-eeecCcceEEEE Q lcl|NC_020883. 1 MIDWTVRGWTD----KTTKNVHGDYERYR---QLYEGKHE--LLFPRAKRLIEEGDAVGRFLDSSQT-ARETQTPYVIFN 70 (589) Q Consensus 1 ~~~~~~~~~~~----~~~~~~~~~~~~~r---~l~~g~~~--~~f~ra~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~n 70 (589) +-+-.-.+-|. +.|+......+|+. ..|+|.+. .+..| .-+.+...... .++.. .+..-+-.|+.| T Consensus 7 ~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~--~~~~~~~~~~~--~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:10 7 IDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKR--RPIEEKEDFET--GGNVRRLDVSVNNKLNNS 82 (474) T ss_pred HhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcc--hhhhhhhhhhh--cccccccccCcccccccc Confidence 11111222233 33333344444443 55666432 22222 11111000000 00000 011122368999 Q ss_pred cchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccch Q lcl|NC_020883. 71 LPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHW 150 (589) Q Consensus 71 ~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~ 150 (589) +++.|++..+.++ +|+=.+... +...+.++ .+ +++++.+.++++|+.... T Consensus 83 ~~~~ivd~~~~yl---~g~pv~~~~------~~~~~~~e-----------------~~----~~~l~~~~~~n~~~~~~~ 132 (474) T protein:vir:10 83 FDSEIVDTRVGYL---HGVPVTYDL------DENAEKNE-----------------KL----KKFITNFAIRNSVDDEDS 132 (474) T ss_pred hHHHHHHhHhhhe---eccceeEee------CCCCcchH-----------------HH----HHHHHHHHhhcCHhHHHH Confidence 9999999999988 443222111 00001111 11 457888999999999999 Q ss_pred hhHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeeh-hhhcc Q lcl|NC_020883. 151 SNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTN-MLYPV 228 (589) Q Consensus 151 ~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~-~~y~~ 228 (589) .....+.+-|.+...+|++. +.+++...+|.+.||. |.+ .++.-.++++|....+..+-.+.+ .+|-. T Consensus 133 ~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v---------~d~-~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~ 202 (474) T protein:vir:10 133 EIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFV---------GDN-ILEPTYSLRYFYEKDDDNGTDYVYAEFYDN 202 (474) T ss_pred HHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEE---------EcC-CCceEEEEEEEEEeeCCCceEEEEEEEEcC Confidence 99999999999999999985 4599999999999983 311 111111233333322222211110 11100 Q ss_pred ccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHH Q lcl|NC_020883. 229 VKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESK 308 (589) Q Consensus 229 ~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l 308 (589) ...+.....+ ++...+.....++..++.|++++|+ +.|.|||+++.++ T Consensus 203 ----~~~~~~~~~~-----------------------~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~l 250 (474) T protein:vir:10 203 ----AYYYVFRGEG-----------------------IDALQEVGRYEHLFDYNPLFGVPNN-----KEMIGDAEKVIHL 250 (474) T ss_pred ----ceEEEEeecC-----------------------CCcccccccccCCCCccceEEecCC-----CCCCCchHHHHHH Confidence 0000000000 0111122233466777778888884 5799999999999 Q ss_pred HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeeccc Q lcl|NC_020883. 309 QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 309 ~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) +|++|.++|+.+..++.+++|.+.+. |...+.+. .. .++. . .+....+++..+++++|++. T Consensus 251 iDa~d~~~S~~~~~~~~~~~~~l~i~--------g~~~~~~~--~~--~~~~----~---~~i~~~~~~~~~~~l~~~~~ 311 (474) T protein:vir:10 251 IDAYDLTMSDASSEISQTRLAYLVLR--------GMGMSEEM--IQ--ETQK----S---GAFELFDKDMDVKYLTKDVN 311 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhc--------cCCCCchh--hh--hhhh----c---ceeEecCCCCceeEEeccCC Confidence 99999999999999999999987653 21111111 00 0010 1 11112233456889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+.+..+++.|.+.|+..+++|..+++... ++.||+|+++.+..+..|+.+++..|..+|+++++++..+.+..+ T Consensus 312 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:10 312 DTMIENHLDRIEKNIMRFAKSVNFNSDEFN-----GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 999999999999999999999988776432 245999999999999999999999999999999988876654432 Q ss_pred c---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020883. 469 S---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 469 ~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~ 545 (589) . ......+.|.|.+.+|.++++. |+++..+. +++|.+|+++++ |..++ +++|++||++|+.... T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~--a~~~~kl~--g~iS~et~~~~l-~~v~d--~~~E~eri~~E~~e~~------ 453 (474) T protein:vir:10 387 YNLDDDSYLNLIFKFTRNIPVNKLEE--SQVLINLK--GQVSERTRLGQS-QLVDD--VDYELDEMEKESLEFN------ 453 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHH--HHHHHHHh--ccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH------ Confidence 2 2334467899999999886655 67777764 689999999987 64543 7899999988875310 Q ss_pred cccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 546 INQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 546 ~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ++ .++ ...|+..+++...+.| T Consensus 454 ---------~~-~~~---~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 454 ---------DK-LPD---IDEGDANDKSQNNQSE 474 (474) T ss_pred ---------hh-ccc---ccCCCcCCCCccccCC Confidence 00 010 1111222222222222 No 45 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=2e-39 Score=232.79 Aligned_cols=453 Identities=13% Similarity=0.072 Sum_probs=269.8 Q ss_pred Cccceeccchh----HHHHhhcchhhhhh---hhhcCCcc--ccCHHHHHHHhhccccceeccCcce-eeecCcceEEEE Q lcl|NC_020883. 1 MIDWTVRGWTD----KTTKNVHGDYERYR---QLYEGKHE--LLFPRAKRLIEEGDAVGRFLDSSQT-ARETQTPYVIFN 70 (589) Q Consensus 1 ~~~~~~~~~~~----~~~~~~~~~~~~~r---~l~~g~~~--~~f~ra~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~n 70 (589) +-+-.-.+-|. +.|+......+|+. ..|+|.+. .+..| .-+.+...... .++.. .+..-+-.|+.| T Consensus 7 ~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~--~~~~~~~~~~~--~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:94 7 IDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKR--RPIEEKEDFET--GGNVRRLDVSVNNKLNNS 82 (474) T ss_pred HhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcc--hhhhhhhhhhh--cccccccccCcccccccc Confidence 11111222233 33333344444443 55666432 22222 11111000000 00000 011122368999 Q ss_pred cchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccch Q lcl|NC_020883. 71 LPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHW 150 (589) Q Consensus 71 ~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~ 150 (589) +++.|++..+.++ +|+=.+... +...+.++ .+ +++++.+.++++|+.... T Consensus 83 ~~~~ivd~~~~yl---~g~pv~~~~------~~~~~~~e-----------------~~----~~~l~~~~~~n~~~~~~~ 132 (474) T protein:vir:94 83 FDSEIVDTRVGYL---HGVPVTYDL------DENAEKNE-----------------KL----KKFITNFAIRNSVDDEDS 132 (474) T ss_pred hHHHHHHhHhhhe---eccceeEee------CCCCcchH-----------------HH----HHHHHHHHhhcCHhHHHH Confidence 9999999999988 443222111 00001111 11 457888999999999999 Q ss_pred hhHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeeh-hhhcc Q lcl|NC_020883. 151 SNIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTN-MLYPV 228 (589) Q Consensus 151 ~~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~-~~y~~ 228 (589) .....+.+-|.+...+|++. +.+++...+|.+.||. |.+ .++.-.++++|....+..+-.+.+ .+|-. T Consensus 133 ~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v---------~d~-~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~ 202 (474) T protein:vir:94 133 EIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFV---------GDN-ILEPTYSLRYFYEKDDDNGTDYVYAEFYDN 202 (474) T ss_pred HHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEE---------EcC-CCceEEEEEEEEEeeCCCceEEEEEEEEcC Confidence 99999999999999999985 4599999999999983 311 111111233333322222211110 11100 Q ss_pred ccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHH Q lcl|NC_020883. 229 VKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESK 308 (589) Q Consensus 229 ~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l 308 (589) ...+.....+ ++...+.....++..++.|++++|+ +.|.|||+++.++ T Consensus 203 ----~~~~~~~~~~-----------------------~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~l 250 (474) T protein:vir:94 203 ----AYYYVFRGEG-----------------------IDALQEVGRYEHLFDYNPLFGVPNN-----KEMIGDAEKVIHL 250 (474) T ss_pred ----ceEEEEeecC-----------------------CCcccccccccCCCCccceEEecCC-----CCCCCchHHHHHH Confidence 0000000000 0111122233466777778888884 5799999999999 Q ss_pred HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeeccc Q lcl|NC_020883. 309 QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 309 ~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) +|++|.++|+.+..++.+++|.+.+. |...+.+. .. .++. . .+....+++..+++++|++. T Consensus 251 iDa~d~~~S~~~~~~~~~~~~~l~i~--------g~~~~~~~--~~--~~~~----~---~~i~~~~~~~~~~~l~~~~~ 311 (474) T protein:vir:94 251 IDAYDLTMSDASSEISQTRLAYLVLR--------GMGMSEEM--IQ--ETQK----S---GAFELFDKDMDVKYLTKDVN 311 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhc--------cCCCCchh--hh--hhhh----c---ceeEecCCCCceeEEeccCC Confidence 99999999999999999999987653 21111111 00 0010 1 11112233456889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+.+..+++.|.+.|+..+++|..+++... ++.||+|+++.+..+..|+.+++..|..+|+++++++..+.+..+ T Consensus 312 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:94 312 DTMIENHLDRIEKNIMRFAKSVNFNSDEFN-----GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 999999999999999999999988776432 245999999999999999999999999999999988876654432 Q ss_pred c---ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020883. 469 S---SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 469 ~---~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~ 545 (589) . ......+.|.|.+.+|.++++. |+++..+. +++|.+|+++++ |..++ +++|++||++|+.... T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~--a~~~~kl~--g~iS~et~~~~l-~~v~d--~~~E~eri~~E~~e~~------ 453 (474) T protein:vir:94 387 YNLDDDSYLNLIFKFTRNIPVNKLEE--SQVLINLK--GQVSERTRLGQS-QLVDD--VDYELDEMEKESLEFN------ 453 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHH--HHHHHHHh--ccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHH------ Confidence 2 2334467899999999886655 67777764 689999999987 64543 7899999988875310 Q ss_pred cccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 546 INQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 546 ~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ++ .++ ...|+..+++...+.| T Consensus 454 ---------~~-~~~---~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 454 ---------DK-LPD---IDEGDANDKSQNNQSE 474 (474) T ss_pred ---------hh-ccc---ccCCCcCCCCccccCC Confidence 00 010 1111222222222222 No 46 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=1.2e-39 Score=233.91 Aligned_cols=424 Identities=15% Similarity=0.106 Sum_probs=267.5 Q ss_pred CccceeccchhHHHHhhc---chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVH---GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAE 77 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~ 77 (589) |. .-|--+.|+..- ..|.++++.|+|+|.-+..+++ .+...+-+|+.|+++.|++ T Consensus 1 l~----~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~------------------~~~~~~~ki~~n~~~~ivd 58 (429) T protein:vir:98 1 MT----KDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQK------------------EQYKPDNRLVVNFAKYIVD 58 (429) T ss_pred CC----HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc------------------ccCCCcceeecchHHHHHH Confidence 21 112233333322 2344455799999853322211 0112345789999999999 Q ss_pred cchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHH Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQ 157 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~ 157 (589) ..+.++ +|+=.+.-. +. .. -++++..+.+.++|..+......++. T Consensus 59 ~~~~~l---~g~~~~~~~----------~~------------------~~----~~~~l~~~~~~n~~~~~~~~~~~~~~ 103 (429) T protein:vir:98 59 TFNGYF---IGVPVQTSH----------EN------------------KQ----VSNYLELLDGYNDQDDNNAELSKICS 103 (429) T ss_pred HHhhhh---cccCceeec----------CC------------------hH----HHHHHHHHHhhcCHhHHHHHHHHHHh Confidence 999888 553222111 00 01 13467888888899999999999999 Q ss_pred HcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecCCCccc-eEEEEEeeeccccceeehhhhccccccchh Q lcl|NC_020883. 158 VDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQ-FLHIYRERVEKDGLRTTNMLYPVVKAKGDV 235 (589) Q Consensus 158 v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~-~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 235 (589) +-|.+...+|+++ +.+++.+.+|.+.||. |......+-. ++++|+. +.+... ..+|....+. T Consensus 104 ~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v---------~dd~~~~~~~~~i~~~~~---~~~~~~-~~~~~~~~~~--- 167 (429) T protein:vir:98 104 IYGHGYELVFNDENAEAGITYLTPLEAFIV---------YDDSIRQKPLFAVRYFYN---KGGVLE-GSYSDASNIT--- 167 (429) T ss_pred hcCeEEEEEEecCCCcEEEEEEcccceEEE---------EeCCCCCceEEEEEEEEe---cCceEE-EEEEeCceEE--- Confidence 9999999999985 4599999999999983 3221111000 1122211 111100 0111100000 Q ss_pred heeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHH Q lcl|NC_020883. 236 KKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWT 315 (589) Q Consensus 236 ~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t 315 (589) +....+..+. + ......++.+..|++++|+ ++|+|||+++.+++|++|.+ T Consensus 168 -~~~~~~~~~~-------------------~-----~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~liD~~d~~ 217 (429) T protein:vir:98 168 -YFKDGEKGIE-------------------I-----GESEPHPFDGVPMIEYVEN-----EERQSLLASVVTLINAFNKA 217 (429) T ss_pred -EEEecCCceE-------------------e-----cccccccCCccceEEecCC-----CCCCCcHHHHHHHHHHHHHH Confidence 0001110000 0 0122345666678888984 57999999999999999999 Q ss_pred HhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHH Q lcl|NC_020883. 316 ITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDH 395 (589) Q Consensus 316 ~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ 395 (589) +|+.+..++.++.|.+.+. |...+ .+... .++ ..+-+.+...+..+..+++++|++..+.+..+ T Consensus 218 ~s~~~~~~~~~~~p~~~i~--------g~~~~--~~~~~--~~~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 281 (429) T protein:vir:98 218 ISEKANDVEYFADAYLKIL--------GAELD--DETLK--SLR----DTRIINLKDTDAQQLTVEFLQKPDADATQEHL 281 (429) T ss_pred HHHHHHHHHHhcCceeeee--------cCCCC--cchhh--hHh----hCceeeccCCCCCCcceeEEeecCCHHHHHHH Confidence 9999999999999988753 11111 11111 010 11112222222334457889999999999999 Q ss_pred HHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCccc Q lcl|NC_020883. 396 VKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEE 475 (589) Q Consensus 396 ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~ 475 (589) ++.|.+.|+..+++|..+++.. +..||+|++..+.++..|+.+++..|..+|++++++++.+.+..+....... T Consensus 282 ~~~l~~~i~~~s~~p~~~~~~~------gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~ 355 (429) T protein:vir:98 282 LDRLENLIFRTAMVANISDESF------GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIG 355 (429) T ss_pred HHHHHHHHHHHhCccccCcccc------ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Confidence 9999999999999997665421 2359999999999999999999999999999999998888776555556666 Q ss_pred ceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccC Q lcl|NC_020883. 476 PNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMND 555 (589) Q Consensus 476 p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~ 555 (589) ..|.|.+.+|.++++. |+++..+ ++++|.||++.++ |..++ +++|++||++|+........++ ++ T Consensus 356 i~v~f~~~~p~~~~~~--a~~~~kl--~g~is~et~~~~l-~~v~d--~~~E~~ri~~E~~~~~~~~~~~--------~~ 420 (429) T protein:vir:98 356 IKYKFTRNLPANLLEE--SQIAGNL--AGIVSEETQVGVL-SIVEN--PQKEIERKNSDKSTLISRQAGG--------LN 420 (429) T ss_pred ceEEeCCCCCcCHHHH--HHHHHHH--hccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHHhh--------hc Confidence 7899999999887655 5666554 5689999999887 66665 4689999999987432111111 11 Q ss_pred cccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 556 NRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 556 ~~~~~~~p~deg~~~eep~~~ 576 (589) ..+. +.| ++ T Consensus 421 ~~~~---~~~---------~~ 429 (429) T protein:vir:98 421 GQNT---TTI---------LE 429 (429) T ss_pred CCCC---CCC---------CC Confidence 1111 111 00 No 47 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=1e-39 Score=234.38 Aligned_cols=437 Identities=14% Similarity=0.072 Sum_probs=271.0 Q ss_pred Cccce-eccchhHHHHhh---c----chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWT-VRGWTDKTTKNV---H----GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~-~~~~~~~~~~~~---~----~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) +++.. -..-|.+.|..+ | ..|.++++.|+|+|. +..|-. . .+...+-.|++|++ T Consensus 8 ~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~-i~~~~~-------~----------~~~~~~~ki~~n~~ 69 (453) T protein:vir:73 8 LMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIME-ISSQKA-------K----------DSWKPDNRLTNNFA 69 (453) T ss_pred eeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-hhcCCC-------C----------CccCccceeecchH Confidence 22211 111122222111 1 245556789999985 433310 0 01113456899999 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhh Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSN 152 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~ 152 (589) +.|++..+.++ +|+=.+.-+ +. .. -++++..+.+.++|..+.... T Consensus 70 ~~ivd~~~~~l---~g~~~~~~~----------~d------------------~~----~~~~l~~~~~~n~~~~~~~~~ 114 (453) T protein:vir:73 70 KYIVDTFVGYF---NGIPIKKTH----------DD------------------KS----VLEAMQLFDNLNDMEDEESEL 114 (453) T ss_pred HHHHHHhhhhh---cccCceeec----------CC------------------hH----HHHHHHHHHHhcChhHHHHHH Confidence 99999999888 554322111 00 01 145788899999999999999 Q ss_pred HHHHHHcCceeEEEEEecC-ceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccc Q lcl|NC_020883. 153 IVQHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKA 231 (589) Q Consensus 153 l~~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~ 231 (589) ..++.+-|.....+|++.+ .++|.+.+|.+.|| +|..... ..++.+.+.+.+.++- +.-.+|- T Consensus 115 ~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~---------v~dd~~~--~~~~~~i~~~~~~~~~-~~~~vyt---- 178 (453) T protein:vir:73 115 AKIACVYGRAYELMYQNESTESEVIYCSPLNVFM---------VYDDSIK--QKPLFAVYYGFDEEGN-LSGTVYT---- 178 (453) T ss_pred HHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE---------EEeCCCC--ceeEEEEEEEEecCce-EEEEEEe---- Confidence 9999999999999999854 59999999999998 3322211 1111111111111111 1111221 Q ss_pred cchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHH Q lcl|NC_020883. 232 KGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDE 311 (589) Q Consensus 232 ~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~De 311 (589) +..+++....+..+... ...+.+..+..|++++|+ ++|+|+|+++.+++|+ T Consensus 179 ~~~i~~~~~~~~~~~~~------------------------~~~~~~~g~vPvv~~~n~-----~~g~s~~~~v~~liDa 229 (453) T protein:vir:73 179 LLETISITGKAGEVKFG------------------------ESTYNVYSDLPIVEYNFN-----EERQSIFEPVHSLINS 229 (453) T ss_pred CCeEEEEEecCCceEEc------------------------cceeccCCceeEEEecCC-----CCCCcchhhHHHHHHH Confidence 01111111111111100 112234455567888885 4799999999999999 Q ss_pred HHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHH Q lcl|NC_020883. 312 INWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIG 391 (589) Q Consensus 312 Ld~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirvee 391 (589) +|.++|+.+..++.++.|.+.+.-..+. . .+.......+..... ... .........+..+++++|++..+. T Consensus 230 ~~~~~S~~~~~~~~~~~~~l~~~g~~~~------~-~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~d~~~l~~~~~~~~ 300 (453) T protein:vir:73 230 YNKVTSEKANDVEYFSDQYLVFLGAEVD------E-EDAKNIKDNRLINFF-DKN-SNGQGTNAAKVDVKFLDKPDSDVQ 300 (453) T ss_pred HHHHHHHHHHHHHHhccceeeeecCCCC------c-hhhhccccccccccc-ccc-cccccccccCceeEEeeecCCHHH Confidence 9999999999999999998876321111 0 111111111110000 000 001111233445778999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_020883. 392 DMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSI 471 (589) Q Consensus 392 h~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~ 471 (589) ...+++.|...|+..|++|..+++.. +..||+|++..+..+..|+.+++..|..+|++++++++.+.+..+... T Consensus 301 ~~~~~~~l~~~I~~~s~~p~~~~~~~------gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 374 (453) T protein:vir:73 301 TENLLNRLERSIFQFTMAANISDENF------GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKD 374 (453) T ss_pred HHHHHHHHHHHHHHHhCCcccCcccc------cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 99999999999999999997666421 134999999999999999999999999999999999988877666666 Q ss_pred CcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccc Q lcl|NC_020883. 472 RIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFE 551 (589) Q Consensus 472 ~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~ 551 (589) ....+.|.|++.+|.+++++ |+++..++ +++|.||++.++ |..++ +++|++||++|+.........+. T Consensus 375 ~~~~i~v~f~~~~p~~~~~~--a~~~~k~~--giis~et~~~~~-~~~~d--~~~E~~ri~~E~~~~~~~~~~~~----- 442 (453) T protein:vir:73 375 AWKDIEYTFTRNEPKDIKEQ--AETANILK--GITSEETALSVI-SVIPD--VQAEMEKIKKKKLLQLSLTRTSN----- 442 (453) T ss_pred ccccceEEeCCCCCCCHHHH--HHHHHHHh--ccCcHHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHHhcc----- Confidence 66677899999999886654 67777765 589999998765 76665 57899999998775433322211 Q ss_pred cccCcccCCCCCCCC Q lcl|NC_020883. 552 QMNDNRDEDGNIIEE 566 (589) Q Consensus 552 ~~~~~~~~~~~p~de 566 (589) ..+++..-.|. T Consensus 443 ----~~~~~~~~~~~ 453 (453) T protein:vir:73 443 ----LVRMKQMRGNL 453 (453) T ss_pred ----CCcchhhhcCC Confidence 11111111121 No 48 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=1.9e-39 Score=232.91 Aligned_cols=499 Identities=9% Similarity=0.008 Sum_probs=276.1 Q ss_pred CccceeccchhHHHHhhcc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcce-eeecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQT-ARETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~n~~~~ 74 (589) |.==.+..+--+-+.+.|. .|.+.++.|+|+| ++..|-.....+. +.+. -+...+-.|+.|+.+. T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h-~Il~r~~~~~~~~--------~~~~~d~~~~nnki~~nf~k~ 78 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQEN-DIEKSRIFYMNDK--------GQLREDNYASNVKISHGFFTE 78 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-hhhhccccccccc--------ccccccccccccccccchHHH Confidence 2222334444333444442 2334458899999 4544421111100 0001 1122345799999999 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) |++..+.++ +|+=.+.-+.+ ..+++. ++++..++. ++|...+..-.. T Consensus 79 Ivd~~~~yl---~G~Pv~~~~~d-------------------------~~~~e~----~~~l~~~~~-~~~~~~~~el~~ 125 (537) T protein:vir:78 79 LVDQLAQYL---LSNGVEVKVKD-------------------------EDNTQL----DEILQEYFD-EDFQATIDTLVT 125 (537) T ss_pred HHHHHhhhh---cccCceeecCc-------------------------chhHHH----HHHHHHHhh-ccHHHHHHHHHH Confidence 999999998 66533321100 011111 334555554 456666777778 Q ss_pred HHHHcCceeEEEEEecC-ceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccc----cceeeh-hhhcc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKD----GLRTTN-MLYPV 228 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~----~~~~~~-~~y~~ 228 (589) ++.+-|..+..+|++.+ .+++...++.+.|| ||.+ .++-..++++|....... +-.+.+ .+|-. T Consensus 126 ~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~p---------v~d~-~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~ 195 (537) T protein:vir:78 126 NASKKGFEGIFARTTSEGKLKFQTVDGLTLIP---------VFDD-YGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNE 195 (537) T ss_pred HHhhcCeeEEEeeecCCCceEEEEEccceeEE---------EEcC-CCCceeEEEEEeeeeccccccCcceEEEEEEEcC Confidence 88899999999999955 49999999999999 3321 222223334333222111 011110 12210 Q ss_pred ccccchhheeeccc---ccccccccccccchhhhhhcccCCc----cccccccccCCCCcceEEEecCCCCCCCcccCcc Q lcl|NC_020883. 229 VKAKGDVKKEIKKG---ELVTNVEGAEDLEGEELIREVLNIP----DDRPLENFYPGRNRPFISYWANNETFMNPYGISA 301 (589) Q Consensus 229 ~~~~~~~~~~~~~g---d~~~~~~e~~d~e~e~~i~~~i~ip----~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD 301 (589) ..+.. +....| ..+....................+. ..........+..+..|+++.|+ ..|.|| T Consensus 196 ~~i~~---y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn-----~~~~sd 267 (537) T protein:vir:78 196 EAVCY---YIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNN-----KDGMSD 267 (537) T ss_pred CcEEE---EEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEeccC-----ccCCCc Confidence 00000 111111 0111111111111111110000000 01112223345666667777774 479999 Q ss_pred hhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccc Q lcl|NC_020883. 302 LDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSME 381 (589) Q Consensus 302 ~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~ 381 (589) |+++.+++|++|.++|+.+..++.|+.|.+.+.-..+ +..++.... ++ ...++..+.++..++ T Consensus 268 ~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~--------~~~~~~~~~--l~-------~~~~i~v~~d~~~v~ 330 (537) T protein:vir:78 268 VKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSG--------DSTDKLRQN--IK-------AKKMIGVNGDNAGME 330 (537) T ss_pred hhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCC--------ccchhHHHH--Hh-------hcCceeecCCCCcee Confidence 9999999999999999999999999999887643221 122221110 00 111222233445678 Q ss_pred eeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 382 IHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCL 461 (589) Q Consensus 382 ~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l 461 (589) +++|++..+....++++|.+.||..+.++.....+ . +..||+|++++++.+..|+..+++.|.++|++++++++ T Consensus 331 ~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~---~---gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~ 404 (537) T protein:vir:78 331 IQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVG---D---GNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVV 404 (537) T ss_pred EEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccc---c---cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999998877654321 1 23599999999999999999999999999999998887 Q ss_pred HHHhhc-CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccc Q lcl|NC_020883. 462 WLLNDQ-DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDT 540 (589) Q Consensus 462 ~L~~~~-~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p 540 (589) .+.+.. ...+....+.|.|.+.+|.+++|. |+++..+.+++++|.+|++.++ |..++.+ +.+++++|...... T Consensus 405 ~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~--a~~~~~l~~~giiS~eT~l~~~-p~vdd~e---~ek~~~ee~~~~~~ 478 (537) T protein:vir:78 405 SDIALRGLGEYDSNDICFEIEPHVLANELDI--ATTRKTEAETEALKIGNIMTVA-PRIGDDE---TLKLIAEELDLDYN 478 (537) T ss_pred HHHhhcCCcccccceeeEEeccCCCCCHHHH--HHHHHHHHhcCcchHHHHHHhC-CCCCCHH---HHHHHHHHHHhhhh Confidence 776543 345666778999999999997766 7788888899999999999875 7667642 22333333221111 Q ss_pred ccccccccccccccCcccCCCCCCCC--CCCCCCCCcchhhhhh-----cccccCC Q lcl|NC_020883. 541 SSLMGINQTFEQMNDNRDEDGNIIEE--GDTEEEPSAEENEEIE-----KEGEPIA 589 (589) Q Consensus 541 ~~~g~~~~~l~~~~~~~~~~~~p~de--g~~~eep~~~~~e~~~-----~~~~~~~ 589 (589) .......+.-.+..+-. ++..+.-+ ++..+.||+++++-.. -+++|-+ T Consensus 479 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 533 (537) T protein:vir:78 479 ELKDALAEQDAQSLDVS-PDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNA 533 (537) T ss_pred hhhhhhhhhcccccCcC-cchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCcc Confidence 11111100000000000 00001101 1111222222222222 2344444 No 49 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=5.3e-39 Score=230.44 Aligned_cols=447 Identities=15% Similarity=0.139 Sum_probs=265.9 Q ss_pred Cc--cceeccchhHHHHhh---c-c---hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEc Q lcl|NC_020883. 1 MI--DWTVRGWTDKTTKNV---H-G---DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNL 71 (589) Q Consensus 1 ~~--~~~~~~~~~~~~~~~---~-~---~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~ 71 (589) ++ +-- .-.+.+.|+.+ | . .|.++++.|+|+| +++.|.......+. .. +...+-.|+.|+ T Consensus 35 ~~~~~~~-~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~-~I~~~~~~~~~~~~---------~~-~~~~~~ri~~n~ 102 (492) T protein:vir:94 35 IVRTNNK-PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGA---------VD-PLKPDDRMITNF 102 (492) T ss_pred ccccCCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccccccccccc---------cc-ccccccccccch Confidence 11 110 01222222222 2 2 3455557899998 67776322221110 00 111233588999 Q ss_pred chhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchh Q lcl|NC_020883. 72 PKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWS 151 (589) Q Consensus 72 ~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~ 151 (589) ++.|++..+.++ +|+=.+.-.. + .. .+++++.+.+| +|...... T Consensus 103 ~k~Ivd~~~~yl---~G~p~~~~~~---------d-------------------~~----~~~~l~~~~~n-~~~~~~~~ 146 (492) T protein:vir:94 103 HANLVDQKVSYI---VGKPIAFKHT---------D-------------------DE----VVKRIDEVLGN-RFDDKLHS 146 (492) T ss_pred HHHHHHHHHhhh---cccCceeccC---------c-------------------hH----HHHHHHHHHhc-cHHHHHHH Confidence 999999998888 5532221110 0 01 13456666654 57777777 Q ss_pred hHHHHHHcCceeEEEEEec-CceeEEEecCceecccccCcceeEEEeecCC-CccceEEEEEeeeccccceeehhhhccc Q lcl|NC_020883. 152 NIVQHQVDGGIVAAPVIDE-LGPRIVFKARDVYFPHDDEKGADLAYYIDHG-QYGQFLHIYRERVEKDGLRTTNMLYPVV 229 (589) Q Consensus 152 ~l~~~~v~Gg~~~~~~~~~-~~~~i~f~~~d~~~P~~d~~~~div~~~e~~-~~~~~l~~~~~~~~~~~~~~~~~~y~~~ 229 (589) ...++.+-|.....+|++. +.+++.+.+|.+.|| +|..... +---++++|+..... . + .+|-.. T Consensus 147 ~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~---------v~d~~~~~~~~a~ir~~~~~~~~--~-~--~~y~~~ 212 (492) T protein:vir:94 147 VLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP---------IWTDKEHEELEAFIRMYKLENET--K-V--EYWDKV 212 (492) T ss_pred HHHHHhhCCeEEEEEEecCCCceEEEEEcccceEE---------EEcCCCCCceEEEEEEEeeccce--e-E--EEEecC Confidence 7888899999999999984 559999999999998 4432211 111233333322111 0 0 122100 Q ss_pred cccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHH Q lcl|NC_020883. 230 KAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQ 309 (589) Q Consensus 230 ~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~ 309 (589) .+.. +....|........ . ...........+.....|++++|+ .+|.|||+++.+++ T Consensus 213 ~v~~---~~~~~~~~~~~~~~------------~---~~~~~~~~~~~~~g~vPvv~~~nn-----~~~~sd~e~v~~li 269 (492) T protein:vir:94 213 TVNY---YVYENGSLIPDYSN------------N---LENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLI 269 (492) T ss_pred eEEE---EEEecCeeeecccc------------c---cccccccccccCCCccceEEecCC-----CCCCCchHHHHHHH Confidence 0111 11111111100000 0 000111223355666668888885 36999999999999 Q ss_pred HHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccH Q lcl|NC_020883. 310 DEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISK 389 (589) Q Consensus 310 DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirv 389 (589) |++|.++|+.++.++.++.|.+.+. |...+..++... ......+...+++ ..+++++|+... T Consensus 270 Da~d~~~S~~~~~~~~~~~p~lv~~--------g~~~~~~~~~~~---------~~~~~~~~~~~~~-~~~~~l~~~~~~ 331 (492) T protein:vir:94 270 DAYNRRLSDLSNTFKDSNELTYVLK--------NYDDQELPEFKR---------LLRYYGAIKVSDN-GGVDTIQVEVPV 331 (492) T ss_pred HHHHHHHHHHHHHHHHhcCceeeee--------cCCcccchhhHH---------HHhhccceecCCC-CcceeEeccCCH Confidence 9999999999999999999977652 221111111111 0011122222333 457889999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_020883. 390 IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS 469 (589) Q Consensus 390 eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~ 469 (589) +....+++.|.+.|+..+++|..+++..+ ++.||+|+++.+..+..|+.+++..|..+|+++++++..+.... T Consensus 332 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~-- 404 (492) T protein:vir:94 332 ENSKKYLDELYQKIMLFGQAVDFSSDKFG-----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-- 404 (492) T ss_pred HHHHHHHHHHHHHHHHHhCCcCCCccccc-----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-- Confidence 99999999999999999999987776432 23599999999999999999999999999999998766554322 Q ss_pred ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020883. 470 SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQT 549 (589) Q Consensus 470 ~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~ 549 (589) .......|.|.+.+|.++++. +++++.++ +++|.+|+++++ |..++ +++|++||++|+...... + T Consensus 405 -~~~~~i~v~f~~~~p~~~~e~--~~~~~kl~--giiS~et~~~~l-~~v~d--~~~E~eri~~E~~~~~~~-~------ 469 (492) T protein:vir:94 405 -GEHKDVDISFNYNKVANTELQ--VQTAQQSM--GIVSHETVLENH-PFVED--LQAELERIEQEQMEYNKQ-L------ 469 (492) T ss_pred -cccceeeEEecCCCCCCHHHH--HHHHHHHh--ccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHhh-c------ Confidence 234456799999999887765 66776654 689999999876 65654 678999998886532111 1 Q ss_pred cccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 550 FEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 550 l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) .. +++..+++.+.+ +++...|.| T Consensus 470 -~~-~~~~~~~~~~~~-----~~~~~~e~e 492 (492) T protein:vir:94 470 -PN-LDDGGADSAQQQ-----ERSNNKESE 492 (492) T ss_pred -cc-cccccCCCCccc-----cCCccccCC Confidence 11 112222222222 222222222 No 50 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=1e-38 Score=228.82 Aligned_cols=428 Identities=13% Similarity=0.074 Sum_probs=264.9 Q ss_pred chhHHHHh-------hcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchh Q lcl|NC_020883. 9 WTDKTTKN-------VHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPAT 81 (589) Q Consensus 9 ~~~~~~~~-------~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~ 81 (589) -|-+.|+. -+..|.++++.|+|+| +++.|......+.+.- +..-+-.|+.|+++.|++..+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~----------~~~~~~ki~~n~~~~Ivd~~~~ 69 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKN-DILKKGVVVQNRDENP----------LRNADNRISHNFHEILVDEKAS 69 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccC-cccccccccccccccc----------ccccccccccchHHHHHHhhhh Confidence 22222222 2456778889999998 5555533322211111 1112347999999999999998 Q ss_pred hhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCc Q lcl|NC_020883. 82 MVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGG 161 (589) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg 161 (589) ++ +|+-.+.... +.. ...+++..+.+ ++|.........++.+-|. T Consensus 70 yl---~G~p~~~~~~---------~~~----------------------~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~ 114 (451) T protein:vir:10 70 YM---FTYPVLFDID---------NNK----------------------ELNEKVTDVLG-NEFTRKAKNLAIEASNCGS 114 (451) T ss_pred he---ecccceeecC---------CcH----------------------HHHHHHHHHhc-cCHHHHHHHHHHHHhhcCe Confidence 88 5544432110 000 01234455554 4677677777788889999 Q ss_pred eeEEEEEec---------CceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceee-----hhhh Q lcl|NC_020883. 162 IVAAPVIDE---------LGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTT-----NMLY 226 (589) Q Consensus 162 ~~~~~~~~~---------~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~-----~~~y 226 (589) ....+|+++ +.+++...+|.+.|| +|... .++-.-+++.|.......+.... -.+| T Consensus 115 a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~---------vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~y 185 (451) T protein:vir:10 115 AWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIP---------IYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFW 185 (451) T ss_pred EEEEEeecCCcccccccccceeEEEEcccceEE---------EEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEE Confidence 999999985 457777779999998 33322 11111233333222221111000 0011 Q ss_pred ccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhh Q lcl|NC_020883. 227 PVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLE 306 (589) Q Consensus 227 ~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie 306 (589) - ...+++....+. . ..++..+......|..+..|++++|+. .|.|||+++. T Consensus 186 t----~~~~~~~~~~~~--~------------------~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~~d~e~v~ 236 (451) T protein:vir:10 186 T----DKILDKYKFFGV--S------------------CCGSQIEHITVQHRFNSVPFVEFSNNI-----KKQSDLSKYK 236 (451) T ss_pred e----CCeEEEEEeccc--C------------------ccccccccccccCCCCeeeEEEeccCC-----CCCCchhhHH Confidence 0 000000000000 0 001112233445677888899988853 5899999999 Q ss_pred HHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccc-ccccCccceeee Q lcl|NC_020883. 307 SKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTF-DENGRSMEIHQI 385 (589) Q Consensus 307 ~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~-de~g~~~~~iq~ 385 (589) +++|++|.++|+.+..++.|++|.+.++- ...+.+.+..... + ....+.+... +..+..+++++| T Consensus 237 ~liDa~~~~~S~~~~~~~~~~~~~l~~~g--------~~~~~~~~~~~~~--~----~~~~i~~~~~~~~~~~~~~~l~~ 302 (451) T protein:vir:10 237 KILDLYDRVMSGFANDLEDIQQIIYILEN--------FGGEDTSEFLKEL--K----RYKTIKTETDSEGDSGGLKTMQI 302 (451) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeec--------CCcccchhhHHHH--h----hCCeEEecCcCCccCCcceEEee Confidence 99999999999999999999999887642 1111112111110 0 0011111111 223456889999 Q ss_pred cccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020883. 386 DISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLN 465 (589) Q Consensus 386 Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~ 465 (589) ++..+....+++.|.+.|+..+++|..+++. . +..||+|+++.++.+..|+..++..|.++|+++++++..+.. T Consensus 303 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-----~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 376 (451) T protein:vir:10 303 EIPTEARKIILEILKKQIYESGQGLQQDTEN-----F-GNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG 376 (451) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCcccccccc-----c-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 9999999999999999999999999765542 1 135999999999999999999999999999999988776553 Q ss_pred hcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccc Q lcl|NC_020883. 466 DQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 466 ~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~ 545 (589) . .......|.|++.+|.+++|. |+++..++ +++|.||+++++ |..++ +++|++||++|...+...... T Consensus 377 ~----~d~~~i~i~f~~~~p~n~~e~--~~~~~kl~--g~iS~et~~~~~-p~v~d--~~~e~~~~~ee~~~~~~~~~~- 444 (451) T protein:vir:10 377 V----TDYKKIQQTYTRNMMSNDLED--ADIATKSV--GIIPTKIILRHH-PWVDD--VEEAEKLYLEEKKIQASKVSD- 444 (451) T ss_pred C----CCccceeEEecCCCCCCHHHH--HHHHHHHh--ccCchHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHHh- Confidence 2 234556799999999997765 67777764 589999998876 76665 468889997776543222211 Q ss_pred cccccccccC Q lcl|NC_020883. 546 INQTFEQMND 555 (589) Q Consensus 546 ~~~~l~~~~~ 555 (589) . ++.+.| T Consensus 445 ~---~~~~~~ 451 (451) T protein:vir:10 445 D---YNNFTE 451 (451) T ss_pred h---cCCCCC Confidence 1 221111 No 51 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=1.6e-37 Score=222.26 Aligned_cols=460 Identities=14% Similarity=0.073 Sum_probs=268.5 Q ss_pred Cccc-eeccchhHHHHhhcc---hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDW-TVRGWTDKTTKNVHG---DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~---~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) |.+= +..-|-+..++..-. -+.++++.|+|+|.... +.+. .-...++..+++|+++.|+ T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~------------~~~~-----~~~~~~~~~~~~n~~~~iv 70 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEA------------IGVT-----VPIQMQSLLAHVGYPRLYV 70 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchh------------cCCC-----CChhhhhhhhhcCcHHHHH Confidence 2111 111122333333222 24555688999995321 1111 1112234567889999999 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) +..+..+. + .+|... +.. -.++.+.+|.+.++|..+......++ T Consensus 71 d~~~~~l~--~----~g~~~~--------~~~----------------------~~~~~~~~i~~~N~~d~~~~~~~~~a 114 (485) T protein:vir:10 71 DSIAERQA--V----EGFRFG--------DAD----------------------EADEELWQWWQANNLDIEAPLGYTDA 114 (485) T ss_pred HHHHhhhc--c----cceecC--------CCc----------------------hhHHHHHHHHHhcCHhHHHHHHHHHH Confidence 99888771 1 112100 000 00235567778888998888999999 Q ss_pred HHcCceeEEEEEec---------CceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 157 QVDGGIVAAPVIDE---------LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 157 ~v~Gg~~~~~~~~~---------~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) ++-|.++..+|.++ +.++|.+.+|...|| +|.....+.-.+++++.. ++...+ ....+|- T Consensus 115 ~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~---------~~D~~~~~~~~~~~~~~~-~~~~~~-~~~~~y~ 183 (485) T protein:vir:10 115 YVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYA---------EIDPRIGRVSKAIRVAYD-AEGNEI-QAATLYT 183 (485) T ss_pred hhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEE---------EEcCCCCceeEEEEEEEe-eCCCeE-EEEEEEe Confidence 99999988888763 347788888888887 332111111112222211 111111 1111221 Q ss_pred cccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-hh Q lcl|NC_020883. 228 VVKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-NL 305 (589) Q Consensus 228 ~~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~i 305 (589) +..+++... .|. +. .....+.+..++.|++++|++....|+|+|+++ .+ T Consensus 184 ----~~~~~~~~~~~~~-~~------------------------~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v 234 (485) T protein:vir:10 184 ----PNDIFGWYRVENE-WQ------------------------EWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPEL 234 (485) T ss_pred ----CCeEEEEEEcCCc-eE------------------------EeccccCCCCcccEEEeccccccCCCCCccchhHHH Confidence 001111111 111 11 111234567778899999999999999999997 58 Q ss_pred hHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeee Q lcl|NC_020883. 306 ESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQI 385 (589) Q Consensus 306 e~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~ 385 (589) .+++|++|.++|+.+.+.+.++.|.+++--.-++. ....++.... .. +...-.++... +....+.|| T Consensus 235 ~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~----~~~~~~~~~~-----~~--~~~~~~i~~~~--~~d~k~~q~ 301 (485) T protein:vir:10 235 RSMTDAAARILMLMQATAELMGVPQRLIFGIKPEE----IGVDPETGQT-----LF--DAYLARILAFE--DAEGKIQQF 301 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCccc----ccccccccch-----hh--hhcccceeccC--CCCceEEee Confidence 99999999999999999999999977652100100 0001111110 00 11111111111 122445566 Q ss_pred cc-cHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 386 DI-SKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLL 464 (589) Q Consensus 386 Di-rveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~ 464 (589) +. .++.|++.++.++.+++.++++|+..||... .+ ..||+|+++.+..+..|+++++..|..+|++++++++.+. T Consensus 302 ~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~-~n---~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~ 377 (485) T protein:vir:10 302 SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA-DN---PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMM 377 (485) T ss_pred cccchHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 53 4788999999999999999999999998522 11 2489999999999999999999999999999999988877 Q ss_pred hhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccc Q lcl|NC_020883. 465 NDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQ--GQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSS 542 (589) Q Consensus 465 ~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~ 542 (589) .............|.|.+.+|.+..+. |+++..+++++ .+|.+++++++ | ++++.+ +|++|+.+++.+..... T Consensus 378 ~~~~~~~~~~~i~v~w~~~~~~~~~~~--ada~~kl~~ag~~~~s~et~~~~l-g-~~~~~~-~~~~~~~ee~~~~~~~~ 452 (485) T protein:vir:10 378 KGGDVPPDMLRMETVWRDPSTPTYAAK--ADAASKLYNGGTGVIPRERARKDM-G-YSIAER-EEMRRWDEEEAAMGLGL 452 (485) T ss_pred CCCCCcccceeeeEEecCCCCCCHHHH--HHHHHHHHhccccCCCHHHHHHhC-C-CCHhHH-HHHHHHHHHHHHHHHHH Confidence 654333334456789999999886655 66666666655 88999998765 4 777664 68899988776543333 Q ss_pred ccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccC Q lcl|NC_020883. 543 LMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPI 588 (589) Q Consensus 543 ~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~ 588 (589) +... .+..+. ..|.++++++|.+... +.+++.- T Consensus 453 ~~~~-------~~~~~~---~~~~~~~~~~~~~~~~---~~~~~~~ 485 (485) T protein:vir:10 453 IGTM-------VDPNPT---VPGSPSPAPAPKPAAL---ESGGDAA 485 (485) T ss_pred HHHh-------hccCCC---CCCCCCccccccCcCC---CCCCCCC Confidence 3322 111111 1222222222222111 1122211 No 52 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=3.9e-37 Score=220.19 Aligned_cols=459 Identities=15% Similarity=0.077 Sum_probs=267.2 Q ss_pred ccceeccc---------hhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEE Q lcl|NC_020883. 2 IDWTVRGW---------TDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIF 69 (589) Q Consensus 2 ~~~~~~~~---------~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 69 (589) +---+-|+ .++.|+..-..+.||+ +.|+|+|.-. ++++. .-+..++.+++. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~------------~~~~~-----~~~~~~~~~~~~ 63 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPE------------AIGVT-----VPVQMQSLLAHV 63 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchh------------hcCcc-----cchhhhhhhhcc Confidence 22222233 2334444434355555 6699998421 11111 111224567889 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+++.|++..+.++. ...+.. . +.. + .++.+..+.+.++|.... T Consensus 64 n~~~~ivd~~~~~l~--~~g~~~----~--------~~~---------------~-------~~~~l~~i~~~N~~d~~~ 107 (485) T protein:vir:24 64 GYPRLYVDSIAERQA--VEGFRL----G--------DAD---------------E-------ADEELWQWWQANNLDIEA 107 (485) T ss_pred chHHHHHHHHhhhhc--cCceec----C--------CCc---------------h-------hHHHHHHHHHhcChhHHH Confidence 999999999888771 111111 0 000 0 022456777778888888 Q ss_pred hhhHHHHHHcCceeEEEEEecC---------ceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccce Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVIDEL---------GPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLR 220 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~~~---------~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~ 220 (589) .....++.+-|.....+|.+++ .++|.+.+|.+.|| +|.....+.-.++.++.. +..+.. T Consensus 108 ~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~---------i~D~~~~~~~~~~~~~~~--~~~~~~ 176 (485) T protein:vir:24 108 PLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYA---------EIDPRIGRPAKAIRVAYD--AEGNEI 176 (485) T ss_pred HHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEE---------EeeCCcCceeEEEEEEEe--ecCCeE Confidence 8899999999999999988753 35788888888887 332221111111222211 111111 Q ss_pred eehhhhccccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccC Q lcl|NC_020883. 221 TTNMLYPVVKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGI 299 (589) Q Consensus 221 ~~~~~y~~~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~ 299 (589) +...+|-. ..+++... +|+ + ......+.+..++.||+++|++...+|+|+ T Consensus 177 ~~~~~y~~----~~~~~~~~~~~~-~------------------------~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~ 227 (485) T protein:vir:24 177 QAATLYTP----NETFGWFRAEGE-W------------------------VEWFSDPHGLGAVPVVPLPNRTRLSDLYGT 227 (485) T ss_pred EEEEEEcC----CcEEEEEecCCc-e------------------------EeecccccCCCcccEEEeccCcccCCcCCc Confidence 11122310 01111111 121 1 011123356677889999999999999999 Q ss_pred cchh-hhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC Q lcl|NC_020883. 300 SALD-NLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR 378 (589) Q Consensus 300 SD~~-~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~ 378 (589) |+++ .+.+++|++|.++|+.+.+.+.++.|.+++--.-++. ....++..... .+...-.++...+ . T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~----~~~~~~~~~~~-------~~~~~~~i~~~~~--~ 294 (485) T protein:vir:24 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEE----IGVDPETGQTL-------FDAYLARILAFED--A 294 (485) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccc----cccccccccch-------hhhcccceeccCC--C Confidence 9997 5999999999999999999999999988652100000 00011111110 0111111111111 1 Q ss_pred ccceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 379 SMEIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELY 457 (589) Q Consensus 379 ~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li 457 (589) .+.+.|++ ..++.|++.++.++.++...+++|...||... . ...||+|+++.+..+..|+++++..|..+|++++ T Consensus 295 ~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~---n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 370 (485) T protein:vir:24 295 EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA-D---NPASAEAIRAAESRLIKKVERKNAIFGGAWEEAM 370 (485) T ss_pred CceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhcccc-C---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23455655 46788999999999999999999999998422 1 1249999999999999999999999999999999 Q ss_pred HHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhcc--chhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 458 ESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASK--QGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 458 ~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a--~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) ++++.+....+.........|.|.+..|.+..+. |.....+.++ +.+|.+|++.++ | |+++.+ +|++|+++++ T Consensus 371 ~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~--ad~~~kl~~~g~~~~s~et~~~~l-~-~~~d~~-~e~~~~~ee~ 445 (485) T protein:vir:24 371 RLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAK--ADAATKLYGNGQGVIPRERARKDM-G-YSIAER-EEMRRWDEEE 445 (485) T ss_pred HHHHHHhcCCCCccccceeeEEecCCCCCCHHHH--HHHHHHHHhcccccCCHHHHHhhC-C-CCHhHH-HHHHHHHHHH Confidence 9988776554444445566789999988776654 5544445544 478999988764 5 776665 5899998877 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccC Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPI 588 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~ 588 (589) .+.....+.+.....+. +..++++ .++.++++.+. +++.- T Consensus 446 ~~~~~~~~~~~~~~~~~--~~~~~~~--~e~~~~~~~~~---------~~~~a 485 (485) T protein:vir:24 446 AAMGLGLLGTMVDADPT--VPGSPNP--TPAPKPQPAIE---------GGDSA 485 (485) T ss_pred hhhhhhHHHhhcccCCC--CCCCCCC--CCCCCCccCCC---------CCCCC Confidence 65433333332221110 0111111 11111111111 11111 No 53 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=4.8e-37 Score=219.71 Aligned_cols=466 Identities=12% Similarity=0.031 Sum_probs=266.6 Q ss_pred CccceeccchhHHHHhhcchhhhh---hhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERY---RQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAE 77 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~ 77 (589) -+| -.-|.++.++..-....|| .+.|+|+|. +... ++. .-+..++..++.|+++.|++ T Consensus 6 ~~d--~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~-i~~~-----------~~~-----~~~~~~~~~~~~n~~~~ivd 66 (488) T protein:vir:23 6 SID--PEKLRDQLLDAFENKQNELKSSKAYYDAERR-PDAI-----------GLA-----VPLDMRKYLAHVGYPRTYVD 66 (488) T ss_pred CCC--HHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-hhhc-----------Ccc-----cchhhhhhhhhcchHHHHHH Confidence 222 1235555554443334444 477999984 4221 110 01112344688999999999 Q ss_pred cchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHH Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQ 157 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~ 157 (589) ..|..+. +-.+..+.+ ...+.+...+ ++. ++.+..+.+.++|..++.....+++ T Consensus 67 ~~a~~l~--~~Gf~~~~~---~~~~~~~~~d-----------------~~~----~~~l~~i~~~N~~~~~~~~~~~~a~ 120 (488) T protein:vir:23 67 AIAERQE--LEGFRIPSA---NGEEPESGGE-----------------NDP----ASELWDWWQANNLDIEATLGHTDAL 120 (488) T ss_pred HHHHhhh--ccceeccCC---cccccccccc-----------------hhH----HHHHHHHHHhcChhHHHHHHHHHHh Confidence 9887551 001111111 1111110000 111 2356778888899999999999999 Q ss_pred HcCceeEEEEEe---------cCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhcc Q lcl|NC_020883. 158 VDGGIVAAPVID---------ELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPV 228 (589) Q Consensus 158 v~Gg~~~~~~~~---------~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~ 228 (589) +-|.....++.+ .+.++|.+.++.+.||. |.......-..+++|.....+.. +.-.+|-. T Consensus 121 i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~---------~d~~~~~~~~~~~~~~~~~~~~~--~~~~~y~~ 189 (488) T protein:vir:23 121 IYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAE---------VDPRTRKVLYAIRAIYGADGNEI--VSATLYLP 189 (488) T ss_pred hcCceEEEEecCCcccccCCCCCcceEEEeccceeEEE---------EecCCCceEEEEEEEEecCCCcE--EEEEEEec Confidence 988887776653 23367888888888873 32111111111222211111111 11113310 Q ss_pred ccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-hhh Q lcl|NC_020883. 229 VKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-NLE 306 (589) Q Consensus 229 ~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~ie 306 (589) ..++.... .|. +. .....+.+..++.|++++|++....++|+|+++ .+. T Consensus 190 ----~~~~~~~~~~~~-~~------------------------~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~ 240 (488) T protein:vir:23 190 ----DTTMTWLRAEGE-WE------------------------APTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELR 240 (488) T ss_pred ----CcEEEEEecCCc-eE------------------------eccccccCCCCcceEEeccccccCCcCCccchhhhHH Confidence 00111111 111 10 011234666778889999999999999999997 589 Q ss_pred HHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec Q lcl|NC_020883. 307 SKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID 386 (589) Q Consensus 307 ~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D 386 (589) +++|++|.++|+.+..++.++.|.+++- |...+... .....+......... .++.. ++|..+.+.||+ T Consensus 241 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~--------G~~~~~~~-~~~~~~~~~~~~~~~--~v~~~-~~g~~~~~~q~~ 308 (488) T protein:vir:23 241 SVTDAAAQILMNMQGTANLMAIPQRLIF--------GAKPEELG-INAETGQRMFDAYMA--RILAF-EGGEGAHAEQFS 308 (488) T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHh--------CCCccccc-ccccccchhhhhhhh--hhccC-CCCCCceeEecC Confidence 9999999999999999999999877651 11111000 000001111110011 12211 344556677776 Q ss_pred -ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020883. 387 -ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLN 465 (589) Q Consensus 387 -irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~ 465 (589) ..++.|...++.++.++...+++|..+||... .+ ..||.|++..+..+..|+.+++..|..+|++++++++.+.. T Consensus 309 ~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~-~n---~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 384 (488) T protein:vir:23 309 AAELRNFVDALDALDRKAASYSGLPPQYLSSSS-DN---PASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVK 384 (488) T ss_pred CCChHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 47788999999999999999999999998522 11 24899999999999999999999999999999999877654 Q ss_pred hcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccc Q lcl|NC_020883. 466 DQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQ--GQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSL 543 (589) Q Consensus 466 ~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~ 543 (589) ............|.|.+..|.+..+. |.++..+.+++ .+|++|++.++ | |++.. .+|++|+++++.......+ T Consensus 385 ~~~~~~~~~~i~v~f~~~~~~s~~~~--ada~~kl~~~g~~~~s~et~~~~l-~-~~~d~-~~~~~~~~~~~~~~~~~~~ 459 (488) T protein:vir:23 385 GGDIPTEYYRMETVWRDPSTPTYAAK--ADAAAKLFANGAGLIPRERGWVDM-G-YTIVE-REQMRQWLEQDQKQGLGLI 459 (488) T ss_pred CCCcchhhccceEEecCCCCCCHHHH--HHHHHHHHhcccccCCHHHHHHhC-C-CCchH-HHHHHHHHHHHHHHHHHHH Confidence 33222333456789999988776654 55555555544 78999888776 6 44433 3577777555433222222 Q ss_pred cccccccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 544 MGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 544 g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) .+. +.+ ..+.+..+.....+.+.+||+++ T Consensus 460 ~~~---~~~-~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 460 GSL---YGA-STPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHH---hcc-CCCcccCCCCCCCCCCCCCCCCC Confidence 221 110 00111111111233455666666 No 54 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=2.7e-36 Score=215.61 Aligned_cols=460 Identities=13% Similarity=0.034 Sum_probs=263.8 Q ss_pred Cc-------cceeccchhHHHHhh---cchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEE Q lcl|NC_020883. 1 MI-------DWTVRGWTDKTTKNV---HGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFN 70 (589) Q Consensus 1 ~~-------~~~~~~~~~~~~~~~---~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n 70 (589) |- .++-.-|..++++.+ .-.+.++++-|+|+|. +... ++. .-+..++..++.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~-i~~~-----------~~~-----~~~~~~~~~~~~n 63 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERR-PDAV-----------GVT-----VPQQMQKLLAHVG 63 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhc-----------ccc-----cchhHHhhhhhcC Confidence 10 011111112222221 1123455677999985 3211 110 1122345567899 Q ss_pred cchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccch Q lcl|NC_020883. 71 LPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHW 150 (589) Q Consensus 71 ~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~ 150 (589) +++.|++..+..+. +-.+++ + +.. +. ++.+..+.+.++|+.... T Consensus 64 ~~~~ivd~~~~~l~--~~g~~~--~----------~~~---------------------~~-~~~l~~i~~~N~~d~~~~ 107 (484) T protein:vir:77 64 YPRLYIDAIAARQE--LEGFRL--G----------GAD---------------------KA-DEQLWDWWQANDLDIEST 107 (484) T ss_pred cHHHHHHHHHhhhc--cCceec--C----------Ccc---------------------hh-HHHHHHHHHhcCHhHHHH Confidence 99999999887661 011111 0 000 00 224567788888998889 Q ss_pred hhHHHHHHcCceeEEEEEecCc---------eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecccccee Q lcl|NC_020883. 151 SNIVQHQVDGGIVAAPVIDELG---------PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRT 221 (589) Q Consensus 151 ~~l~~~~v~Gg~~~~~~~~~~~---------~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~ 221 (589) ....++.+-|.....+|.++++ ++|.+.+|.+.|+ +|.....+--..++++..... +-.+ T Consensus 108 ~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~---------~~D~~~~~~~~a~~~~~~~~~--~~~~ 176 (484) T protein:vir:77 108 LGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYA---------QIDPRTRQVMRAIRAIEDEEG--NEVI 176 (484) T ss_pred HHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEE---------EecCCCCceEEEEEEEEeecC--CcEE Confidence 9999999999999999988665 4688888888776 332211111123333322211 1111 Q ss_pred ehhhhccccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCc Q lcl|NC_020883. 222 TNMLYPVVKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGIS 300 (589) Q Consensus 222 ~~~~y~~~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~S 300 (589) ...+|-. ..+++... .|. +. ..+..+++..++.|++++|+....+|+|+| T Consensus 177 ~~~~y~~----~~~~~~~~~~~~-~~------------------------~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s 227 (484) T protein:vir:77 177 GATLYLP----NNTVIWNREDGQ-WV------------------------QVANVAHNLEMVPVIPIPNRTRLSDLYGTT 227 (484) T ss_pred EEEEEec----CeEEEEEecCCc-eE------------------------eeccccCCCCCcceEEeccccccCccCCcc Confidence 1123310 01111111 111 11 111234667777789889999999999999 Q ss_pred chh-hhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCc Q lcl|NC_020883. 301 ALD-NLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRS 379 (589) Q Consensus 301 D~~-~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~ 379 (589) +++ .+.+++|++|.++|+.+...+.++.|.+++- |...+.. ......+..........+-..+. .. T Consensus 228 ~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~--------G~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~----~~ 294 (484) T protein:vir:77 228 EITPELRSVTDAAARTLMLMQATAELMGVPQRLLF--------GVKGEEL-GVDPETGQTLFDAYLARILAFED----HE 294 (484) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh--------CCCcchh-cccccccchhhhhhhhhhcccCC----CC Confidence 997 5999999999999999999998998887651 1111100 00111111111111111111111 12 Q ss_pred cceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 380 MEIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYE 458 (589) Q Consensus 380 ~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~ 458 (589) +.+.||+ ..++.|++.++.++.++..++++|+.+||... .+ ..||.|++..+.++..|+++++..|.++|+++++ T Consensus 295 ~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~n---~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~ 370 (484) T protein:vir:77 295 SKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSS-EN---PASAEAIRSSESRLVKTVERKNKIFGGAWEQAMR 370 (484) T ss_pred ceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4456666 45788999999999999999999999998422 22 2489999999999999999999999999999999 Q ss_pred HHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcc Q lcl|NC_020883. 459 SCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQ--GQSLETTVRRMNPDASEDWIQEEIARIEEEQA 536 (589) Q Consensus 459 ~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a 536 (589) +++.+.+............|.|.+..|.+..+. |.....+++++ ++|.+|+++++ | |++..+ +|++|+++|+. T Consensus 371 l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~--ad~~~kl~~~g~gi~s~et~~~~l-~-~~~~~~-~e~~~~~~ee~ 445 (484) T protein:vir:77 371 VAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAK--ADAATKLYNNGQGVIPKERARIDM-G-YSITER-EEMRKWDEEEQ 445 (484) T ss_pred HHHHHhCCCCcccccccceEEecCCCCCCHHHH--HHHHHHHHhccCCCCCHHHHHhcC-C-CChhHH-HHHHHHHHHHH Confidence 887776543322333456789999988776544 55555566554 88999988886 4 776654 47899987765 Q ss_pred ccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 537 GSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 537 ~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) +.....++++...-+ + ..+++..-..++.+|++++ +.. | T Consensus 446 ~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~~~~~~~~-~~~-------~ 484 (484) T protein:vir:77 446 AQGLGLMGTMFGTDP----S--GGGNPDNPETPEPQPNPAE-EAA-------A 484 (484) T ss_pred HHHHHHHhhhccccc----c--CCCCCCCCCcccccCCCcc-ccC-------C Confidence 433333332211111 1 1111110001111111111 111 1 No 55 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1.1e-35 Score=212.20 Aligned_cols=432 Identities=11% Similarity=0.036 Sum_probs=265.0 Q ss_pred ccceeccchhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhcc Q lcl|NC_020883. 2 IDWTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEI 78 (589) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~ 78 (589) ++=+.+-|..+.++..-.-..||+ +-|+|+|...+ . ++ ..-+..++..++.|+++.|++. T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~-~-----------~~-----~~~~~~~~~k~~~n~~~~ivd~ 63 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRD-L-----------GV-----AIPPELQRVQTVVSWPGIAVDA 63 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchh-c-----------Cc-----ccchhhhhhhhhcchHHHHHHH Confidence 666666677777766544455555 77999985311 1 10 0111223456899999999998 Q ss_pred chhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHH Q lcl|NC_020883. 79 PATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQV 158 (589) Q Consensus 79 pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v 158 (589) .+..+. ..+|.. +.+ +-+..+.+.++|.......+.++.+ T Consensus 64 ~~~~l~------~~g~~~-----------------~d~-----------------~~l~~i~~~n~~~~~~~~~~~~~~~ 103 (441) T protein:vir:80 64 LEERLD------WLGWTN-----------------GDG-----------------YGLDGVYAANRLATASCDVHLDALI 103 (441) T ss_pred HHhhhc------cccccC-----------------CCh-----------------HHHHHHHHhcCHHHHHHHHHHHHhh Confidence 888771 111110 000 1134555667788888888889999 Q ss_pred cCceeEEEEEecCc-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhhe Q lcl|NC_020883. 159 DGGIVAAPVIDELG-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKK 237 (589) Q Consensus 159 ~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~ 237 (589) -|-.+..+|.++++ ++|.+.+|.+.|| +|....++...++++|.......... .+|-. ..+++ T Consensus 104 ~G~a~~~v~~d~~g~~~i~~~~p~~~~~---------i~d~~~~~~~~~~~~~~~~~~~~~~~---~vy~~----~~~~~ 167 (441) T protein:vir:80 104 FGLSFVAIIPHGDGTVSVRPQSPKNCTG---------KFSADGSRLDAGLVVQQTCDPEVVEA---ELLLP----DVIVQ 167 (441) T ss_pred cCeeEEEEEeCCCCceEEEEEccceEEE---------EEeCCCCceeEEEEEEEEecCceEEE---EEEec----CeEEE Confidence 99999999998555 9999999999887 44333222222333332221111111 12200 00011 Q ss_pred eecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-hhhHHHHHHHHHH Q lcl|NC_020883. 238 EIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-NLESKQDEINWTI 316 (589) Q Consensus 238 ~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~ie~l~DeLd~t~ 316 (589) ....|.. ..........+..++.|++++|++...+++|+|++. .+.+++|++|.++ T Consensus 168 ~~~~~~~-----------------------~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~ 224 (441) T protein:vir:80 168 VERRGSR-----------------------EWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTL 224 (441) T ss_pred EEEcCCc-----------------------ceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHH Confidence 1111100 001112344566777889999999999999999996 5999999999999 Q ss_pred hHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHH Q lcl|NC_020883. 317 TRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDH 395 (589) Q Consensus 317 S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ 395 (589) |+.+.+.+.++.|.+++- |..-+......... . ...+-..+.+++|..+.+.+++ ..++.|+.. T Consensus 225 s~~~~~~~~~~~~~~~i~--------G~~~~~~~~~~~~~----~---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (441) T protein:vir:80 225 LGQSVNRDFYAYPQRWVT--------GVSADEFSQPGWVL----S---MASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQ 289 (441) T ss_pred HHHHHHHHhhcCceeeee--------cCCccccccchhhh----c---ccccccCCCCCCCCcceeEecCccchHHHHHH Confidence 999999999999988762 21111111100000 0 0001112233445556666666 468889999 Q ss_pred HHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-ccCcc Q lcl|NC_020883. 396 VKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-SIRIE 474 (589) Q Consensus 396 ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-~~~~e 474 (589) ++.++.+++..+++|+..||... . ...||+|++.++.++..|+.+++..|..+|++++++++.+...... ..... T Consensus 290 l~~~i~~~~~~~~~p~~~~g~~~-~---~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~ 365 (441) T protein:vir:80 290 MRLLAQLTAGEAAVPERYFGFIT-S---NPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFG 365 (441) T ss_pred HHHHHHHHhcccCCCHHHhccCC-C---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccce Confidence 99999999999999999998522 1 2348999999999999999999999999999999887766543322 22234 Q ss_pred cceeeeCCcCCCCCCHHHHHHHHHHHhccch--hhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 475 EPNIETQDMILKPRAELVAENMAAYAASKQG--QSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 475 ~p~I~f~D~lPvde~El~~A~t~~~l~~a~~--~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) ...|.|.+.+|.+..+. |+.+..+.+++. +|+++++..+ | +++. |++||++|+..+ ..+++..++.. T Consensus 366 ~i~~~f~~~~~~~~~e~--ad~~~kl~~~g~~~~s~~~~~~~l-~-~~~~----e~~~~~~e~~e~-~~~~~~~~~~~-- 434 (441) T protein:vir:80 366 DVGLRWRDASTPTRAAT--ADAVTKLVGAGILPADSRTVLEML-G-LDDV----QVEAVMRHRAES-SDPLAVLAGAI-- 434 (441) T ss_pred eeeEEeCCCCCcCHHHH--HHHHHHHHhcCcccccHHHHHHhC-C-CCHH----HHHHHHHHHHHH-HHHHHHHhhhh-- Confidence 56789999999886654 666666667665 4778777655 5 4443 334444433221 12223332222 Q ss_pred ccCcccCCCC Q lcl|NC_020883. 553 MNDNRDEDGN 562 (589) Q Consensus 553 ~~~~~~~~~~ 562 (589) +++++.- T Consensus 435 ---~~~~~~~ 441 (441) T protein:vir:80 435 ---SRQTNEV 441 (441) T ss_pred ---hcccccC Confidence 2222222 No 56 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=6.4e-36 Score=213.52 Aligned_cols=457 Identities=13% Similarity=0.042 Sum_probs=270.6 Q ss_pred Ccc--ceeccchhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MID--WTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) .++ =+..-|..++++.+-.-..|++ +.|+|+|.-. . .++. .-...++..++.|+++.| T Consensus 7 ~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~-~-----------~~~~-----~~~~~~~~~~v~n~~~~i 69 (486) T protein:vir:42 7 GMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPE-A-----------IGVT-----VPREMQQLLAHVGYPRLY 69 (486) T ss_pred CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch-h-----------cccc-----cchhHhhhhhccchHHHH Confidence 000 0111245566666544445544 6699998421 0 1111 011224557889999999 Q ss_pred hccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 76 ~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) ++..+..+. ..+|... +. +-.++.+..|.+.++|..+..+...+ T Consensus 70 Vd~~~~~l~------~~g~~~~--------~~----------------------~~~~~~~~~i~~~N~~d~~~~~~~~~ 113 (486) T protein:vir:42 70 VDSVAERQA------VEGFRLG--------DA----------------------DEADEELWQWWQANNLDIEAPLGYTD 113 (486) T ss_pred HHHHHhhhc------ccceecC--------CC----------------------chhHHHHHHHHHhcChhHHHHHHHHH Confidence 998776651 1112100 00 00123467778888899888999999 Q ss_pred HHHcCceeEEEEEec---------CceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhh Q lcl|NC_020883. 156 HQVDGGIVAAPVIDE---------LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLY 226 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~---------~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y 226 (589) +.+-|.....++.++ ..++|.+.+|...|+ +|.....+.-.++++|... ..+..+...+| T Consensus 114 a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~---------i~d~~~~~~~~~~~~~~~~--~~~~~~~~~~y 182 (486) T protein:vir:42 114 AYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHA---------EIDPRINRVSKAIRVAYDK--EGNEIQAATLY 182 (486) T ss_pred HhhcCceEEEEecCCcccccccCCCeeEEEEecccceEE---------EEeCCCCCeEEEEEEEEec--CCCeEEEEEEE Confidence 999998888887653 226778888887776 4532222222345544322 12222222344 Q ss_pred ccccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-h Q lcl|NC_020883. 227 PVVKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-N 304 (589) Q Consensus 227 ~~~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~ 304 (589) - +..++.... .|. + .....++.+..++.|+.++|++....++|+|+++ . T Consensus 183 ~----~~~~~~~~~~~~~-~------------------------~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~ 233 (486) T protein:vir:42 183 T----PMETIGWFRADGE-W------------------------AEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPE 233 (486) T ss_pred c----CCcEEEEEecCCc-E------------------------EeecceecCCCCceEEEeccccccCCCCCcccchhh Confidence 1 111111111 121 1 1112344667778899889999999999999998 5 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccc----cccccccccccccccccccccccccccccccCcc Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAY----ERDGHSAKEASMMTPRIDHRDMEITTFDENGRSM 380 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~----d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~ 380 (589) +.+++|++|.++|+.+...+.++.|.+++- |... ..++...+.. ......+-+.+. ..+ T Consensus 234 v~~liDa~~~~~s~~~~~~e~~a~p~~~i~--------G~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~----~~~ 296 (486) T protein:vir:42 234 LRSMTDAAARILMLMQATAELMGVPQRLIF--------GIKPEEIGVDSETGQTLF-----DAYLARILAFED----AEG 296 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcchHHHhh--------cCCccccccccccccchh-----hhhhchhcccCC----CCc Confidence 899999999999999999999999877652 1110 0111111100 000111111111 224 Q ss_pred ceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 381 EIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYES 459 (589) Q Consensus 381 ~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~ 459 (589) .+.||+ ..++.|++.++.++.++...+++|..+||... .+ ..||+|+++.+..+..|+.+++..|..+|++++++ T Consensus 297 ~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~n---~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l 372 (486) T protein:vir:42 297 KIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA-DN---PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRI 372 (486) T ss_pred eEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 466766 46788999999999999999999999998522 11 24999999999999999999999999999999998 Q ss_pred HHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhcc--chhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccc Q lcl|NC_020883. 460 CLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASK--QGQSLETTVRRMNPDASEDWIQEEIARIEEEQAG 537 (589) Q Consensus 460 ~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a--~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~ 537 (589) ++.+..............|.|.+.+|.+..+. |+.+..++++ +++|.+|.+..+ | |++++ .+|++|+++|+.+ T Consensus 373 ~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~--ad~~~kl~~~~~g~~s~et~~~~l-g-~~~d~-~~e~~~~~~e~~~ 447 (486) T protein:vir:42 373 AYRIMKGGDVPPDMLRMETVWRDPSTPTYAAK--ADAATKLYGNGQGVIPRERARIDM-G-YSVKE-REEMRRWDEEEAA 447 (486) T ss_pred HHHHhcCCCccccceeeeEEecCCCCCCHHHH--HHHHHHHHhcccCCCCHHHHHhcC-C-CChhH-HHHHHHHHHHHHH Confidence 87776543322233456789999988776654 6666666654 678999887654 5 77665 4589999888766 Q ss_pred cccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccccc Q lcl|NC_020883. 538 SDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEP 587 (589) Q Consensus 538 ~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~ 587 (589) .....+.+.+..-... ..++.++..+.+++.-+. ..++- T Consensus 448 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~---------~~~~~ 486 (486) T protein:vir:42 448 MGLGLLGTMVDADPTV--PGSPSPTAPPKPQPAIES---------SGGDA 486 (486) T ss_pred HHHHHHHHhhcCCCCC--CCCCCCCCCCCCCcccCC---------CCCCC Confidence 4443333321111100 001111111111111111 11111 No 57 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=1.3e-34 Score=206.31 Aligned_cols=464 Identities=11% Similarity=0.001 Sum_probs=264.1 Q ss_pred cceeccchhHHHHhhcch---hhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 3 DWTVRGWTDKTTKNVHGD---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 3 ~~~~~~~~~~~~~~~~~~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) ==|-+-|-.+.++..=.. +.++++.|+|+|. |-.. .+- .-+...+..++.|+++.|++.+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~-i~~~-----------~~~-----~~~~~~~~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRR-LKTI-----------GIG-----APPELAYLDVQPGWVATYLRTL 63 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccc-----------ccc-----cchhHhhhhhhcchHHHHHHHH Confidence 124444444444433222 3445578999985 3111 110 0011234468899999999998 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) +..+. +-.+.. + .++ +. ++.+..|...++|+.+....+.++.+- T Consensus 64 ~~~l~--~~g~~~--~---------~d~-------------------~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~ 107 (480) T protein:vir:78 64 SDRLD--IEGFRI--S---------EDS-------------------EG----LEELWNWWQANDLDEESVLGHDDSLTF 107 (480) T ss_pred Hhhhc--cCceec--C---------CCc-------------------hh----HHHHHHHHHhcCHHHHHHHHHHHHhhc Confidence 88771 111111 0 000 00 123455666678888888888888898 Q ss_pred CceeEEEEE------e-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhccccc Q lcl|NC_020883. 160 GGIVAAPVI------D-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKA 231 (589) Q Consensus 160 Gg~~~~~~~------~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~ 231 (589) |.....+|. | ++.++|.+.++.+.|| +|... ..+--.+|++|..+.+.... ..-.+|-. T Consensus 108 G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~---------~~D~~~~~~~~~~i~~~~~~~~~~~~-~~~~~y~~--- 174 (480) T protein:vir:78 108 GRSYITVSHPDVESGDPAGIPLIRVESPLYMYA---------ELDPRNTRRVTRAVRLYTTRDDVAVP-DRATLYLP--- 174 (480) T ss_pred CceEEEEecCccccCCCCCeeEEEEEcccceEE---------EEcCCCccceEEEEEEEEeecCCCce-EEEEEEeC--- Confidence 887777774 2 4559999999999998 33221 22222345555443322211 11123310 Q ss_pred cchhheee-cccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhh-hhHHH Q lcl|NC_020883. 232 KGDVKKEI-KKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDN-LESKQ 309 (589) Q Consensus 232 ~~~~~~~~-~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~-ie~l~ 309 (589) +.+++.. ..+.. .....+.+.++++..++.|+.++|++...+|+|+|+++. +.+++ T Consensus 175 -~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) T protein:vir:78 175 -DETVPLRRNGGLN---------------------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) T ss_pred -CeEEEEEecCCCc---------------------cccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHH Confidence 0111111 11110 001122234567888888999999999999999999974 99999 Q ss_pred HHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecc-c Q lcl|NC_020883. 310 DEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDI-S 388 (589) Q Consensus 310 DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Di-r 388 (589) |++|.++|+.+.+++.++.|.+++. |...+....- ..+ ........ .+. ...|..+.+.||+. . T Consensus 233 Da~~~~~s~~~~~~~~~a~p~~~i~--------G~~~~~~~~~--~~~-~~~~~~~~--~~~--~~~~~~~~~~~~~~~~ 297 (480) T protein:vir:78 233 DAASRTLMNLQSASQILGTPLRVIS--------GVTTDELTND--GEN-TTLDIYYG--RIL--TLASEAAKISEFKAAE 297 (480) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhh--------cCCccccccc--ccc-chhhhhhh--hhc--cCCCCCceEEecCccC Confidence 9999999999999999999987762 1111110000 000 00000000 011 12234466788886 7 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+.|++.++.++.+++..+++|...||... . ...||.|+++.+..+..|+++++..|..+|++++++++.+.... T Consensus 298 ~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~-~---n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~- 372 (480) T protein:vir:78 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSS-E---NPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE- 372 (480) T ss_pred HHHHHHHHHHHHHHHhcccCCChHHhcccc-C---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC- Confidence 899999999999999999999999998522 1 23489999999999999999999999999999999887765321 Q ss_pred cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhc--cchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020883. 469 SSIRIEEPNIETQDMILKPRAELVAENMAAYAAS--KQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGI 546 (589) Q Consensus 469 ~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~--a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~ 546 (589) .........|.|.+..+.+..+. |..+..+.+ .+++|.+|++..+ | |+++++ +|+++++++++.. ++... T Consensus 373 ~~~~~~~i~v~f~~~~~~s~~~~--ad~~~kl~~~g~~~~s~et~~~~l-g-~~~d~~-~~~~~~~~e~~~~---~~~~~ 444 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAK--ADAVSKLYANGQGPIPKEQARIDL-G-YTATQR-EQMRDWDKQETED---MIDTL 444 (480) T ss_pred ccccceeeeEEecCCCCCCHHHH--HHHHHHHHHhccccCCHHHHHhcC-C-CCHhHH-HHHHHHHHHHHHH---HHHHh Confidence 11223345789999887775543 433333344 3478999988876 5 665543 4555555554432 11111 Q ss_pred ccccccccCcccCCCCCCCCCCCCCCCCcchhhhhh Q lcl|NC_020883. 547 NQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIE 582 (589) Q Consensus 547 ~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~ 582 (589) ...-.+..+....+.-+.+.+++++.|.|--.-++- T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 445 YSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred hccccccCCCCCCCCCCCCCCccccccCCCCcccCC Confidence 101000000000000000111122222222222111 No 58 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=7.4e-34 Score=202.21 Aligned_cols=467 Identities=12% Similarity=0.082 Sum_probs=258.4 Q ss_pred CccceeccchhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAE 77 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~ 77 (589) -+.=.-+-|....+...-.-+.||+ +.|+|+|.--+ +.+ ..-...+..++++|+++.|++ T Consensus 17 ~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~------------~~~-----~~p~~~~~~~~v~n~~~~iVd 79 (504) T protein:vir:99 17 ELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQ------------IGN-----LIPPEYLRTATVLGWSAKAVD 79 (504) T ss_pred CCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchh------------ccc-----cccHHHHHHhhccCcHHHHHH Confidence 1111122334444544444455666 45999995311 110 000122344689999999999 Q ss_pred cchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHH Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQ 157 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~ 157 (589) ..|+.+. +- +|.. +.+ ... ++-+.++.+.|+|.-+....+.++. T Consensus 80 ~~a~rl~--~~----Gf~~-----------------~d~------~~~-------~~~l~~i~~~N~ld~~~~~~~~~a~ 123 (504) T protein:vir:99 80 TLARRCN--LE----SFVW-----------------PDG------DYG-------SIGGPDVWDENFFATKANNAMVSSL 123 (504) T ss_pred HHHhhhc--cc----eeeC-----------------CCC------Chh-------hHHHHHHHHhcChhhHHHHHHHHHH Confidence 9888661 11 1110 000 000 1235667777888888889999999 Q ss_pred HcCceeEEEEEecCc---eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccch Q lcl|NC_020883. 158 VDGGIVAAPVIDELG---PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGD 234 (589) Q Consensus 158 v~Gg~~~~~~~~~~~---~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~ 234 (589) +-|.....+|-++++ ++|.+.+|.+.|. +|.....+...+++++ ++. .++..+.-.+|- .+. T Consensus 124 iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~---------iyD~~~~~~~~a~~~~-~~d-~~g~~~~~~~y~----~~~ 188 (504) T protein:vir:99 124 IHGPAFLINTEGGAGEPDSLIHVKSAMQATG---------EWNSRRNAMDSLLSIT-SRD-AEGHPTGIALYE----DGV 188 (504) T ss_pred hhCceeEEEecCCCCCceeEEEEeccceeEE---------EEeCCCCceeEEEEEE-Eec-CCCeEEEEEEEc----CCc Confidence 999988888877544 5688888877774 4532111111122222 222 222222222331 111 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-hhhHHHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-NLESKQDEIN 313 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~ie~l~DeLd 313 (589) +++....+..... .+......|+ | ||.++|++...+|+|+|++. .+.+++|++| T Consensus 189 ~~~~~~~~~~~~~----------------------~~~~~~~~gv--P-vV~~~n~~~~~~~~G~sei~~~v~~l~Da~~ 243 (504) T protein:vir:99 189 TVTADMDDDGDWH----------------------ADVRTHKLGV--P-VEVLPYKPREDRPLGSSRITRPVMSLQQRAL 243 (504) T ss_pred EEEEEEcCCceee----------------------eccccCCCCc--c-eEEecccccCccccCcccchhhHHHHHHHHH Confidence 1111111110000 0001112354 3 67779999999999999995 8999999999 Q ss_pred HHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccc-----cCccceeeec-c Q lcl|NC_020883. 314 WTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDEN-----GRSMEIHQID-I 387 (589) Q Consensus 314 ~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~-----g~~~~~iq~D-i 387 (589) +++++.....+.++.|..++= ++- .....++|++..........++ -..+.+++ +..+.+.|++ . T Consensus 244 ~~~~~~~~~~e~~a~p~r~i~-G~~---~~~~~~~d~~~~~~~~~~~~~i-----~~~~~~~~~~~~~~~~~~~~q~~~~ 314 (504) T protein:vir:99 244 KGCIRMDGHADVYSFPQLILL-GAD---AKNFRNKDGSMKPAWQIALARV-----FALPDDEDEPDAARARADVKQFPAS 314 (504) T ss_pred HHHHHHHHHHHHhcchhhhhc-cCC---ccccccccccccchhhhhhhhh-----hcCCCccccccccCccceeeecCCC Confidence 999999999999999987651 100 0011123333222111111111 11122222 2235566666 4 Q ss_pred cHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020883. 388 SKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQ 467 (589) Q Consensus 388 rveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~ 467 (589) .++.|.+.++.++.+|..++++|..+||+...++ ..||.|++..+.++.+|+.++++.|..+|++++++++.+.... T Consensus 315 ~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n---~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~ 391 (504) T protein:vir:99 315 SPQPHIEMLEQIAMMFSGETSIPVESLGFSNRAN---PTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGL 391 (504) T ss_pred ChHHHHHHHHHHHHHHHhhhCCCHHHhccccccc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 5888999999999999999999999999743322 2489999999999999999999999999999999988776543 Q ss_pred Cc-ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchh--h-HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccc Q lcl|NC_020883. 468 DS-SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQ--S-LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSL 543 (589) Q Consensus 468 ~~-~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~--S-~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~ 543 (589) +. ........|.|.|..|.+..+ .|..+..+.+++.+ + .++.+.++ .++++++++..+....+++.+ .+ T Consensus 392 ~~~~~~~~~~~v~w~d~~~~s~a~--~aDa~~Kl~~ag~~l~~~~~~l~~~l--g~~~~ei~r~~~e~~~~~~~~---~~ 464 (504) T protein:vir:99 392 DRIPPEWKTIDSKFRSPLYLSKAA--QADAGAKMLGAGPEWLKETEVGLELL--GLTPQQAKRALAERRRASSVS---II 464 (504) T ss_pred CccccccccceeEecCCCccCHHH--HHHHHHHHHhhccccccchHHHHhhc--CCCHHHHHHHHHHHHHHhhHH---HH Confidence 32 222345678999988877554 46666667777653 3 45555555 377765544333333333221 11 Q ss_pred cccccccccccCcccCCCCCCCCCCCCCCCCc----chhhhhhccc Q lcl|NC_020883. 544 MGINQTFEQMNDNRDEDGNIIEEGDTEEEPSA----EENEEIEKEG 585 (589) Q Consensus 544 g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~----~~~e~~~~~~ 585 (589) .+..++.+ . +.+.+.++.++..||++ .-...-.++| T Consensus 465 ~~l~~~~~-----~-~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 465 EALNRRQQ-----E-AATAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred HHHhcccC-----C-CCCCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 21111111 0 11111222222222222 1112222222 No 59 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1.6e-33 Score=200.45 Aligned_cols=459 Identities=10% Similarity=0.001 Sum_probs=263.7 Q ss_pred cceeccchhHHHHhhcch---hhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 3 DWTVRGWTDKTTKNVHGD---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 3 ~~~~~~~~~~~~~~~~~~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) ==|-+-|..+.++..-.. +.++++.|+|+|.-.+ .++. .-...++..+++|+++.|++.. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~------------~~~~-----~~~~~~~~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT------------IGIG-----APPELAYLDVQPGWVATYLRTL 63 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchh------------cccc-----cchhhhhhhhhcchHHHHHHHH Confidence 123444555555543222 3445578999985211 1111 0012234568899999999998 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) +..+. +. +|... .++ +. ++.+..+...++|..+......++.+- T Consensus 64 ~~~l~---~~---g~~~~-------~d~----------------~~-------~~~l~~i~~~N~~~~~~~~~~~~a~~~ 107 (480) T protein:vir:78 64 SDRLD---IE---GFRIS-------EDS----------------EG-------LEELWNWWQANDLDEESVLGHDDSLTF 107 (480) T ss_pred Hhhhc---cC---ceecC-------CCc----------------hh-------HHHHHHHHHhcCHHHHHHHHHHHHhhc Confidence 88771 11 11100 000 00 234566677778888888888898999 Q ss_pred CceeEEEEE------e-cCceeEEEecCceecccccCcceeEEEeec-CCCccceEEEEEeeeccccceeehhhhccccc Q lcl|NC_020883. 160 GGIVAAPVI------D-ELGPRIVFKARDVYFPHDDEKGADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYPVVKA 231 (589) Q Consensus 160 Gg~~~~~~~------~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~ 231 (589) |.+...+|- + ++.++|.+.+|.+.||. |... ..+--.+|++|+.+.+.... +.-.+|-. T Consensus 108 G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i---------~D~~~~~~~~~~i~~~~~~d~~~~~-~~~~~y~~--- 174 (480) T protein:vir:78 108 GRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE---------LDPRNTRRVTRAVRLYTTRDDVAVP-DRATLYLP--- 174 (480) T ss_pred CceEEEeecCccccCCCCCeeEEEEEcccceEEE---------EcCCCccceEEEEEEEEeecCCcce-EEEEEEeC--- Confidence 888877763 3 45599999999999983 3221 11112244554433322221 11122210 Q ss_pred cchhhee-ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchh-hhhHHH Q lcl|NC_020883. 232 KGDVKKE-IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALD-NLESKQ 309 (589) Q Consensus 232 ~~~~~~~-~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~-~ie~l~ 309 (589) ..+++. .+.+... + ...+.+..+++..++.|++++|+....+|+|+|+++ .+.+++ T Consensus 175 -~~~~~~~~~~~~~~-------------------~--~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~ 232 (480) T protein:vir:78 175 -DETVPLRRNGGLND-------------------Q--WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) T ss_pred -CeEEEEEecCCCcc-------------------c--ccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHH Confidence 000111 0111100 0 011223456788888899999999999999999997 599999 Q ss_pred HHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecc-c Q lcl|NC_020883. 310 DEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDI-S 388 (589) Q Consensus 310 DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Di-r 388 (589) |++|.++|+.+..++.++.|.+++. |...+...+- ..+ ........ .+. ...|..+.+.||+. . T Consensus 233 Da~~~~~s~~~~~~~~~a~p~~~i~--------G~~~~~~~~~--~~~-~~~~~~~~--~~~--~~~~~~~~~~~~~~~~ 297 (480) T protein:vir:78 233 DAASRTLMNLQSASQILGTPLRVIS--------GVTTDELTND--GEN-TTLDIYYG--RIL--TLASEAAKISEFKAAE 297 (480) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhh--------CCCccccccc--ccc-chhhhhhh--hhc--cCCCCCceEEecCccC Confidence 9999999999999999999987653 1111110000 000 00000010 011 12234466788875 6 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) .+.|.+.++.++.+++..+++|...||... .+ ..||+|+++++..+..|+.+++..|..+|++++++++.+.... T Consensus 298 ~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~-~n---~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~- 372 (480) T protein:vir:78 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN---PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE- 372 (480) T ss_pred HHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC- Confidence 788999999999999999999999998521 11 2489999999999999999999999999999998887765321 Q ss_pred cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhcc--chhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccc Q lcl|NC_020883. 469 SSIRIEEPNIETQDMILKPRAELVAENMAAYAASK--QGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGI 546 (589) Q Consensus 469 ~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a--~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~ 546 (589) .........|.|.+..|.+..+. |..+..+.++ +++|.+|++.++ | |+++++++ +++++.+++... +.+. T Consensus 373 ~~~~~~~i~v~w~~~~~~s~~~~--ad~~~kl~~~g~~~~s~et~~~~l-g-~~~d~~~e-~~~~~~~~~~~~---~~~~ 444 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAK--ADAVSKLYANGQGPIPKEQARIDL-G-YTATQREQ-MRDWDKQETEDM---IDTL 444 (480) T ss_pred ccccceeeeEEecCCCCCCHHHH--HHHHHHHHHhcccCCCHHHHHhcC-C-CCHhHHHH-HHHHHHHHHHHH---HHHh Confidence 12233456799999888776654 4434334433 467998877766 5 77765544 445544443321 1222 Q ss_pred ccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 547 NQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 547 ~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) +.+. +.+..+...++.++.+ ++++.+....- T Consensus 445 ~~~~-------~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 475 (480) T protein:vir:78 445 YSTT-------KAQADATPKPTVTETK-----TETQTSPSGFN 475 (480) T ss_pred hccc-------cCCCccccCCCCCCCC-----CccCCCcccCC Confidence 1111 1111111111111111 01111111111 No 60 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=3.3e-34 Score=204.17 Aligned_cols=452 Identities=11% Similarity=0.010 Sum_probs=259.7 Q ss_pred CccceeccchhHHHHh--------hc-ch---hhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee-eecCcceE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKN--------VH-GD---YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA-RETQTPYV 67 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~--------~~-~~---~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~-~~~~~~y~ 67 (589) ||+|-.-.+|.+-++. .| .. +.++++.|+|+|.-++ +.. .+..+ .+.-.-++ T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~--------------~~~~~~~~~~~~~~ 65 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPD-LAT--------------RHKNKEREVLQQLS 65 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccc-ccc--------------ccCChhHHHHHHHh Confidence 9999988888775432 23 23 4455578999985322 100 00000 00012235 Q ss_pred EEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc Q lcl|NC_020883. 68 IFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER 147 (589) Q Consensus 68 ~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~ 147 (589) ++|+++.|++..+..+. +..++. + +.. . ++.+..+.+.++|.. T Consensus 66 ~~n~~~~iVd~~~~~l~--~~gf~~--~----------d~~-------------------~----~~~~~~i~~~N~~d~ 108 (479) T protein:vir:99 66 RKPWMGLMVNSFAQQLI--VDGYRK--T----------GTN-------------------E----NAKGWDTWRLNQMDK 108 (479) T ss_pred hcCcHHHHHHHHHhhcc--cccccC--C----------Cch-------------------h----hHHHHHHHHhcChhH Confidence 68999999998887661 111110 0 000 0 112345666677887 Q ss_pred cchhhHHHHHHcCceeEEEEE-----e-cCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecccccee Q lcl|NC_020883. 148 RHWSNIVQHQVDGGIVAAPVI-----D-ELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRT 221 (589) Q Consensus 148 ~~~~~l~~~~v~Gg~~~~~~~-----~-~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~ 221 (589) .+.....++.+-|.....+|. | ++.++|.+.+|-+.|| +|..... ....++..+.+ ..+... T Consensus 109 ~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~---------iydd~~~-~~~~~~~~~~~--~~~~~~ 176 (479) T protein:vir:99 109 QQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFA---------IWEDPYW-DEWPKYLLERQ--PNGQYW 176 (479) T ss_pred HHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEE---------EecCCcc-cceeeEEEeec--CceeEE Confidence 777778888888876666663 3 3448888888888777 4432211 11111221111 111110 Q ss_pred ehhhhccccccchhheee-cccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCc Q lcl|NC_020883. 222 TNMLYPVVKAKGDVKKEI-KKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGIS 300 (589) Q Consensus 222 ~~~~y~~~~~~~~~~~~~-~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~S 300 (589) +|-. ....... ..|. +. .....+.+..++.|++++|++.. +++|+| T Consensus 177 ---~~~~----~~~~~~~~~~~~-~~------------------------~~~~~~h~~g~vPvv~f~n~~~~-~~~g~s 223 (479) T protein:vir:99 177 ---WWTE----EDYSIFEFKQGK-FI------------------------YRETVSHDYGHIPFVRYVNVMDL-RGVCYG 223 (479) T ss_pred ---EEec----ceEEEEEecCCc-ee------------------------eccccccCCCCcceEEeecCCCc-CcCCcc Confidence 1100 0000001 1111 11 00122345677779999999877 457999 Q ss_pred chhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCcc Q lcl|NC_020883. 301 ALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSM 380 (589) Q Consensus 301 D~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~ 380 (589) ||+.+.+++|++|.++|+.+..++.++.|.+++.-..+. .+.++...+ ... ... .+.... +... T Consensus 224 d~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~------~~~~~~~~~---~~~--~~~---~i~~~~--~~~~ 287 (479) T protein:vir:99 224 DVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLP------EGANADQEK---MRF--AQE---SMLISQ--NEKA 287 (479) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcc------cccccchhc---ccc--ccc---cceeec--CCCc Confidence 999999999999999999999999999998766421111 011111000 000 001 111111 1224 Q ss_pred ceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 381 EIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYES 459 (589) Q Consensus 381 ~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~ 459 (589) .+.|++ ..++.|.+.++.++.+|...+++|...||+.. ..||+|++.++.++..|+++++..|..+|++++++ T Consensus 288 ~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~------n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l 361 (479) T protein:vir:99 288 SFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIV------NVAADALAAGTRQTMQKLFEKQATWKASHNQTMRL 361 (479) T ss_pred eEEEecccchHHHHHHHHHHHHHHhccCCCCHHHccccc------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 566775 56899999999999999999999999998521 25899999999999999999999999999999988 Q ss_pred HHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccc Q lcl|NC_020883. 460 CLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSD 539 (589) Q Consensus 460 ~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~ 539 (589) ++.+..... ........+.|.+..+.+..+ .|..+..+.+++.+|.+|++.++ |.+++++++ ++.++++++.+. T Consensus 362 ~~~~~~~~~-~~~~~~i~~~w~~~~~~s~~~--~ad~~~kl~~ag~is~et~l~~l-~gv~~~~~e-~~~~~~~~~~~~- 435 (479) T protein:vir:99 362 VNKIEGRTE-EATDLDFTITWQDVTIQSLAQ--FADAWAKMVESLKIPAEGVWDMI-PNLDQSTVN-GWKEIYDREGDF- 435 (479) T ss_pred HHHHcCCCc-cccceeeeEEecCCCCCCHHH--HHHHHHHHHhcCCCCHHHHHHhc-CCCCHHHHH-HHHHHHHHHHHH- Confidence 776653211 122234568999987766554 46677777788899999999777 778876543 333333333221 Q ss_pred cccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 540 TSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 540 p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) +.....+.+-.++.++. ...++..++.+..+++++|-+ T Consensus 436 ----~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 473 (479) T protein:vir:99 436 ----GKYMRKLQNGPDPAEQR--------GGPNGATNMQQANNKTGEPAS 473 (479) T ss_pred ----HHHHHHHhcccCccccc--------CCCCCCCCCCCCCCCCcchhc Confidence 11111111101111111 111122222233445555555 No 61 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=1.5e-33 Score=200.59 Aligned_cols=475 Identities=13% Similarity=0.086 Sum_probs=258.5 Q ss_pred Cc------c--ceeccchhHHHHhhcchhh--------hhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCc Q lcl|NC_020883. 1 MI------D--WTVRGWTDKTTKNVHGDYE--------RYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQT 64 (589) Q Consensus 1 ~~------~--~~~~~~~~~~~~~~~~~~~--------~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (589) |- + =-+||=.+- +-|+-.+++ -|-.+|-+.|.++= .+++-|+.-++ - T Consensus 1 ~~~~~~~~~~~~~~~~g~~~-~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~----~~lrg~~~~~~------------r 63 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEAN-FPNAVTDFDKARLASYRLYEDMYLTNTSDYQ----VILRGGDEGDQ------------R 63 (527) T ss_pred CCccccccCCCcCcCCcccc-CcccCCHHHHHHHHHHHHHHHHhcCchhhee----eecCCcccccc------------c Confidence 00 0 000000000 000001111 11133334433321 11111111111 1 Q ss_pred ceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcc Q lcl|NC_020883. 65 PYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSK 144 (589) Q Consensus 65 ~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~ 144 (589) + |..+-. +-.+|+...+...+.. ....++ ...+ ++.+...+.-.+ T Consensus 64 ~-~~~ps~-----------~~~~~~~~~~~~~g~~-----~~~~~~--------------~e~v----~~~lr~~~~~e~ 108 (527) T protein:vir:10 64 P-IYVPNG-----------EKLIEAKMRFLGQGLK-----WEFSKK--------------DAKV----DDAIKVLFDREN 108 (527) T ss_pred e-eeehhh-----------HHhhCCcceeeccCcc-----ccccch--------------hHHH----HHHHHHHHHHhh Confidence 1 111111 2224444443331111 000100 0112 334455666688 Q ss_pred ccccchhhHHHHHHcCceeEEEEEecCc-----eeEEEecCceecccccCcc------eeEEEeecCCCccceEEEEEee Q lcl|NC_020883. 145 LERRHWSNIVQHQVDGGIVAAPVIDELG-----PRIVFKARDVYFPHDDEKG------ADLAYYIDHGQYGQFLHIYRER 213 (589) Q Consensus 145 ~~~~~~~~l~~~~v~Gg~~~~~~~~~~~-----~~i~f~~~d~~~P~~d~~~------~div~~~e~~~~~~~l~~~~~~ 213 (589) +.++|++.-..+.|.|--|++..||..+ +++.-++|-+|||..|+.+ +++| |-|++- T Consensus 109 l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~------------~~~~~P 176 (527) T protein:vir:10 109 WEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV------------DEYPHP 176 (527) T ss_pred hHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEe------------eeccCC Confidence 9999999999999999999999999544 8888889999999766653 4444 224333 Q ss_pred ecccccee----ehhhhcccccc-----chhhe---eecccccccccccccccchhhhhhcccCCccccccccccCCCCc Q lcl|NC_020883. 214 VEKDGLRT----TNMLYPVVKAK-----GDVKK---EIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNR 281 (589) Q Consensus 214 ~~~~~~~~----~~~~y~~~~~~-----~~~~~---~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~ 281 (589) .+++.... .-..|.+++-. |.+.+ .-..|+-....+.+-+ .+-+... .+..+.+...-++.. T Consensus 177 ~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~---~~~~~~~---~~~~~l~~lp~pi~f 250 (527) T protein:vir:10 177 DSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLE---PDDIKKL---STLTEEEPLPEQITT 250 (527) T ss_pred ccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccc---hhhhhhh---cCceeeecccCCCCc Confidence 33332211 00112111000 00000 0001111100000000 0001111 122334445556777 Q ss_pred ceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccc Q lcl|NC_020883. 282 PFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTP 361 (589) Q Consensus 282 plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~ 361 (589) +.|||+||.+...+.||+|+++++..++++||.++|..++++.-.|.|.+.. +. + ... +.+|+.. T Consensus 251 iPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-tg-~---~~v--d~~G~~~-------- 315 (527) T protein:vir:10 251 LPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-DS-A---PPR--DSRGNMV-------- 315 (527) T ss_pred cceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-cc-c---ccc--cccCCcC-------- Confidence 8899999999999999999999999999999999999999999889997654 11 1 111 2344433 Q ss_pred ccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHH Q lcl|NC_020883. 362 RIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILK 441 (589) Q Consensus 362 ~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~K 441 (589) ++...+..++..++++. +..+.---.+..+..+++.|.+.|+.++++|..|||..+.+ .+.||+|++.+|.++++| T Consensus 316 ~~~VgPG~iweL~e~ak-~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s---~~~SG~ALeL~L~PLlar 391 (527) T protein:vir:10 316 PWTISPLGMVEHGQNNK-IYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAA---VAESGIALDLKLSAILSS 391 (527) T ss_pred ccccCCceeEecCCCcc-eeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCC---cCcHHHHHHHHHHHHHHH Confidence 24455556676666644 33333333788899999999999999999999999965433 357999999999999999 Q ss_pred HHHHHHHHHHHHHHHH--HHHHHHHhhcC---cccC-cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh Q lcl|NC_020883. 442 SRRLQKEYIDFLKELY--ESCLWLLNDQD---SSIR-IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM 515 (589) Q Consensus 442 v~~~R~~~~~aLk~li--~~~l~L~~~~~---~~~~-~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L 515 (589) +.++|-.+.-.+++.. ++..||.+..+ .... .-...|.|.+.+|+|++.. .+-+.++.+++++|++|||++| T Consensus 392 ~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av--ie~v~tL~~aGi~S~~tAv~~L 469 (527) T protein:vir:10 392 CAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKR--FNQLLQLWEAGLIPAKKLTEEL 469 (527) T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH--HHHHHHHHHcCchhHHHHHHHH Confidence 9999865555554432 22345555322 1111 1245799999999997765 4455578899999999999999 Q ss_pred C-CCCCHHHHHHHHHHHHhhcccc------ccccccccccccccccCcccCCCCCCCCCCCCCC Q lcl|NC_020883. 516 N-PDASEDWIQEEIARIEEEQAGS------DTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEE 572 (589) Q Consensus 516 h-pdw~dE~v~eEv~RI~~E~a~~------~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ee 572 (589) - --| .+..++|++||.++.+++ ...++|+-...+.+|-+.. +-|-|++.+= T Consensus 470 ~~~~g-~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~-----~d~~~~~~~~ 527 (527) T protein:vir:10 470 SKIMG-FELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE-----DDQALNGQPL 527 (527) T ss_pred HhccC-CCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC-----cccccCCCCC Confidence 1 112 334456777887775543 2333333322333331111 1111111111 No 62 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=1.6e-33 Score=200.37 Aligned_cols=475 Identities=13% Similarity=0.083 Sum_probs=258.2 Q ss_pred Cc------c--ceeccchhHHHHhhcchhh--------hhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCc Q lcl|NC_020883. 1 MI------D--WTVRGWTDKTTKNVHGDYE--------RYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQT 64 (589) Q Consensus 1 ~~------~--~~~~~~~~~~~~~~~~~~~--------~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (589) |- + =-+||=.+- +-|+-.+++ -|-.+|-+.|.++= .+++-|+.-++ - T Consensus 1 ~~~~~~~~~~~~~~~~g~~~-~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~----~~lrg~~~~~~------------r 63 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEAN-FPNAVTDFDKARLASYRLYEDMYLTNTSDYQ----VILRGGDEGDQ------------R 63 (527) T ss_pred CCccccccCCCcCcCCcccc-CcccCCHHHHHHHHHHHHHHHHhcCchhhee----eecCCcccccc------------c Confidence 00 0 000000000 000001111 11133334333321 11111111111 1 Q ss_pred ceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcc Q lcl|NC_020883. 65 PYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSK 144 (589) Q Consensus 65 ~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~ 144 (589) + |..+-. +-.+|+...+...+.. ....++ ...+ ++.+...+.-.+ T Consensus 64 ~-~~~ps~-----------~~~~~~~~~~~~~g~~-----~~~~~~--------------~e~v----~~~lr~~~~~e~ 108 (527) T protein:vir:10 64 P-IYVPNG-----------EKLIEAKMRFLGQGLK-----WEFSKK--------------DAKV----DDAIRVLFDREN 108 (527) T ss_pred e-eeehhh-----------HHhhCCcceeeccCcc-----ccccch--------------hHHH----HHHHHHHHHHhh Confidence 1 111111 2224444443331111 000100 0112 334455666688 Q ss_pred ccccchhhHHHHHHcCceeEEEEEecCc-----eeEEEecCceecccccCcc------eeEEEeecCCCccceEEEEEee Q lcl|NC_020883. 145 LERRHWSNIVQHQVDGGIVAAPVIDELG-----PRIVFKARDVYFPHDDEKG------ADLAYYIDHGQYGQFLHIYRER 213 (589) Q Consensus 145 ~~~~~~~~l~~~~v~Gg~~~~~~~~~~~-----~~i~f~~~d~~~P~~d~~~------~div~~~e~~~~~~~l~~~~~~ 213 (589) +.++|++.-..+.|.|--+++..||..+ +++.-++|-+|||..|+.+ +++| |-|++- T Consensus 109 l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~------------~~~~~P 176 (527) T protein:vir:10 109 WEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV------------DEYPHP 176 (527) T ss_pred hHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEe------------eeccCC Confidence 9999999999999999999999999544 8888889999999766653 4444 224333 Q ss_pred ecccccee----ehhhhcccccc-----chhhe---eecccccccccccccccchhhhhhcccCCccccccccccCCCCc Q lcl|NC_020883. 214 VEKDGLRT----TNMLYPVVKAK-----GDVKK---EIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNR 281 (589) Q Consensus 214 ~~~~~~~~----~~~~y~~~~~~-----~~~~~---~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~ 281 (589) .+++.... .-..|.+++-. |.+.+ .-..|+-....+.+-+ .+-+... .+..+.+...-++.. T Consensus 177 ~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~---~~~~~~~---~~~~~l~~lp~pi~f 250 (527) T protein:vir:10 177 DSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLE---PDDIKKL---STLTEEEPLPEQITT 250 (527) T ss_pred ccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccc---hhhhhhh---cCceeeecccCCCCc Confidence 33332211 00112111000 00000 0001111100000000 0001111 122334445556777 Q ss_pred ceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccc Q lcl|NC_020883. 282 PFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTP 361 (589) Q Consensus 282 plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~ 361 (589) +.|||+||.+...+.||+|+++++..++++||.++|..++++.-.|.|.+.. +. + ... +.+|+.. T Consensus 251 iPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-tg-~---~~v--d~~G~~~-------- 315 (527) T protein:vir:10 251 LPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-DS-A---PPR--DSRGNMV-------- 315 (527) T ss_pred cceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-cc-c---ccc--cccCCcC-------- Confidence 8899999999999999999999999999999999999999999889997654 11 1 111 2344433 Q ss_pred ccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHH Q lcl|NC_020883. 362 RIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILK 441 (589) Q Consensus 362 ~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~K 441 (589) ++...+..++..++++. +..+.---.+..+..+++.|.+.|+.++++|..|||..+.+ .+.||+|++.+|.++++| T Consensus 316 ~~~VgPG~iweL~e~ak-~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s---~~~SG~ALeL~L~PLlar 391 (527) T protein:vir:10 316 PWTISPLGMVEHGQNNK-IYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAA---VAESGIALDLKLSAILSS 391 (527) T ss_pred ccccCCceeEecCCCcc-eeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCC---cCcHHHHHHHHHHHHHHH Confidence 24455556676666644 33333333788899999999999999999999999965433 357999999999999999 Q ss_pred HHHHHHHHHHHHHHHH--HHHHHHHhhcC---cccC-cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh Q lcl|NC_020883. 442 SRRLQKEYIDFLKELY--ESCLWLLNDQD---SSIR-IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM 515 (589) Q Consensus 442 v~~~R~~~~~aLk~li--~~~l~L~~~~~---~~~~-~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L 515 (589) +.++|-.+.-.+++.. ++..||.+..+ .... .-...|.|.+.+|+|++.. .+-+.++.+++++|++|||++| T Consensus 392 ~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av--ie~v~tL~~aGiiS~etAv~~L 469 (527) T protein:vir:10 392 CAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKR--FAQLLELWEAGLIPAKKLTEEL 469 (527) T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH--HHHHHHHHHcCchhHHHHHHHH Confidence 9999865555554432 22345555322 1111 1245799999999997765 4455568899999999999999 Q ss_pred C-CCCCHHHHHHHHHHHHhhcccc------ccccccccccccccccCcccCCCCCCCCCCCCCC Q lcl|NC_020883. 516 N-PDASEDWIQEEIARIEEEQAGS------DTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEE 572 (589) Q Consensus 516 h-pdw~dE~v~eEv~RI~~E~a~~------~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ee 572 (589) - --| .+..++|++||.++.+++ ...++|+-...+.+|-+.. +-|-|++.+= T Consensus 470 ~~~~g-~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~-----~d~~~~~~~~ 527 (527) T protein:vir:10 470 SKIMG-FELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE-----DDQALNGQPL 527 (527) T ss_pred HhccC-CCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC-----cccccCCCCC Confidence 1 112 334456777777775543 2333333322333331111 1111111111 No 63 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=5.5e-33 Score=197.45 Aligned_cols=443 Identities=12% Similarity=0.051 Sum_probs=260.8 Q ss_pred CccceeccchhHHHHhhcchhhhhh---hhhcCCccccC-HHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLF-PRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f-~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) |--=|---|..+.++..-....||+ +.|+|+|.-.+ ++. +-. .. -..+-.++.|+++.|+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~--~~~------------~~--~~~~~~~~~n~~~~iv 64 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRN--TSA------------AW--RSFQREARTNWGLMVR 64 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcc--cCh------------hh--chhhhhhhcchHHHHH Confidence 4444444455566666444445555 56999996322 110 000 00 0123357789999999 Q ss_pred ccchhhhcccccc-ccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQ-IKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 77 ~~pa~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) +.++..+ +|+ ++.+.. .+.. . ++.+.++.+.|+|.........+ T Consensus 65 d~~~~~l---~~~g~~~~~~---------~d~~-------------------~----~~~~~~~~~~n~~d~~~~~~~~~ 109 (456) T protein:vir:79 65 DSVADRI---IPNGITVGGS---------ADSD-------------------L----ALRARRIWRDNRMDSVCKQWVKY 109 (456) T ss_pred HHHHhhh---ccCCeecCCC---------CCcc-------------------H----HHHHHHHHHhcChhHHHHHHHHH Confidence 9998877 333 221100 0000 0 11345667777888888888889 Q ss_pred HHHcCceeEEEEEecC-ceeEEEecCceeccc-ccCc----ceeEEEeecCCCccceEEEEEeeeccccceeehhhhccc Q lcl|NC_020883. 156 HQVDGGIVAAPVIDEL-GPRIVFKARDVYFPH-DDEK----GADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVV 229 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~-~d~~----~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~ 229 (589) +.+-|.+...+|.+++ .++|.+.+|.+.||. |+.. .+.+.|..+..+...|.-+|.. ++..+ .|+ T Consensus 110 a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~----~~~~~---~~~-- 180 (456) T protein:vir:79 110 GLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSG----DGWQK---FAR-- 180 (456) T ss_pred HhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcC----CceEE---EEE-- Confidence 9999998888888854 499999999888884 2222 1233332221111111111100 00000 000 Q ss_pred cccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHH Q lcl|NC_020883. 230 KAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQ 309 (589) Q Consensus 230 ~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~ 309 (589) .+.+........ ...+.+.........++..+|.|+++.| +.|.|||+++.+++ T Consensus 181 -----------~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~pvv~~~N------~~~~gd~e~v~~li 234 (456) T protein:vir:79 181 -----------PCFVQSSSRRRL---------VTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGEVEPHIDII 234 (456) T ss_pred -----------EEEeecccccee---------eeccCCceeecccccCCCCceeEEEecC------CCCCchhhhhHHHH Confidence 000000000000 0000011112223456777888888755 58999999999999 Q ss_pred HHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeee-ccc Q lcl|NC_020883. 310 DEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQI-DIS 388 (589) Q Consensus 310 DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~-Dir 388 (589) |++|.++++.+...+.++.|...+.-.-+. ....+..|........ .....-.++..+++ ..+.|+ +.. T Consensus 235 D~~~~~~s~~~~~~~~~a~~~~~~~G~~~~---~~~~d~~g~~i~~~~~----~~~~~~~~~~~~~~---~~~~q~~~~~ 304 (456) T protein:vir:79 235 NRINRAELQLLSTMAIQAFRQRALKSSEHR---LPKVDENGNAIDYASI----FEAAPGALWELPPG---VDIWESQTND 304 (456) T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHhcCCcc---cccccccccccchhhh----hhhhccccccCCCC---cceeeecccC Confidence 999999999888888888887765321110 0011222221111000 00111112222222 222333 467 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD 468 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~ 468 (589) ++.|...++.++.+|+..+++|...||... ...||+|++..+..+..|+++++..|..+|++++++++.+.. T Consensus 305 ~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~-----~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g--- 376 (456) T protein:vir:79 305 FTPMLSAIKEHIRQLSSATKTPLPMLMPDS-----ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG--- 376 (456) T ss_pred hHHHHHHHHHHHHHHHhhcCCChhHhcccc-----cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 888999999999999999999999998532 124899999999999999999999999999999998876652 Q ss_pred cccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHH-HHHHHHHHhhccccccccccccc Q lcl|NC_020883. 469 SSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWI-QEEIARIEEEQAGSDTSSLMGIN 547 (589) Q Consensus 469 ~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v-~eEv~RI~~E~a~~~p~~~g~~~ 547 (589) ........|.|.+..|.+.. ..|+++..+.++|++|.+++++.+ .++++++ ++|++|+++|..+.... T Consensus 377 -~~~~~~i~v~w~~~~~~s~~--~~ada~~kl~~~G~~~~~~~~~~l--g~~~~~i~~~e~~r~~~e~~~~~~~------ 445 (456) T protein:vir:79 377 -ESVEDTVDVSFESPDRVTLG--EKYSAASLAKAAGESWASIRRNIL--NYNADQIKQDDLDRAREQITLFAGN------ 445 (456) T ss_pred -CCccccceEEeCCCCCcCHH--HHHHHHHHHHhcCCChHHHHHhcC--CCCHHHHHHHHHHHHHHHHHHHhhh------ Confidence 12233467999998887754 557888888899999999887765 4776554 57999999987753111 Q ss_pred cccccccCcccCCCCC Q lcl|NC_020883. 548 QTFEQMNDNRDEDGNI 563 (589) Q Consensus 548 ~~l~~~~~~~~~~~~p 563 (589) +.+ .-+++++- T Consensus 446 --~~~---~~~~~~~~ 456 (456) T protein:vir:79 446 --PVQ---RPQEDGSR 456 (456) T ss_pred --Hhh---cCCCCCCC Confidence 111 12222222 No 64 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2.4e-32 Score=193.98 Aligned_cols=441 Identities=13% Similarity=0.056 Sum_probs=256.4 Q ss_pred CccceeccchhHHHHhhcchhhhhhhh---hcCCcccc-CHH-HHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQL---YEGKHELL-FPR-AKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l---~~g~~~~~-f~r-a~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) |--=|--.|-.+.+...-+.+.||++| |+|+|.-. .+| +..-.+ ...-.++.|+++.| T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~-----------------~~~~k~~~n~~~~i 63 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWR-----------------SFQREARTNWGLMV 63 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhh-----------------hhhhhhhcchHHHH Confidence 544444455566666665667777766 99998532 122 110000 01234789999999 Q ss_pred hccchhhhcccccc-ccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQ-IKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 76 ~~~pa~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) ++.++..+ +|+ ++.... .++. . .+.+.++.+.++|......... T Consensus 64 vd~~~~~l---~~~~~~~~~~---------~d~~-------------------~----~~~~~~i~~~N~~d~~~~~~~~ 108 (456) T protein:vir:10 64 RDSVADRI---IPNGITVGGS---------ADSD-------------------L----ALRARRIWRDNRMDSVCKQWVK 108 (456) T ss_pred HHHHHhhh---ccCCeecCCC---------CCcc-------------------h----HHHHHHHHHhcChhhHHHHHHH Confidence 99998877 332 221000 0000 0 1124455666777777778888 Q ss_pred HHHHcCceeEEEEEecC-ceeEEEecCceecccccCc-----ceeEEEeec-CCCccceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEK-----GADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~-----~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) ++++-|.....+|.+++ .++|.+.+|-..||.-|.. .+.+.|... .+....+++++... ..+ .|+ T Consensus 109 ~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~-----~~~---~~~ 180 (456) T protein:vir:10 109 YGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDG-----WQK---FAR 180 (456) T ss_pred HHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccc-----eeE---EEE Confidence 88899988888888754 4899999998888732211 122222211 11111111111100 000 000 Q ss_pred cccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhH Q lcl|NC_020883. 228 VVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLES 307 (589) Q Consensus 228 ~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~ 307 (589) . .. .........+. ... ...........+...|.|+++.| +.|+|||+.+.+ T Consensus 181 ~----~~-~~~~~~~~~~~---~~~--------------~~~~~~~~~~~~~~~~pvv~~~N------~~g~gd~e~vi~ 232 (456) T protein:vir:10 181 P----CF-VQSSSRRRLVT---RIS--------------DSWVPVGDAVVTGSPPPVVVYQN------PDGMGEVEPHID 232 (456) T ss_pred E----EE-Eeecccceeee---ecC--------------CceeeccccCCCCCceeEEEecC------CCCCchhhhhHH Confidence 0 00 00000000000 000 00011112233446677887655 489999999999 Q ss_pred HHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeee-c Q lcl|NC_020883. 308 KQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQI-D 386 (589) Q Consensus 308 l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~-D 386 (589) ++|++|.++|+.+...+.++.|.+++.- +-.. ....+..|........ .....-.++..+++ . .+.|+ . T Consensus 233 liDa~~~~~s~~~~~~~~~a~~~~~i~G-~~~~--~~~~d~~g~~~~~~~~----~~~~~~~~~~~~~~-~--~~~q~~~ 302 (456) T protein:vir:10 233 IINRINRAELQLLSTMAIQAFRQRALKS-TEHG--LPNVDENGNAIDYASI----FEAAPGALWELPPG-V--DIWESQA 302 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhHhHhhhc-cCcc--cccccccccccchhhh----hhhhccccccCCCC-c--ceEEecc Confidence 9999999999888888888888776521 1000 0011222211110000 00111112222222 2 22333 3 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) ..++.|...++.++.++++.+++|...||... ...||+|+++.+..+..|+.+++..|..+|++++++++.+... T Consensus 303 ~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-----~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 303 NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-----ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cChhHHHHHHHHHHHHHHhccCCChHHhcccc-----cChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 56788999999999999999999999998522 2348999999999999999999999999999999988765531 Q ss_pred cCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHH-HHHHHHHHhhccccccccccc Q lcl|NC_020883. 467 QDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWI-QEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 467 ~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v-~eEv~RI~~E~a~~~p~~~g~ 545 (589) .......|.|.+..|.+..+. |+++..+.++|++|.+++..++ .++++++ ++|++|+++|++++ ++ T Consensus 378 ----~~~~~~~v~w~~~~~~~~~~~--ada~~kl~~~gi~~~~~~~~~l--g~~~~~i~~~e~er~~~e~~~~-----~~ 444 (456) T protein:vir:10 378 ----SVEDTVDVSFESPDRVTLGEK--YSAASLAKAAGESWASIRRNIL--NYNADQIKQDDLDRAREQITLF-----AG 444 (456) T ss_pred ----CcccceeEEecCCCCcCHHHH--HHHHHHHHHcCCChHHHHHhhC--CCCHHHHHHHHHHHHHHHHHHH-----hh Confidence 222346799999988876654 7777778888999999887765 4776665 47999999988653 11 Q ss_pred cccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 546 INQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 546 ~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) . +. ++ |.++|+- T Consensus 445 ~---~~----~~-----~~~~~~~ 456 (456) T protein:vir:10 445 N---PV----QR-----PQEDGSR 456 (456) T ss_pred h---hh----hc-----CCCCCCC Confidence 1 11 11 2222222 No 65 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2.4e-32 Score=193.98 Aligned_cols=441 Identities=13% Similarity=0.056 Sum_probs=256.4 Q ss_pred CccceeccchhHHHHhhcchhhhhhhh---hcCCcccc-CHH-HHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQL---YEGKHELL-FPR-AKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l---~~g~~~~~-f~r-a~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) |--=|--.|-.+.+...-+.+.||++| |+|+|.-. .+| +..-.+ ...-.++.|+++.| T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~-----------------~~~~k~~~n~~~~i 63 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWR-----------------SFQREARTNWGLMV 63 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhh-----------------hhhhhhhcchHHHH Confidence 544444455566666665667777766 99998532 122 110000 01234789999999 Q ss_pred hccchhhhcccccc-ccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQ-IKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 76 ~~~pa~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) ++.++..+ +|+ ++.... .++. . .+.+.++.+.++|......... T Consensus 64 vd~~~~~l---~~~~~~~~~~---------~d~~-------------------~----~~~~~~i~~~N~~d~~~~~~~~ 108 (456) T protein:vir:10 64 RDSVADRI---IPNGITVGGS---------ADSD-------------------L----ALRARRIWRDNRMDSVCKQWVK 108 (456) T ss_pred HHHHHhhh---ccCCeecCCC---------CCcc-------------------h----HHHHHHHHHhcChhhHHHHHHH Confidence 99998877 332 221000 0000 0 1124455666777777778888 Q ss_pred HHHHcCceeEEEEEecC-ceeEEEecCceecccccCc-----ceeEEEeec-CCCccceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDEL-GPRIVFKARDVYFPHDDEK-----GADLAYYID-HGQYGQFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~~-~~~i~f~~~d~~~P~~d~~-----~~div~~~e-~~~~~~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) ++++-|.....+|.+++ .++|.+.+|-..||.-|.. .+.+.|... .+....+++++... ..+ .|+ T Consensus 109 ~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~-----~~~---~~~ 180 (456) T protein:vir:10 109 YGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDG-----WQK---FAR 180 (456) T ss_pred HHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccc-----eeE---EEE Confidence 88899988888888754 4899999998888732211 122222211 11111111111100 000 000 Q ss_pred cccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhH Q lcl|NC_020883. 228 VVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLES 307 (589) Q Consensus 228 ~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~ 307 (589) . .. .........+. ... ...........+...|.|+++.| +.|+|||+.+.+ T Consensus 181 ~----~~-~~~~~~~~~~~---~~~--------------~~~~~~~~~~~~~~~~pvv~~~N------~~g~gd~e~vi~ 232 (456) T protein:vir:10 181 P----CF-VQSSSRRRLVT---RIS--------------DSWVPVGDAVVTGSPPPVVVYQN------PDGMGEVEPHID 232 (456) T ss_pred E----EE-Eeecccceeee---ecC--------------CceeeccccCCCCCceeEEEecC------CCCCchhhhhHH Confidence 0 00 00000000000 000 00011112233446677887655 489999999999 Q ss_pred HHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeee-c Q lcl|NC_020883. 308 KQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQI-D 386 (589) Q Consensus 308 l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~-D 386 (589) ++|++|.++|+.+...+.++.|.+++.- +-.. ....+..|........ .....-.++..+++ . .+.|+ . T Consensus 233 liDa~~~~~s~~~~~~~~~a~~~~~i~G-~~~~--~~~~d~~g~~~~~~~~----~~~~~~~~~~~~~~-~--~~~q~~~ 302 (456) T protein:vir:10 233 IINRINRAELQLLSTMAIQAFRQRALKS-TEHG--LPNVDENGNAIDYASI----FEAAPGALWELPPG-V--DIWESQA 302 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhHhHhhhc-cCcc--cccccccccccchhhh----hhhhccccccCCCC-c--ceEEecc Confidence 9999999999888888888888776521 1000 0011222211110000 00111112222222 2 22333 3 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) ..++.|...++.++.++++.+++|...||... ...||+|+++.+..+..|+.+++..|..+|++++++++.+... T Consensus 303 ~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-----~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 303 NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-----ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cChhHHHHHHHHHHHHHHhccCCChHHhcccc-----cChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 56788999999999999999999999998522 2348999999999999999999999999999999988765531 Q ss_pred cCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHH-HHHHHHHHhhccccccccccc Q lcl|NC_020883. 467 QDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWI-QEEIARIEEEQAGSDTSSLMG 545 (589) Q Consensus 467 ~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v-~eEv~RI~~E~a~~~p~~~g~ 545 (589) .......|.|.+..|.+..+. |+++..+.++|++|.+++..++ .++++++ ++|++|+++|++++ ++ T Consensus 378 ----~~~~~~~v~w~~~~~~~~~~~--ada~~kl~~~gi~~~~~~~~~l--g~~~~~i~~~e~er~~~e~~~~-----~~ 444 (456) T protein:vir:10 378 ----SVEDTVDVSFESPDRVTLGEK--YSAASLAKAAGESWASIRRNIL--NYNADQIKQDDLDRAREQITLF-----AG 444 (456) T ss_pred ----CcccceeEEecCCCCcCHHHH--HHHHHHHHHcCCChHHHHHhhC--CCCHHHHHHHHHHHHHHHHHHH-----hh Confidence 222346799999988876654 7777778888999999887765 4776665 47999999988653 11 Q ss_pred cccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 546 INQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 546 ~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) . +. ++ |.++|+- T Consensus 445 ~---~~----~~-----~~~~~~~ 456 (456) T protein:vir:10 445 N---PV----QR-----PQEDGSR 456 (456) T ss_pred h---hh----hc-----CCCCCCC Confidence 1 11 11 2222222 No 66 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=4.2e-32 Score=192.60 Aligned_cols=470 Identities=11% Similarity=0.066 Sum_probs=262.1 Q ss_pred Cccce------------eccchhHHHHhhcchhhhhhhhhcCCcc--ccCHHHHHHHhhccccceeccCcceeeecCcce Q lcl|NC_020883. 1 MIDWT------------VRGWTDKTTKNVHGDYERYRQLYEGKHE--LLFPRAKRLIEEGDAVGRFLDSSQTARETQTPY 66 (589) Q Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~~~~~r~l~~g~~~--~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y 66 (589) -|+|- ++.|-..... -+..+.++.+.|+|+|. .+..++..-.+ . .+-- T Consensus 15 ~~~~p~~~~~~~~~~~l~~~l~~~~~~-~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~----------------~-~~~~ 76 (501) T protein:vir:25 15 DVEFPEDSMSREQLGALVADMWRLHIS-ERQWLDRIYEYTKGLRGRPEVPEGASDEVK----------------E-LAKL 76 (501) T ss_pred cccCCcccCChHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCchhccccCChhhh----------------h-hHhh Confidence 12221 2222222111 22345556678999986 33333221111 0 0112 Q ss_pred EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc Q lcl|NC_020883. 67 VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE 146 (589) Q Consensus 67 ~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~ 146 (589) ++.|+++.|++..+.++. +- +|... +.. . ++-+..+.+.|+|. T Consensus 77 ~v~n~~~~ivd~~a~~l~--~~----gf~~~--------d~~------~-----------------~~~l~~i~~~N~~d 119 (501) T protein:vir:25 77 SVKNVLSLVRDSFAQNLS--VV----GYRNA--------LAK------E-----------------NDPAWEMWQRNRMD 119 (501) T ss_pred hhcChHHHHHHHHHhhhc--cc----ceecC--------Ccc------c-----------------hHHHHHHHHhcChh Confidence 456999999999888772 11 12100 000 0 01234566677788 Q ss_pred ccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCc-c-ceEEEEEeeeccccceeehh Q lcl|NC_020883. 147 RRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQY-G-QFLHIYRERVEKDGLRTTNM 224 (589) Q Consensus 147 ~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~-~-~~l~~~~~~~~~~~~~~~~~ 224 (589) .+..+...++.+-|.....+|.++++..|.+.+|.+.|+ +|....+.. - ..|++|+...+.... ..-. T Consensus 120 ~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp~~~~~---------iy~D~~~~~~~~~ai~~~~~~~~~~~~-~~~~ 189 (501) T protein:vir:25 120 ARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSPRQILA---------VYADPSVDAWPQYALETWVAQKDAKPH-RRGV 189 (501) T ss_pred HHHHHHHHHHhhcCceEEEEecCCCCCeEEEeccccEEE---------EEecCCCCcceeEEEEEEeeccccCcc-eeEE Confidence 888888899999999999999998888899888877765 332211111 1 112222222111111 0001 Q ss_pred hhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhh Q lcl|NC_020883. 225 LYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDN 304 (589) Q Consensus 225 ~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ 304 (589) +|-.. .+++. ..+..+.......... ....... .+.+..+....+.+.....|++++|.+.. +++|+||++. T Consensus 190 ~y~~~----~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~vPiv~f~N~~~~-~~~g~sdie~ 261 (501) T protein:vir:25 190 LYDDT----YMYEL-DLGEVVLGDAGGGQAT-QQPVNVR-EVTDVIEHGATFEGKPVCPVVRFVNGRDA-DDMIVGEVAP 261 (501) T ss_pred EecCe----eEEEE-ecCceeeeeccccccc-ccccccc-ccccccccccccCCccceeeEeccCcccc-Cccccchhhh Confidence 12000 00000 0000000000000000 0000000 01112233344567777789999998765 5679999999 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceee Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQ 384 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq 384 (589) +.+++|++|.++++.....+.++.|..++ .|...+.. ....+. . -.++..++ ..+.+.| T Consensus 262 v~~l~Da~~~~~s~~~~~~e~~a~p~~~i--------~G~~~~~~----~~~~~~-----~--~~i~~~~~--~~~~~~q 320 (501) T protein:vir:25 262 LILLQQAINSVNFDRLIVSRFGANPQRVI--------SGWTGSKA----EVLKAS-----A--LRVWTFED--PEVKAQA 320 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHhhccHHHHH--------hCCCCCcc----chhhhc-----c--cceeccCC--CCceEEE Confidence 99999999999999999999999886654 12221111 111111 1 11222111 2244567 Q ss_pred ec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 385 ID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWL 463 (589) Q Consensus 385 ~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L 463 (589) |+ ..++.|.+.++.++++|+..+++|...||... ...||.|+++.+.++..|+.+++..|..+|++++++++.+ T Consensus 321 ~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~-----~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~ 395 (501) T protein:vir:25 321 FPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKM-----INVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEM 395 (501) T ss_pred ecccChHHHHHHHHHHHHHHHhhcCCChhhhcccc-----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66 46788999999999999999999999998432 2349999999999999999999999999999999988776 Q ss_pred HhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccc Q lcl|NC_020883. 464 LNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSL 543 (589) Q Consensus 464 ~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~ 543 (589) ...... .......|.|.+..|.+..+ .|+++..+.+++ +|.++.+.++ |.++++++++..+..+++.+. ++ T Consensus 396 ~~~~~~-~~~~~i~v~w~~~~~~s~~~--~ada~~kl~~~g-is~et~~~~~-~g~~~~~ie~~~~~~~e~~~~----~~ 466 (501) T protein:vir:25 396 DDDPDT-AADSGAEVLWRDTEARSFGA--VVDGITKLASAG-IPIEHLLSMV-PGMTQQTIQAIKDSLRGGEVK----SL 466 (501) T ss_pred hCCCcc-ccceeeeEEecCCCCCCHHH--HHHHHHHHHhcC-CCHHHHHHHc-CCCCHHHHHHHHHHHHHHhHH----HH Confidence 643221 22234678999998877654 477777777776 4888877766 779987765544433333322 11 Q ss_pred cccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccccc Q lcl|NC_020883. 544 MGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEP 587 (589) Q Consensus 544 g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~ 587 (589) - .. +.++++ .|.++...++++.++...+....+-+ T Consensus 467 ~------~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 467 V------DK-LLSNEP--APVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred H------HH-hhccCc--CCCCCCCCCCCccccccccCCCCCCC Confidence 0 00 111221 22233333333333333333333322 No 67 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.96 E-value=2.5e-30 Score=182.90 Aligned_cols=493 Identities=16% Similarity=0.110 Sum_probs=264.6 Q ss_pred CccceeccchhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcch--hh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPK--VI 75 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~--~i 75 (589) .+-=+.--|-..-=|. -+-+|| .+|.|++.++=+ +..|+ + ..-+|++. .| T Consensus 14 ~fp~~~a~wV~~~D~~---RlaaY~ly~d~y~n~~~el~~-----il~G~--d---------------r~~~~~ps~r~~ 68 (563) T protein:vir:74 14 FLRGGDDNIVDENDKN---RVRAYDLYENIYLNSAETLKL-----VLRGD--D---------------SVPILMPSGRKI 68 (563) T ss_pred cccccccccCCHHHHH---HHHHHHHHHHhhcCchhhhhh-----hcCCC--c---------------eeeeccchHHHH Confidence 1111111221111111 234455 778998888643 11222 1 12344433 67 Q ss_pred hccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 76 ~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) +++=+-+ ||.=-...- +++. .+ +++.+ . + +..+....+-.+.-++|.++-.. T Consensus 69 V~~~~~~----Lg~~~~~~V--e~~~---~d-----e~~~~----------a---v-q~~Lr~~~~~e~l~~~~~~~~r~ 120 (563) T protein:vir:74 69 VEAVHRF----LGVGFDYLV--EPDM---GD-----EGIRQ----------S---L-NAYFRTTFKREAIKAKFTSNKRW 120 (563) T ss_pred HHHHHHh----cCCCcEEec--Cccc---cC-----cchHH----------H---H-HHHHHHHHHHhhhHHHHHHHHHh Confidence 7663333 332111110 0000 00 10000 0 1 55778888888999999999999 Q ss_pred HHHcCceeEEEEEec-----CceeEEEecCceecccccCcceeEEE--------eecCCCccceE---EEEEeee-cccc Q lcl|NC_020883. 156 HQVDGGIVAAPVIDE-----LGPRIVFKARDVYFPHDDEKGADLAY--------YIDHGQYGQFL---HIYRERV-EKDG 218 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~-----~~~~i~f~~~d~~~P~~d~~~~div~--------~~e~~~~~~~l---~~~~~~~-~~~~ 218 (589) +.|.|--|+++.||. ..+++.=+++.+|||-+++-.+.=+| ..+.- ..+++ +-|+... ++.. T Consensus 121 a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd-~~~~~~r~~~~~~~lndeg~ 199 (563) T protein:vir:74 121 GLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDD-PSKKLARRRTFRRVRNDEGM 199 (563) T ss_pred hhhhcceeEEEeeccccccCCCceEeecCCceeeeccCCCCcccceeeecccCCCCCcc-hhccceeeeeeeeeeCCCCC Confidence 999999999999994 24888888999999977765432222 11111 11211 1122221 1112 Q ss_pred ce----eehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCC Q lcl|NC_020883. 219 LR----TTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFM 294 (589) Q Consensus 219 ~~----~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~ 294 (589) ++ +.-.+|..++... .+...+..-.+-+.....+- +.+.+..+.-+..+.++++||.+... T Consensus 200 ~~~~~~~dae~w~lg~wd~-----------r~~~~~~~~~~~~~~~~~~~----d~e~~~LP~pi~~iPiv~~~tip~~~ 264 (563) T protein:vir:74 200 FTGRISSELTHWTLGNWDD-----------RGAISDEQARRKEQVRSAQH----DEEEEELPEPISQLPLYRWRNKPPQN 264 (563) T ss_pred ccceeeeccchhccccccc-----------cCccchhhhcccchhhhhhh----hchhhhccccccCccEEEcCCCCCcc Confidence 11 1112232221100 01111111111111111110 11223333344556688889999999 Q ss_pred CcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 295 NPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 295 ~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d 374 (589) +.||+|++++++.+|++||.+.|..++++.-+|.|-+ |=++.- ..|| ..+. ..+....+..++... T Consensus 265 s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~-vl~~~~--------p~d~----~~g~-~~~w~vgpG~i~El~ 330 (563) T protein:vir:74 265 SSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMY-VTNASA--------PVDP----NTGE-LTDWNIGPMQIVEIA 330 (563) T ss_pred cccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeE-Eecccc--------cccc----cccc-ccccccCCceeEecc Confidence 9999999999999999999999999999999999944 322111 1121 1111 011222334444444 Q ss_pred cccCc--cceeeecccHHHHHHHHHHHHH-HHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_020883. 375 ENGRS--MEIHQIDISKIGDMDHVKNLIK-LMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYID 451 (589) Q Consensus 375 e~g~~--~~~iq~Dirveeh~~~ie~L~~-~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~ 451 (589) +++.. +..++---++.....+++.|.+ -++..+++|..|||..+ ++.+.||+|+..+|.++++|++++|-.+.. T Consensus 331 ~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD---~~~~~SGiALeL~L~PL~a~~~ek~l~l~~ 407 (563) T protein:vir:74 331 GNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVD---VTSAESGISLELQLKPLLAANEEKELEMIV 407 (563) T ss_pred CCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccc---cccccchhhhhhhhhHHHHhhhhhHHHHHH Confidence 44322 2223222355555556666555 66788999999999654 445789999999999999999999977777 Q ss_pred HHHHHH--HHHHHHHhh------cC--cccCccc------ceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh Q lcl|NC_020883. 452 FLKELY--ESCLWLLND------QD--SSIRIEE------PNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM 515 (589) Q Consensus 452 aLk~li--~~~l~L~~~------~~--~~~~~e~------p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L 515 (589) .++.+. ...+||.+. +. .-+++++ +.|.|.+.+|+|.+.. -+-..++.+++++|++|||++| T Consensus 408 ~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~v--v~~~~tl~~aGiiSretAv~~L 485 (563) T protein:vir:74 408 VMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQV--TQDTLLLQQAHLILRKMAVAKL 485 (563) T ss_pred HHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHH--HHHHHHHHHcCchhHHHHHHHH Confidence 665521 222333221 11 1122222 4678999999997754 3444578899999999999999 Q ss_pred ---C---CCCCHHHHHHHHHHHHhh--ccccccccccc---cccccccccCcccCCCCCCC-CCCCCCCCCcchhhhhhc Q lcl|NC_020883. 516 ---N---PDASEDWIQEEIARIEEE--QAGSDTSSLMG---INQTFEQMNDNRDEDGNIIE-EGDTEEEPSAEENEEIEK 583 (589) Q Consensus 516 ---h---pdw~dE~v~eEv~RI~~E--~a~~~p~~~g~---~~~~l~~~~~~~~~~~~p~d-eg~~~eep~~~~~e~~~~ 583 (589) . ||.+.|-...|.++|..- +.+....++|. ++..++..-+.. -++|.| =|++-|=|++- +.- T Consensus 486 ~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd--~g~p~~~~~~~~~~~~~~----~~~ 559 (563) T protein:vir:74 486 RSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDD--QGNPIDQFGNPVEIPPDV----TQV 559 (563) T ss_pred HhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccc--cCCchhHcCCcccCCccc----ccc Confidence 3 555555555566666541 11112233332 222333221111 156664 34454545432 223 Q ss_pred cccc Q lcl|NC_020883. 584 EGEP 587 (589) Q Consensus 584 ~~~~ 587 (589) ++.| T Consensus 560 ~~~~ 563 (563) T protein:vir:74 560 PLSP 563 (563) T ss_pred CCCC Confidence 3334 No 68 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.96 E-value=6.4e-30 Score=180.64 Aligned_cols=419 Identities=11% Similarity=0.005 Sum_probs=238.6 Q ss_pred ceeeecCcce------EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhh Q lcl|NC_020883. 57 QTARETQTPY------VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVID 130 (589) Q Consensus 57 ~~~~~~~~~y------~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 130 (589) +..+++..-| .++|+++.|++..+.++. +. +|.. . |+. T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~--~~----gf~~----~----d~~---------------------- 44 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLL--AL----GVTG----P----DGE---------------------- 44 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhc--cC----ceec----C----CCc---------------------- Confidence 1112222111 467999999999888662 11 1110 0 000 Q ss_pred hhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEEEecCc--------eeEEEecCceecccccCcceeEEEeecCCC Q lcl|NC_020883. 131 LQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDELG--------PRIVFKARDVYFPHDDEKGADLAYYIDHGQ 202 (589) Q Consensus 131 ~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~--------~~i~f~~~d~~~P~~d~~~~div~~~e~~~ 202 (589) .++-+.++.+.|+|..+......++++-|.....+|.++++ ++|.+.+|.+.|+ +|.....+ T Consensus 45 -~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~---------i~D~~~~~ 114 (434) T protein:vir:98 45 -PDTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIV---------EYDPETGE 114 (434) T ss_pred -hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEE---------EEeCCCCc Confidence 01234556777888888888899999999999999876543 5688888877776 44221111 Q ss_pred ccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcc Q lcl|NC_020883. 203 YGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRP 282 (589) Q Consensus 203 ~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~p 282 (589) .--.|++|....+.....+.+ +|- ..... +.....+. .....++.. +.........+.+..+. T Consensus 115 ~~~ai~~~~~~~~~~~~~~~~-~~~-~~~~~--~~~~~~~~---~~~~~~~~~----------~~~~~~~~~~~h~~g~v 177 (434) T protein:vir:98 115 PLVGLKVWHNDIDGFGYARVF-FDD-TSFPY--RTRERTGA---RLPWGPDSW----------VYTGTADSGDVHDLGGM 177 (434) T ss_pred eEEEEEEEEeccCCceEEEEE-EeC-cEEEE--EEeecccc---ccccccccc----------eecccccccccCCCCcc Confidence 111233333222111111110 000 00000 00000000 000000000 00001112233466777 Q ss_pred eEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccc Q lcl|NC_020883. 283 FISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPR 362 (589) Q Consensus 283 lvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~ 362 (589) .|++++|++.... +|+|||+.+.+++|++|.++|+.....+.++.|.+++.-.-++ ...+........... T Consensus 178 Pvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~------~~~~~~~~~~~~~~~-- 248 (434) T protein:vir:98 178 QLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFA------KRTDPATGMTVVDQP-- 248 (434) T ss_pred ceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc------cccccccccchhhhh-- Confidence 8998999988766 5999999999999999999999999999999998876421111 111111000000000 Q ss_pred cccccccccccccccCccceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHH Q lcl|NC_020883. 363 IDHRDMEITTFDENGRSMEIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILK 441 (589) Q Consensus 363 ~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~K 441 (589) .....-.++..+ +..+.+.|++ ..++.|...++.++.+++..+++|...||. . ....||+|++..+..+..| T Consensus 249 ~~~~~~~i~~~~--~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~-~----~~n~Sg~Al~~~~~~l~~k 321 (434) T protein:vir:98 249 FVPSPSAVWASE--GENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYAT-D----LVNISADTIGALDILHVAK 321 (434) T ss_pred hhccccccccCC--CCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhcc-c----cCChHHHHHHHHHHHHHHH Confidence 000011111111 2235566765 578899999999999999999999999983 2 1235999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCH Q lcl|NC_020883. 442 SRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASE 521 (589) Q Consensus 442 v~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~d 521 (589) +.++++.|..+|++++++++.+. +.........|.|.+..|.+..+ .|+++..+.++++ |.++.+.++ | +++ T Consensus 322 ~~~k~~~f~~~l~~~~rl~~~~~---g~~~~~~~~~v~w~~~~~~s~~~--~ada~~kl~~~g~-~~e~~~~~l-g-~~~ 393 (434) T protein:vir:98 322 VREHIASFSEGLESVLALAAAQA---GVPEDYTEAEVRWANPAHVTMAV--KADAATKLKSIGY-PLDVIAEEL-D-ESP 393 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc---CCChhheeeeEEecCCCCCCHHH--HHHHHHHHHhcCC-cHHHHHHhC-C-CCH Confidence 99999999999999998876654 22333445689999998877664 4777778877664 777665555 5 555 Q ss_pred HHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCc Q lcl|NC_020883. 522 DWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSA 575 (589) Q Consensus 522 E~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~ 575 (589) + |++||++|..+..-.. ..+.....++++...++++..++| T Consensus 394 ~----e~~r~~~e~~~~~~~~---------~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 394 A----RVRRIVAGAASQALLA---------ASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred H----HHHHHHHHHHHHHHHH---------HhhhccCCCCCCCCCCcccCCCCC Confidence 4 4555544433211000 001112222233344445555555 No 69 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.95 E-value=3.7e-28 Score=170.95 Aligned_cols=400 Identities=12% Similarity=-0.006 Sum_probs=247.6 Q ss_pred HHhhcchhhhhhhhhcCCccc--cCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhcccccccc Q lcl|NC_020883. 14 TKNVHGDYERYRQLYEGKHEL--LFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIK 91 (589) Q Consensus 14 ~~~~~~~~~~~r~l~~g~~~~--~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~ 91 (589) +--.+.-+.++.+.|+|+|.- +-+.+..-++ ....+++|+++++++..|+.+. +- T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~------------------~~~~~v~nw~~~~Vds~a~rl~--~~--- 57 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIR------------------AKYQAVLGWAAKGVDSLADRLI--FR--- 57 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHH------------------hHHHhhcchhHHHHHHhHhhhc--cc--- Confidence 333334455556889999854 2121111111 0113678999999999888661 11 Q ss_pred ccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEEEecC Q lcl|NC_020883. 92 SSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL 171 (589) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~ 171 (589) +|.. + +. -+.++..-|+|..+......++++-|.+...++-+++ T Consensus 58 -Gf~~-----------~-------d~-----------------~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d 101 (410) T protein:vir:95 58 -AFAN-----------D-------DF-----------------NVTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGED 101 (410) T ss_pred -cccC-----------C-------Cc-----------------hHHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCC Confidence 1110 0 00 0244556677777788888888999988888887644 Q ss_pred -ceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeeccccccccccc Q lcl|NC_020883. 172 -GPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEG 250 (589) Q Consensus 172 -~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e 250 (589) .++|.+.+|.+.+. +|.....+--..|.++ ++. ..+-.+.-.+|- .+.+.+....| T Consensus 102 ~~~~i~~~sP~~~~~---------i~Dp~~~~~~~al~~~-~~~-~~~~~~~~~~~~----~~~~~~~~~~~-------- 158 (410) T protein:vir:95 102 DEVRLQVIESSNATG---------VIDPITGLLVEGYAVL-ARD-DYNRPTLEAYFE----PNATHFIPKDG-------- 158 (410) T ss_pred CceEEEEEcccceEE---------EEeCCCCceEEEEEEE-Eec-CCCeEEEEEEEe----CCcEEEEeeCC-------- Confidence 49999998877774 4422111100111111 111 111111112331 11111111111 Q ss_pred ccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch-hhhhHHHHHHHHHHhHHHHHHHHhCCC Q lcl|NC_020883. 251 AEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL-DNLESKQDEINWTITRSAVIYEQNGKP 329 (589) Q Consensus 251 ~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~-~~ie~l~DeLd~t~S~~srildk~gkp 329 (589) ..+.++.+..++.||.++|.+...+|+|+|++ +.+.+++|++|+++++.....+.++.| T Consensus 159 --------------------~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~p 218 (410) T protein:vir:95 159 --------------------EPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWP 218 (410) T ss_pred --------------------ccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcch Confidence 11234567788889999999999999999999 579999999999999999999999999 Q ss_pred cEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020883. 330 RISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDHVKNLIKLMLIETQ 408 (589) Q Consensus 330 RI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ 408 (589) ...+ .| .+.+++...... .....+-..+.+++|..+.+-||+ ..++.|.+.++.+++++..+++ T Consensus 219 qr~i--------~G--~d~d~~~~~~~~-----~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~ 283 (410) T protein:vir:95 219 QKYI--------LG--LDPDAEPMEKWK-----ATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMG 283 (410) T ss_pred hhee--------ec--cCCCCCcCchhh-----hhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcC Confidence 8876 12 244443322111 111112233445566667777766 5788899999999999999999 Q ss_pred CCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-ccCcccceeeeCCcCCC- Q lcl|NC_020883. 409 TSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-SIRIEEPNIETQDMILK- 486 (589) Q Consensus 409 ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-~~~~e~p~I~f~D~lPv- 486 (589) +|..+||.... . ..|+.|++....+..+|+++++..|-.++++++++++.+...... ........|.|....+. T Consensus 284 lP~~~lg~~~~-N---psSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~ 359 (410) T protein:vir:95 284 LTLDDLGFVSD-N---PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEAD 359 (410) T ss_pred CCHHHhccccC-c---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcc Confidence 99999996332 1 248999999999999999999999999999999998887654321 22233456889743221 Q ss_pred CCCHHHHHHHHHHHhcc--chhhHHHHHHHhCCCCCHHHHHHHHHHH-Hhhccccccccccc Q lcl|NC_020883. 487 PRAELVAENMAAYAASK--QGQSLETTVRRMNPDASEDWIQEEIARI-EEEQAGSDTSSLMG 545 (589) Q Consensus 487 de~El~~A~t~~~l~~a--~~~S~etaVr~Lhpdw~dE~v~eEv~RI-~~E~a~~~p~~~g~ 545 (589) ..+.-+.|..+..+.++ ++++++++..+|+ +++++ ++|| .+|+.+. |. T Consensus 360 ~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg--~~~~~----~~~~~~~e~~~~-----g~ 410 (410) T protein:vir:95 360 ANTMTMIGDGVVKLNQALPGYINAETIRDLTG--IAGDM----SAKPVVSEGGSN-----GE 410 (410) T ss_pred hhhHHHHHHHHHHHHHhccCCccHHHHHHhcC--CChHH----HHHHHHHHHHhC-----CC Confidence 12333334444445555 7889999999985 66553 2344 3343321 11 No 70 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.94 E-value=6.5e-28 Score=169.64 Aligned_cols=409 Identities=12% Similarity=0.023 Sum_probs=253.7 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcc-eEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTP-YVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~-y~~~n~~~~i~~~p 79 (589) .|++-.+-|.++.- -+.++.+.|+|+|..-+ +++- .-+..+.. .+++|+++.+++.. T Consensus 5 ~i~~L~~~~~~~~~-----r~~~~~~yy~g~~~~~~------------~~~~-----~p~~~~~~~~~v~nw~~~~Vd~~ 62 (422) T protein:vir:97 5 GMGYLRRKLALFKT-----GVDKRYRYYAMDDRDDT------------RSIV-----MPNNVREMYRSVLEWTAKGVDSL 62 (422) T ss_pred HHHHHHHHHHHHHH-----HHHHHHHHHhcCCChhh------------cCcc-----ccHHHHHHHHhhcchhHHHHHHH Confidence 45555555544322 24445588999986411 1000 00111111 25779999999998 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) |+.+. + .+|.. . +. + ..++..-|+|..+......++++- T Consensus 63 a~rl~--~----~Gf~~----~--------------d~------------~-----l~~~w~~N~ld~~~~~~~~~al~~ 101 (422) T protein:vir:97 63 ADRII--F----REFTN----D--------------DF------------N-----AWEIFKANNPDIFFDTAIQSALIA 101 (422) T ss_pred Hhccc--c----ceeeC----C--------------ch------------h-----HHHHHHhcChHHHHHHHHHHHHHh Confidence 77440 0 11110 0 00 0 124455566777777788888999 Q ss_pred CceeEEEEEec--CceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhhe Q lcl|NC_020883. 160 GGIVAAPVIDE--LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKK 237 (589) Q Consensus 160 Gg~~~~~~~~~--~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~ 237 (589) |.+...++.++ +.++|.+.+|.+.+. +|. ..+. .....+.+...+.++..+.-..|.. ..+.+ T Consensus 102 G~sf~~v~~~~~~~~p~i~~~sp~~~~~---------i~D-~~~~-~~~~a~~~~~~~~~~~~~~~~~~~~----~~~~~ 166 (422) T protein:vir:97 102 SCCFVYIMPGAEDGLPKMQVIEASKATG---------ILD-PTTF-LLTEGYAILESDSNGNPTLEAYFTD----KDIWY 166 (422) T ss_pred cceeEEEeeCCCCCeeEEEEechhhEEE---------EEe-CCCC-cceeeEEEEEecCCCcEEEEEEEcC----ceEEE Confidence 99999998864 348899988877764 442 1111 1111221111222222222111210 00000 Q ss_pred eecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch-hhhhHHHHHHHHHH Q lcl|NC_020883. 238 EIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL-DNLESKQDEINWTI 316 (589) Q Consensus 238 ~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~-~~ie~l~DeLd~t~ 316 (589) ....| ..+.++.+..++.||.++|.+...+++|+|++ +.+.+++|++|+++ T Consensus 167 ~~~~~----------------------------~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~ 218 (422) T protein:vir:97 167 YPKKG----------------------------KPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTL 218 (422) T ss_pred EcCCC----------------------------ccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHH Confidence 00111 11234567778889999999999999999999 67999999999999 Q ss_pred hHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHH Q lcl|NC_020883. 317 TRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDH 395 (589) Q Consensus 317 S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ 395 (589) ++.....+.++.|...+ .| .+.++........ ....+-..+.+++|..+.+-||+ ..++.|.+. T Consensus 219 ~~~~~~~e~~a~pqr~i--------~G--~d~d~~~~~~~~~-----~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~ 283 (422) T protein:vir:97 219 ERAEVTAEFYSFPQKYV--------LG--MDPDAKPMEKWRA-----TVSTLLEISKDEDGDKPTVGQFTTASMAPFMEH 283 (422) T ss_pred HHHHHHHHHhcchhhhh--------cc--cCcccccCchhhh-----hhhhhhccCCCCCCCcceeeecCCCChhHHHHH Confidence 99999999999998865 12 2344432221111 11112234445666667776766 567889999 Q ss_pred HHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-cCcc Q lcl|NC_020883. 396 VKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSS-IRIE 474 (589) Q Consensus 396 ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~-~~~e 474 (589) ++.++.++..++++|..+||.... + ..||.|++.++.+..+|+++++..|-.++++++++++.+....... .... T Consensus 284 l~~~~~~~a~~s~lP~~~lg~~~~-N---psSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~ 359 (422) T protein:vir:97 284 LKMYASLFAGGSGLTLDDLGFPSD-N---PSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM 359 (422) T ss_pred HHHHHHHHhcccCCCHHHhccccC-c---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc Confidence 999999999999999999996321 1 2489999999999999999999999999999999987766532211 1123 Q ss_pred cceeeeCCcCCCCCCHHH-HHHHHHHHhcc--chhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccc Q lcl|NC_020883. 475 EPNIETQDMILKPRAELV-AENMAAYAASK--QGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGS 538 (589) Q Consensus 475 ~p~I~f~D~lPvde~El~-~A~t~~~l~~a--~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~ 538 (589) ...+.|.+..|.+...++ .|.-+..+.++ +++++++...+|. +++ .+.|+.||.+.++-. T Consensus 360 ~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg--~~~--~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 360 DTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTG--VKG--ADKPIPAITEVTTDG 422 (422) T ss_pred cceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcC--CCc--hhHHHHHHHhhhccC Confidence 467899877666633321 13323334455 7889999999984 443 356888888776642 No 71 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.94 E-value=1.8e-27 Score=167.26 Aligned_cols=397 Identities=12% Similarity=-0.006 Sum_probs=249.1 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) .|++-.+-|.++. .-+.++.+.|+|+|.. ..-...+-. +.+ ....+++|+++.|++..| T Consensus 5 ~i~~L~~~~~~~~-----~r~~~~~~yY~g~~~~-~~~~~~~p~------------~~~---~~~~~v~nw~~~iVds~a 63 (409) T protein:vir:94 5 GIGYLRFKLSVHK-----RRAEMRYDQYAMKYVD-RFKGITIPQ------------ALS---QQYRSILGWCAKGVDSLA 63 (409) T ss_pred HHHHHHHHHHHHh-----HHHHHHHHHhcccCch-hhcChhhhH------------HHH---HHHhhhcchhHHHHHHhH Confidence 4555555554432 2344555899999853 111111000 010 112367899999999877 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) +.+. +- +|. . .+ .+ ..++..-|+|..++.....++++-| T Consensus 64 ~rl~--~~----Gf~-----------~--------~d-----------~~-----l~~i~~~N~ld~~~~~~~~~aliyG 102 (409) T protein:vir:94 64 DRLV--FR----EFE-----------N--------DD-----------FT-----VNEIFEENNPDIFFDSAVLSSLIAS 102 (409) T ss_pred hhcc--cC----ccc-----------C--------Cc-----------hH-----HHHHHHhcChhHHHHHHHHHHHHhc Confidence 7551 11 111 0 00 01 2356666778888888888999999 Q ss_pred ceeEEEEEecCc-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeee-ccccceeehhhhccccccchhhee Q lcl|NC_020883. 161 GIVAAPVIDELG-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERV-EKDGLRTTNMLYPVVKAKGDVKKE 238 (589) Q Consensus 161 g~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~-~~~~~~~~~~~y~~~~~~~~~~~~ 238 (589) .....++-++++ ++|.+.+|.+.+- +|.... .+-+..++... +..+..+...+|- .+.++.. T Consensus 103 ~sf~~v~~~~dg~~~i~~~sp~~~~~---------i~D~~~---~~~~~a~~~~~~d~~~~~~~~~~~~----~~~~~~~ 166 (409) T protein:vir:94 103 CSFTYISKGENDAVRLQVIEAVNATG---------IIDPIT---GLLTEGYAVLERDENNNVVLEAHFL----PDRTDYY 166 (409) T ss_pred ceeEEEecCCCCceEEEEeccceEEE---------EEecCC---CceeeeEEEEEecCCCceEEEEEEe----cCcEEEE Confidence 888888876444 8999988877763 553221 12222222221 1223222323331 1111111 Q ss_pred ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch-hhhhHHHHHHHHHHh Q lcl|NC_020883. 239 IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL-DNLESKQDEINWTIT 317 (589) Q Consensus 239 ~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~-~~ie~l~DeLd~t~S 317 (589) ...+.. ...+..+..++.||.++|++...+++|+|++ +.+.+++|++|++++ T Consensus 167 ~~~~~~---------------------------~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~ 219 (409) T protein:vir:94 167 YRDSRN---------------------------NISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLE 219 (409) T ss_pred EecCce---------------------------eEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHH Confidence 111100 0122345667889999999999999999999 579999999999999 Q ss_pred HHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHHH Q lcl|NC_020883. 318 RSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDHV 396 (589) Q Consensus 318 ~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~i 396 (589) +.....+.++.|...+- | .+++++..... ......+-..+.+++|..+.+-||+ ..++.|...+ T Consensus 220 ~~~~~~e~~a~pqr~i~--------G--~d~d~~~~~~~-----~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l 284 (409) T protein:vir:94 220 RADVTAEFYSFPQKYVT--------G--LSDDAEPMETW-----KATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQL 284 (409) T ss_pred HHHHHHHHhcChhheeE--------e--cCCCCcccchh-----hhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHH Confidence 99999999999988761 2 24444322211 1111112233445666667777776 4778899999 Q ss_pred HHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-ccCccc Q lcl|NC_020883. 397 KNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-SIRIEE 475 (589) Q Consensus 397 e~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-~~~~e~ 475 (589) +.+++++..++++|..+||.... . ..||.|++..+.+..+|+++++..|-.++++++++++.+...... ...... T Consensus 285 ~~~~~~~a~~t~lP~~~lg~~~~-N---psSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~ 360 (409) T protein:vir:94 285 RTAAAGFAGETGLTLDDLGFVSD-N---PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRK 360 (409) T ss_pred HHHHHHHhhhcCCCHHHhccccC-c---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccc Confidence 99999999999999999996332 1 248999999999999999999999999999999998877654321 112234 Q ss_pred ceeeeCCcCCCCCCHH-HHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHH Q lcl|NC_020883. 476 PNIETQDMILKPRAEL-VAENMAAYAASKQ--GQSLETTVRRMNPDASEDW 523 (589) Q Consensus 476 p~I~f~D~lPvde~El-~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~ 523 (589) ..+.|.+..|.+-..+ +.|.-+..+.+++ ++++++...+++ +++.. T Consensus 361 ~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG--~~~~d 409 (409) T protein:vir:94 361 TKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTG--IEGGE 409 (409) T ss_pred ceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcC--CCCCC Confidence 6789998766664333 2244455667777 667888888885 33222 No 72 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.93 E-value=4.5e-26 Score=159.56 Aligned_cols=445 Identities=13% Similarity=0.093 Sum_probs=251.0 Q ss_pred Cc----------cceeccchhHHHHhhcchhhhhh---hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceE Q lcl|NC_020883. 1 MI----------DWTVRGWTDKTTKNVHGDYERYR---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYV 67 (589) Q Consensus 1 ~~----------~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~ 67 (589) || +=+-+-|....++..-+-..||+ +-|+|+|..-+ ++.- .-...+...+ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~------------~~~~-----~p~~~r~~~~ 63 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQY------------VGTL-----IPPQYFNLGL 63 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhh------------cccc-----ccHHHHHHHh Confidence 32 22233455556665555444554 77999986311 1100 0011123346 Q ss_pred EEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc Q lcl|NC_020883. 68 IFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER 147 (589) Q Consensus 68 ~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~ 147 (589) ++|+++++++..|+.+. +- +|-.. +.. .+ ..-..++.+-|+|.. T Consensus 64 v~nw~~~~Vd~~a~rl~--~~----Gf~~~--------d~~------~~----------------~~~l~~iw~~N~ld~ 107 (474) T protein:vir:81 64 VLGWTGKAVDALARRCN--LE----GFVWP--------DGD------LD----------------SLGGTEVVDDNHLLS 107 (474) T ss_pred hcChHHHHHHHHHhhhc--cc----ceECC--------CCC------cc----------------chHHHHHHHhcChhH Confidence 89999999999888661 11 11100 000 00 012456777788888 Q ss_pred cchhhHHHHHHcCceeEEEEEecCc---eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehh Q lcl|NC_020883. 148 RHWSNIVQHQVDGGIVAAPVIDELG---PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNM 224 (589) Q Consensus 148 ~~~~~l~~~~v~Gg~~~~~~~~~~~---~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~ 224 (589) +....+.++++-|.....++-.+++ ++|.+.+|.+.|. +|.......-..+.++ ..+.++-.+.-. T Consensus 108 ~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~---------~~D~~~~~~~~al~~~--~~~~~g~~~~~~ 176 (474) T protein:vir:81 108 EIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATG---------EWNRRRRGLNNLLSII--DKDKEGKVLSLA 176 (474) T ss_pred HHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEE---------EEeCCCCcceeeeEEE--EEcCCCcEEEEE Confidence 8888888889998888888875444 7788888877774 4422111111112111 111222222222 Q ss_pred hhccccccchhheeec-ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch- Q lcl|NC_020883. 225 LYPVVKAKGDVKKEIK-KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL- 302 (589) Q Consensus 225 ~y~~~~~~~~~~~~~~-~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~- 302 (589) +| ..+.+++... ++. .. + ..+......| .| ||.++|++...+|+|+|.+ T Consensus 177 ly----~~~~~~~~~~~~~~-~~-------------------w--~~~~~~~~~g--vP-vV~~~n~~~~~~~~G~s~i~ 227 (474) T protein:vir:81 177 LY----LDNETVTAQRDKAT-LK-------------------W--QVDRDEHVYG--VP-AQVLPYKPAPKRPFGQSRIT 227 (474) T ss_pred EE----eCCcEEEEEEcCcc-ce-------------------e--eeccCCCCCC--cc-eEEecccccccCcCCccccc Confidence 33 0111111111 110 00 0 0011111224 34 6778999999999999999 Q ss_pred hhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCcc-- Q lcl|NC_020883. 303 DNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSM-- 380 (589) Q Consensus 303 ~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~-- 380 (589) ..+.+++|++|+++++.....+.++.|..+|- ++-. ....+++++......-.... +-..+.+++|..+ T Consensus 228 e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~~~---~~~~d~d~~~~~~~~~~~~~-----i~~~~~d~d~~~~~~ 298 (474) T protein:vir:81 228 KPMMGLQDAGVRELARREGHMDVFSYPEFWLL-GADE---SALKNADGTIKSVWEARLGR-----IKGLPDDADADIPQL 298 (474) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhheee-cCCh---hhcccccccccchhhhhHHH-----HhcCCCccccccccc Confidence 58999999999999999888889999988761 1100 00112233322211100000 1112233333222 Q ss_pred ---ceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 381 ---EIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKEL 456 (589) Q Consensus 381 ---~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~l 456 (589) .+-|++ ..++.|.+.++.++.++...+++|...||+...++. .|+.|++....+..+|+++++..|..+++++ T Consensus 299 ~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np---~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~ 375 (474) T protein:vir:81 299 ARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNP---TSAESYDASQYELIAEAEGAVDDFTPALRKA 375 (474) T ss_pred ccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234544 678889999999999999999999999997533332 3889999999999999999999999999999 Q ss_pred HHHHHHHHhhcC-cccCc--ccceeeeCCcCCCCCCHHHHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHHHHHHHHHH Q lcl|NC_020883. 457 YESCLWLLNDQD-SSIRI--EEPNIETQDMILKPRAELVAENMAAYAASKQ--GQSLETTVRRMNPDASEDWIQEEIARI 531 (589) Q Consensus 457 i~~~l~L~~~~~-~~~~~--e~p~I~f~D~lPvde~El~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~v~eEv~RI 531 (589) +++++.+..... ..... ....+.|.|.-..... +.|..+..+.+++ +.+.++..++ |.++++++++..+.. T Consensus 376 ~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a--~~aDa~~Kl~~a~~~~~~~~~~~~~--lg~t~~~i~~~~~~~ 451 (474) T protein:vir:81 376 FIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKS--AQADAGMKQLAAVPWLAETEVGLEL--IGLTPQQARRAMADK 451 (474) T ss_pred HHHHHHHhCCCCccccchhhccceeEecCCCccCHH--HHHHHHHHHHhcccCCCcHHHHHhh--cCCCHHHHHHHHHHH Confidence 999988764321 11111 2356889986554433 4466666666665 3445555554 468887665544444 Q ss_pred HhhccccccccccccccccccccCcccCCCCCC Q lcl|NC_020883. 532 EEEQAGSDTSSLMGINQTFEQMNDNRDEDGNII 564 (589) Q Consensus 532 ~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~ 564 (589) ..++++. .+. ..++...+.++.+ T Consensus 452 ~~~~~~~---~~~-------~l~~~~~~~~~aq 474 (474) T protein:vir:81 452 RRVQGRG---TLQ-------ALIDRSNNGATAQ 474 (474) T ss_pred HHHhHHH---HHH-------HHHhcCCCCCCCC Confidence 4444321 111 1123222222222 No 73 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.93 E-value=6e-26 Score=158.86 Aligned_cols=396 Identities=12% Similarity=-0.015 Sum_probs=242.6 Q ss_pred ccceeccchhHHHHhhcchhhhhh---hhhcCCccc--cCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 2 IDWTVRGWTDKTTKNVHGDYERYR---QLYEGKHEL--LFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~r---~l~~g~~~~--~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) .+ .-.-++.++..-..+.||+ +.|+|+|.. +-+.+..-+. ....+++|+++.|+ T Consensus 1 ~~---~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~------------------~~~~~v~nw~~~iV 59 (409) T protein:vir:16 1 MT---EKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALS------------------QQYRSILGWCAKGV 59 (409) T ss_pred CC---HHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHH------------------HHHhhhcChhHHHH Confidence 11 1122334444444445555 569999864 1111111111 11236789999999 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) +..|+.+. +- +|. .++ +-+.++..-|+|..+......++ T Consensus 60 ds~a~rl~--~~----Gf~-------------------~~d----------------~~l~~i~~~N~ld~~~~~~~~~a 98 (409) T protein:vir:16 60 DSLADRLV--FR----EFE-------------------NDD----------------FTVNEIFEENNPDIFFDSTVLSA 98 (409) T ss_pred HHhHhhcc--cc----ccc-------------------Ccc----------------hHHHHHHHhcChhHHHHHHHHHH Confidence 99877551 11 111 000 01245666788888888889999 Q ss_pred HHcCceeEEEEEecCc-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEee-eccccceeehhhhccccccch Q lcl|NC_020883. 157 QVDGGIVAAPVIDELG-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRER-VEKDGLRTTNMLYPVVKAKGD 234 (589) Q Consensus 157 ~v~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~-~~~~~~~~~~~~y~~~~~~~~ 234 (589) ++-|.....++-++++ ++|.+.+|.+.+. +|.... . +-...++.+ .+..+..+...+|- .+. T Consensus 99 l~yG~sf~~v~~~~dg~~~i~~~sP~~~~~---------i~D~~~-~--~~~~a~~~~~~d~~~~~~~~~~~~----~~~ 162 (409) T protein:vir:16 99 LIASCSFTYISKGENDAVRLQVIEATNATG---------IIDPIT-G--LLTEGYAVLERDENNNVVLEAHFL----PDR 162 (409) T ss_pred HHhCceeEEEecCCCCceEEEEEcccceEE---------Eeeccc-c--cceeeeEEEEecCCCceEEEEEEe----cCc Confidence 9999988888876444 8999998887775 442211 1 111111111 12233333333331 011 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcch-hhhhHHHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISAL-DNLESKQDEIN 313 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~-~~ie~l~DeLd 313 (589) +......+ .....++.+..++.||.++|++...+++|+|++ +.+.+++|++| T Consensus 163 ~~~~~~~~---------------------------~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~ 215 (409) T protein:vir:16 163 TDYYYRDS---------------------------RNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAK 215 (409) T ss_pred EEEEEecC---------------------------ccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHH Confidence 11100000 001123456677889999999999999999999 57999999999 Q ss_pred HHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHH Q lcl|NC_020883. 314 WTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGD 392 (589) Q Consensus 314 ~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh 392 (589) +++++.....+.++.|...+- | .+.+++...... .....+-..+.+++|..+.+-||+ ..++.| T Consensus 216 r~~~~~~~~~e~~a~pqr~i~--------G--~d~d~~~~~~~~-----~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~ 280 (409) T protein:vir:16 216 RTLERADVTAEFYSFPQKYVT--------G--LSDDAEPMETWK-----ATVSSMLQFTKDEDGDKPTLGQFTQPSMSPF 280 (409) T ss_pred HHHHHHHHHHHHhcChhheeE--------e--cCCCCCccchhh-----hhhhHhhccCCCCCCCCceEEecCCCChhHH Confidence 999999999999999988761 2 234443222111 111112233445666667776766 567899 Q ss_pred HHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-c Q lcl|NC_020883. 393 MDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSS-I 471 (589) Q Consensus 393 ~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~-~ 471 (589) .+.++.+++++..++++|..+||.... . ..|+.|++..+.+..+|+++++..|-.++++++++++.+....+.. - T Consensus 281 ~~~l~~~~~~~a~~s~lP~~~lg~~~~-N---psSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~ 356 (409) T protein:vir:16 281 TEQLRTAAAGFAGETGLTLDDLGFVSD-N---PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLRE 356 (409) T ss_pred HHHHHHHHHHHhhhcCCCHHHcccccC-c---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccch Confidence 999999999999999999999996332 1 2489999999999999999999999999999999988876543221 1 Q ss_pred CcccceeeeCCcCCCCCC-HHHHHHHHHHHhccc--hhhHHHHHHHhCCCCCHHH Q lcl|NC_020883. 472 RIEEPNIETQDMILKPRA-ELVAENMAAYAASKQ--GQSLETTVRRMNPDASEDW 523 (589) Q Consensus 472 ~~e~p~I~f~D~lPvde~-El~~A~t~~~l~~a~--~~S~etaVr~Lhpdw~dE~ 523 (589) ......|.|.+..+.+-. .-+.|..+..+.+++ ++.+++...+++ +++.. T Consensus 357 ~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g--~~~~d 409 (409) T protein:vir:16 357 QFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTG--IKGAE 409 (409) T ss_pred hhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhcc--CCCCC Confidence 112356889987654432 222344444455554 333566666664 33222 No 74 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.19 E-value=6.7e-11 Score=76.36 Aligned_cols=512 Identities=10% Similarity=0.068 Sum_probs=207.5 Q ss_pred Ccccee-------------------------------------ccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHH Q lcl|NC_020883. 1 MIDWTV-------------------------------------RGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLI 43 (589) Q Consensus 1 ~~~~~~-------------------------------------~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~ 43 (589) |.|-.- +-+-.+....+..+++.+++ + |+ +-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~---------~-r~-~a~ 69 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQD---------N-RA-EMA 69 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchH---------H-HH-HHH Confidence 111111 11111222222222222111 1 11 111 Q ss_pred hhccccceeccCcceee-------ecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccc Q lcl|NC_020883. 44 EEGDAVGRFLDSSQTAR-------ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQD 116 (589) Q Consensus 44 ~~~~~~~~~~~~~~~~~-------~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 116 (589) +... |.+|+||-. ....|-+++|+.+.+++. .+ |....+-+. -. +.|.+ T Consensus 70 ~d~~----fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~--v~-----g~~~~nr~~------------~~-~~p~~ 125 (776) T protein:vir:93 70 VDED----YYDNIQWSQDEIDELKERGQAPTVYNVISQSVNW--II-----GSEKRGRSD------------FK-VLPRR 125 (776) T ss_pred HHHH----HhCCCCCCHHHHHHHHhcCCceEEecchHHHHHH--HH-----HHHHhCCcc------------eE-EecCC Confidence 2222 334666632 345677999999888876 33 322222110 00 11211 Q ss_pred cchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEEEec----CceeEEEecCceecccc----- Q lcl|NC_020883. 117 EEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDE----LGPRIVFKARDVYFPHD----- 187 (589) Q Consensus 117 ~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~----~~~~i~f~~~d~~~P~~----- 187 (589) . +.. +.+++=.++++.+...|+........+.++++.|=.+.+++++. +.+++..+++..+|+-- T Consensus 126 ~---~d~---~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a~~~ 199 (776) T protein:vir:93 126 K---DGG---KAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGAESWRNILWDSTYRRL 199 (776) T ss_pred h---hHH---HHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeeccChhheeeccccccC Confidence 1 111 22333355888899999999999999999999998889999983 22555666776666621 Q ss_pred cCcceeEEEeecCCCccceE---------------EEEEeee----------c-------------------cccceeeh Q lcl|NC_020883. 188 DEKGADLAYYIDHGQYGQFL---------------HIYRERV----------E-------------------KDGLRTTN 223 (589) Q Consensus 188 d~~~~div~~~e~~~~~~~l---------------~~~~~~~----------~-------------------~~~~~~~~ 223 (589) |..-|.++++..+-..+.+. |.+..+. . ...+++ . T Consensus 200 D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v-~ 278 (776) T protein:vir:93 200 DMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRM-I 278 (776) T ss_pred CHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccccccccccCCCeEEE-E Confidence 22234444433321111000 0000000 0 000000 0 Q ss_pred hhhccccccchhheeeccccc-ccccccccccchhhhhhcccCC---------------cc-ccccccccCCCCcceEEE Q lcl|NC_020883. 224 MLYPVVKAKGDVKKEIKKGEL-VTNVEGAEDLEGEELIREVLNI---------------PD-DRPLENFYPGRNRPFISY 286 (589) Q Consensus 224 ~~y~~~~~~~~~~~~~~~gd~-~~~~~e~~d~e~e~~i~~~i~i---------------p~-~~e~~~i~TGv~~plvvy 286 (589) ..|........+... ..|+. ..+.......+........+.+ +. +......+.+-..|+|++ T Consensus 279 E~~~r~~~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~ 357 (776) T protein:vir:93 279 EAWFRMPVRVQRLKG-RNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPI 357 (776) T ss_pred EEEEeeeeehhhccc-ccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEEe Confidence 011000000000000 00000 0000000000000000000000 00 011122234444556655 Q ss_pred ecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccc Q lcl|NC_020883. 287 WANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHR 366 (589) Q Consensus 287 vPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~ 366 (589) +......+-+|.|.+..+.+.++.+|.+.|+...++ ++.+++++.+.+...... ++. + ...+... T Consensus 358 -~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~~~~gav~~~d~~-~~~---~-~rp~~vi------ 422 (776) T protein:vir:93 358 -WGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKVLMEEGAVDDIDEF-RRE---A-ARPDAVM------ 422 (776) T ss_pred -cCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCceeeccccccchHHH-HHh---c-ccCCcee------ Confidence 555566666899999999999999999988776664 778899998887643110 110 0 0000000 Q ss_pred cccccccccccCccceee-ecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHH Q lcl|NC_020883. 367 DMEITTFDENGRSMEIHQ-IDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRL 445 (589) Q Consensus 367 dlev~~~de~g~~~~~iq-~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~ 445 (589) .+.+. .-..+.+.. .+ -..++++.+..+...|-.+++.+..++|... .+.||+|+..+........... T Consensus 423 --~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~-----n~~Sg~ai~~~~~~~~~~~~~~ 492 (776) T protein:vir:93 423 --TVKNG--KLGAVKMDVDRD-LAPAHLELASRSIQMIQQVGGVTDEMLGRTT-----NAVSGVAIQARQEQGSVATNKL 492 (776) T ss_pred --eeCCc--cccccccccCcC-ccHHHHHHHHHHHHHHHHhhCcChHHhCCCc-----chhhHHHHHHHHHHHHHHHHHH Confidence 00000 000111111 11 2467888888888888889999999999532 2458888776665555555555 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCc---ccCcc-----------------------cceeeeCCcCCC-CCCHHHHHHHHH Q lcl|NC_020883. 446 QKEYIDFLKELYESCLWLLNDQDS---SIRIE-----------------------EPNIETQDMILK-PRAELVAENMAA 498 (589) Q Consensus 446 R~~~~~aLk~li~~~l~L~~~~~~---~~~~e-----------------------~p~I~f~D~lPv-de~El~~A~t~~ 498 (589) -..+..+++++.+.++.|-..+.. .+.+. +-+|....+.-. ..++...+.+++ T Consensus 493 ~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~q 572 (776) T protein:vir:93 493 FDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELME 572 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHH Confidence 566677788877777666654311 01000 002222222111 113333444555 Q ss_pred HHhccchhhHH---HHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCc Q lcl|NC_020883. 499 YAASKQGQSLE---TTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSA 575 (589) Q Consensus 499 ~l~~a~~~S~e---taVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~ 575 (589) ++......... ..+..+-+.-.-+++. ++|+.-+...+|...-.- ....-....+. ....-..+.... T Consensus 573 l~~~~~p~~~~~~~~~~~e~~d~p~~~e~~---~~l~~~~~~~~p~q~~~~--~e~~~~qq~q~----~~~q~q~~~~~a 643 (776) T protein:vir:93 573 VIGKMPPEIALTMLDLLVENMDIPNRDELV---KRIRAVNGQKDPDQDEPT--PEEIAREQAQQ----QQQQYNDALAIA 643 (776) T ss_pred HHhhcChhhHHHHHHHHHHhcCccchHHHH---HHHHHhhcccccchhhcc--hhHHHHHHHhh----HHHHHHHHHhhh Confidence 54332221111 1111111111122222 333333222221111100 00000000000 000000000000 Q ss_pred chhhhhhc-------------ccccCC Q lcl|NC_020883. 576 EENEEIEK-------------EGEPIA 589 (589) Q Consensus 576 ~~~e~~~~-------------~~~~~~ 589 (589) .-.++..+ ...... T Consensus 644 ~~~~~qa~a~~~~aea~~~~aqa~~~~ 670 (776) T protein:vir:93 644 TLEEQQAKARKAAAEAQVAEAKAKHIS 670 (776) T ss_pred hhhHhhHHHHHHHHHHHHHhhhhhhhh Confidence 00000000 000000 No 75 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.97 E-value=2.6e-09 Score=67.64 Aligned_cols=425 Identities=14% Similarity=0.087 Sum_probs=176.6 Q ss_pred Ccc--------cee-ccch---hHHHHhhcc-----hhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecC Q lcl|NC_020883. 1 MID--------WTV-RGWT---DKTTKNVHG-----DYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQ 63 (589) Q Consensus 1 ~~~--------~~~-~~~~---~~~~~~~~~-----~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~ 63 (589) -++ ..+ -|+- |...-+..| +|...+.||.+ T Consensus 11 ~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~--------------------------------- 57 (461) T protein:vir:80 11 KIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYAS--------------------------------- 57 (461) T ss_pred hhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHh--------------------------------- Confidence 000 000 0110 111111111 11112222211 Q ss_pred cceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhc Q lcl|NC_020883. 64 TPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNS 143 (589) Q Consensus 64 ~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~ 143 (589) ++ |.+-+|++||..+-|---.+++ +.++. .+.+++..+.- T Consensus 58 ~~-----l~r~iVd~~a~d~~r~g~~i~~-------------~~~~~----------------------~~~~~~~~~~l 97 (461) T protein:vir:80 58 NS-----IAMNIVDIISEDMVRAGWSLKT-------------DNKEM----------------------KKNIESKWRKL 97 (461) T ss_pred CC-----ccchhhccchHHhhcCCeeeec-------------CCHHH----------------------HHHHHHHHHHh Confidence 11 3466888998888443222221 11111 12233334434 Q ss_pred cccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeeh Q lcl|NC_020883. 144 KLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTN 223 (589) Q Consensus 144 ~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~ 223 (589) +++.++.+.+....+.||.+.-+.+.+.+.+ .++..-|-+-++--.|+ ||++|-.. .+... T Consensus 98 ~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~----~~~~~~pl~~~~~~~~~----------~l~~~~~~----~i~~~- 158 (461) T protein:vir:80 98 KTKDRFQKLYADKRLYGDGFLSIGVVSSNRE----QADLSTAIDPKTIKSIP----------YINTFNTQ----KVTQL- 158 (461) T ss_pred hHHHHHHHHHHhhcccccEEEEEEeecCCcc----ccCccCCccccccccee----------EEEecccc----ccchh- Confidence 4455555555555566666555555443331 22222332111111233 34443111 11000 Q ss_pred hhhcc----ccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccC Q lcl|NC_020883. 224 MLYPV----VKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGI 299 (589) Q Consensus 224 ~~y~~----~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~ 299 (589) ..... .-.+...+.+...+.... + ...+..+.+ ... +-.--|+|+.|.+.....+|+ T Consensus 159 ~~~~dp~sp~fg~P~~y~i~~~~~~~~-------~----~~~~~~~~~----~~~----iH~SRii~~~~~~~~~~~~G~ 219 (461) T protein:vir:80 159 YLNQDMFSEHFGEVEFFEVNRVSQLGE-------E----ILSGTTAST----SEQ----IHRSRIIHEQGLRFEGETKGR 219 (461) T ss_pred hhcccCcCcccccceEEEEeccccccc-------c----ccccccCcc----ceE----EccccEEEecCCCCCccccCc Confidence 00000 000000011111000000 0 000000000 000 111226667787777788899 Q ss_pred cchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCc Q lcl|NC_020883. 300 SALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRS 379 (589) Q Consensus 300 SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~ 379 (589) |.++.+.+.+...+.+.-....++-+..-+.+.++. |....+ +..++....+ ... .+...+.+...+ T Consensus 220 S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~--l~~~~~---~~~~~~~~~~--~~~-~~~~g~~~~d~~----- 286 (461) T protein:vir:80 220 SIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDD--IDALNK---DDKANLTAML--DFM-FRTEALAIIKGD----- 286 (461) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecch--HHhhhc---hHHHHHHHHH--HHh-cCCceEEEEcCC----- Confidence 999999999999998876666666554555454431 211110 1111111111 100 011112222211 Q ss_pred cceeeecccHHHHHHHHHHHHHHHHHHhcCCchh-cccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_020883. 380 MEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKA-VDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELY 457 (589) Q Consensus 380 ~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~A-Fg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li 457 (589) -++-+.+..+..-...++.+..+|.+.+++|..- ||.- .++-+||....+.+..+ |+++| ..+...|.+++ T Consensus 287 e~~e~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s----~g~~asge~D~~~yyd~---i~~~qe~~l~p~le~l~ 359 (461) T protein:vir:80 287 EQLTKESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQE----AGTLTGAQYDVMNYYAR---VSSIQENRLRPQLEYLT 359 (461) T ss_pred cceEEEecCcCCHHHHHHHHHHHHhhhhcCCeeeeeccc----CCccccchHHHHHHHHH---HHHHHHHHHHHHHHHHH Confidence 2244556666677888999999999999999864 5542 22223555544444444 45555 34567777777 Q ss_pred HHHHHHHhhcCcccCc--ccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 458 ESCLWLLNDQDSSIRI--EEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 458 ~~~l~L~~~~~~~~~~--e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) .++++-.-..+..+.+ .+..|.|++..+.+++|+ |++.+.... +..+++. ++-.+.+++.++++ ... T Consensus 360 ~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kek--Ae~~~~~a~----a~~~~~~--~g~is~~e~r~~l~---~~~ 428 (461) T protein:vir:80 360 RLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTD--AEVRKLTAE----ADQIYIV--NGVLDPDEVKETRF---GRF 428 (461) T ss_pred HHHHHHhcccccccCccccceEEEeCCCCCCCHHHH--HHHHHHHHH----HHHHHHh--cCCCCHHHHHHHHH---Hhc Confidence 6655433222222333 245699999887776655 666655332 2233332 23466666666553 111 Q ss_pred cccccc-cccccccccccccCcccCCCCCCCCCCCCCCCCc Q lcl|NC_020883. 536 AGSDTS-SLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSA 575 (589) Q Consensus 536 a~~~p~-~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~ 575 (589) ..++. .+.+..+ +.++-..-.++...+|+++| T Consensus 429 -~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 429 -GLENSSKFSGDSA-------EIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred -CCCCCccCCCCCc-------hhhhhhhhccccccccCCCC Confidence 11111 1111100 00000001111222233333 No 76 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.79 E-value=3.5e-08 Score=61.48 Aligned_cols=431 Identities=10% Similarity=0.011 Sum_probs=186.3 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhcc-ccceec---cCcceeeecCcceEEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGD-AVGRFL---DSSQTARETQTPYVIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~-~~~~~~---~~~~~~~~~~~~y~~~n~~~~i~ 76 (589) |- |.-=.- -....+-.+...|.+|.|..+ +...|+ |.-+.. +..|-+.-.|..|. |+++-++ T Consensus 1 m~---V~~~hp-~y~a~~~~W~~~rd~~~G~~~--------~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~--n~~~~t~ 66 (452) T protein:vir:94 1 MP---IETKHP-EYLAYENDWIDCRVASLGQRE--------VKKKGVRFLPKLSGQTDDMYNAYKQRALFY--SITSKTL 66 (452) T ss_pred CC---CCCcCH-HHHHHHHHHHHHHHHhcChHH--------HHcCCcccCCCCCCCCHHHHHHHHhhccCC--chHHHHH Confidence 21 111011 111223345566678888533 111111 222221 11112222333333 6665554 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH 156 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~ 156 (589) .. .+|.+-.. ++.-+.+..++ ...+|.+.--++++++. .+..+ T Consensus 67 ~~-------~~G~vf~k--------~p~~~~p~~l~-------------~~~~D~~G~~L~~~~~~---------~~~~~ 109 (452) T protein:vir:94 67 SA-------LSGMVLDQ--------PPVITHPDAMS-------------KYFEDQSGIQFYEVFTR---------AVEET 109 (452) T ss_pred HH-------HhchhhcC--------CceecccHHHH-------------HHHhcccCCCHHHHHHH---------HHHHH Confidence 43 34444331 11111111110 01123333333333333 44456 Q ss_pred HHcCceeEEEEEecC--ceeEEEecCceeccc---ccCcceeEEEeecCC---Ccc----ceEEEEEeeeccccceeehh Q lcl|NC_020883. 157 QVDGGIVAAPVIDEL--GPRIVFKARDVYFPH---DDEKGADLAYYIDHG---QYG----QFLHIYRERVEKDGLRTTNM 224 (589) Q Consensus 157 ~v~Gg~~~~~~~~~~--~~~i~f~~~d~~~P~---~d~~~~div~~~e~~---~~~----~~l~~~~~~~~~~~~~~~~~ 224 (589) ++.|++..-+=+... -+++.++.|.+.+=- .+++-+-+++.+... ..+ +.+..||.+.-.+|. ..-+ T Consensus 110 l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~-~~v~ 188 (452) T protein:vir:94 110 LLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGL-LQIT 188 (452) T ss_pred HhcCeEEEEEeeccCCCceEEEEechhhhcCccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCe-EEEE Confidence 778877765544422 377777777665531 222223233321100 011 112223322100111 0111 Q ss_pred hhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhh Q lcl|NC_020883. 225 LYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDN 304 (589) Q Consensus 225 ~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ 304 (589) +|+ ...|..+...++.. + ..-.+.+....||++........ .|.+.+-+ T Consensus 189 ~~~-----------~~~~~~~~~~~~~~--------------~-----~~~~~~l~~IP~v~~~~~~~~~~-~~~pPLl~ 237 (452) T protein:vir:94 189 VHE-----------TQDGKVWELAKTST--------------I-----QNVGVTMDYIPFFCITPSGLSMT-PAKPPMID 237 (452) T ss_pred EEE-----------ccCCceeeecccee--------------e-----cCCCcccceeEEEEEcCCCCCCC-CCccchHH Confidence 221 11222222211110 1 11124556556667665444333 47788888 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceee Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQ 384 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq 384 (589) |..+--++-..-|....++-..+.|.+.+. |. +....+ ...+..++..++.|..+.|++ T Consensus 238 LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~--------g~--~~~~~i-----------~iG~~~~~~lpe~~~~~~yie 296 (452) T protein:vir:94 238 IVDINYSHYRTSADLEHGRHFTGLPTPWIT--------GA--ESQSTM-----------HIGSTKAWVIPEVAAKVGFLE 296 (452) T ss_pred HHHHHHHHhcchhHHHHHHHHcccceeEee--------cC--cCCCce-----------EecccccccCCCCCCcceEEc Confidence 888877777777777777777788877653 11 111111 112223455566566688999 Q ss_pred ecccH-HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 385 IDISK-IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWL 463 (589) Q Consensus 385 ~Dirv-eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L 463 (589) ++.+- ..|++.++.|.++|..+. ...+... +.+..|+.|...+.....+--..+-..+.++++++++++... T Consensus 297 ~~g~~i~~~~~~l~~le~~m~~~G---a~ll~~~----~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w 369 (452) T protein:vir:94 297 FTGQGLQSLEKALSEKQAQLASLS---ARLIDNS----TRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDM 369 (452) T ss_pred cCchhHHHHHHHHHHHHHHHHHHH---HHhhccC----CCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99765 679999999999986432 2333211 111223333322222111112222233446666666443332 Q ss_pred HhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh----CCCCCHHHHHHHHHHHHhhccccc Q lcl|NC_020883. 464 LNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM----NPDASEDWIQEEIARIEEEQAGSD 539 (589) Q Consensus 464 ~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L----hpdw~dE~v~eEv~RI~~E~a~~~ 539 (589) ... +....+ .++-+|.+. +.+..+ ...+.+ +..+|.+|.+|..+.| -++|++| .+||..|..+.. T Consensus 370 ~g~-~~~~~v-~~n~dF~~~-~~~~~~--~~al~~-~~~~G~is~~t~~~~L~~~gvl~~~~e-----~~~i~~E~~~~~ 438 (452) T protein:vir:94 370 ESM-GGTLNI-KLNSAFLDS-KLTAAE--LKAWVE-AYLSGGISKEIYIHALKVGKVLPPPGE-----SMGVIPDPPAPE 438 (452) T ss_pred cCC-CCceEE-Eeccccccc-cCCHHH--HHHHHH-HHhcCCCcHHHHHHHHHhCCCCCCccC-----HHHHHHHhhccC Confidence 211 111111 122334321 223222 223333 4577899999998877 3556543 356766655432 Q ss_pred cccccccccccccccCcccCCCCCCCCCCCC Q lcl|NC_020883. 540 TSSLMGINQTFEQMNDNRDEDGNIIEEGDTE 570 (589) Q Consensus 540 p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ 570 (589) |.+.+ .|.+-|++- T Consensus 439 ~~~~~-----------------~~~~~~~~~ 452 (452) T protein:vir:94 439 PSPSN-----------------TPPNPSSKA 452 (452) T ss_pred cccCC-----------------CCCCCccCC Confidence 22222 133333333 No 77 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.77 E-value=4.8e-09 Score=66.20 Aligned_cols=495 Identities=9% Similarity=0.016 Sum_probs=177.5 Q ss_pred Cccceeccch-hHH--HHhhcc-hhhhhh----hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWTVRGWT-DKT--TKNVHG-DYERYR----QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~~~~~~-~~~--~~~~~~-~~~~~r----~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) |-+=.+..-. .++ -+++|+ -+.+++ +-|.|+... .+ .. ..+.+ .++.+...+ T Consensus 10 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~---~~----~~--------~~s~~----~~~~v~~~v- 69 (705) T protein:vir:88 10 MDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFG---NE----RP--------GKSGI----VSRDVQETV- 69 (705) T ss_pred CCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCC---cc----cC--------CCCcc----ccHHHHHHH- Confidence 1111111110 111 133344 222222 445565321 10 00 01111 111111111 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhc-ccccccchhhhhhhhhhhhhhhhHHHHHHhhc-cccccch Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMI-EGPQDEEEAGKNENNTVIDLQNEIIEQITKNS-KLERRHW 150 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~-~~~~~~~ 150 (589) ...++...-.|. ..+.++ +-|.. .+....++.-.+.++-++.++ ....-+. T Consensus 70 -----------~~~~~~l~~~~~----------~~~~~~~~~p~~------~~D~~~a~~~~~~~~~~~~~~~~~~~~~~ 122 (705) T protein:vir:88 70 -----------DWIMPSLMKVFT----------SGGQVVKYEPDT------AEDVEQAEQETEYVNYLFMRKNEGFKVMF 122 (705) T ss_pred -----------HHHHHHHHHhhc----------CCCceEEEeeCC------hhHHHHHHHHHHHHhHHHhhccchhHHHH Confidence 111111111111 011111 11211 123344555566776654443 2234455 Q ss_pred hhHHHHHHcCceeEEEEEec-------------------------------------------------CceeEEEecCc Q lcl|NC_020883. 151 SNIVQHQVDGGIVAAPVIDE-------------------------------------------------LGPRIVFKARD 181 (589) Q Consensus 151 ~~l~~~~v~Gg~~~~~~~~~-------------------------------------------------~~~~i~f~~~d 181 (589) +.+-+++..|--+.+++|+. +.++|.-+++. T Consensus 123 ~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~ 202 (705) T protein:vir:88 123 DWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPE 202 (705) T ss_pred HHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHH Confidence 66666666666678888853 33555556666 Q ss_pred eecccccCc---ceeEEEeec-CCCccc----e-E---EEEEeee-ccccceeehhhhccccccchhhe----eecccc- Q lcl|NC_020883. 182 VYFPHDDEK---GADLAYYID-HGQYGQ----F-L---HIYRERV-EKDGLRTTNMLYPVVKAKGDVKK----EIKKGE- 243 (589) Q Consensus 182 ~~~P~~d~~---~~div~~~e-~~~~~~----~-l---~~~~~~~-~~~~~~~~~~~y~~~~~~~~~~~----~~~~gd- 243 (589) -||++-+.+ -|.++++-. .+..+- | . -..++.+ +....... -+........... +..+.. T Consensus 203 d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e--~~~~~~~d~~~~~~~~~~~~~~~~ 280 (705) T protein:vir:88 203 NFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPE--RLVRDNFDMTGQLQYNSGDDAEAN 280 (705) T ss_pred HceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhh--hccccccccccccccccccccCCc Confidence 666432222 222222111 111000 0 0 0000000 00000000 0000000000000 000000 Q ss_pred -ccccccccc--ccchhh--------hhhcccCCccccccccccCCCC-cceEEEecCCCCCCCcccCcchhhhhHHHHH Q lcl|NC_020883. 244 -LVTNVEGAE--DLEGEE--------LIREVLNIPDDRPLENFYPGRN-RPFISYWANNETFMNPYGISALDNLESKQDE 311 (589) Q Consensus 244 -~~~~~~e~~--d~e~e~--------~i~~~i~ip~~~e~~~i~TGv~-~plvvyvPN~~~~~~~lG~SD~~~ie~l~De 311 (589) .+.+.+... ++.+.. .+++.+ . .+. ... .|+++ ++-.+..++.||.|.++.+.++++. T Consensus 281 r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~i--l------~~~-~~~~~PF~~-~~~~p~~~~~~G~g~~~~~~d~Q~~ 350 (705) T protein:vir:88 281 REVWASECYTLLDVDGDGISELRRILYVGDYI--I------SNE-PWDCRPFAD-LNAYRIAHKFHGMSVYDKIRDIQEI 350 (705) T ss_pred eeEEEEEeeeEecccCCcceeeEEEEEeCccc--c------ccc-cCCCCCEEE-ecceeecCccccCChHHHHhHHHHH Confidence 011111100 011000 001100 0 001 112 34444 5777888999999999999999999 Q ss_pred HHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHH Q lcl|NC_020883. 312 INWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIG 391 (589) Q Consensus 312 Ld~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirvee 391 (589) ||..+++....+-++++|++.||.+++... .......| .+.... .+..+.++...--..+ T Consensus 351 ~n~~~~~~~d~~~~~~~~~~~~~~g~v~~~--------d~~~~~pg-----------~vv~~~-~~~~i~~~~~~~~~~~ 410 (705) T protein:vir:88 351 RSVLMRNIMDNIYRTNQGRSVVLDGQVNLE--------DLLTNEAA-----------GIVRVK-SMNSITPLETPQLSGE 410 (705) T ss_pred HHHHHHHHHHHHHhccCCceeccccccCcc--------cccccCCC-----------eeEEec-CCCccccccCCcCcHH Confidence 999999998888888999999999887421 11111111 111111 1122444444444455 Q ss_pred HHHHHHHHHHHHHHHhcCCchhcccccCc-ccchhHHHHHHHHHhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCc Q lcl|NC_020883. 392 DMDHVKNLIKLMLIETQTSEKAVDFYLDG-GASGAQSGVAKFYDLLTTILKSRRLQKEYI-DFLKELYESCLWLLNDQDS 469 (589) Q Consensus 392 h~~~ie~L~~~Il~~a~ts~~AFg~~~~~-g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~-~aLk~li~~~l~L~~~~~~ 469 (589) ++..+..+...+-.+++.+..+.|...+. +.+.|.++++.. +..--......-+.|+ .+++++++..++|...+.. T Consensus 411 ~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~--~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~ 488 (705) T protein:vir:88 411 VYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQL--MTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQN 488 (705) T ss_pred HHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 66667777777778899999999853211 112233344332 2222223333334454 4567776666665554322 Q ss_pred c----------cCcc--c----ceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 470 S----------IRIE--E----PNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 470 ~----------~~~e--~----p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) . +.+. . -+|...-++-....+...+.+..++.. .+.....-.++|..+...+.+-++++.. T Consensus 489 ~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~---~q~l~~~~~~~~~~~~~~~~~~~~el~e 565 (705) T protein:vir:88 489 QEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEM---AQAVVGGGGLGVLVSEQNLYNILKEVTE 565 (705) T ss_pred CceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHH---HHHhhcccchhhhcChHHHHHHHHHHHH Confidence 1 1110 0 011111111112233333343333221 0111111122233344433333333322 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-------cc-CC Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-------EP-IA 589 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-------~~-~~ 589 (589) ......+.-+......+. ..+. ... ....++.. ..++..... |- .+ T Consensus 566 ~~~~k~~~~~~~~~~~~e----~~~~---~~~--~~q~e~~~-~~~~~~~q~e~~k~q~e~~~~ 619 (705) T protein:vir:88 566 NAGYKDPDRFWTNPNSPE----ALQA---KAI--REQKEAQP-KPEDIKAQADAQRAQSDALAK 619 (705) T ss_pred hhhhhhHHHHhhhhhhHH----HHHH---HHh--hhhhhhhH-HHHHHHHHHHHHHHHHHHHHH Confidence 211101111110000000 0000 000 00011100 000000000 00 00 No 78 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=98.64 E-value=1.6e-07 Score=57.87 Aligned_cols=502 Identities=12% Similarity=0.058 Sum_probs=182.1 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhh---cCCccccC--HHH-HHHHhhccccceeccCcceee--ecCcceEEEEc- Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLY---EGKHELLF--PRA-KRLIEEGDAVGRFLDSSQTAR--ETQTPYVIFNL- 71 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~---~g~~~~~f--~ra-~~~~~~~~~~~~~~~~~~~~~--~~~~~y~~~n~- 71 (589) |+| ++++...|+..+.+-+... +-+..+-+ .++ .+++ +|.......+.... .-++- ++.|- T Consensus 15 ~~~------~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~---~y~~~~~~~~~~~~~~~~rs~-~~~~~v 84 (651) T protein:vir:80 15 YDE------THDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQ---DYLRDQVLRSVGDVNADWRHK-ITTGKA 84 (651) T ss_pred hhh------hHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHH---HhhccccccccCCCCCCCCcc-ccChhH Confidence 444 6666666776666554332 11111100 000 0011 11111100000000 00111 11111 Q ss_pred chhhhccchhhhccccccccccccCCc-ccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh----hcccc Q lcl|NC_020883. 72 PKVIAEIPATMVSGSIGQIKSSITTGE-IDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK----NSKLE 146 (589) Q Consensus 72 ~~~i~~~pa~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k----n~~~~ 146 (589) ...+--+=|.|+. .++|++. +.-.+..+.+ -.....+.++.++. .|+|. T Consensus 85 ~~~ve~~~~~l~~-------~~~~~~~~~~~~p~~~~d-------------------~a~~~~~~~~~~~~~~l~~~~~~ 138 (651) T protein:vir:80 85 FEAIETIHAYLMS-------ATFPNKNWFDVVPAKPGQ-------------------DNLLVSRLIKRYVQDKLTEGKFR 138 (651) T ss_pred HHHHHHHHHHHHH-------hhcCCCceeEeccCCchh-------------------HHHHHHHHHHHHHHHHhhccCcH Confidence 1111111112221 1122111 0001111111 01112345555544 67777 Q ss_pred ccchhhHHHHHHcCceeEEEEEec--------------------------------CceeEEEecCceecccccCc---c Q lcl|NC_020883. 147 RRHWSNIVQHQVDGGIVAAPVIDE--------------------------------LGPRIVFKARDVYFPHDDEK---G 191 (589) Q Consensus 147 ~~~~~~l~~~~v~Gg~~~~~~~~~--------------------------------~~~~i~f~~~d~~~P~~d~~---~ 191 (589) ..+...+-++.+.|=++.+++|+- +.++|.-+++.-||++-..+ - T Consensus 139 ~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d 218 (651) T protein:vir:80 139 AAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNR 218 (651) T ss_pred HHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccc Confidence 777777788888999999999862 23566667777777642222 2 Q ss_pred eeEEEeecCCCc--------cceEEE-EEeeeccc-----ccee-ehhhhccccccchhheeeccccccccccccc--cc Q lcl|NC_020883. 192 ADLAYYIDHGQY--------GQFLHI-YRERVEKD-----GLRT-TNMLYPVVKAKGDVKKEIKKGELVTNVEGAE--DL 254 (589) Q Consensus 192 ~div~~~e~~~~--------~~~l~~-~~~~~~~~-----~~~~-~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~--d~ 254 (589) |.++++...+.. +.|..+ ...+.++. .+-. ...-+++....+. .--..+.+.++.. +. T Consensus 219 ~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~-----~~~~~v~v~E~~~~~d~ 293 (651) T protein:vir:80 219 GAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLW-----SPHQNVELLEYWGDIHL 293 (651) T ss_pred cceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCcccc-----ccccceEEEEEEEEeec Confidence 333322222211 111100 00000000 0000 0000000000000 0000111221111 11 Q ss_pred chhhhhhcc-cCCccc--cccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcE Q lcl|NC_020883. 255 EGEELIREV-LNIPDD--RPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRI 331 (589) Q Consensus 255 e~e~~i~~~-i~ip~~--~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI 331 (589) +...+ ... +..... .........-..|+++ ++-.+..+++||+|..+.+.+.+..||..+.+......+.++|++ T Consensus 294 e~~~~-~~~~v~~~g~~il~~~~~~~~~~~Pf~~-~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~ 371 (651) T protein:vir:80 294 ENKTY-HDVVVTIMGNEVLRFEQNPYWCGRPFVI-GTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMY 371 (651) T ss_pred cCCce-EEEEEEEcCcEEecccccCCCCCCCeee-ecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcE Confidence 11000 000 000000 0001111222346655 466778889999999999999999999998888888889999999 Q ss_pred EechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020883. 332 SITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDHVKNLIKLMLIETQTS 410 (589) Q Consensus 332 ~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts 410 (589) .||...+.+.... ...| |+... + .. +..+..+++. ......++.+..|...+-.+++.+ T Consensus 372 ~v~~d~~~~~~~l------~~~p--g~vi~--------~---~~-~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~ 431 (651) T protein:vir:80 372 TLRSDGLLQPEDV------YTEP--GKVFL--------V---SD-HGDLQPLANQSSNFSITYQESSFLESTIDKNFGTG 431 (651) T ss_pred EecCCccccHHHh------hcCC--CceEE--------e---cC-CCCceeeccCcccchhHHHHHHHHHHHHHHHhcCC Confidence 9986654432111 1111 11100 0 11 1112333332 234556667777777777788899 Q ss_pred chhcccccCcccchhHHHHHHHHH-hhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccc----------C------ Q lcl|NC_020883. 411 EKAVDFYLDGGASGAQSGVAKFYD-LLTTILKSRRLQKEYI-DFLKELYESCLWLLNDQDSSI----------R------ 472 (589) Q Consensus 411 ~~AFg~~~~~g~~~A~Sg~A~r~~-~~~~~~Kv~~~R~~~~-~aLk~li~~~l~L~~~~~~~~----------~------ 472 (589) ....|....+....|+++++.+.. ...++. .+-..+. ++++.+++.+++|....+..- . T Consensus 432 ~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~---~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~ 508 (651) T protein:vir:80 432 NYVGANAARSGERVTAAEVAAVREAGGNRLS---GIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYE 508 (651) T ss_pred hHHhCCCccchhhccHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccc Confidence 888886433322334467766532 222221 1222233 366777666655554332110 0 Q ss_pred ccccee--eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccc Q lcl|NC_020883. 473 IEEPNI--ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTF 550 (589) Q Consensus 473 ~e~p~I--~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l 550 (589) +.+.++ .++ .++..-.. ..+-.+. .+++.+....+.. .|..... ..+.++..+-... +.++... T Consensus 509 i~~~dl~~~~~-iv~~g~~~--~~~r~~~--~~~l~~~~q~~~~-~p~~~~~---~~~~~~~~~l~~~--~g~~~~~--- 574 (651) T protein:vir:80 509 LDVEDLQKEVR-LVPIGSDH--VIERKQY--IEDRLTFIQAVAQ-VPEMGQL---VDYKRILVDLLQH--WGFEEPE--- 574 (651) T ss_pred cCccceeeeee-eeeccHHH--HHHHHHH--HHHHHHHHHhhcc-CCccchh---hhHHHHHHHHHHH--cCCCCcH--- Confidence 000111 110 01111111 0100000 0112222212211 2333221 1233332221111 1111111 Q ss_pred ccccCcccCCCCCCCCCCCCCCCCc----------chhhhhhcc------cccCC Q lcl|NC_020883. 551 EQMNDNRDEDGNIIEEGDTEEEPSA----------EENEEIEKE------GEPIA 589 (589) Q Consensus 551 ~~~~~~~~~~~~p~deg~~~eep~~----------~~~e~~~~~------~~~~~ 589 (589) .++.+.++- + ....++.. .+....+.. +.-++ T Consensus 575 -~~l~~~~q~--~----~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~ 622 (651) T protein:vir:80 575 -AYLKQQDQQ--A----PANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMS 622 (651) T ss_pred -HhcCCCccc--h----hhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111110 0 00000000 000000000 00000 No 79 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=98.62 E-value=1.9e-07 Score=57.46 Aligned_cols=519 Identities=13% Similarity=0.072 Sum_probs=204.6 Q ss_pred CccceeccchhHHHHh-hcchhhhhhhhhcCC---ccccCHHHHHHHhhccc-----------cceeccCccee------ Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKN-VHGDYERYRQLYEGK---HELLFPRAKRLIEEGDA-----------VGRFLDSSQTA------ 59 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~~r~l~~g~---~~~~f~ra~~~~~~~~~-----------~~~~~~~~~~~------ 59 (589) |.. |--++ |.--|-+=-|++-+. ...+|.||.+.+....- =-.+.+|+||. T Consensus 1 ~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~ 72 (711) T protein:vir:10 1 MAK--------KQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTE 72 (711) T ss_pred CCc--------ccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHH Confidence 000 00000 000000000111110 00011111111000000 01122355552 Q ss_pred -eecCcceEEEEcchhhhccchhhhcccccccc-ccccC---Ccccchhhccchhhc--ccccccchhhhhhhhhhhhhh Q lcl|NC_020883. 60 -RETQTPYVIFNLPKVIAEIPATMVSGSIGQIK-SSITT---GEIDPDIEEDTDEMI--EGPQDEEEAGKNENNTVIDLQ 132 (589) Q Consensus 60 -~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~-~~~~~---~~~~~~~~~~~~~~i--~~~~~~~~~~~~~~~~~~~~~ 132 (589) +.-..|-+++|+.+.+++. .+..-.-..+. ...|. ..+-.+.+..+.+.. .+++ ..-...+.= T Consensus 73 l~~~g~p~~~~N~i~~~v~~--v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~--------~d~~~Ae~l 142 (711) T protein:vir:10 73 RELEQRPCLVNNVLPTFVDQ--VLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGK--------NDYELAEVF 142 (711) T ss_pred HHhcCCCcEEEcchHHHHHH--HhhhHhhCCcceEEecccccchhhhhhhhccccccccCCCh--------hHHHHHHHH Confidence 3446788999999998876 22211111110 11121 111112222222222 1112 222233333 Q ss_pred hhHHHHHHhhccccccchhhHHHHHHcCceeEEEEEe-------cCceeEEEe-cCceecc--cc---cCcceeEEEeec Q lcl|NC_020883. 133 NEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPVID-------ELGPRIVFK-ARDVYFP--HD---DEKGADLAYYID 199 (589) Q Consensus 133 ~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~-------~~~~~i~f~-~~d~~~P--~~---d~~~~div~~~e 199 (589) +.++..+..+|+......+.+.++++.|=-+.++++| ++.++|.-+ ++..+|. +. |..-|.++++.. T Consensus 143 ~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~ 222 (711) T protein:vir:10 143 TGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDD 222 (711) T ss_pred HHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeee Confidence 5588889999999988888898887777666677654 234666444 4554554 21 333455555444 Q ss_pred CCCccceEEEE----------------Eeeeccccceeehhhhccccccchhheeecccccccccccccccchh-----h Q lcl|NC_020883. 200 HGQYGQFLHIY----------------RERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGE-----E 258 (589) Q Consensus 200 ~~~~~~~l~~~----------------~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e-----~ 258 (589) +--.+.+.-.| ..+.+.+.+++..--|+. .+..... ....|.++...... .+..+ . T Consensus 223 ~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~-~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~g~ 299 (711) T protein:vir:10 223 TMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTRE-PVIREIA-LLSDGRSFWLDALE-DIVDELLEAGI 299 (711) T ss_pred cCCHHHHHHhCCchhhhhhhcccccccCcccCcceeeEEEEEeee-eeeeEEE-eecCCceeccCcch-hHHHHHHhcCc Confidence 32211100000 001111222221111110 0000000 00011111111000 00000 0 Q ss_pred -hhhcc------------cCCccccccccccCCCCcceEEEecCCC---CCCCcccCcchhhhhHHHHHHHHHHhHHHHH Q lcl|NC_020883. 259 -LIREV------------LNIPDDRPLENFYPGRNRPFISYWANNE---TFMNPYGISALDNLESKQDEINWTITRSAVI 322 (589) Q Consensus 259 -~i~~~------------i~ip~~~e~~~i~TGv~~plvvyvPN~~---~~~~~lG~SD~~~ie~l~DeLd~t~S~~sri 322 (589) .+..+ +| +.+.+...-++|-..|+|.++.+.. ....|+| .+.++.+.++.+|.++|+...+ T Consensus 300 ~~~~~~~~~~~~v~~~~~~G-~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G--~vr~~~d~Qr~~N~~~s~~~~~ 376 (711) T protein:vir:10 300 SIVRTRKVKTFKTYWRKITG-ANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRS--IIRHSKDAQRMANYWDSAATET 376 (711) T ss_pred hhhhhhhhceeeEEEEEEec-ceeecCCCCCCCCcccEEEEeeeeeccccccccch--hhhhhhhhHHHHHHHHHHHHHH Confidence 00000 01 1111222334566678888876643 3334455 6899999999999999999999 Q ss_pred HHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHH Q lcl|NC_020883. 323 YEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKL 402 (589) Q Consensus 323 ldk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~ 402 (589) +-+.++++++++.+.+...... |.+.. ..+ +..+.+.+.......++.++.-.-..++++.+...... T Consensus 377 l~~~~~~~~~~~~gai~~~~~~-~~e~~-~~~----------~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~ 444 (711) T protein:vir:10 377 VALAPKAPFIGSEGNVEGREDE-WEQAN-TKN----------FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEK 444 (711) T ss_pred HHhcCCCceeecCcccCChHHH-HHhcc-ccC----------CCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHH Confidence 9899999999999888642110 11100 000 00011112222222344443333456677777777777 Q ss_pred HHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc---ccCcc----- Q lcl|NC_020883. 403 MLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS---SIRIE----- 474 (589) Q Consensus 403 Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~---~~~~e----- 474 (589) |-.+++.+..++|... .+.||+|+..+..+........-..+..+++++.+.++.|-..+.. .+.+. T Consensus 445 i~~~tGi~~~~~G~~~-----n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~ 519 (711) T protein:vir:10 445 IKSTMGMYDASLGAMG-----NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDET 519 (711) T ss_pred HHHHhCCChHHcCCCc-----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCC Confidence 7778999999998632 2458888776665554444444455566777777666655543211 00000 Q ss_pred ---------------------------cceeeeCCcCCCCC-CHHHHHHHHHHHhcc---chhhHHHHHHHhCCCCC-HH Q lcl|NC_020883. 475 ---------------------------EPNIETQDMILKPR-AELVAENMAAYAASK---QGQSLETTVRRMNPDAS-ED 522 (589) Q Consensus 475 ---------------------------~p~I~f~D~lPvde-~El~~A~t~~~l~~a---~~~S~etaVr~Lhpdw~-dE 522 (589) +-+|....+...+. .+...+.+++++... ..+-....+..+ |+. .+ T Consensus 520 ~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~--d~p~~~ 597 (711) T protein:vir:10 520 EDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNM--DWPGAD 597 (711) T ss_pred cceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhc--CCCCHH Confidence 00122222211110 111112222221110 011111111111 122 22 Q ss_pred HHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 523 WIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 523 ~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) ++.+ +|+.-+..+++ ..++ .++++....+..+-+... .++ T Consensus 598 el~e---~lr~~~~~~~~----------------------~~~~-~~~~qq~~~e~qq~~~~~-q~~ 637 (711) T protein:vir:10 598 VIAE---RLKKIVPPNVL----------------------SKDE-REAIEEDMPEQTEPTPEQ-QVE 637 (711) T ss_pred HHHH---HHHhhcCcccC----------------------cchh-hhHHHHHHHHHHHHHHHH-HHH Confidence 2222 22211111100 0000 000000000000000000 000 No 80 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=98.57 E-value=2.6e-07 Score=56.66 Aligned_cols=457 Identities=14% Similarity=0.079 Sum_probs=192.4 Q ss_pred chhHHHHh---hc-------chhhhhhhhhcCCccccCHHHHHHHhhc-cccceeccCc---ceeeecCcceEEEEcchh Q lcl|NC_020883. 9 WTDKTTKN---VH-------GDYERYRQLYEGKHELLFPRAKRLIEEG-DAVGRFLDSS---QTARETQTPYVIFNLPKV 74 (589) Q Consensus 9 ~~~~~~~~---~~-------~~~~~~r~l~~g~~~~~f~ra~~~~~~~-~~~~~~~~~~---~~~~~~~~~y~~~n~~~~ 74 (589) -+|+-+++ .| -.+...|.+|.|.++ +...| .|.-+...-+ |-+.-.|..| .|+++- T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~--------~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~--~n~~~~ 70 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEA--------MREAGETYLPRHQEETDKGYQERLASAVL--LNMVEQ 70 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHH--------HHhhcccCCCCCCCCCHHHHHHHHhcccC--CChHHH Confidence 33333332 23 345566788888522 11222 2333332111 1111112222 244443 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) ++ +..+|.+-..=| ..+.+....+. +.+ .+=+|+|.--+++++++ .+. T Consensus 71 tl-------~~l~G~vf~k~p------~~~~~~p~~~~---------~~l-~~d~D~~G~~L~~f~~~---------~~~ 118 (513) T protein:vir:97 71 TL-------DTLSGKPFSEPI------KLNEDVPKAIE---------ETI-LPDVDLQGNNLDVFARQ---------WFR 118 (513) T ss_pred HH-------HHHhhhhhhcCc------ccCcCchHHHH---------HHH-hhccCCCCCCHHHHHHH---------HHH Confidence 33 333444433111 11222222221 001 01133333334444443 334 Q ss_pred HHHHcCceeEEEEEec-C------------------ceeEEEecCceecc----cccCc--ceeEEEeec----CCCccc Q lcl|NC_020883. 155 QHQVDGGIVAAPVIDE-L------------------GPRIVFKARDVYFP----HDDEK--GADLAYYID----HGQYGQ 205 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~~-~------------------~~~i~f~~~d~~~P----~~d~~--~~div~~~e----~~~~~~ 205 (589) .++..|++..-|=... . -+++.++.|.+.+= +.+|+ -+-+++.+. ++-..+ T Consensus 119 ~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~ 198 (513) T protein:vir:97 119 EGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEV 198 (513) T ss_pred HHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcce Confidence 4566666543331110 0 15566666655533 23333 233444322 222233 Q ss_pred eEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEE Q lcl|NC_020883. 206 FLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFIS 285 (589) Q Consensus 206 ~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvv 285 (589) .++.||.+.. + ..++|+. ...+.+.. .+.. ++ ....+++..+.|| T Consensus 199 ~~~q~rvL~~--g---~~~v~r~----------~~~~~~~~--~e~~-------------~~-----~~g~~~l~~IP~v 243 (513) T protein:vir:97 199 CKRRIRVLEP--G---LVQLWEP----------VKKSNAQK--EEWA-------------LA-----DEWATGLNYVPLV 243 (513) T ss_pred EEEEEEEEeC--c---eEEEEEe----------ecCCCccc--cceE-------------Ee-----cCCCCcCCceeEE Confidence 4455554421 1 1133321 11111110 0000 00 1122456666666 Q ss_pred EecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccc Q lcl|NC_020883. 286 YWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDH 365 (589) Q Consensus 286 yvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~ 365 (589) ++-+.... ..-|.+-|-+|..+-.+.=...|....++-..+.|.++++ |.. +.+++ + +-. T Consensus 244 ~~~~~~~~-~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~--------G~~-~~~~~--~--------i~i 303 (513) T protein:vir:97 244 TFYADRQG-FMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACS--------GAS-GEDSD--P--------VVV 303 (513) T ss_pred EEecCCCC-CCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeee--------cCC-cCCCC--c--------eEe Confidence 66443322 2236666666555544444455555566656688877764 111 01111 0 111 Q ss_pred ccccccccccccCccceeeecccH-HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHH Q lcl|NC_020883. 366 RDMEITTFDENGRSMEIHQIDISK-IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRR 444 (589) Q Consensus 366 ~dlev~~~de~g~~~~~iq~Dirv-eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~ 444 (589) .+-.++..++.+..+.|++++.+. ..++..++.|.++|..+. ...... .+++.|+++.+.+..+..+.... T Consensus 304 G~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~G---a~ll~~-----~~~~~Ta~a~~~~~~~~~S~L~~ 375 (513) T protein:vir:97 304 GPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYG---AEFLKR-----KTGGQTATARALDSAEATSDLSA 375 (513) T ss_pred eccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHH---HHhhcc-----CCccccHHHHHHHHHHHHHHHHH Confidence 112344455666778899999665 568899999999985432 222221 12235666666555555555666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh------CCC Q lcl|NC_020883. 445 LQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM------NPD 518 (589) Q Consensus 445 ~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L------hpd 518 (589) +...+.++|++++.++............+ ..+-+|... +.+.-..+.+++ +..++.+|.+|.++.| .|+ T Consensus 376 ~a~~le~al~~~l~~~a~wlg~~~~~~~v-~in~dF~~~---~~~~~~~~al~~-a~~~G~is~~t~~~~L~r~gvl~~d 450 (513) T protein:vir:97 376 MTGLFEDALAQALDITADWLRLGPNGGTV-ELVKDYDLE---EMDAPGLQALQV-AREKRDISRKTYLNGLRLRGVLPED 450 (513) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCccEE-EeccccCcc---cCCHHHHHHHHH-HHhCCCCCHHHHHHHHHhccCCCcc Confidence 66778888888875544332211111111 111234332 112212233333 4567889988887665 577 Q ss_pred CCHHH-HHHHHHHHHhhcccc--ccccccccccccccccCcccCCCCCC---CCCCCCCCCCcch Q lcl|NC_020883. 519 ASEDW-IQEEIARIEEEQAGS--DTSSLMGINQTFEQMNDNRDEDGNII---EEGDTEEEPSAEE 577 (589) Q Consensus 519 w~dE~-v~eEv~RI~~E~a~~--~p~~~g~~~~~l~~~~~~~~~~~~p~---deg~~~eep~~~~ 577 (589) +++++ .+++.+||.+..+-. +-.+.+ .+|+-...-++.++ +.+- ..|+.+-.|-++- T Consensus 451 ~d~~~~~e~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 451 FDEDEDWEELMEEISEAMGRAGLDLDPAQ-KNPPEGGEGEGEGE-GEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCHHHHHHHHHHhhhhccCCCCccccccC-CCCCCCCCCCCCCC-CCCCCCCCccccCCCCCCCC Confidence 77544 455666676554321 111111 11111111111111 1111 1112222222222 No 81 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.14 E-value=3.7e-06 Score=50.35 Aligned_cols=445 Identities=11% Similarity=0.005 Sum_probs=155.1 Q ss_pred Cccceeccch-hHHHHhhcchhhh-----------------hhhhhcCCccccCHHHHHHHhhccccceeccCcceeeec Q lcl|NC_020883. 1 MIDWTVRGWT-DKTTKNVHGDYER-----------------YRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARET 62 (589) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~-----------------~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~ 62 (589) .|-=+..|-. .-+.-+.++++.. |..-|.| +..-.+.+ T Consensus 56 ~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~l~a~Y~------------------ 111 (537) T protein:vir:10 56 AMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIG------HQMCALIA------------------ 111 (537) T ss_pred cccccccccccchhccccccchhhhhhhccccccchhhhhccccCCcc------HHHHHHHH------------------ Confidence 0000000000 0001111211111 1111222 11111111 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) +++ +.+-+|++||+.+.|---.++. +..+-+ .+-+.+.+++..+. T Consensus 112 ~~~-----l~r~iVd~~A~d~~r~~~~i~~-------------~~~~~~-----------------~~~~~~~l~~~~~~ 156 (537) T protein:vir:10 112 THW-----LVNKACSQMPRDAMRKGYKIIS-------------DDGNEL-----------------DPKDAKFIDRYDRA 156 (537) T ss_pred hCc-----hhhhhhhhhhHHhhcCCceeec-------------CCcccc-----------------cHHHHHHHHHHHHH Confidence 111 4567889998877443333322 000000 00011233444444 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceee Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTT 222 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~ 222 (589) -+++.++.+.+.-.-.-||-++.+.++.. |-..+ +.++..+-+ ..+ .-++++++.-++-. .+.+. T Consensus 157 l~~~~~l~~a~~~~rlyG~~~i~i~v~~~---------D~~~~-~~Pl~~~~i---~kg-~~k~l~vidp~~~~-~~~~~ 221 (537) T protein:vir:10 157 FNIKKHAIQFVRKGRIFGIRIALFKVDSP---------DPYYY-EKPFNIDGV---MPG-AYKGIVQIDPYWCA-PLLDA 221 (537) T ss_pred hhHHHHHHHHHHhcccccceEEEEeecCc---------CCccc-ccccccccc---ccc-ceeEEEEechhhcc-cccch Confidence 33333333333322222333333333221 11111 111111000 000 01234443221111 11111 Q ss_pred hhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCCCcccCcc Q lcl|NC_020883. 223 NMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFMNPYGISA 301 (589) Q Consensus 223 ~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~~~lG~SD 301 (589) +..- .+. ...+..... +. ..+.++ |..--.++.|.+ +|+. +....++|+|- T Consensus 222 ~~~~--dp~----------sp~fg~P~~---y~---v~g~~i----H~SRli~f~g~~------~p~~~~~~~~~~G~Sv 273 (537) T protein:vir:10 222 QASS--NPV----------SMHFYEPTY---WL---INGKKY----HRSHLAIYINDE------VVDFLKPSYIYGGVPL 273 (537) T ss_pred hhhc--cCC----------ccccCCcee---ee---ecCeEe----cceeEEEecCCC------CchhhhcccCcccccH Confidence 0000 000 000000000 00 000000 111001111111 1232 23445689999 Q ss_pred hhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccc Q lcl|NC_020883. 302 LDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSME 381 (589) Q Consensus 302 ~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~ 381 (589) +..+.+-+...+.+.-..+.++-+..-+.+.+. ++..+.+. ++ ...........-+.. .++-.+..+ .+ T Consensus 274 lq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~--~~~~l~~~----~~-~~~r~~~~~~~r~n~--g~~~id~e~--e~ 342 (537) T protein:vir:10 274 PQQIMERVYAAERTANEGPMLAMTKRQTVLKVD--AAQVLANK----QQ-FDETMSWWTATRDNY--QVRVVDKDN--ED 342 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceeeec--hHHhhcCH----HH-HHHHHHHHHhhcCCc--ceeEecCCC--ce Confidence 998888888888876666777655454444332 11111110 00 000000000000100 011112112 22 Q ss_pred eeeecccHHHHHHHHHHHHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 382 IHQIDISKIGDMDHVKNLIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESC 460 (589) Q Consensus 382 ~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~ 460 (589) |-+.++.+..--..++....+|-..++.|.. .||.- ..+-..||....+.+..+ |+..|..+..+|.+++.++ T Consensus 343 ~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~s---p~GlnatGe~D~~~yyd~---I~~~Qe~l~p~l~~l~~ll 416 (537) T protein:vir:10 343 VVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTV---PTGFNSTGDYEEASYHEE---CESTQDDMRPLIDRHHQLV 416 (537) T ss_pred eEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCC---ccccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHH Confidence 3344555555566777777778788888866 45531 112223566665555555 3444444567777777665 Q ss_pred HHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHH-------HHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 461 LWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAA-------YAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 461 l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~-------~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) +.+. .+. .....|.|++....+++|+ |++.+ ++.++++ .+.+++++++..- T Consensus 417 ~~~~--~~~---~~~~~i~f~pL~~~s~kEk--Aei~~~~a~a~~~~~~~G~-------------i~~~Evr~~L~~~-- 474 (537) T protein:vir:10 417 CRSH--LRK---RIRVKVEFPPMDAPKESER--ADTFLKKMQAAKLAFEMGA-------------VDGVDVNEYLRMD-- 474 (537) T ss_pred HHhc--CCC---CcceEEEeCCCCCCCHHHH--HHHHHHHHHHHHHHHHcCC-------------CCHHHHHHHHhcc-- Confidence 5432 121 2245689999877776664 55544 3444444 4444444444321 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCC-C-----CCCCcchhhhhhcccccCC Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT-E-----EEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~-~-----eep~~~~~e~~~~~~~~~~ 589 (589) .... ...++...+.-..+.++.++.. +..++.+ . .-++..+.+++.++-++=| T Consensus 475 ~~~g--~~~l~~~~~~ed~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 475 PTLG--FTSITPAMRPTDAEDIDVDDEG-KPVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred Cccc--cccccCCCChhhhhcccCCccC-CcCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 0000 0011111000000011111111 1111111 0 1112222233333333333 No 82 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=98.02 E-value=6.8e-06 Score=48.90 Aligned_cols=447 Identities=11% Similarity=0.034 Sum_probs=154.0 Q ss_pred Cc-cceec--cchhHHHHhhcchhhhhhhhhcCCc---------cccCHHHH--HHHhhccccceeccCcceeeecCcce Q lcl|NC_020883. 1 MI-DWTVR--GWTDKTTKNVHGDYERYRQLYEGKH---------ELLFPRAK--RLIEEGDAVGRFLDSSQTARETQTPY 66 (589) Q Consensus 1 ~~-~~~~~--~~~~~~~~~~~~~~~~~r~l~~g~~---------~~~f~ra~--~~~~~~~~~~~~~~~~~~~~~~~~~y 66 (589) |+ .|.+. +-.-.+....|+.+.---.|.-|.. ...|++-. .+++ .-. T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~-------------------~~~ 95 (532) T protein:vir:94 35 LATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLA-------------------QLP 95 (532) T ss_pred hhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHH-------------------cCc Confidence 11 01110 1111111122221110001111110 01111111 0111 111 Q ss_pred EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc Q lcl|NC_020883. 67 VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE 146 (589) Q Consensus 67 ~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~ 146 (589) +.+-+|++||+-+-|---.++. +.++-++ +-+...++...+.-+++ T Consensus 96 ----l~r~~Vd~~aed~~r~~~~i~~-------------~~~~~~~-----------------~~~~~~i~~~~~~l~v~ 141 (532) T protein:vir:94 96 ----EYRTMHETPADECVRAWGKITC-------------SSKDELA-----------------ADKATRITQKLEQYNVR 141 (532) T ss_pred ----hhhhhhccchHHHhhCCceEee-------------CCccccc-----------------hHHHHHHHHHHHhhhHH Confidence 2355788888877555555443 1110000 00112334444444444 Q ss_pred ccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEE-Eeeeccccceeehhh Q lcl|NC_020883. 147 RRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIY-RERVEKDGLRTTNML 225 (589) Q Consensus 147 ~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~-~~~~~~~~~~~~~~~ 225 (589) .++-+.+.-.-.-||.+..+.++..+..-.+..+=..=|..-.+|+ + ++++++ +.++....+...+- T Consensus 142 ~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~-~----------~~l~vld~~~v~p~~~~~~dp- 209 (532) T protein:vir:94 142 TLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGC-L----------IGFATIEPMWLSPNAYNATDP- 209 (532) T ss_pred HHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccce-e----------eEEEeechheecccccccccc- Confidence 3343444433355555555555544332212111000010011111 1 122322 22211111100000 Q ss_pred hccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCCCcccCcchhh Q lcl|NC_020883. 226 YPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFMNPYGISALDN 304 (589) Q Consensus 226 y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~~~lG~SD~~~ 304 (589) ....+...++..... +.++ |..--.++.|.+ +|+. +....++|+|.+.. T Consensus 210 ---------------~sp~fg~P~~y~v~~-----g~~i----H~SRli~f~g~~------~p~~~~~~~~~~G~Svlq~ 259 (532) T protein:vir:94 210 ---------------TLPSFYKPDSWIATS-----GKKI----HSSRIHTVVGRP------VGDMLKAAYSFRGVSISQL 259 (532) T ss_pred ---------------cccccCCceeEEEcc-----Ceee----ccceEEEecCCC------chhhhccccccccccHHHH Confidence 000011111100000 0000 111111111221 2332 23345689999988 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEe-chhhhhccccccccccccccccccccccc-cccccccccccccccCccce Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISI-TKEMMDTLLNIAYERDGHSAKEASMMTPR-IDHRDMEITTFDENGRSMEI 382 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~V-P~~~L~t~~g~~~d~dge~~~~~~~~~~~-~d~~dlev~~~de~g~~~~~ 382 (589) +.+-+...+.+.-..+.++.+..-..+.+ -..+|... .......++...-.. -....+ ++ +... .+| T Consensus 260 ~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~------~~~~~~~r~~~~~~~~~n~g~~-~i--d~~~--e~~ 328 (532) T protein:vir:94 260 AMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPG------GAQSLDARLQLFNLYRDNRNIG-AL--DKGT--EEI 328 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcch------hHHHHHHHHHHHHhhcCCccce-EE--cCCC--cee Confidence 88888888877555555553322222211 01111110 000000000000000 001111 11 1111 223 Q ss_pred eeecccHHHHHHHHHHHHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_020883. 383 HQIDISKIGDMDHVKNLIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESC 460 (589) Q Consensus 383 iq~Dirveeh~~~ie~L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~ 460 (589) -+..+.+..-...++....+|-+.+++|.. .||.- . .+-..||-...+.+..+ |+..| ..+...|.+++.+. T Consensus 329 e~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~s-p--~GlnstGe~D~~~yyd~---I~s~Qe~~l~p~le~l~~~l 402 (532) T protein:vir:94 329 QQTNTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGIT-P--NGLNASSDGEIRVWYDF---IAGYQATNLTPLMEWIIDLI 402 (532) T ss_pred EEEecccCCHHHHHHHHHHHHHhHhCCCeeeeecCC-c--ccccccchHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH Confidence 345555666667788888888888898876 56641 1 12223455455445444 33333 22446667766554 Q ss_pred HHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHH-------HHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 461 LWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMA-------AYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 461 l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~-------~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) ++.. .+ .. +....+.|++....+++|+ |++. +++.++++ .+.+++.+++.. T Consensus 403 ~~s~--~g-~~-~~d~~~~f~pL~~~s~kEk--Aei~~~~a~a~~~~~~~Gv-------------i~~~Evr~~l~~--- 460 (532) T protein:vir:94 403 QLSE--YG-QI-DPGLAWEWSPLMELDDKEL--AEVRQLNASTDSTLMELGV-------------IDAKMVQQRLAA--- 460 (532) T ss_pred HHHh--cC-CC-CCCceEEeCCCCCCCHHHH--HHHHHHHHHHHHHHHhcCC-------------CCHHHHHHHHhc--- Confidence 4322 12 11 2245688999877776665 4444 33444444 444444444421 Q ss_pred hcccccccccccccccc--------ccccCcccCCCCCCCCCCCCCCC--CcchhhhhhcccccCC Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTF--------EQMNDNRDEDGNIIEEGDTEEEP--SAEENEEIEKEGEPIA 589 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l--------~~~~~~~~~~~~p~deg~~~eep--~~~~~e~~~~~~~~~~ 589 (589) +|.+-..+..+. .+.-++.....+|.+-+.+.+.| .++++|. +..|-+ T Consensus 461 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~---~~~~~~ 518 (532) T protein:vir:94 461 -----DPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQT---DNQPDA 518 (532) T ss_pred -----CCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCC---CCccCC Confidence 000000000000 00000000111111111111112 2222222 222222 No 83 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=98.00 E-value=3e-07 Score=56.35 Aligned_cols=480 Identities=12% Similarity=0.043 Sum_probs=191.8 Q ss_pred Cc------cceeccchh------HHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MI------DWTVRGWTD------KTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~------~~~~~~~~~------~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) |+ -|.+.-|++ .+++..|--|+||+..++-+-... +.+++-.. T Consensus 11 ~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~-------------------~~~~r~~~------ 65 (584) T protein:vir:95 11 LLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQ-------------------GLPWKNST------ 65 (584) T ss_pred hccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhc-------------------cccccccc------ Confidence 44 233333332 233444555666665433211111 11121111 Q ss_pred EEcchhhhcc---chhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH----h Q lcl|NC_020883. 69 FNLPKVIAEI---PATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT----K 141 (589) Q Consensus 69 ~n~~~~i~~~---pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~----k 141 (589) +++++.... =+-|+ .+.||+... .+ +++-+. +.... .-++.++..+ . T Consensus 66 -~~~k~~~~~~~i~~~l~-------~~~Fp~~~w-----~~----~v~~~~-------~~~~~--~~~~ai~~~i~dkl~ 119 (584) T protein:vir:95 66 -TLPKLCQIRDNLHSNYF-------SSLFPNDDW-----LR----WVGYGK-------GDSTK--TKAKAIQAYMSNKCR 119 (584) T ss_pred -chhHHHHHHHHHHHHHH-------HhhcCccce-----ee----eecCCC-------chhhH--HHHHHHHHHHhhhhh Confidence 233332221 11111 233332211 11 111111 11111 1145555555 5 Q ss_pred hccccccchhhHHHHHHcCceeEEEEEecC--------------ceeEEEecCceecccccCc---ceeEEEeecCCC-- Q lcl|NC_020883. 142 NSKLERRHWSNIVQHQVDGGIVAAPVIDEL--------------GPRIVFKARDVYFPHDDEK---GADLAYYIDHGQ-- 202 (589) Q Consensus 142 n~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~--------------~~~i~f~~~d~~~P~~d~~---~~div~~~e~~~-- 202 (589) .|+|..-|..-|-+..+-|-+++++.|.-. .++|+=++|.-+||+-... .+.++.+--.|. T Consensus 120 e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~prieriSP~d~~~Dpsa~~i~d~~fivrs~~T~~~ 199 (584) T protein:vir:95 120 ESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVRISPLDIVFNPLATSISDTFKIVRSVKTKGE 199 (584) T ss_pred hccHHHHHHHHHHhhccCCceEEEEeEeecceeeeccccccccccceEEeeChhheeecCCCCCccchhhhhhhhhhHHH Confidence 668888888999999999999999999865 4788888887777743222 222221111110 Q ss_pred ------cc-----------ceEEEEEeee--ccccceeehhhhccccccchhheeecccccccccccccccchhh---hh Q lcl|NC_020883. 203 ------YG-----------QFLHIYRERV--EKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEE---LI 260 (589) Q Consensus 203 ------~~-----------~~l~~~~~~~--~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~---~i 260 (589) ++ +..|..+... +.+.+-+. .-|- .++.+.++.. .....+..+++.-++..++ .. T Consensus 200 L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~-~~~~-~d~~~~~~ey-~~~~~V~vl~~~g~~~~~~~~e~~ 276 (584) T protein:vir:95 200 LMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKA-AGFD-VDGFGNLYEY-YMSDWVEILEFYGDYHDKETGELQ 276 (584) T ss_pred HHHHHhhcCccccchHHHHHHHHhccCCCCCcccccccc-cccc-cccccccccc-cCCceeEEEeecccccccccCCCc Confidence 00 0111110000 00000000 0000 0000000000 1111122223222221111 11 Q ss_pred hccc-CC---ccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechh Q lcl|NC_020883. 261 REVL-NI---PDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKE 336 (589) Q Consensus 261 ~~~i-~i---p~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~ 336 (589) ...+ -+ -...-.+...+...++.+++.+..+..++.||.|+.+-+.++++.||.++-...-.+.+|++| |+.. T Consensus 277 ~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~p---v~k~ 353 (584) T protein:vir:95 277 TNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQP---PLKI 353 (584) T ss_pred ccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCc---ceee Confidence 1110 00 001111222233344334444667788899999999999999999999887777778889998 3444 Q ss_pred hhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcc Q lcl|NC_020883. 337 MMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVD 415 (589) Q Consensus 337 ~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg 415 (589) ++.. ++..+.++ .+...++.| +.+++.|. ..+-.....+..+...+-..++.|.++.| T Consensus 354 ~~~~-------~~~~~~pg-------------~~~~~~~~~-~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G 412 (584) T protein:vir:95 354 IGEV-------EEFVWGPG-------------AEIHLDQGG-DVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMG 412 (584) T ss_pred cccc-------chhcccCC-------------ceeecCCCC-CcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcc Confidence 4432 11111111 122223333 34566554 23344445566666666678999999998 Q ss_pred cccCcccchhHHHHHHHHHhhhHHH-HHHHHHHHHHHHH-HHHHHHHHHHHhhcCc---c-------------cCcccce Q lcl|NC_020883. 416 FYLDGGASGAQSGVAKFYDLLTTIL-KSRRLQKEYIDFL-KELYESCLWLLNDQDS---S-------------IRIEEPN 477 (589) Q Consensus 416 ~~~~~g~~~A~Sg~A~r~~~~~~~~-Kv~~~R~~~~~aL-k~li~~~l~L~~~~~~---~-------------~~~e~p~ 477 (589) .. .+++.|++|.+. ++.+.. -++++-..++..| ++++.++..+.+.... . +.+.+++ T Consensus 413 ~~--~~~~~TAtg~s~---l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~D 487 (584) T protein:vir:95 413 IR--TPGEKTAFEVQQ---LGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTRED 487 (584) T ss_pred cc--cchhhhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChhh Confidence 63 223334455432 333332 2344445555555 7777666555432111 1 1122222 Q ss_pred eeeC-CcCCCC-----CCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccc Q lcl|NC_020883. 478 IETQ-DMILKP-----RAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFE 551 (589) Q Consensus 478 I~f~-D~lPvd-----e~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~ 551 (589) +.-+ +..+.- +.|...+++.+.+.. +.-..+.| --.+.++.+|.++...++... +. T Consensus 488 l~g~~~~va~Ga~~~~~keq~~q~l~~ilq~-------~~~~~i~p----~~~~~~l~~~ladl~~~p~~~-------~~ 549 (584) T protein:vir:95 488 ITANGKIRPIGARHFGKQAQDLQNLVGIFNS-------QIGQMILP----HTSGKALATFVDDVTGLQGYE-------IF 549 (584) T ss_pred hccCeeEEeehhhHHHHHHHHHHHHHHHHHh-------hhhhhccc----cchHHHHHHHHHHHhCCCccc-------cc Confidence 2111 000100 011111222222111 11222222 233456666665554332100 11 Q ss_pred cccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 552 QMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 552 ~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) ++ + - ...+ +-++.-....--+..+-.+.++ T Consensus 550 ~~--~--~---~~~~-Q~~~q~~~~~~q~~~~~~~~~~ 579 (584) T protein:vir:95 550 RP--N--V---AVAE-QAETQSLVAQAQEDLQLQAQMP 579 (584) T ss_pred CC--C--c---ccch-hHHHHhhhHHHHHHHHHHHhhh Confidence 10 0 0 0000 0000000000000001111111 No 84 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=97.88 E-value=1.3e-05 Score=47.37 Aligned_cols=440 Identities=12% Similarity=0.085 Sum_probs=187.0 Q ss_pred Ccc----ceeccchhHHHHhhcchhhhhhhhhcCCc----------------------cccCHHHHHHHhhccccceecc Q lcl|NC_020883. 1 MID----WTVRGWTDKTTKNVHGDYERYRQLYEGKH----------------------ELLFPRAKRLIEEGDAVGRFLD 54 (589) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~r~l~~g~~----------------------~~~f~ra~~~~~~~~~~~~~~~ 54 (589) +|| |.--+|.-+-...- +. +.-|+|.. ..|=-||+.|.+.-+|+... T Consensus 3 ~~dr~i~~~sP~~~~~R~~ar---~~--~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a-- 75 (502) T protein:vir:79 3 ILDDVIGVFSPGWKAARLRSR---AV--IQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGV-- 75 (502) T ss_pred hHhhHHhhcChHHHHHHHhhH---HH--HhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH-- Confidence 222 32222221111000 00 01122221 11222333333311111111 Q ss_pred CcceeeecCcceEEEEcchhhhccchhhhcccccc-ccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhh Q lcl|NC_020883. 55 SSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQ-IKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQN 133 (589) Q Consensus 55 ~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 133 (589) ++ .++++.+|. .-+ +...+..... ..+ +.++..+ . T Consensus 76 ---------------------v~---~~~~nvVG~ggi~------~~~~~~~~~~-----~~~-----~~~~~~i----e 111 (502) T protein:vir:79 76 ---------------------FD---KLEERVVGKNGII------VEPHPVLRNG-----AIA-----RDLAAEI----R 111 (502) T ss_pred ---------------------HH---HHHHhhccCCcee------eeeccCCCCh-----hHH-----HHHHHHH----H Confidence 11 123333332 111 0111111100 000 1233333 3 Q ss_pred hHHHHHHhhccccccchh------hHHHHHHcCceeEEEEEecC---------ceeEEEecCceecccccCcceeEEEee Q lcl|NC_020883. 134 EIIEQITKNSKLERRHWS------NIVQHQVDGGIVAAPVIDEL---------GPRIVFKARDVYFPHDDEKGADLAYYI 198 (589) Q Consensus 134 e~i~~v~kn~~~~~~~~~------~l~~~~v~Gg~~~~~~~~~~---------~~~i~f~~~d~~~P~~d~~~~div~~~ 198 (589) ..++.-..+|.+..++.- .+-..+++|=|.++..+... ..+|+..++|.. |+...-+-.|.-=. T Consensus 112 ~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l-~~~~~~~~~i~~GV 190 (502) T protein:vir:79 112 TRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFI-PMTSDESNRLNQGV 190 (502) T ss_pred HHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhc-CCCCCCCCeeEeee Confidence 356666777866644322 34455789999999887642 257888888764 43222122222212 Q ss_pred cCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCC Q lcl|NC_020883. 199 DHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPG 278 (589) Q Consensus 199 e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TG 278 (589) |..+.++ .+.|-+++. +-|+... .+ ..- T Consensus 191 e~d~~Gr--------------~~aY~i~~~-----------hPgd~~~--------------~~-------------~~r 218 (502) T protein:vir:79 191 FVDDWGR--------------PEKYLVYKS-----------RPVSGRQ--------------ME-------------TKE 218 (502) T ss_pred EECCCCc--------------eEEEEEeec-----------CCCCCcc--------------cc-------------eeE Confidence 2222222 222223321 1111000 00 011 Q ss_pred CCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHH-Hh-CCCcEEechhhhhcccccccc--ccccccc Q lcl|NC_020883. 279 RNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYE-QN-GKPRISITKEMMDTLLNIAYE--RDGHSAK 354 (589) Q Consensus 279 v~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srild-k~-gkpRI~VP~~~L~t~~g~~~d--~dge~~~ 354 (589) ++..-|+|+-+........|+|++.-+...+..|+.-.. +...- +. +--.++| ++..+.... ..+.... T Consensus 219 vpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~d--ael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~ 291 (502) T protein:vir:79 219 VDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYED--SELTAARIAAALGMYI-----RKGDGQSYEPDGNGSKEN 291 (502) T ss_pred echhheEEeecccCCccccCCchHHHHHHHHHHHhHHHH--HHHHHHHHhhhheeee-----ecCCCcccccccCCCCCc Confidence 223348888777777788999999998888888886421 11110 11 2222222 221100000 0000000 Q ss_pred cccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHH Q lcl|NC_020883. 355 EASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYD 434 (589) Q Consensus 355 ~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~ 434 (589) ...+...+-.+...-..|..+++++..--...+-.++..+++.|-+-.++|+..++-.. + + +-+|.|.. T Consensus 292 -----~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~---s--~-nySs~R~~ 360 (502) T protein:vir:79 292 -----ERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNY---N--G-TYSAQRQE 360 (502) T ss_pred -----cccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc---c--c-hHHHHHHH Confidence 00000001111111233445666666655667788888999999988899999987421 1 1 56777888 Q ss_pred hhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhc-CcccCc-------ccceeeeCC-cCC-CCCCHHHHHHHHHHHhcc Q lcl|NC_020883. 435 LLTTILKSRRLQKEYIDFL-KELYESCLWLLNDQ-DSSIRI-------EEPNIETQD-MIL-KPRAELVAENMAAYAASK 503 (589) Q Consensus 435 ~~~~~~Kv~~~R~~~~~aL-k~li~~~l~L~~~~-~~~~~~-------e~p~I~f~D-~lP-vde~El~~A~t~~~l~~a 503 (589) ++...+.++++|..+...+ +.+.+. ||+... .+.+.. .-..+.|-. +.+ +| ++..++...+...+ T Consensus 361 ~~e~~r~~~~~q~~~~~~~~~pi~~~--~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iD--P~Ke~~a~~~~i~~ 436 (502) T protein:vir:79 361 LVESTDGYLILQDWFIGAVTRPMYRA--WLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWID--PVKEAEAWKIQIRG 436 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHcCCCCCCCCCCchhhcceeeecCCccccC--hHHHHHHHHHHHHc Confidence 8888888888887666543 333322 222221 111111 112455622 111 34 33444555556699 Q ss_pred chhhHHHHHHHhCCCCC--HHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCC-CCCCCCCCCCcchhhh Q lcl|NC_020883. 504 QGQSLETTVRRMNPDAS--EDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNII-EEGDTEEEPSAEENEE 580 (589) Q Consensus 504 ~~~S~etaVr~Lhpdw~--dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~-deg~~~eep~~~~~e~ 580 (589) |++|++..++...-||+ .++..+|.+++++-. +++.. +++-..+++. ....++..+.++++|+ T Consensus 437 Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~G-----l~~~~---------~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 437 GAATESDWVRAGGRNPDDVKRRRKAEIDENRKLD-----LVFDT---------DPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcC-----CCCCC---------CCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 99999999999865554 233334444443321 11111 1110000000 0001122222223333 No 85 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=97.78 E-value=6.8e-06 Score=48.91 Aligned_cols=409 Identities=11% Similarity=0.066 Sum_probs=156.1 Q ss_pred HHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc------hhhhccchhhhcccc Q lcl|NC_020883. 14 TKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP------KVIAEIPATMVSGSI 87 (589) Q Consensus 14 ~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~------~~i~~~pa~~~~~~~ 87 (589) +-.. .-|+..+-|-+. + +.+-+-+ ..+++|-+.+++ +-+|++||..+-|-- T Consensus 1 ~~~~----D~~~n~~~gg~~--~---------~~~~~~~--------~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g 57 (422) T protein:vir:10 1 MVKT----DSYANIFLGGSD--G---------SEIYGSL--------QNQAPTILASLYADNALVRRIIDTIPETALAAG 57 (422) T ss_pred Cccc----hhhHHHHcCCCC--C---------ccccCcc--------cccCHHHHHHHHHhChhhHHHHhhhhHHHhcCC Confidence 1111 225555555221 0 0000000 112333333333 458888888774433 Q ss_pred ccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEE Q lcl|NC_020883. 88 GQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPV 167 (589) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~ 167 (589) -+|++ +.++ . .+++..+.-+++.++.+.+.-.-+-||.+..+- T Consensus 58 ~~i~~-------------~~~~------------~------------~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~ 100 (422) T protein:vir:10 58 FHIDG-------------IDDE------------P------------AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAI 100 (422) T ss_pred ccccC-------------CCHH------------H------------HHHHHHHHhhHHHHHHHHHHhhccccceEEEEE Confidence 33321 1110 0 011111222222222223332223333333333 Q ss_pred EecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccc Q lcl|NC_020883. 168 IDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 168 ~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~ 247 (589) +.+.+- .--|-+.. + .=++++++..+.-.. ..+-..+ ...-+.. T Consensus 101 v~d~~~--------~~~Pl~~~-----------g-~~~~l~v~d~~~i~~------~~~~~dp----------~s~~fg~ 144 (422) T protein:vir:10 101 VKDNRA--------LTSPVREG-----------A-ELETVRVYDRTQVKV------QTREENP----------RNARFGE 144 (422) T ss_pred ecCCCC--------cccccccc-----------C-ceeeEEeeccccccc------hhcccCc----------cccccCc Confidence 322110 00121110 0 011344432221000 0110000 0000111 Q ss_pred cccccccchhhhhhcccCCccccccccccCCCCcceEEEecC-CCCCCCcccCcchhh-hhHHHHHHHHHHhHHHHHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWAN-NETFMNPYGISALDN-LESKQDEINWTITRSAVIYEQ 325 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN-~~~~~~~lG~SD~~~-ie~l~DeLd~t~S~~srildk 325 (589) ........ ..+. .++.-|..--.++.|.+ +|+ .+...++||+|-+.. +.+-+...+.+--....++.+ T Consensus 145 P~~y~v~~---~~~~-~~~~iH~SRli~~~g~~------~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~ 214 (422) T protein:vir:10 145 PLTYRITT---NESD-MFYDVHYSRIHIIDGER------IPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKR 214 (422) T ss_pred ceEEEEec---CCCC-cceeeccceeEEeCCCC------chhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 10100000 0000 00000111111222222 123 234566789999975 778777777776666666655 Q ss_pred hCCCcEEec--hhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHH Q lcl|NC_020883. 326 NGKPRISIT--KEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLM 403 (589) Q Consensus 326 ~gkpRI~VP--~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~I 403 (589) ..-..+.++ .+++.+. .....-.+..... ...+. ....+.+.. .+ -++-+.++.+..-...++....+| T Consensus 215 ~~~~v~~~~~l~~~~~~~--~~~~~~~~r~~~~-~~~~~-~~~~~~l~~---~~--e~~e~~~~~lsgl~~~~~~~~~~i 285 (422) T protein:vir:10 215 KQQAVWKAKGLAELCDDS--EGFGAARLRLAQV-DNNSG-VGQAIGIDA---ES--EEYSVLNSDIGGIDAFLDKKFDRI 285 (422) T ss_pred hccccccchhHHHhcCCc--cchHHHHHHHHHH-HHhcC-CccceeEec---CC--cceEEEecccCChHHHHHHHHHHH Confidence 454444443 1222111 0000000000000 00000 011111211 11 234455666667777788888888 Q ss_pred HHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeC Q lcl|NC_020883. 404 LIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQ 481 (589) Q Consensus 404 l~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~ 481 (589) -+.++.|.. .||.- ..+-..||-...+.+..+ |+..| ..+.++|.+++.+.++ .+...|+|+ T Consensus 286 aaa~~IP~t~L~G~s---~~Glnatgd~d~~~yyd~---i~~~Qe~~l~p~l~~l~~~i~~----------s~~~~~~f~ 349 (422) T protein:vir:10 286 VALSGIHEIILKNKN---VGGVSSSQNTALETFHKL---VDRKRNAELLPILEFLIPFIVN----------AEEWSVEFN 349 (422) T ss_pred HhhhCCCeeeeccCC---cccccccchHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhcc----------cCCcEEEeC Confidence 888888865 34431 111123455555445444 33334 2355677777665433 134568999 Q ss_pred CcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCC Q lcl|NC_020883. 482 DMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDG 561 (589) Q Consensus 482 D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~ 561 (589) +-...+++|+ |++.+.... +..+++. +.-.+.+++++++..+..+.. ++ .++.+ ++. T Consensus 350 pL~~~sekek--aei~~~~a~----a~~~~~~--~g~i~~~e~r~~L~~~~~~~~------~~------~~~~~---~~~ 406 (422) T protein:vir:10 350 PLAQESSKDK--AEILEKNVN----SIAALIA--AGAMDIDEARDTLRTIAPEVK------IN------DGSVE---TEV 406 (422) T ss_pred CCCCCCHHHH--HHHHHHHHH----HHHHHHh--cCCCCHHHHHHHhhhhccccc------CC------CCCCc---ccc Confidence 9887766654 667655332 2233333 345677777777754422211 11 11111 111 Q ss_pred CCCCCC-CCCCCCCcc Q lcl|NC_020883. 562 NIIEEG-DTEEEPSAE 576 (589) Q Consensus 562 ~p~deg-~~~eep~~~ 576 (589) ++.+.+ +++++|+++ T Consensus 407 ~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 407 TISETSNDPLEVPTDD 422 (422) T ss_pred chhhcCCCCCCCCCCC Confidence 111111 222333333 No 86 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=97.68 E-value=2.9e-05 Score=45.47 Aligned_cols=410 Identities=15% Similarity=0.127 Sum_probs=157.1 Q ss_pred HHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEE------cchhhhccchhhhcccc Q lcl|NC_020883. 14 TKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFN------LPKVIAEIPATMVSGSI 87 (589) Q Consensus 14 ~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n------~~~~i~~~pa~~~~~~~ 87 (589) +|+|+-|-+. .+.-| +..=.++.. + ..+.+|-+.+ +.+-+|++||+.+.|-- T Consensus 1 ~~~~~~d~~~--~~~~~-~~~~~~~~~-----------~--------~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g 58 (427) T protein:vir:10 1 MKIVKHDGYN--DIFNG-GADGSPKPF-----------F--------MSDASYHVGSFYNDNATAKRIVDVIPEEMVTAG 58 (427) T ss_pred CCccccchHH--HHhhc-CCCCcccCc-----------c--------ccCchHHHHHHHHcCchhhhhhccchHHhhcCC Confidence 7777665442 23322 111111100 0 0112221111 23457888887774433 Q ss_pred ccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcCceeEEEE Q lcl|NC_020883. 88 GQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDGGIVAAPV 167 (589) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~ 167 (589) -.|+ |..+. +.++...+.-+++.++.+.+.-.-+-||-++-+. T Consensus 59 ~~i~---------------------g~~~~----------------~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~ 101 (427) T protein:vir:10 59 FKMS---------------------GVKDE----------------KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAI 101 (427) T ss_pred cccc---------------------CccHH----------------HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEE Confidence 3332 21111 0122223322232223333332223343333333 Q ss_pred EecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeeh---hhhccccccchhheeeccccc Q lcl|NC_020883. 168 IDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTN---MLYPVVKAKGDVKKEIKKGEL 244 (589) Q Consensus 168 ~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~---~~y~~~~~~~~~~~~~~~gd~ 244 (589) +++... + --|.+ +.+ .| ++++++..+.-..+....+ .-| .+...+.+...+.+ T Consensus 102 v~d~~~-l-------~~p~~-~~g-~l----------~~l~v~d~~~~~~~~~~~dp~s~~f----g~P~~y~v~~~~~~ 157 (427) T protein:vir:10 102 IKDNRM-L-------TSQAK-PGA-KL----------EGVRVYDRFAITVEKRVTNARSPRY----GEPEIYKVSPGDNM 157 (427) T ss_pred ecCCCc-c-------ccccC-CCc-ce----------eEEEEechhcccccccccCcccccc----CcceEEEEecCCCC Confidence 333221 0 01111 111 11 1344432211000000000 000 00000011000000 Q ss_pred ccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCCCcccCcchh-hhhHHHHHHHHHHhHHHHH Q lcl|NC_020883. 245 VTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFMNPYGISALD-NLESKQDEINWTITRSAVI 322 (589) Q Consensus 245 ~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~~~lG~SD~~-~ie~l~DeLd~t~S~~sri 322 (589) - ++--|..--.++.|.+ +|+. +...++||.|.+. -+.+-+...+.+......+ T Consensus 158 ~-------------------~~~iH~SRli~~~g~~------~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l 212 (427) T protein:vir:10 158 Q-------------------PYLIHHSRVFIADGER------VAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQI 212 (427) T ss_pred c-------------------ceEEccccEEEecCCC------chhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHH Confidence 0 0000111111122222 1332 3456779999985 4777777777776666666 Q ss_pred HHHhCCCcEEec--hhhhhcccccccccccccccccccc--ccccccccccccccccccCccceeeecccHHHHHHHHHH Q lcl|NC_020883. 323 YEQNGKPRISIT--KEMMDTLLNIAYERDGHSAKEASMM--TPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKN 398 (589) Q Consensus 323 ldk~gkpRI~VP--~~~L~t~~g~~~d~dge~~~~~~~~--~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~ 398 (589) +.+..-..+-++ ..++....+ ........... .... ...+.+. ..+ -+|-+.+..+..-...++. T Consensus 213 ~~k~~~~v~k~~~l~~~~~~~~~-----~~~~~~r~~~~~~~~~~-~~~~~l~---~~~--e~~e~~~~~lsgl~~~~~~ 281 (427) T protein:vir:10 213 LRRKQQAVWKVKGLAEMCDDDDA-----QYAARLRLAQVDDNSGV-GRAIGID---AET--EEYDVLNSDISGVPEFLSS 281 (427) T ss_pred HHHhccccccchhHHHHhcCccc-----hHHHHHHHHHHHHhcCc-ccceeee---cCC--CceeEEecccCChHHHHHH Confidence 655444444433 122221100 00000000000 0000 1111111 111 2244555666666777888 Q ss_pred HHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCcccCcccc Q lcl|NC_020883. 399 LIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESCLWLLNDQDSSIRIEEP 476 (589) Q Consensus 399 L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~l~L~~~~~~~~~~e~p 476 (589) ...+|-+.++.|.. .||. +..+-+.||-...+....++ +..| ..+..+|.+++.+.++ + +.- T Consensus 282 ~~~~iaaa~~IP~t~L~G~---sp~Glnstgd~D~~nyyd~i---~~~Qe~~l~p~l~~l~~~i~~-------s---~~~ 345 (427) T protein:vir:10 282 KMDRIVSLSGIHEIIIKNK---NVGGVSASQNTALETFYKLV---DRKREEDYRPLLEFLLPFIVD-------E---EEW 345 (427) T ss_pred HHHHHHhhhCCCeeeeccC---CccccccchhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhc-------C---CCc Confidence 88888888888865 4443 11122235555554454443 3333 2355677777655432 1 245 Q ss_pred eeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCc Q lcl|NC_020883. 477 NIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDN 556 (589) Q Consensus 477 ~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~ 556 (589) .|.|++....+++|+ |++..... .+..+++.. .=.+.+++.+++..+..+.+. .-+.. + +. T Consensus 346 ~~~f~pL~~~s~kEk--aei~~~~a----~a~~~~~~~--gvi~~~e~r~~L~~~~~~~~~----~~~~~------~-~~ 406 (427) T protein:vir:10 346 SIEFEPLSVPSKKEE--SEITKNNV----ESVTKAITE--QIIDLEEARDTLRSIAPEFKL----KDGNN------I-NI 406 (427) T ss_pred EEEeCCCCCCCHHHH--HHHHHHHH----HHHHHHHhc--CCCCHHHHHHHHHhhhccccC----CCCcc------c-cc Confidence 789999887776665 56654432 222223322 236667777777655333221 11110 0 00 Q ss_pred ccCCCCCCCCCCCCCCCCcchhhhhhc Q lcl|NC_020883. 557 RDEDGNIIEEGDTEEEPSAEENEEIEK 583 (589) Q Consensus 557 ~~~~~~p~deg~~~eep~~~~~e~~~~ 583 (589) + .+.+ .++.+|+..|.++-++ T Consensus 407 e----~~~~--~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 407 R----EPEE--TTEPEPGLGEKLEDEN 427 (427) T ss_pred c----ccch--hcCCCCCCCCCCCCCC Confidence 0 0111 1222333333333222 No 87 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=97.66 E-value=3.1e-05 Score=45.28 Aligned_cols=424 Identities=13% Similarity=0.079 Sum_probs=157.4 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) |==+- +--|..-.+++ .|..+|.|+---.. ++.+.....+.. +.-..|--==+.+-+|++|| T Consensus 1 ~~~~m---~~~~~~~~~~D---~~~~~~~~~~g~~~--------~~~~~~~~~~~~----~l~~~Y~~~~l~~~~Vd~~a 62 (435) T protein:vir:79 1 MGVFM---SDKVKAITKED---GYNEIFGSKDGTFR--------PNAFYMQRAAFK----ALSQFYEEDGMARRIVDVIP 62 (435) T ss_pred CCccc---ccccccchhhc---chhhhhcccccccc--------cCcccCCcCCHH----HHHHHHhcCchhhhhhccch Confidence 10000 00011111222 22222222100000 000111111100 11111211123356778887 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) ..+.|---.| +|.++. +.+++..+.-+++.++.+.+.-.-+-| T Consensus 63 ed~~r~g~~i---------------------~g~~~~----------------~~~~~~~~~l~~~~~l~~a~~~~rl~G 105 (435) T protein:vir:79 63 EEMVTPGFKV---------------------DGVKNE----------------KSFKSRWDELRLNAKIIDALSWSRLFG 105 (435) T ss_pred HHhhcCCcee---------------------cCCChH----------------HHHHHHHHHhhHHHHHHHHHHhhhccc Confidence 7663322222 221111 123334444334434444444444555 Q ss_pred ceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeeh---hhhccccccchhhe Q lcl|NC_020883. 161 GIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTN---MLYPVVKAKGDVKK 237 (589) Q Consensus 161 g~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~---~~y~~~~~~~~~~~ 237 (589) |.+.-+.+.+++. ..-|-+.. + .| ++++++..+.=..+..-.+ .-| .+...+. T Consensus 106 ~~~i~i~~~d~~~--------~~~Pl~~~-g-~i----------~~i~v~d~~~i~~~~~~~dp~sp~f----g~P~~y~ 161 (435) T protein:vir:79 106 GSAILAVVADNKM--------LKSPVKPG-A-QL----------EDIRVYDRYQITIHERETNARSVRY----GEPKLYK 161 (435) T ss_pred cEEEEEEecCCCC--------cccccccC-C-ce----------eeEEeechhhccchhhccCCccccc----CcceEEE Confidence 5444443322211 11231111 1 11 1334432211000000000 000 0000001 Q ss_pred eecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCCCcccCcch-hhhhHHHHHHHHH Q lcl|NC_020883. 238 EIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFMNPYGISAL-DNLESKQDEINWT 315 (589) Q Consensus 238 ~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~~~lG~SD~-~~ie~l~DeLd~t 315 (589) +...+..- +..-|..--.++.|.+ +|+. +...++||.|.+ +.+.+-+...+.+ T Consensus 162 v~~~~~~~-------------------~~~iH~SRli~~~g~~------~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~ 216 (435) T protein:vir:79 162 ISPGGDIP-------------------EFFVHYSRICIIDGER------VSNEKRRQNDGWGASILNKRLIEAIVDYNYC 216 (435) T ss_pred EecCCCCC-------------------ceEEcceeEEEecCCc------chhhhccccCcccchHHHHHHHHHHHHHHHH Confidence 10000000 0000111111222222 1222 345678999988 6788877778877 Q ss_pred HhHHHHHHHHhCCCcEEech--hhhhccccccccccccccccccc--cccccccccccccccccccCccceeeecccHHH Q lcl|NC_020883. 316 ITRSAVIYEQNGKPRISITK--EMMDTLLNIAYERDGHSAKEASM--MTPRIDHRDMEITTFDENGRSMEIHQIDISKIG 391 (589) Q Consensus 316 ~S~~srildk~gkpRI~VP~--~~L~t~~g~~~d~dge~~~~~~~--~~~~~d~~dlev~~~de~g~~~~~iq~Dirvee 391 (589) ......++.+..-+.+.++. .++.+. . ........... ..+. ....+-+...+ -++-+.+..+.. T Consensus 217 ~~~~~~l~~~~~~~v~~~~~l~~~~~~~--~---~~~~~~~r~~~~~~~~~-~~~~~~i~~~~-----e~~e~~~~~lsg 285 (435) T protein:vir:79 217 QELATQLLRRKQQAVWKARDLALMCDDE--E---GRYAARLRLAQVDDESG-VGKAIGIDATD-----EEYEVLNSDVSG 285 (435) T ss_pred HHHHHHHHHHhcCccccchhHHHhhcCc--c---chHHHHHHHHHHHHhcC-CCCceeEecCC-----cceEEEecccCC Confidence 66666666554544444432 222111 0 00000000000 0000 11112222111 123345556666 Q ss_pred HHHHHHHHHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_020883. 392 DMDHVKNLIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESCLWLLNDQDS 469 (589) Q Consensus 392 h~~~ie~L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~l~L~~~~~~ 469 (589) -...++....+|-+.++.|.. .||.- ..+-+.||-...+.+..++. ..| ..+...|.+++.+.++ T Consensus 286 l~~~~~~~~~~iaaa~~IP~t~L~G~s---~~glnstgd~d~~~yyd~i~---~~Qe~~l~p~l~~l~~li~~------- 352 (435) T protein:vir:79 286 VPEFLQEKIDRIVALTGIHEIIIKNKN---TGGVSASQNTALETFYKLID---RKRVEDYKPILEFLLPFMIS------- 352 (435) T ss_pred HHHHHHHHHHHHHhhhCCCeeeeccCC---ccccccchhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhhc------- Confidence 777788888898888999873 45541 11222355555555555532 233 2345666766655332 Q ss_pred ccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccc Q lcl|NC_020883. 470 SIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQT 549 (589) Q Consensus 470 ~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~ 549 (589) + ..-.|+|++.+..+++|+ |++...... +..+++. +.-.+.+++.++++-+-.+... .+.+.+. T Consensus 353 s---~d~~~~f~pL~~~sekEk--Aei~~~~a~----a~~~~~~--~g~i~~~e~r~~L~~~~~~~~~-----~~~~~~~ 416 (435) T protein:vir:79 353 E---TEWSIEFEPLSVPSDKDK--AEIMAKNVE----SVVKLKA--EQAINLKETRDTLRSICPDLKI-----MDNDNIE 416 (435) T ss_pred C---CCCeEEeCCCCCCCHHHH--HHHHHHHHH----HHHHHHh--cCCCCHHHHHHHHHHhccccCC-----CCccccc Confidence 1 244689999887776655 666655332 2222332 2346777777777422222211 1111111 Q ss_pred cccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 550 FEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 550 l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) ++ +..+.+.++..||- |+| T Consensus 417 ~~---~~~d~~~~~~~e~g--------~~~ 435 (435) T protein:vir:79 417 LP---EPEDLDPEPGQEGG--------LNK 435 (435) T ss_pred CC---ccccCCCCCCCCCC--------CCC Confidence 11 11111111222222 222 No 88 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=97.52 E-value=5.2e-05 Score=44.06 Aligned_cols=468 Identities=11% Similarity=0.091 Sum_probs=156.1 Q ss_pred CccceeccchhH-HHHhhcchhhhhh----------------hhhcCCccccCHHHHHHHhhccccceeccCcce----- Q lcl|NC_020883. 1 MIDWTVRGWTDK-TTKNVHGDYERYR----------------QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQT----- 58 (589) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~r----------------~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~----- 58 (589) ----...-|.++ +..++|-++.+|+ .+|+|+- ++++.+ +.+++ T Consensus 15 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~------------grs~vv~~~v 79 (763) T protein:vir:95 15 SQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKA---KPPKVK------------GRSQVQPKLV 79 (763) T ss_pred cchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccC---cccccC------------CCccccCHHH Confidence 000001122222 1122222222222 1233322 222110 01111 Q ss_pred eeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhc-ccccccchhhhhhhhhhhhhhhhHHH Q lcl|NC_020883. 59 ARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMI-EGPQDEEEAGKNENNTVIDLQNEIIE 137 (589) Q Consensus 59 ~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~e~i~ 137 (589) ++.+. |++-.|+++||.=+ ..+ +-|.. .+..+.++.-.++++ T Consensus 80 ~~~ve--~~~~~l~~~f~~~~-----------------------------~~~~~~P~~------~~D~~~A~q~t~~~n 122 (763) T protein:vir:95 80 RRQAE--WRYSALTEPFLGSN-----------------------------KLFKVTPVT------WEDVQGARQNELVLN 122 (763) T ss_pred HHHHH--HHHHHHHHhhcCCC-----------------------------cEEEEecCC------cchHHHHHHHHHHHH Confidence 00001 12222233332221 111 22222 333444444455666 Q ss_pred HHHhh--ccccccchhhHHHHHHcCceeEEEEEecC-------------------------------------------- Q lcl|NC_020883. 138 QITKN--SKLERRHWSNIVQHQVDGGIVAAPVIDEL-------------------------------------------- 171 (589) Q Consensus 138 ~v~kn--~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-------------------------------------------- 171 (589) -++.+ ..|+ -+.+++-+++..|=-+.|+||+-. T Consensus 123 ~~~~~~~~~~~-~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (763) T protein:vir:95 123 YQFRTKLNRVS-FIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVD 201 (763) T ss_pred HHHhhcCchhh-HHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhcccccccc Confidence 65433 2332 355677777777777788888621 Q ss_pred -----------------------------------ceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecc Q lcl|NC_020883. 172 -----------------------------------GPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEK 216 (589) Q Consensus 172 -----------------------------------~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~ 216 (589) .++|.-+.+.-||+.-..++ +..+-+|+++-. +.++ T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~s--------D~~Da~~~~~~~-~~t~ 272 (763) T protein:vir:95 202 EAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQG--------DINKAMFAIVSF-ETCK 272 (763) T ss_pred chhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCC--------chhhCceEeeEE-eccH Confidence 11111122222222100000 001112332211 1110 Q ss_pred ccceeehhhhcccc-----ccchhh--eeecc-------------ccccccccccc--ccchhhhhhcc----cCCcccc Q lcl|NC_020883. 217 DGLRTTNMLYPVVK-----AKGDVK--KEIKK-------------GELVTNVEGAE--DLEGEELIREV----LNIPDDR 270 (589) Q Consensus 217 ~~~~~~~~~y~~~~-----~~~~~~--~~~~~-------------gd~~~~~~e~~--d~e~e~~i~~~----i~ip~~~ 270 (589) .-|. .+-|.... ...... ..... .+.+...++.- ++++.....-. .| .... T Consensus 273 ~dL~--~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g-~~iL 349 (763) T protein:vir:95 273 ADLL--KEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIG-STLI 349 (763) T ss_pred HHHH--hccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccCCcceeEEEEEEEEc-Ceee Confidence 0010 00000000 000000 00000 00111111111 11111100000 00 0001 Q ss_pred ccccccCCCC-cceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccc Q lcl|NC_020883. 271 PLENFYPGRN-RPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERD 349 (589) Q Consensus 271 e~~~i~TGv~-~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~d 349 (589) .....+.... .|+++ ++-.+..+++||.|.++.+.+.++.+|..+++...++..+++|++.|+.+++... | T Consensus 350 ~~~~~p~~~~~~PFv~-~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~-------d 421 (763) T protein:vir:95 350 RLEKNPYPDGKLPFVL-IPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDAL-------N 421 (763) T ss_pred ecccccccCCCcCEEE-ecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccch-------h Confidence 1111112222 45554 4667888999999999999999999999999988888888999999998887431 1 Q ss_pred ccccccccccccccccccccccccccccCccceee---ecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhH Q lcl|NC_020883. 350 GHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQ---IDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQ 426 (589) Q Consensus 350 ge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq---~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~ 426 (589) ......|.. ..+.+.......+...+ .......-+...+..++ .+++.+..+.|... .+.+.+. T Consensus 422 -~~~~~pg~v--------~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e---~~TGv~~~~~G~~~-~~~~~ta 488 (763) T protein:vir:95 422 -SRRYREGED--------YEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAE---SLTGVKAFAGGVTG-ESYGDVA 488 (763) T ss_pred -hhcccCCce--------EEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHH---HhhCcchhhcCcCc-ccccchh Confidence 110110000 00000000000111111 11122233333333333 45677877777422 2223344 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-----------cCcc------cceeeeCCcCCCCCC Q lcl|NC_020883. 427 SGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSS-----------IRIE------EPNIETQDMILKPRA 489 (589) Q Consensus 427 Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~-----------~~~e------~p~I~f~D~lPvde~ 489 (589) ||++... ...-.....+-+.+.++++.+++.++.|...+... +.+. +-+|...++.+ ... T Consensus 489 t~v~~l~--qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~~a-s~~ 565 (763) T protein:vir:95 489 AGIRGVL--DAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDISTA-EVD 565 (763) T ss_pred HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecccc-hHH Confidence 5554421 11111223334556677777777766665543111 1110 01122222211 000 Q ss_pred HHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 490 ELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 490 El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) ....+++.+ .+..+.|+.+..-...-+.++.+-.. +......+ +.. ++ T Consensus 566 ~q~~~~l~~------------ll~~l~~~~~~~~~~~il~~~~d~~~------~~~~~~~l------r~~--------q~ 613 (763) T protein:vir:95 566 NQKSQDLGF------------MLQTIGPNVDQQITLNILAEIADLKR------MPKLAHDL------RTW--------QP 613 (763) T ss_pred HHHHHHHHH------------HHHHhccccChHHHHHHHHHHHhhhc------hhhhHHHH------Hhc--------CC Confidence 000111111 11222343333221111112111000 00000001 000 00 Q ss_pred CCCCCcchhhhhhcc-cccCC Q lcl|NC_020883. 570 EEEPSAEENEEIEKE-GEPIA 589 (589) Q Consensus 570 ~eep~~~~~e~~~~~-~~~~~ 589 (589) +++|...--.+++.. .+.-+ T Consensus 614 ~~d~~~q~qaqle~~~~q~e~ 634 (763) T protein:vir:95 614 QPDPVQEQLKQLAVEKAQLEN 634 (763) T ss_pred CccchhhhHHHHHHHHHHHHH Confidence 011111000000000 00000 No 89 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.51 E-value=5.4e-05 Score=43.99 Aligned_cols=457 Identities=9% Similarity=0.004 Sum_probs=175.0 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhc-cccceec--------cCcceeeecCcceEEEEc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEG-DAVGRFL--------DSSQTARETQTPYVIFNL 71 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~-~~~~~~~--------~~~~~~~~~~~~y~~~n~ 71 (589) |-|-+-+ . .-....+-.+...|.++.|.++ +...| .|.-... +..|-+.-.|..| .|+ T Consensus 1 m~~V~~~--h-p~y~~~~~~W~~ird~~~G~~~--------~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~--~n~ 67 (501) T protein:vir:95 1 MPNVSFI--R-PELGKLLPLYYLIRDAIAGEPT--------VKGARTTYLPMPNAEDQSKENKARYEAYLKRAVF--YNV 67 (501) T ss_pred CCCCCCC--C-HHHHHHHHHHHHHHHHhcChHH--------HHhcccccCcCCCCCCCcccchHHHHHHhhcccc--Cch Confidence 5542211 1 1122344556777888988753 00011 1111110 0111111222222 344 Q ss_pred chhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchh Q lcl|NC_020883. 72 PKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWS 151 (589) Q Consensus 72 ~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~ 151 (589) ++-++ +..+|.+-.. ++.-+.+..++ .-.+=+|++.--+++++++ T Consensus 68 ~~~t~-------~~l~G~vf~k--------~p~~~~p~~l~-----------~l~~d~D~~G~~L~~f~~~--------- 112 (501) T protein:vir:95 68 ARRTL-------FGLVGQVFMR--------DPVVKVPALLN-----------PLVANATGSGINLTQLAKR--------- 112 (501) T ss_pred HHHHH-------HHHhhhhhcC--------CcceeCcHHHH-----------HHHhccCCCCCCHHHHHHH--------- Confidence 44333 3334444432 22212222221 0112233333344444443 Q ss_pred hHHHHHHcCceeEEEEEe--cCc--------------eeEEEecCceecc----cccCc--ceeEEEeecCCC-cc---- Q lcl|NC_020883. 152 NIVQHQVDGGIVAAPVID--ELG--------------PRIVFKARDVYFP----HDDEK--GADLAYYIDHGQ-YG---- 204 (589) Q Consensus 152 ~l~~~~v~Gg~~~~~~~~--~~~--------------~~i~f~~~d~~~P----~~d~~--~~div~~~e~~~-~~---- 204 (589) .+..++..|++..-+=.- ... |++.++.|.+.+= ..+|+ -+-+++.+.... ++ T Consensus 113 ~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~ 192 (501) T protein:vir:95 113 AVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEM 192 (501) T ss_pred HHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCccc Confidence 344445666665433211 111 5566666655543 12222 233444333221 12 Q ss_pred ceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceE Q lcl|NC_020883. 205 QFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFI 284 (589) Q Consensus 205 ~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plv 284 (589) +.+..||.+.--.......++|+....+.. .+++ +.........+ . .....-.+++....+ T Consensus 193 ~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~-----~~~~-~~~~~~~~~~~-----------~--~~~~~g~~~l~~IPf 253 (501) T protein:vir:95 193 KTSGQFRVLRLDEEGYYVHEIWREPQPTKA-----DGSK-IPKGNYQQYVV-----------Y--KPTDAQGKRLTEIPF 253 (501) T ss_pred ceeEEEEEEeeCCCceEEEEEEEecCCccc-----Ccce-ecCCcccccce-----------e--eeeccCCCcCCeeeE Confidence 234445544322222223355542111111 0000 11011100000 0 001111255655566 Q ss_pred EEecCCCCCCCcccCcchhhhhHHHHHHHHH---H-hHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccc Q lcl|NC_020883. 285 SYWANNETFMNPYGISALDNLESKQDEINWT---I-TRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMT 360 (589) Q Consensus 285 vyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t---~-S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~ 360 (589) |++-.....-. -|.+-+-+|.. ||-. . +....++-..+.|.+.+. -++ ..+...+... T Consensus 254 v~~~~~~~~~~-~~~pPLl~lA~----lni~hy~~ssd~~~~l~~~~~P~l~i~--G~~----~~~~~~~~~~------- 315 (501) T protein:vir:95 254 MFIGSENNDSN-PDNPNFYDLAS----LNMAHYRNSADYEESCYIVGQPTPVLI--GLT----EEWVTNVLKG------- 315 (501) T ss_pred EEEecCCCCCC-CCccchHHHHH----HHHHHHhhhhHHHHHHHHcccceeeee--CCc----ccccccCCCC------- Confidence 66522211111 23343433332 2322 1 122334434477776542 010 0111111110 Q ss_pred cccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHH Q lcl|NC_020883. 361 PRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTIL 440 (589) Q Consensus 361 ~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~ 440 (589) ++...+-..+..+. +....|++++..-.. ...++.+.++|..+ +...+.. .+++.|+++.+.+..+..+ T Consensus 316 -~i~~G~~~~~~lP~-~~~~~~ie~~~~~i~-~~~l~~l~~~m~~~---Ga~ll~~-----~~~~~Ta~~~~~~~~~~~S 384 (501) T protein:vir:95 316 -SVNFGSRGGIPLPV-GADAKLLQASENTML-KEAMDTKERQMVAL---GAKLVEQ-----KEVQRTATEAELEAASEGS 384 (501) T ss_pred -ceeecccccccCCC-CCceeEEecChhhHH-HHHHHHHHHHHHHH---HHhhccC-----CccchhHHHHHHHHHHHhH Confidence 11111122344443 346788888876654 67788888887643 2222221 1123455554444433333 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHhhcCcccCcccceeeeC-CcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-C Q lcl|NC_020883. 441 KSRRLQKEYIDFLKELYESC-LWLLNDQDSSIRIEEPNIETQ-DMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-P 517 (589) Q Consensus 441 Kv~~~R~~~~~aLk~li~~~-l~L~~~~~~~~~~e~p~I~f~-D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-p 517 (589) -...+-..+.++|..+++++ .|+-.. ...+.|..+ |-.+.+.++.. ++.+..+..++.+|.+|.++.|- . T Consensus 385 ~L~~~a~~le~al~~~l~~~a~w~g~~------~~~~~v~i~~df~~~~~~~~~-~~al~~~~~~G~is~~t~~~~L~~~ 457 (501) T protein:vir:95 385 TLSSATKNVSAAFEWALKWAARWVGQA------DSGVKFELNTDFDIARMTPDE-RRSLVEEWQKGAITFEEMRTGLRKA 457 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCC------CCceEEEEecccccccCCHHH-HHHHHHHHhCCCCcHHHHHHHHHhC Confidence 34444455567777766443 343211 111222221 11112222211 23333456788999999977761 1 Q ss_pred CCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCC-CCCCCC Q lcl|NC_020883. 518 DASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNII-EEGDTE 570 (589) Q Consensus 518 dw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~-deg~~~ 570 (589) ..-+....+|.++|..+....++..-. . +...+++.. |-|+-| T Consensus 458 ~v~~~~~~~e~e~i~~~~~~~~~~~~~------~----~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 458 GVATEDDSKAKEKIAKDTAEAMALATP------A----NVPGDGSGGDNVGNSE 501 (501) T ss_pred CCCChhHHHHHHHHHhhhcCccccccc------C----CCCCCCcccccccCCC Confidence 233334445667777665432221111 1 111111111 112222 No 90 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=97.43 E-value=6.9e-05 Score=43.41 Aligned_cols=461 Identities=11% Similarity=0.028 Sum_probs=157.7 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCc-------------------------cccCHHHHHHHhhccccceeccC Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKH-------------------------ELLFPRAKRLIEEGDAVGRFLDS 55 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~-------------------------~~~f~ra~~~~~~~~~~~~~~~~ 55 (589) |.+|.+.+--..+ .|..+.|+=.. ...|+++.. .-|...-..+ T Consensus 43 ~~~~~~~~~~~~~--------~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~f~g 110 (765) T protein:vir:96 43 IRGWNVEPEKAPV--------IRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQ----DWYNSQGFIG 110 (765) T ss_pred HhhcccccccCCC--------CCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHH----hhhcccCCcc Confidence 6666655433222 22233333100 001111100 0000000000 Q ss_pred cceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhH Q lcl|NC_020883. 56 SQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEI 135 (589) Q Consensus 56 ~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~ 135 (589) . ++-..|---=+.+-+|++||..+-|---.|++ ...+..+-+.+. T Consensus 111 y----ql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~-------------------------------~~~e~~~~~~~~ 155 (765) T protein:vir:96 111 Y----QACAIISQHWLVDKACSMSGEDAARNGWELKS-------------------------------DGRKLSDEQSAL 155 (765) T ss_pred H----HHHHHHHhCchhhhhhhcchHHhhcCCceeec-------------------------------CccccCHHHHHH Confidence 0 01111111113345666666665332222221 111111223345 Q ss_pred HHHHHhhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeec Q lcl|NC_020883. 136 IEQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVE 215 (589) Q Consensus 136 i~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~ 215 (589) +++..+.-+++.++.+.+.-.-.-||-++...++....+ .. + -| +..+-+ .. ..-++++++.-.+. T Consensus 156 l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~-~l---~--~P----L~~~~I---~k-g~~kgl~vldp~~~ 221 (765) T protein:vir:96 156 IARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPD-YY---E--KP----FNPDGI---AP-GSYKGISQIDPYWA 221 (765) T ss_pred HHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcc-hh---h--cc----cccccc---cc-ceeeEEEEechhhc Confidence 555555554444444444433344444444444321110 00 0 11 100000 00 01123444422111 Q ss_pred cccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCC Q lcl|NC_020883. 216 KDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFM 294 (589) Q Consensus 216 ~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~ 294 (589) . .+.+.+.. . +. ....|..... +. +-+.++ |..--.++.|.+ +|+. +... T Consensus 222 ~-~~~v~e~~-~--Dp---------~sp~fg~P~~---y~---i~g~~I----H~SRli~~~g~~------lpd~lk~~~ 272 (765) T protein:vir:96 222 M-PQLTAEST-A--DP---------SAEHFYEPDF---WI---ISGKKY----HRSHLVVVRGPQ------PPDILKPTY 272 (765) T ss_pred c-cccchhcc-c--cc---------cccccCccee---ee---ecCcee----ccceEEEecCCC------chhhhcccc Confidence 1 11110000 0 00 0000000000 00 000000 111001111111 1222 3445 Q ss_pred CcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 295 NPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 295 ~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d 374 (589) .++|+|-++.+.+-+...+.+.-....++-+..-..+.+ .++..+.+ .+. ....+.......+..-+.++ + T Consensus 273 ~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~--~~~~~l~~----~~~-l~~r~~~~~~~r~n~g~~~i--d 343 (765) T protein:vir:96 273 IFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHV--DVEKAIAN----EDA-FNARLAFWIANRDNHGVKVI--G 343 (765) T ss_pred CccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--chHhhhcc----HHH-HHHHHHHHHHhcCCceeEEe--c Confidence 568999999888888888877655555554433333332 12211110 000 00000000000011101121 1 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCc-hhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHH Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSE-KAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDF 452 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~-~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~a 452 (589) .. -+|-+.+..+..-...++....+|-+.++.|. ..||.- ..+.-.||-...+.+..+ |+..| ..+..+ T Consensus 344 -~e--e~~e~~s~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqs---p~GlnATGe~D~~nYyD~---I~s~Qe~~l~p~ 414 (765) T protein:vir:96 344 -ID--ETMEQFDTNLSDFDSVIMNQYQLVAAIAKTPATKLLGTS---PKGFNATGEHETISYHEE---LESIQEHIFDPL 414 (765) T ss_pred -CC--cceeEEecccCCHHHHHHHHHHHHHhhhCCCeeeeccCC---cccccCcchHHHHHHHHH---HHHHHHHHHHHH Confidence 11 23445666677777788888888888888885 445431 011113455444444444 33333 235677 Q ss_pred HHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHH-----HHHHHHHhccchhhHHHHHHHhC--C-----CCC Q lcl|NC_020883. 453 LKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVA-----ENMAAYAASKQGQSLETTVRRMN--P-----DAS 520 (589) Q Consensus 453 Lk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~-----A~t~~~l~~a~~~S~etaVr~Lh--p-----dw~ 520 (589) |.+++.++++.. .+. ....|.|++.+..+++|++. |++.+++.+++++|...+-.+|. | +.+ T Consensus 415 le~L~~li~~s~-----~i~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~ 488 (765) T protein:vir:96 415 LERHYLLLAKSE-----SID-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLT 488 (765) T ss_pred HHHHHHHHHHhc-----CCC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCC Confidence 788776655431 122 24578999988777665532 23344444555554444433331 0 011 Q ss_pred HHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC--------CCCCCcchhhhhhccc------- Q lcl|NC_020883. 521 EDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT--------EEEPSAEENEEIEKEG------- 585 (589) Q Consensus 521 dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~--------~eep~~~~~e~~~~~~------- 585 (589) +++.+ .+. ..+|..... +. ..+.++.+.+..+..+ +..|+...++....+. T Consensus 489 d~~~e-------~~~-~~~pe~~~~----~~--~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~ 554 (765) T protein:vir:96 489 DDQAE-------TEP-GMSPENLAE----LE--KAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEG 554 (765) T ss_pred ccccc-------ccc-CCCcccccc----cc--CCCcccccccCccccccCCCCccCCCCcccccCCcccCCcccccccc Confidence 11100 000 000000000 00 0000111111110000 0000000011110000 Q ss_pred ---------ccCC Q lcl|NC_020883. 586 ---------EPIA 589 (589) Q Consensus 586 ---------~~~~ 589 (589) +|.+ T Consensus 555 ~g~~~~~p~~~~p 567 (765) T protein:vir:96 555 AGEAATPPSRPNP 567 (765) T ss_pred CccccCccccccc Confidence 0010 No 91 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=97.43 E-value=6.9e-05 Score=43.40 Aligned_cols=450 Identities=13% Similarity=0.045 Sum_probs=184.4 Q ss_pred CccceeccchhHHHHhhcchhh-----hhhhhhc-------------CCccccCHHHHHHHhhccccceeccCcceeeec Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYE-----RYRQLYE-------------GKHELLFPRAKRLIEEGDAVGRFLDSSQTARET 62 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-----~~r~l~~-------------g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~ 62 (589) +|-|..+.=.-.. +..-..|+ |-.+.+- +.-..|=-||+.|.+. |.++ T Consensus 14 ~i~~~~~~~~~~~-~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rN----------n~~a--- 79 (505) T protein:vir:96 14 MVNWAWYRYVEPQ-KNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSIN----------NPYA--- 79 (505) T ss_pred ccchhhhhhHHHH-HHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhc----------ChHH--- Confidence 3322211000000 00000000 0001110 0111122334444431 1111 Q ss_pred CcceEEEEcchhhhccchhhhcccccc--ccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQ--IKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT 140 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~ 140 (589) .-+++ .++++.+|. ++-.. .+... .+..+ ..++..+-+ .+..-. T Consensus 80 ----------~~av~---~~~~nvVG~~Gi~~~~-------~~~~~-----~~~~~-----~~~~~~ie~----~w~~Wa 125 (505) T protein:vir:96 80 ----------KRFYQ---LLKNNVIGPKGMTFQS-------RVKRR-----NGKPD-----DRANTLIEG----NWQQWI 125 (505) T ss_pred ----------HHHHH---HHHHHhcCCCcceeee-------cCCcc-----ccccc-----HHHHHHHHH----HHHHhc Confidence 11111 123333331 21100 00000 00111 113333322 233333 Q ss_pred h--hccccccc------hhhHHHHHHcCceeEEEEEecCc---eeEEEecCceec-ccccCc--ceeEEEeecCCCccce Q lcl|NC_020883. 141 K--NSKLERRH------WSNIVQHQVDGGIVAAPVIDELG---PRIVFKARDVYF-PHDDEK--GADLAYYIDHGQYGQF 206 (589) Q Consensus 141 k--n~~~~~~~------~~~l~~~~v~Gg~~~~~~~~~~~---~~i~f~~~d~~~-P~~d~~--~~div~~~e~~~~~~~ 206 (589) . +|.+..++ ...+...+++|=|.++.....+. .+|+..++|..= |.+... +-.|..=.|..+.++ T Consensus 126 ~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr- 204 (505) T protein:vir:96 126 KKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWER- 204 (505) T ss_pred CCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCc- Confidence 2 46555544 22445567899988887765433 577777776532 211111 111222222222222 Q ss_pred EEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEE Q lcl|NC_020883. 207 LHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISY 286 (589) Q Consensus 207 l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvy 286 (589) .+.|.+++. +-|+.+..... ....+.-++.+-|.| T Consensus 205 -------------~~aY~i~~~-----------hPgd~~~~~~~---------------------~~~~~~rvpa~~vlH 239 (505) T protein:vir:96 205 -------------PVAYHLLVN-----------HPGDNSYCYHY---------------------AGQTYERVPADEIIH 239 (505) T ss_pred -------------eEEEEEeec-----------CCCcccccccc---------------------ccccccccCHhHhhh Confidence 222223221 11221110000 001112234556888 Q ss_pred ecCCCCCCCcccCcchhhhhHHHHHHHHHHh-HHHHHHHHh-CCCcEEechhhhhccccccccccccccccccccccccc Q lcl|NC_020883. 287 WANNETFMNPYGISALDNLESKQDEINWTIT-RSAVIYEQN-GKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRID 364 (589) Q Consensus 287 vPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S-~~srildk~-gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d 364 (589) +-.........|+|+|.-+...+..|+.-.. -+.+. +. +-=..+|-. -.........+.+++....+ T Consensus 240 ~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a--~i~A~~a~fi~~-~~~~~~~~~~~~~~~~~~~l-------- 308 (505) T protein:vir:96 240 TFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAA--ELGAKKVGFYEQ-DPEAYDQPPEDDQGEIVEEV-------- 308 (505) T ss_pred hhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHH--HHhhhheeeeec-CCccCCCccccccCcccccc-------- Confidence 8777777888999999988888888886422 11111 11 111122210 00011111122222211111 Q ss_pred cccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHH Q lcl|NC_020883. 365 HRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRR 444 (589) Q Consensus 365 ~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~ 444 (589) .+.. +..-..|..+.+++.+.-...+..++..+++.|-+-.++|+..+.-.. ++ .+=+|.|..++..++.++. T Consensus 309 -~pG~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~-s~----~nYSS~R~~~~e~~r~~~~ 381 (505) T protein:vir:96 309 -EAGT-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDL-EG----VNFSSLRSGELDERDLYKL 381 (505) T ss_pred -CCce-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-cc----ccHHHHHHHHHHHHHHHHH Confidence 1111 112234555667777766677888899999999999999999886311 11 1456777778777777788 Q ss_pred HHHHHHH-HHHHHHHHHHHHHhhcCc-ccCcc----cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCC Q lcl|NC_020883. 445 LQKEYID-FLKELYESCLWLLNDQDS-SIRIE----EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPD 518 (589) Q Consensus 445 ~R~~~~~-aLk~li~~~l~L~~~~~~-~~~~e----~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpd 518 (589) +|..+.. +++.+.+.-+......|. .+... -..+.|-..--.--+++..++.......+|++|++..++...-| T Consensus 382 ~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D 461 (505) T protein:vir:96 382 LQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDD 461 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCC Confidence 7776665 334443322211111111 11111 12355632111111344445555667799999999999998654 Q ss_pred CCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcch Q lcl|NC_020883. 519 ASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEE 577 (589) Q Consensus 519 w~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~ 577 (589) |+ ++.+|+++= ........+....+ ..+.+....++++.++++| T Consensus 462 ~~--~v~~q~a~e---~~~~~~~Gl~~~~~----------~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 462 PE--DVFDEIAWE---EQLMRDKGVNPTPP----------EQESKDATTDEEDDSASDD 505 (505) T ss_pred HH--HHHHHHHHH---HHHHHHcCCCCCCC----------CCCCCCCCCCCCCCCCCCC Confidence 44 333333322 11111111110000 0011111112222222222 No 92 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=97.34 E-value=8.9e-05 Score=42.78 Aligned_cols=462 Identities=9% Similarity=0.028 Sum_probs=177.5 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccc--eeccCcceeeecCcceEEEEcchhhhcc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVG--RFLDSSQTARETQTPYVIFNLPKVIAEI 78 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~--~~~~~~~~~~~~~~~y~~~n~~~~i~~~ 78 (589) |-|-+-+ . .-....+-.+...|.++.|.++ +..+...-|-+.+..+ .-.+..|-+.-.|..| .|+++-+++ T Consensus 32 m~dV~~~--h-p~y~a~~~~W~~ird~~~G~~~-~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~--~n~~~~tl~- 104 (535) T protein:vir:80 32 LPNVGYQ--R-VEFGEMLPKWRKIMDCLSGQEA-IKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIF--YNVTARTLD- 104 (535) T ss_pred CCCCCcC--C-HHHHHHHHHHHHHHHHhcChHH-HHhcccccCCCCCcccCCcCCHHHHHHHHhhccC--CChhHHHHH- Confidence 5442111 0 1122334556677788888632 1111111111111000 0000111111222222 355544443 Q ss_pred chhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHH Q lcl|NC_020883. 79 PATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQV 158 (589) Q Consensus 79 pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v 158 (589) ..+|.+-. +++.-+.+..++ ...+=+|++.--++++++. .+..++. T Consensus 105 ------~l~G~vfr--------k~p~~~~p~~l~-----------~l~~d~D~~G~~L~~f~~~---------~~~~~l~ 150 (535) T protein:vir:80 105 ------GMMGQVFS--------RDPIRQLPPALE-----------AIVEDIDGEGVSLDQQAKK---------ALGYTMG 150 (535) T ss_pred ------HHhchhhc--------CCcceeccHHHH-----------HHHhccCCCCCCHHHHHHH---------HHHHHHh Confidence 33444433 122222222221 1112234444344444443 3444556 Q ss_pred cCceeEEEEEecCc--------------eeEEEecCceecc----cccCc--ceeEEEeecCCC-----ccceEEEEEee Q lcl|NC_020883. 159 DGGIVAAPVIDELG--------------PRIVFKARDVYFP----HDDEK--GADLAYYIDHGQ-----YGQFLHIYRER 213 (589) Q Consensus 159 ~Gg~~~~~~~~~~~--------------~~i~f~~~d~~~P----~~d~~--~~div~~~e~~~-----~~~~l~~~~~~ 213 (589) .|+|..-+=.-..+ +++.++.|.+.+= ..+|+ -+-+++.+.... ..+.+..||.+ T Consensus 151 ~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL 230 (535) T protein:vir:80 151 FGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVL 230 (535) T ss_pred cCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEE Confidence 67665444222111 5666666655543 12222 233444332111 12344444443 Q ss_pred ec-cccceeehhhhccccccchhheeecccc-cccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCC Q lcl|NC_020883. 214 VE-KDGLRTTNMLYPVVKAKGDVKKEIKKGE-LVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNE 291 (589) Q Consensus 214 ~~-~~~~~~~~~~y~~~~~~~~~~~~~~~gd-~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~ 291 (589) .- .+|... -.+|+. ..++. .....+. .....-.+.+....||++-... T Consensus 231 ~~~~~G~y~-v~~~~~----------~~~~~~~~~~~~~-------------------~~~~~g~~~l~~IPfv~~~~~~ 280 (535) T protein:vir:80 231 QLNAEGNYQ-VERWRR----------ETQEEMYYSYSKH-------------------VPTDGNGNPFKEIPFQFIGPLD 280 (535) T ss_pred EecCCceEE-EEEEEe----------ecCCcccccccee-------------------ecccCCCcccCeeEEEEeecCC Confidence 21 111111 123320 00111 0110000 0111122455555566653322 Q ss_pred CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccc--cccccccccccccccccccccccc Q lcl|NC_020883. 292 TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIA--YERDGHSAKEASMMTPRIDHRDME 369 (589) Q Consensus 292 ~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~--~d~dge~~~~~~~~~~~~d~~dle 369 (589) . ....|.+-|-+|..+--+.=..-+....++-..+.|.+++. |+. |..++-... ++...+-. T Consensus 281 ~-~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~--------G~~~~~~~~~~~~~-------~i~iG~~~ 344 (535) T protein:vir:80 281 N-NADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFT--------GLTKDWVEDVFKDF-------KVHLGSRA 344 (535) T ss_pred C-CCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeee--------cCchhhhhcCCCCc-------ceEecCcc Confidence 2 22235555555444422221122233444445578877653 111 111110000 01111112 Q ss_pred ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020883. 370 ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEY 449 (589) Q Consensus 370 v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~ 449 (589) ++..+++ ....|+++..+-.. .+.++.+..+|.+ ++...+... .+. .+.++++..+ ....+-...+-..+ T Consensus 345 ~~~lP~~-~~~~~~e~~~~~~a-~~~l~~~e~qM~~---lGa~ll~~~--~~~-~Ta~~a~~~~--~~~~S~L~~~a~~l 414 (535) T protein:vir:80 345 IIPLPQG-ATAGILQITPNSVP-FEAMTHKESQMIA---MGANLLVKS--GGN-RTFGEAQQEE--ASEQSILSACTKNV 414 (535) T ss_pred cccCCCC-CCcceeeeccchhH-HHHHHHHHHHHHH---HHHHhhccC--ccc-ccHHHHHHHH--HHHhHHHHHHHHHH Confidence 3334443 44677787766544 4677777777653 444444322 222 2223333332 11122223334445 Q ss_pred HHHHHHHHHHHH-HHHhhcCcccCccccee----eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh------CCC Q lcl|NC_020883. 450 IDFLKELYESCL-WLLNDQDSSIRIEEPNI----ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM------NPD 518 (589) Q Consensus 450 ~~aLk~li~~~l-~L~~~~~~~~~~e~p~I----~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L------hpd 518 (589) .++|+.+++++. |+-.. ...+.+.| +|.+. ..+.. ..+.++ .+..++.+|.+|.++.| .|+ T Consensus 415 e~al~~aL~~~A~w~G~~----~~~~~~~i~~n~dF~~~-~ld~~--~~~all-~~~~~G~Is~et~~~~L~r~gvl~~~ 486 (535) T protein:vir:80 415 SMAFRKALRWANQFQTGI----VNDETVEYNLNTDFPAA-RLTPN--ERAELI-LEWQQGAITFKEMRAGLRRAGVASED 486 (535) T ss_pred HHHHHHHHHHHHHHcCCc----cCCCceEEEeccccccc-cCCHH--HHHHHH-HHHhcCCCCHHHHHHHHHhCCCCCcc Confidence 567766664443 43211 12222333 34321 12222 222333 34557889999998777 344 Q ss_pred CCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 519 ASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 519 w~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) .+.+ +|..||+.|.... +...|-.+++- ....++-|.|-|+.+---.+. T Consensus 487 ~~~e---ee~~ri~~E~~~~-~~~~g~~~d~~-----~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 487 DAKA---ETEGKATVEFIAK-TAAAGKVGDAA-----SGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred cchH---HHHHHHHhhhhhc-cccCCCCCCCC-----CCCCCcCcccCCccccccCCC Confidence 4443 5778998874332 11122111111 112233345545544443333 No 93 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=97.16 E-value=0.00015 Score=41.58 Aligned_cols=469 Identities=12% Similarity=0.046 Sum_probs=193.6 Q ss_pred ccchhHHHHhhcc--hhhhh--h---hhhcCCccccCHHHHHHHhhccccceeccCcceee-------------ecCcce Q lcl|NC_020883. 7 RGWTDKTTKNVHG--DYERY--R---QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------------ETQTPY 66 (589) Q Consensus 7 ~~~~~~~~~~~~~--~~~~~--r---~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------------~~~~~y 66 (589) -+|-|++|.-|-- ...|- | .=|+|.... ++........ ..|.=+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~----------------r~~~~~~~~~s~~~~i~~~~~~lr~RaRd 64 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPG----------------RTHKAKRQPLGADTSLQKSAVSMREQCRK 64 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCcc----------------ccccccCCCCChHHHHHHHHHHHHHHHHH Confidence 5677777765521 11111 1 113332111 0000000000 001112 Q ss_pred EEEEcchh--hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcc Q lcl|NC_020883. 67 VIFNLPKV--IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSK 144 (589) Q Consensus 67 ~~~n~~~~--i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~ 144 (589) |+-|-+-+ +++ .++++.+|..=+ .+...+...++ .+ -..+++.+ ...+..-.++|. T Consensus 65 L~rNn~~a~~av~---~~~~nvVG~~G~-----~i~p~~l~~d~-----~~-----a~~l~~~i----e~~w~~Wa~~~D 122 (548) T protein:vir:95 65 LDEDHDLVTGLLD---RLEERVVGGSGI-----GVEPLPLRLDG-----SV-----HAELAMEI----RSAWAEWSLSPE 122 (548) T ss_pred HHhcChHHHHHHH---HHHHhccCcccc-----ceeeeecCCCH-----HH-----HHHHHHHH----HHHHHHhhcCcc Confidence 22221110 111 122222331100 01111111111 00 01222333 224555566777 Q ss_pred ccccchh------hHHHHHHcCceeEEEEEecC---------ceeEEEecCceecccccCc-ceeEEEeecCCCccceEE Q lcl|NC_020883. 145 LERRHWS------NIVQHQVDGGIVAAPVIDEL---------GPRIVFKARDVYFPHDDEK-GADLAYYIDHGQYGQFLH 208 (589) Q Consensus 145 ~~~~~~~------~l~~~~v~Gg~~~~~~~~~~---------~~~i~f~~~d~~~P~~d~~-~~div~~~e~~~~~~~l~ 208 (589) +..++.- .+-..+++|=|.++..+... ..+|+..++|.. ++.... +-.|.-=.|..+.++ T Consensus 123 ~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l-~~~~~~~~~~i~~GIE~D~~Gr--- 198 (548) T protein:vir:95 123 TSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYL-PFSYNNLSKGIVQGIERDTWRR--- 198 (548) T ss_pred ccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhc-CCCCCCCCCceeeeeEECCCCc--- Confidence 6655422 34445789999999888632 147888888764 321111 111221111111122 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) .+.|-+++. +-|+.+..... ....-++..-|+|+- T Consensus 199 -----------p~aY~i~~~-----------hPgd~~~~~~~-----------------------~~~~rvpA~~VlHif 233 (548) T protein:vir:95 199 -----------KRAYHLLKD-----------HPGNLQTLGGS-----------------------LAVKRVEAERIIHIA 233 (548) T ss_pred -----------eEEEEEeec-----------CCCcccccccc-----------------------cceeeechhHheecc Confidence 222222211 11221100000 000112445588987 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHh-HHHHHHHHh-CCCcEEechhhhhccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTIT-RSAVIYEQN-GKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHR 366 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S-~~srildk~-gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~ 366 (589) .........|+|.|.-+...+..|++-.. -+.+. |. +-=..+|=...=+.......+.++... +... T Consensus 234 ~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~a--ki~A~~a~fi~~~~~~~~~~~~~~~~~~~~---------~~~~ 302 (548) T protein:vir:95 234 YRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAA--RISAALAMYIKKGNPDSYTVEPGKDRKNRT---------IPIA 302 (548) T ss_pred cccCCccccCcchHHHHHHHHHHHhHHHHHHHHHH--HHhhhheeeeecCCCccccCCCCccccccc---------cccc Confidence 77777788999999998888888886432 11111 11 111122211000000000001111100 0011 Q ss_pred cccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_020883. 367 DMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ 446 (589) Q Consensus 367 dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R 446 (589) +-.+...-..|..+.+++.+--...+..++..+++.|-+-.++|+..++-.. + + |-+|.|..++..++.+..+| T Consensus 303 pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~-s----~-nYSS~R~~l~e~~r~~~~~q 376 (548) T protein:vir:95 303 PGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAY-D----G-TYSAQRQELVEGWLGYDLLQ 376 (548) T ss_pred CCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c----h-hHHHHHHHHHHHHHHHHHHH Confidence 1111111233445666666555566777888999999888899999997421 1 1 56778888888887777777 Q ss_pred HHHHHHHHH-HHHHHHHHHhhc-CcccCc-------ccceeeeCC-cCC-CCCCHHHHHHHHHHHhccchhhHHHHHHHh Q lcl|NC_020883. 447 KEYIDFLKE-LYESCLWLLNDQ-DSSIRI-------EEPNIETQD-MIL-KPRAELVAENMAAYAASKQGQSLETTVRRM 515 (589) Q Consensus 447 ~~~~~aLk~-li~~~l~L~~~~-~~~~~~-------e~p~I~f~D-~lP-vde~El~~A~t~~~l~~a~~~S~etaVr~L 515 (589) ..+...+-+ +.+. ||+... .+.+.+ .-..+.|-. +.+ +| ++..++...++..+++.|++..++.. T Consensus 377 ~~~i~~~~~Pi~~~--wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iD--P~Kea~A~~~~i~~Gl~T~~~~~a~~ 452 (548) T protein:vir:95 377 HEFIDYWCRPVYRS--WLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWIN--PMHEANAWELLVKAGFADEAEVARAR 452 (548) T ss_pred HHHHHHHHHHHHHH--HHHHHHHcCCcCCCCCCCchhheeeeeecCCccccC--hHHHHHHHHHHHHcCCCCHHHHHHHh Confidence 666554433 3322 333221 111111 112456622 111 34 44445566667799999999999998 Q ss_pred CCCCCH--HHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCC--CCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 516 NPDASE--DWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNII--EEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 516 hpdw~d--E~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~--deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) .-||++ ++..+|.+++++-.- ++.+. +.........++-+++. .-|..-..|.++-.|+.-..+--|- T Consensus 453 G~D~~ev~~q~a~E~~~~~~~GL-----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (548) T protein:vir:95 453 GRDPRELKKSRETEIKANRAAGL-----VFSSD-AYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLP 524 (548) T ss_pred CCCHHHHHHHHHHHHHHHHHcCC-----CCCCc-ccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCc Confidence 755553 444455555543221 11111 11100001111111111 0010111111222222222221111 No 94 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=97.03 E-value=0.0002 Score=40.84 Aligned_cols=479 Identities=12% Similarity=0.030 Sum_probs=186.7 Q ss_pred Ccccee-------ccchhHHHHhhcchhh---h-------hh-------hhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDWTV-------RGWTDKTTKNVHGDYE---R-------YR-------QLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~---~-------~r-------~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) |++=++ +||--.....-.+.|+ + ++ ..-......|=-||+.|.+ .| T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~r----------Nn 70 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMAD----------ND 70 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHh----------cC Confidence 333222 2222111111000000 0 00 0000011122234444444 22 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) .+|+ + +++.-..-++|+ |=....-|+.. -+.+-+. +.-..++..+.++=++.. T Consensus 71 ~~a~------------~-av~~~~~nvVG~-Gi~~~~~~~~~---~l~g~~~----------~~~~~~~~~ie~~w~~wa 123 (553) T protein:vir:63 71 GFTN------------G-AVGYQRDSIVGA-QYRLNSMPDIN---VIPGATE----------EWAEEYQTIVEAKFELYA 123 (553) T ss_pred hHHH------------H-HHHHHHHhhccC-Cceeeeccchh---hhcCCCH----------HHHHHHHHHHHHHHHHhc Confidence 2211 1 111111222222 22222111100 0000011 011223333322211111 Q ss_pred HH------HHhhccccccchhhHHHHHHcCceeEEEEEecC-----ceeEEEecCceecc-cccCcceeEEEeecCCCcc Q lcl|NC_020883. 137 EQ------ITKNSKLERRHWSNIVQHQVDGGIVAAPVIDEL-----GPRIVFKARDVYFP-HDDEKGADLAYYIDHGQYG 204 (589) Q Consensus 137 ~~------v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~-----~~~i~f~~~d~~~P-~~d~~~~div~~~e~~~~~ 204 (589) +. +---..|..--.-.+-..+++|=|.++..+... .++|+..++|..=- .+..-+-.|..=.|..+.+ T Consensus 124 ~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~G 203 (553) T protein:vir:63 124 ESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRG 203 (553) T ss_pred CCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCC Confidence 11 111112332233355567889999998877632 26778887765321 1221122233222222222 Q ss_pred ceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceE Q lcl|NC_020883. 205 QFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFI 284 (589) Q Consensus 205 ~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plv 284 (589) + .+.|-+|+. +-|+.+....... -..+ ...++-++.+-| T Consensus 204 r--------------~vaY~i~~~-----------hPgd~~~~~~~~~-------~~~r---------~~~~~~v~a~~v 242 (553) T protein:vir:63 204 R--------------PQGYWIQVA-----------HPGDLYQMAPDMY-------KWKF---------VQQSKPWGRRQV 242 (553) T ss_pred c--------------eEEEEeecc-----------CCCcccccccccc-------ceee---------eccccccChhHh Confidence 2 222223321 1222111000000 0000 011234567779 Q ss_pred EEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHH-Hh-CCCcEEe-----chhhhhcccccccccccccccccc Q lcl|NC_020883. 285 SYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYE-QN-GKPRISI-----TKEMMDTLLNIAYERDGHSAKEAS 357 (589) Q Consensus 285 vyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srild-k~-gkpRI~V-----P~~~L~t~~g~~~d~dge~~~~~~ 357 (589) +|+-.........|+|.|.-+...+..|++-.. +...- +. +.=.++| ++...+...+.. .++......+ T Consensus 243 lH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~d--aeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 318 (553) T protein:vir:63 243 IHILEPREPDQSRGIADIVSGLKDMRMAKRFKE--MSLQNAVINASYAAAIESELPPEFIHSQMSGGS--PNADMVGIFG 318 (553) T ss_pred eecccccCCCcccCCchHHHHHHHHHHHhHHHH--HHHHHHHHhhhheeeeecCCChhhhhhhccccc--cccccccccc Confidence 998777778888999999988888888886422 11100 11 1111222 111111110000 0000000000 Q ss_pred cc-------c---cccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHH Q lcl|NC_020883. 358 MM-------T---PRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQS 427 (589) Q Consensus 358 ~~-------~---~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~S 427 (589) .. + ..+...+- .+..-..|..+.+++...-...+-.++..+++.|-+-.++|+..+.- +-+++ + T Consensus 319 ~~~~~~~~~~~~~~~~~l~pG-~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~-D~s~~----n 392 (553) T protein:vir:63 319 KYMDALKAYVGGANNIQIDGA-KIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTR-DFSKA----N 392 (553) T ss_pred ccccccccccccccceeecCc-eeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhh-hcccc----c Confidence 00 0 00000010 11112334456666666556677788889999999888999988863 21121 3 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcC-cccCc---------cc-------ceeeeCCcCCCCCC Q lcl|NC_020883. 428 GVAKFYDLLTTILKSRRLQKEYIDFL-KELYESCLWLLNDQD-SSIRI---------EE-------PNIETQDMILKPRA 489 (589) Q Consensus 428 g~A~r~~~~~~~~Kv~~~R~~~~~aL-k~li~~~l~L~~~~~-~~~~~---------e~-------p~I~f~D~lPvde~ 489 (589) =+|.|..++.-++.+..+|..+...+ +.+.+. ||+.... +.+.+ .. ..+.|-..-...-+ T Consensus 393 YSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~--wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iD 470 (553) T protein:vir:63 393 YSSIQAGIAMTRRFLEGRKKMCADRLATEFFTL--WLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQID 470 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccC Confidence 45667666666666666666555543 333222 2332211 11110 00 12455332111113 Q ss_pred HHHHHHHHHHHhccchhhHHHHHHHhCCCCC--HHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCC-C Q lcl|NC_020883. 490 ELVAENMAAYAASKQGQSLETTVRRMNPDAS--EDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIE-E 566 (589) Q Consensus 490 El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~--dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~d-e 566 (589) ++..++...+...+|+.|++..++...-||+ .++..+|.+++++-- +++.. +.+...+++.+ + T Consensus 471 P~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~G-----l~~~~---------~~~~~~~~~~~~~ 536 (553) T protein:vir:63 471 QLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYG-----LTFNL---------SAKRSLGDGRDAA 536 (553) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcC-----CCCCC---------CCccccCCCcccC Confidence 4444555556779999999999999975555 233334444433321 11111 11111111111 1 Q ss_pred CCCCCCCCcchhhhhhcccc Q lcl|NC_020883. 567 GDTEEEPSAEENEEIEKEGE 586 (589) Q Consensus 567 g~~~eep~~~~~e~~~~~~~ 586 (589) ..+.+.|.+++.+| ++| T Consensus 537 ~~~~~~~~~~~~~~---~~e 553 (553) T protein:vir:63 537 TGIAEDPAAAQTSQ---QGE 553 (553) T ss_pred CCCCCCCCCCCccc---ccC Confidence 11222222222222 222 No 95 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=419 Identities=10% Similarity=0.013 Sum_probs=154.5 Q ss_pred cceeccchhHHHHhhcchhhhhhhhhc--CCc-cccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 3 DWTVRGWTDKTTKNVHGDYERYRQLYE--GKH-ELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~r~l~~--g~~-~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) =|++-|...-+. |.--+..+++- |-+ ..-|... -..|---=+.+-+|++| T Consensus 1 ~~~~D~~~~~~~----~~g~~~~~~~~~~~~~~~~~~~~l-----------------------~a~Y~~~~l~~~~vd~~ 53 (437) T protein:vir:52 1 MKFFDGIKSLAL----KLGSKQEQTYYSPSLSLTDDLVQL-----------------------EALWRDNWIANKVCIKR 53 (437) T ss_pred CchhhhhHhHHh----cCCCccccceeecCccccccHHHH-----------------------HHHHHhCchhhHHhhcc Confidence 111111111110 00000000000 000 0000010 11111111334577788 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH-HH Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH-QV 158 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~-~v 158 (589) |..+-|-=-.|++ +. ..+-|.+.+.+..+.-|++.++.+.+.-. ++ T Consensus 54 a~d~~r~~~~i~~-------------~d--------------------~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~ 100 (437) T protein:vir:52 54 PEDMVRNWREIYS-------------ND--------------------LNSKQLDLFTKFERSLKLRETLTKALQWSSLY 100 (437) T ss_pred hHHhhcCCceEec-------------CC--------------------CCHHHHHHHHHHHHhhcHHHHHHHHHHhcccc Confidence 7776443333322 00 00011234555555555544444444422 23 Q ss_pred cCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccc-cceeehh---hhccccccch Q lcl|NC_020883. 159 DGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKD-GLRTTNM---LYPVVKAKGD 234 (589) Q Consensus 159 ~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~-~~~~~~~---~y~~~~~~~~ 234 (589) +||+++ +.+++. +.-.|-+. ++.=++++++-.+.-.. .....+- -| .+.. T Consensus 101 G~a~i~-i~~d~~---------~~~~pl~~------------~~~~~~~~v~~~~~v~~~~~~~~dp~s~~f----g~p~ 154 (437) T protein:vir:52 101 GSVGLL-VVTDSQ---------NTSAPLKP------------TERLKRLIILPKWKISPTGTKDDDVLSPNF----GRYS 154 (437) T ss_pred cceEEE-EEecCC---------Cccccccc------------CCceeEEEEechhhcccccccccccccccc----Ccce Confidence 333332 333332 22233221 11112344443221110 0000000 00 0000 Q ss_pred hheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC---CCCCCcccCcchhhhhHHHHH Q lcl|NC_020883. 235 VKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN---ETFMNPYGISALDNLESKQDE 311 (589) Q Consensus 235 ~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~---~~~~~~lG~SD~~~ie~l~De 311 (589) .+.+..++.++ .-|. .| |+++.+. ......+|+|.++.+.+-+.. T Consensus 155 ~y~v~~~~~~~---------------------~iH~---------SR--ii~~~~~~~~~~~~~~~G~s~le~~~~~i~~ 202 (437) T protein:vir:52 155 EYSILGGSQSI---------------------TVHH---------SR--LIILNANDAPLSDNDIWGVSDLEKIIDVLKR 202 (437) T ss_pred EEEEecCCcce---------------------eEcc---------ce--eEEecCccCCCccccccCCchHHHHHHHHHH Confidence 11111011000 0011 11 2333221 233556899999999888888 Q ss_pred HHHHHhHHHHHHHHhCCCcEEec--hhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccH Q lcl|NC_020883. 312 INWTITRSAVIYEQNGKPRISIT--KEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISK 389 (589) Q Consensus 312 Ld~t~S~~srildk~gkpRI~VP--~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirv 389 (589) .+.+.-....++.+...+.+-++ ...|... ..+.-.+...... ..-+..-+-++. .+.. |-+.++.+ T Consensus 203 ~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~---~~~~~~~~~~~~~---~~~~~~~~~~~d---~~~~--~e~~~~~~ 271 (437) T protein:vir:52 203 FDSASVNVGDLIFESKIDIFKIAGLSDKIAAG---MENEVASVISAVQ---EIKSATNSLLLD---AENE--YDRKELTF 271 (437) T ss_pred HHHHHHHHHHHHHHcCCCceecchHHHHhcCC---cHHHHHHHHHHHH---HhcCCCceEEEc---CCcc--eEEEecCc Confidence 88876666666655555555443 2233221 0000000000000 000111111221 1122 33455555 Q ss_pred HHHHHHHHHHHHHHHHHhcCCc-hhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020883. 390 IGDMDHVKNLIKLMLIETQTSE-KAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESCLWLLNDQ 467 (589) Q Consensus 390 eeh~~~ie~L~~~Il~~a~ts~-~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~l~L~~~~ 467 (589) ..-...++....+|-+.+++|. ..||.- .+ +- .||....+.+..+ ++..| ..+...|.+++.+++ +... T Consensus 272 sgl~~~l~~~~~~iaaa~~iP~t~L~G~s-~~--Gl-asge~D~~~yyd~---i~~~Qe~~l~p~le~l~~~i~-~~~~- 342 (437) T protein:vir:52 272 TGLKDLLTEFRNAVAGAADMPVTILFGQS-VS--GL-ASGDEDIQNYHEA---IRRLQETRLRPIFEIIDPLIC-NELF- 342 (437) T ss_pred CCHHHHHHHHHHHHHHHhcCchhhhcCcC-cc--cc-cccHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-HHhc- Confidence 5666777888888888889887 444531 11 22 3566665555555 34444 235566777766432 2222 Q ss_pred CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020883. 468 DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGIN 547 (589) Q Consensus 468 ~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~ 547 (589) + .. +....|+|++.+..+++|+ |++.+++.. +..+++. ++-.+.+++++++.+ ++. ++... T Consensus 343 g-~~-~~~~~~~f~pL~~~s~kek--ae~~~~~a~----a~~~~~~--~g~i~~~e~r~~L~~----~g~-----~~~i~ 403 (437) T protein:vir:52 343 G-GL-PADWWFEFVPLTTVKQEQQ--INMLNTFAT----AANTLIQ--NGVLNEYQIANELRE----SGL-----FANIS 403 (437) T ss_pred C-CC-CCcceEEeCCcCCcCHHHH--HHHHHHHHH----HHHHHHh--cCCCCHHHHHHHHHh----cCC-----CCCCC Confidence 2 12 2345789999877665554 666554322 2222222 234666777666632 111 11110 Q ss_pred c-ccccccCccc--CCCCCCCCCCCCCCCCcchhhh Q lcl|NC_020883. 548 Q-TFEQMNDNRD--EDGNIIEEGDTEEEPSAEENEE 580 (589) Q Consensus 548 ~-~l~~~~~~~~--~~~~p~deg~~~eep~~~~~e~ 580 (589) . ..+...+..+ ++..+.+ ...++++++-.+| T Consensus 404 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 437 (437) T protein:vir:52 404 AEHIEELKNADEFAGNFEEPE--KMEGAQVQNSEDQ 437 (437) T ss_pred ccccccccCCCCCCCccCCCC--CCCCCCCCCCCCC Confidence 0 0000000000 0000000 0001111111111 No 96 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=96.75 E-value=0.00036 Score=39.44 Aligned_cols=503 Identities=10% Similarity=0.001 Sum_probs=203.6 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee------ecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------ETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------~~~~~y~~~n~~~~ 74 (589) |-| .++.+..+|+-|.++..- . .+ -|+ +-+++-.| .+|+||.. ..+. ..++|+.+. T Consensus 1 m~d------~~~~~~~~~~~~~~~~~~---~-~~--~r~-~a~~d~~f----y~G~Qw~~~~~~~l~~q~-rp~~N~i~~ 62 (725) T protein:vir:77 1 MAD------NENRLESILSRFDADWTA---S-DE--ARR-EAKNDLFF----SRVSQWDDWLSQYTTLQY-RGQFDVVRP 62 (725) T ss_pred CCc------hHHHHHHHHHHHHHHHHh---h-HH--HHH-HHHHHHHh----hCCCCCCHHHHHHHHhcC-CCccccHHH Confidence 444 233344444333333210 0 00 010 11111122 22445432 1111 226798888 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) +++. . +|.-..+-+ + .-+.|.+. ...+.+++ =..++..+...|+.....-+++. T Consensus 63 ~i~~--v-----~g~~~~nr~------------d-~~v~P~~~---~d~~~Ae~---l~~~~~~~~~~~~~~~a~s~Af~ 116 (725) T protein:vir:77 63 VVRK--L-----VSEMRQNPI------------D-VLYRPKDG---ARPDAADV---LMGMYRTDMRHNTAKIAVNIAVR 116 (725) T ss_pred HHHH--H-----HhhHHhCCc------------c-eEEecCCc---cHHHHHHH---HHHHHHHHHHhhCchhHHHHHHH Confidence 7665 2 343333222 1 11223221 11222332 24478888889999988888888 Q ss_pred HHHHcCceeEEEEEe---c----CceeEEEe----cCceecccccCc-----ceeEEEeecCCCcc-------------- Q lcl|NC_020883. 155 QHQVDGGIVAAPVID---E----LGPRIVFK----ARDVYFPHDDEK-----GADLAYYIDHGQYG-------------- 204 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~---~----~~~~i~f~----~~d~~~P~~d~~-----~~div~~~e~~~~~-------------- 204 (589) ++++.|=..+++..| + ..++|..+ ++.++|.--+.+ -|.++|+.++--.+ T Consensus 117 ~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~ 196 (725) T protein:vir:77 117 EQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDAD 196 (725) T ss_pred HHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchh Confidence 888877667776544 1 22444443 233233311111 12223322221111 Q ss_pred ------ceEEEEEeeeccccceeehhhhccccccchhheeec--ccccccccccccccchhhhhhcc--cCC-------- Q lcl|NC_020883. 205 ------QFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK--KGELVTNVEGAEDLEGEELIREV--LNI-------- 266 (589) Q Consensus 205 ------~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~--~gd~~~~~~e~~d~e~e~~i~~~--i~i-------- 266 (589) -+.+.+..+.+.+.+++. ..|......-......+ .|+.+.. ..+++ +...... -|+ T Consensus 197 ~~~~~~~~~~~~~~~~~~d~vrv~-E~~~r~~~~~~~~~~~~~~tg~~~~~--~~~~~--~~~~~~~~~~g~~~~~~~~~ 271 (725) T protein:vir:77 197 DIPSFQNPNDWVFPWLTQDTIQIA-EFYEVVEKKETAFIYQDPVTGEPVSY--FKRDI--KDVIDDLADSGFIKIAERQI 271 (725) T ss_pred hcccccccccccccccCCCeeEEE-EEEEEEEEeeEEEEecCCCCcceeec--ChhhH--HHHHHHhhhcCchhhhhccc Confidence 011111111122222221 11110000000000000 1111100 00000 0000000 000 Q ss_pred -----------ccc-cccccccCCCCcceEEEecCCC-CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe Q lcl|NC_020883. 267 -----------PDD-RPLENFYPGRNRPFISYWANNE-TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI 333 (589) Q Consensus 267 -----------p~~-~e~~~i~TGv~~plvvyvPN~~-~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V 333 (589) +.+ -+....++|-..|+|-+|.... ..+.+++-+-+.++.+.++.+|.+.|....++-..++.+..+ T Consensus 272 ~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~ 351 (725) T protein:vir:77 272 KRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFF 351 (725) T ss_pred ceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhcccccccc Confidence 111 1112245566667777666643 345566667889999999999999999888887778888888 Q ss_pred chhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchh Q lcl|NC_020883. 334 TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKA 413 (589) Q Consensus 334 P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~A 413 (589) +.+.++......-..++..+.. ..++......-....+..++..-=..++++.+......|=.+++....+ T Consensus 352 ~~~~i~~~~~~~~~~~~~~~~~---------~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~ 422 (725) T protein:vir:77 352 WPEQIAGFEHMYDGNDDYPYYL---------LNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDT 422 (725) T ss_pred chhhhhHHHHHHHhccCCceec---------ccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHH Confidence 8877753211100001111100 0000000000000112223322234567777777777777788988888 Q ss_pred cccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---cccCcc---------------- Q lcl|NC_020883. 414 VDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSIRIE---------------- 474 (589) Q Consensus 414 Fg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~~~e---------------- 474 (589) +|.. +.+.||+|+..+-.+........=..+..+++++.+.++.|-..+. +.+-+. T Consensus 423 lG~~-----~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~ 497 (725) T protein:vir:77 423 EAVN-----GGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVV 497 (725) T ss_pred hCCC-----chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeeccccc Confidence 8853 2246888776544433333333444555666666666655543211 000000 Q ss_pred ---------------cceeeeCCcCCC-CCCHHHHHHHHHHHhccchh-h-HHHHHHHhCCCCCHHHHHHHHHHHHhhcc Q lcl|NC_020883. 475 ---------------EPNIETQDMILK-PRAELVAENMAAYAASKQGQ-S-LETTVRRMNPDASEDWIQEEIARIEEEQA 536 (589) Q Consensus 475 ---------------~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~-S-~etaVr~Lhpdw~dE~v~eEv~RI~~E~a 536 (589) .-+|...++... ...+...+.+++++...... . .-..+...-+..+-+.+++.++||+.... T Consensus 498 ~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~ 577 (725) T protein:vir:77 498 DLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI 577 (725) T ss_pred ccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhh Confidence 011222222110 11222233444443332211 1 11222222233445566777888866543 Q ss_pred ccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccccc--CC Q lcl|NC_020883. 537 GSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEP--IA 589 (589) Q Consensus 537 ~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~--~~ 589 (589) +... . ++ . ++++++...+..+.++.--. ++ T Consensus 578 ~~~~---~-------------q~----~---~~~e~q~~~~~qq~~~~q~~~e~~ 609 (725) T protein:vir:77 578 QMGV---K-------------KP----E---TPEEQQWLVEAQQAKQGQQDPAMV 609 (725) T ss_pred hhhc---c-------------CC----C---ChhhHHHHHHHHHHHHHhHHHHHH Confidence 2210 0 00 0 01111111111111111000 11 No 97 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=96.62 E-value=0.00046 Score=38.86 Aligned_cols=450 Identities=11% Similarity=0.009 Sum_probs=179.3 Q ss_pred ccceeccchhHHHHhhcchhhh-hhhhhcCCcccc----CH---HHHHHHhhccccceeccCcceeeecCcceEEEEcc- Q lcl|NC_020883. 2 IDWTVRGWTDKTTKNVHGDYER-YRQLYEGKHELL----FP---RAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP- 72 (589) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~-~r~l~~g~~~~~----f~---ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~- 72 (589) .+||=+|+.--. .+..-+ -+.-|+|....- ++ -..++......+ ..|.=+|+-|-+ T Consensus 1 m~~~~~~~~a~~----~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~l-----------r~RaRdl~rNn~~ 65 (495) T protein:vir:10 1 MNMTPSGYQSLA----SGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTL-----------RARSHHNVRNNPW 65 (495) T ss_pred CCcccccccccc----hhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHH-----------HHHHHHHHhcChH Confidence 566666653100 000000 001122221110 00 000000000000 001112222211 Q ss_pred -hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccch- Q lcl|NC_020883. 73 -KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHW- 150 (589) Q Consensus 73 -~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~- 150 (589) +-+++ .++++.+|. ++. ..+..+ -+..+..+ ...+..-.++|.+..++. T Consensus 66 a~~av~---~~~~~vVG~---Gi~-----p~~~~~--------------~~~~~~~i----e~~w~~wa~~~D~~g~~~f 116 (495) T protein:vir:10 66 ATNAVA---TWVAAAVGN---GLT-----PRWRMK--------------EQELRQEL----QELWGDWVNEADFDEVQSF 116 (495) T ss_pred HHHHHH---HHHHhhcCC---Ccc-----cccCCc--------------hHHHHHHH----HHHHHHhhcCcccccccCH Confidence 11111 122223332 111 111000 01233333 334566666776654432 Q ss_pred -----hhHHHHHHcCceeEEEEEec--C----ceeEEEecCcee-cccccCc---ceeEEEeecCCCccceEEEEEeeec Q lcl|NC_020883. 151 -----SNIVQHQVDGGIVAAPVIDE--L----GPRIVFKARDVY-FPHDDEK---GADLAYYIDHGQYGQFLHIYRERVE 215 (589) Q Consensus 151 -----~~l~~~~v~Gg~~~~~~~~~--~----~~~i~f~~~d~~-~P~~d~~---~~div~~~e~~~~~~~l~~~~~~~~ 215 (589) ..+-...++|=|.++..+.. . ..+|+..++|.. -|.+... +--|..=.|..+.++ T Consensus 117 ~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr---------- 186 (495) T protein:vir:10 117 YGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGK---------- 186 (495) T ss_pred HHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCc---------- Confidence 24445678999888776652 2 268888888874 2322211 111222112222222 Q ss_pred cccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCC Q lcl|NC_020883. 216 KDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMN 295 (589) Q Consensus 216 ~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~ 295 (589) .+.|-+|+. +-|+.+...... + ..-++..-|+|+-. ..... T Consensus 187 ----~vaY~i~~~-----------hpgd~~~~~~~~-------------------~----~~rvpA~~vlH~f~-~r~gQ 227 (495) T protein:vir:10 187 ----RKAYCFYRN-----------HPAESSLIGDPV-------------------D----TVWIKAEHVLHVTV-LTVRS 227 (495) T ss_pred ----eEEEEEeec-----------CCCccccccccc-------------------c----eeeechhheEeccc-cCCCc Confidence 112122221 222211100000 0 01123344778754 45677 Q ss_pred cccCcchhhhhHHHHHHHHHHh-HHHHHHHHh-CCCcEEec----hhhhhcccc-ccccccccccccccccccccccccc Q lcl|NC_020883. 296 PYGISALDNLESKQDEINWTIT-RSAVIYEQN-GKPRISIT----KEMMDTLLN-IAYERDGHSAKEASMMTPRIDHRDM 368 (589) Q Consensus 296 ~lG~SD~~~ie~l~DeLd~t~S-~~srildk~-gkpRI~VP----~~~L~t~~g-~~~d~dge~~~~~~~~~~~~d~~dl 368 (589) ..|+|.++-+.. +..|+.--. -+.+. |. +--..+|- +.......+ ...+..+... ....+. T Consensus 228 ~RGis~la~i~~-l~~l~~y~dael~~a--~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~---------~~l~pG 295 (495) T protein:vir:10 228 DAGAPWFQLLLR-LNELDQYEDAELVRK--KTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRI---------TGLNPG 295 (495) T ss_pred ccCcchhHHHHH-HHHhhHHHHHHHHHH--HHhhhheeeeecCCCccccccccCccccccCcccc---------eecCCc Confidence 789999887765 566664321 11111 11 11112221 100000000 0000011000 001111 Q ss_pred cccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_020883. 369 EITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKE 448 (589) Q Consensus 369 ev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~ 448 (589) . +..-..|..+++++..--...+-.++..+++.|-+-.++||..+.- +-+++ +=+|.|..++.-.+.+.++|.. T Consensus 296 ~-i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltg-D~s~~----nYSS~R~~~~e~~r~~~~~q~~ 369 (495) T protein:vir:10 296 T-LQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTG-DLRGV----NYSSIRAGLLEFRRLCQQVQHH 369 (495) T ss_pred e-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhc-ccccc----cHHHHHHHHHHHHHHHHHHHHH Confidence 1 1122345556666666556677788899999999888999998863 21121 3456777777777777776643 Q ss_pred -HHH-HHHHHHHHHHHHHhhcCcccCc----c----cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCC Q lcl|NC_020883. 449 -YID-FLKELYESCLWLLNDQDSSIRI----E----EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPD 518 (589) Q Consensus 449 -~~~-aLk~li~~~l~L~~~~~~~~~~----e----~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpd 518 (589) +.. +++.+.+.-+......|. +.+ . -..+.|-..-..--+++..++.......+|++|++..++...-| T Consensus 370 ~~~~~~~~pi~~~~l~~a~l~G~-i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D 448 (495) T protein:vir:10 370 MIIHQFCRPVGRWFMDFAVASGA-VVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYD 448 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHcCC-CCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCC Confidence 333 334443322211111111 110 0 12345522111111344445666667799999999999998644 Q ss_pred CCHHHHHHHHHHHHhhcccccc--ccccccccccccccCcccCCCCCCCCCCCCCCCCcchhh Q lcl|NC_020883. 519 ASEDWIQEEIARIEEEQAGSDT--SSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENE 579 (589) Q Consensus 519 w~dE~v~eEv~RI~~E~a~~~p--~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e 579 (589) | +++.+|+++ |...... +++... ++...++.. +.+..+++.+.+| T Consensus 449 ~--~~v~~q~a~---e~~~~~~~Gl~~~~~---------p~~~~~~~~--~~~~~~~~~~~~e 495 (495) T protein:vir:10 449 M--EELFDMISD---ANQLIDEYDLRLDSD---------PRYVNGSGA--EQKSVMEAALNNE 495 (495) T ss_pred H--HHHHHHHHH---HHHHHHHcCCCCCCC---------CCcCCCccC--CCCCCCCCCCCCC Confidence 4 444444433 2222211 222211 111111111 0122222222222 No 98 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=505 Identities=13% Similarity=0.105 Sum_probs=196.1 Q ss_pred Cc---cceeccchhHHHHhhc-chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee-------eecCcceEEE Q lcl|NC_020883. 1 MI---DWTVRGWTDKTTKNVH-GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA-------RETQTPYVIF 69 (589) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~-~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~-------~~~~~~y~~~ 69 (589) |- +=+--.=.++-..++| .-+.+|+.=-++. .-+ |+ +-.+.-.| .+|+||. +....|-+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~-r~-~a~~d~~f----y~G~Qw~~~~~~~l~~~g~p~~~~ 72 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQ--PLW-RD-AANKACAY----YDGDQLAPEVIQVLKDRGQPMTIH 72 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhh--HHH-HH-HHHHHHHh----hcCCCCCHHHHHHHHhcCCCcEEe Confidence 11 1000000111111122 1122222111111 111 21 12222222 2466773 2456788999 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+.+.+++. . +|....+-+ ..-+.|.+... +..+ +.+.=..++..+..+|+..... T Consensus 73 N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~pr~~~~-~~~~---~Ae~l~~~~~~~~~~~~~~~~~ 128 (714) T protein:vir:10 73 NLIAPTVDG--V-----LGMEAKTRT-------------DLIVMSDDPND-ETEK---LAEAINAEFADACRLGNMNKAR 128 (714) T ss_pred ccHHHHHHH--H-----HHHHHhCCc-------------ceEEecCCCCh-hhHH---HHHHHHHHHHHHHHhhchhHHH Confidence 999988876 2 333333222 11133322110 0111 2222245888899999988888 Q ss_pred hhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccccC-----cceeEEEeecCCCccc----e---------- Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHDDE-----KGADLAYYIDHGQYGQ----F---------- 206 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~d~-----~~~div~~~e~~~~~~----~---------- 206 (589) -.++.+++..|=-+.+++++ +..++|..+++.-+|..-+. .-|.++++.++--.+. | T Consensus 129 s~af~~~~~~G~G~~~~~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~ 208 (714) T protein:vir:10 129 SDAYAEQIKAGLSWVEVRRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYA 208 (714) T ss_pred HHHHHHhhhcccceEEeeeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhcc Confidence 88888877777666777887 33488888888888863222 2344444332111000 0 Q ss_pred -----------------EEEE------Eeeecc---------ccceeehhhhccccccchhheeeccccccccccccccc Q lcl|NC_020883. 207 -----------------LHIY------RERVEK---------DGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDL 254 (589) Q Consensus 207 -----------------l~~~------~~~~~~---------~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~ 254 (589) -..+ +.++.. ...++...-||...+.... . -..|..+..-.. ... T Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~-~-~~~g~~~~~d~~-~~~ 285 (714) T protein:vir:10 209 IDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVI-E-LSNGRVVAFDKN-NLM 285 (714) T ss_pred chhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEee-c-CCCCCeeeeCcc-CHH Confidence 0000 000000 0000000001100000000 0 001111000000 000 Q ss_pred chhhhhhcccCC----------------ccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHHHHHHHHh Q lcl|NC_020883. 255 EGEELIREVLNI----------------PDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQDEINWTIT 317 (589) Q Consensus 255 e~e~~i~~~i~i----------------p~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~DeLd~t~S 317 (589) +....+.-.+.+ .-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++.+|.+.| T Consensus 286 ~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s 363 (714) T protein:vir:10 286 QAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRI 363 (714) T ss_pred HHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccce--ehhhhhhHHHHHHHHHH Confidence 000000000000 00112222344555666666655322 223444 45788999999999999 Q ss_pred HHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccccccccc----CccceeeecccHHHHH Q lcl|NC_020883. 318 RSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENG----RSMEIHQIDISKIGDM 393 (589) Q Consensus 318 ~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g----~~~~~iq~Dirveeh~ 393 (589) +...++ .++ ++++..+.+.. .+.+..-.. .++... ....+...+| ..++..+.---..+++ T Consensus 364 ~~~~~l--~~~-~~~~~~gav~~-------~d~~~~e~~---~rp~~v--i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:10 364 KLTWLL--QAK-RVIMDEDATQL-------SDNDLMEQL---ERPDGI--IKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred HHHHHH--hCC-ceeeccccccc-------cHHHHHHhc---cCCCCe--EEecccccccCCccccccccCCCCCcHHHH Confidence 888775 355 45554433321 111110000 000000 0111000111 1122222122345677 Q ss_pred HHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc--- Q lcl|NC_020883. 394 DHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSS--- 470 (589) Q Consensus 394 ~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~--- 470 (589) +.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-..+... T Consensus 429 ~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv 503 (714) T protein:vir:10 429 QVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRN 503 (714) T ss_pred HHHHHHHHHHHHhhCCCHHHcCCCc-----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcE Confidence 7777777777778999999998632 24588887655544433333344445556666666665555432110 Q ss_pred cCc--c----------------------------cceeeeCCcCCC-CCCHHHHHHHHHHHhccc----hhhHHHHHHHh Q lcl|NC_020883. 471 IRI--E----------------------------EPNIETQDMILK-PRAELVAENMAAYAASKQ----GQSLETTVRRM 515 (589) Q Consensus 471 ~~~--e----------------------------~p~I~f~D~lPv-de~El~~A~t~~~l~~a~----~~S~etaVr~L 515 (589) +.+ + +-+|...++... ...+...+.+++++.... .+-....|..+ T Consensus 504 ~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~ 583 (714) T protein:vir:10 504 HAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL 583 (714) T ss_pred EEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc Confidence 000 0 001221121101 112222334444443211 11122233333 Q ss_pred CCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcc----------- Q lcl|NC_020883. 516 NPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKE----------- 584 (589) Q Consensus 516 hpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~----------- 584 (589) + |. -.++-++||.+-....++ .+.-..++.+-...-.++++. T Consensus 584 d--~p--~~~ei~~~ir~~~~~~~~-----------------------~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a 636 (714) T protein:vir:10 584 D--VP--QKQEFVERIRAALGTPKS-----------------------PDEMTPEEQEVAAQQQALQQQQAELQMREMAG 636 (714) T ss_pred C--Cc--CHHHHHHHHHHHcCCCCC-----------------------ccccCcchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 32 233456666544321100 000001111101111111110 Q ss_pred ----cccCC Q lcl|NC_020883. 585 ----GEPIA 589 (589) Q Consensus 585 ----~~~~~ 589 (589) .+.-+ T Consensus 637 ~~~k~eaea 645 (714) T protein:vir:10 637 RVAKLEADA 645 (714) T ss_pred HHHHHHHHH Confidence 00000 No 99 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=96.29 E-value=0.00078 Score=37.63 Aligned_cols=505 Identities=10% Similarity=0.005 Sum_probs=204.5 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee------ecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------ETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------~~~~~y~~~n~~~~ 74 (589) |-| +++.+..+|+-|.++..-- .+ -|+ +-.++-.|. +|+||.. ..+.+ .++|+.+. T Consensus 1 m~d------~~~~~~~~~~~~~~~~~~~----~~--~R~-~a~~d~~fy----~G~QW~~~~~~~l~~q~r-p~~N~i~~ 62 (725) T protein:vir:10 1 MAD------NENRLESILSRFDADWTAS----DE--ARR-EAKNDLFFS----RVSQWDDWLSQYTTLQYR-GQFDVVRP 62 (725) T ss_pred CCc------hHHHHHHHHHHHHHHHHhh----HH--HHH-HHHHHHHhh----cCCCCCHHHHHHHHhcCC-CcccchHH Confidence 554 3444444444444433210 00 011 111222222 3555532 11222 26799888 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) +++. . +|.-..+-+ + .-+.|.+. ...+.+++ =..++..+..+|+.....-+++. T Consensus 63 ~v~~--v-----~g~e~~nr~------------d-~~v~p~~~---~d~~~Ae~---l~~~~~~~~~~~~~~~~~s~Af~ 116 (725) T protein:vir:10 63 VVRK--L-----VSEMRQNPI------------D-VLYRPKDG---ASPDAADV---LMGMYRTDMRHNTAKIAVNIAVR 116 (725) T ss_pred HHHH--H-----HhhHHhCCc------------c-eEEecCCc---chHHHHHH---HHHHHHHHHHhcCcchHHhHHHH Confidence 7776 2 333222111 1 11223221 12222333 24478888899999988888888 Q ss_pred HHHHcCceeEEEEEe---cC----ceeEEEe----cCceecccccC-----cceeEEEeecCCCcc-------ceE---- Q lcl|NC_020883. 155 QHQVDGGIVAAPVID---EL----GPRIVFK----ARDVYFPHDDE-----KGADLAYYIDHGQYG-------QFL---- 207 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~---~~----~~~i~f~----~~d~~~P~~d~-----~~~div~~~e~~~~~-------~~l---- 207 (589) ++++.|=.+.++..| ++ .+.|..+ ++.++|.--+. .-|-+++..++--.+ +|- T Consensus 117 ~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~ 196 (725) T protein:vir:10 117 EQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDAD 196 (725) T ss_pred HHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCccc Confidence 888888777777544 22 2444433 23334431111 123333333321100 110 Q ss_pred EE--E-------Eeeeccccceeehhhhccccccchhheee--cccccccccccccccch--hhhhhccc---------- Q lcl|NC_020883. 208 HI--Y-------RERVEKDGLRTTNMLYPVVKAKGDVKKEI--KKGELVTNVEGAEDLEG--EELIREVL---------- 264 (589) Q Consensus 208 ~~--~-------~~~~~~~~~~~~~~~y~~~~~~~~~~~~~--~~gd~~~~~~e~~d~e~--e~~i~~~i---------- 264 (589) ++ + -.+.+.+..++. +.|......-...... ..|+.++. ...+++. .......+ T Consensus 197 ~~~~~~~~~~~~~~~~~~~~vrv~-E~~~r~~~~~~~~~~~d~~~g~~~~~--~~~~~~~~~~~~~~~g~~~~~~r~~~~ 273 (725) T protein:vir:10 197 NIPSFQNPNDWVFPWLTQDTIQIA-EFYEVVEKKETAFIYQDPVTGEPVSY--FKRDIKDVIDDLADSGFIKIAERQIKR 273 (725) T ss_pred ccccccccccccccccCCCeEEEE-EEEEEEEEeeEEEEeccCCCCceeec--chhhhHHHHHHhhcccchhhhhcccee Confidence 00 0 000111111111 1111000000000000 01211110 0000000 00000000 Q ss_pred -------CCccc-cccccccCCCCcceEEEecCCC-CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEech Q lcl|NC_020883. 265 -------NIPDD-RPLENFYPGRNRPFISYWANNE-TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITK 335 (589) Q Consensus 265 -------~ip~~-~e~~~i~TGv~~plvvyvPN~~-~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~ 335 (589) -.+.+ -+....++|...|+|.+|.... ..+.+++-+-+.++.+.++.+|.+.|....++-..++.+..++. T Consensus 274 ~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~ 353 (725) T protein:vir:10 274 RRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWP 353 (725) T ss_pred eEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccH Confidence 00111 1112234566667776665532 34456666788999999999999999999888777888888888 Q ss_pred hhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcc Q lcl|NC_020883. 336 EMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVD 415 (589) Q Consensus 336 ~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg 415 (589) +.++......-..++..+...+ +.......-....+.+++.---..++++.+......|-.+++.+..+.| T Consensus 354 ~~i~~~e~~~~~~~~~~~~~~~---------~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG 424 (725) T protein:vir:10 354 EQIAGFEHMYDGNDDYPYYLLN---------RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA 424 (725) T ss_pred hhhhHHHHHHhccCCceeeecc---------cccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 8875421110001111111000 0000000000112222222223446777777777777778899988888 Q ss_pred cccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---cccCcc------------------ Q lcl|NC_020883. 416 FYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSIRIE------------------ 474 (589) Q Consensus 416 ~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~~~e------------------ 474 (589) .. +.+.||+|+..+..+........-..+..+++++.+.++.|-..+. +.+-+. T Consensus 425 ~~-----~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~ 499 (725) T protein:vir:10 425 VN-----GGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDL 499 (725) T ss_pred cC-----chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccc Confidence 52 2346888876555444433333444555666666666655543211 000000 Q ss_pred -------------cceeeeCCcCCC-CCCHHHHHHHHHHHhccchh-h-HHHHHHHhCCCCCHHHHHHHHHHHHhhcccc Q lcl|NC_020883. 475 -------------EPNIETQDMILK-PRAELVAENMAAYAASKQGQ-S-LETTVRRMNPDASEDWIQEEIARIEEEQAGS 538 (589) Q Consensus 475 -------------~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~-S-~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~ 538 (589) +=+|...++... ...+...+.+++++...... . ....+..+-+..+-+.+++.++||+...... T Consensus 500 ~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~ 579 (725) T protein:vir:10 500 ATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM 579 (725) T ss_pred cccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhh Confidence 001222221100 11222223444444332211 1 1122222223344455566677776543221 Q ss_pred ccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc--ccCC Q lcl|NC_020883. 539 DTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG--EPIA 589 (589) Q Consensus 539 ~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~--~~~~ 589 (589) .. + ++ . .+++++...+..+.++.- ..++ T Consensus 580 ~~---~-------------~~----~---~~e~~q~~~e~qq~~~~q~~~e~~ 609 (725) T protein:vir:10 580 GV---K-------------KP----E---TPEEQQWLVEAQQAKQGQQDPAMV 609 (725) T ss_pred cc---C-------------Cc----c---ccchhHHHHHHHHHHHhhhHHHHH Confidence 00 0 00 0 011111111111111110 0000 No 100 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=458 Identities=10% Similarity=-0.001 Sum_probs=188.0 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) --++...||.-.. .+....+...+ ..+=-||+.|.+.-+++.. +++.-. T Consensus 27 ~~~~~~~~w~~~~-~s~~~~i~~~~-------~~lr~RaRdl~rNn~~a~~-----------------------av~~~~ 75 (530) T protein:vir:38 27 GFGGQLRGWNPPS-ESADAALLPNY-------SRGNARADDLVRNNGYAAN-----------------------AVQLHQ 75 (530) T ss_pred CCCCcccccccCC-CCHHHHHHHHH-------HHHHHHHHHHHhcChHHHH-----------------------HHHHHH Confidence 1233444442210 11111111111 2233456666552222211 112112 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh----cccccc------ch Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN----SKLERR------HW 150 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn----~~~~~~------~~ 150 (589) ..+.|+ |=....-|+. ...+-+. +.-..+++++.++ +..-..+ |.+..+ -. T Consensus 76 ~nvVG~-Gi~~~~~p~~----~~l~~~~----------~~~~~~~~~ie~~----w~~W~~~~~~~~D~~g~~~f~~~q~ 136 (530) T protein:vir:38 76 DHIVGS-FFRLSYRPSW----RYLGINE----------EDSRAFSRDVEAA----WNEYAEDDFCGIDAERKRTFTMMIR 136 (530) T ss_pred HHhhCC-Cceeeeccch----hhcCCCH----------hHHHHHHHHHHHH----HHHhhcCCCcEEeeeccCCHHHHHH Confidence 223222 2222111110 0011111 0011233333222 3332222 233222 22 Q ss_pred hhHHHHHHcCceeEEEEEecC-----ceeEEEecCceec-ccccCcceeEEEeecCCCccceEEEEEeeeccccceeehh Q lcl|NC_020883. 151 SNIVQHQVDGGIVAAPVIDEL-----GPRIVFKARDVYF-PHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNM 224 (589) Q Consensus 151 ~~l~~~~v~Gg~~~~~~~~~~-----~~~i~f~~~d~~~-P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~ 224 (589) ..+-..+++|-|.++..+... ..+|+..++|..= |.+..-+-.|..=.|..+.++ .+.|. T Consensus 137 l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr--------------~~aY~ 202 (530) T protein:vir:38 137 EGVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGA--------------ALGYY 202 (530) T ss_pred HHHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCc--------------eEEEE Confidence 344566789999999887743 2678888876532 111111111222112112222 22222 Q ss_pred hhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhh Q lcl|NC_020883. 225 LYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDN 304 (589) Q Consensus 225 ~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ 304 (589) +|+. ...|+... ....+| ....++++-|+|+-+........|+|++.- T Consensus 203 i~~~----------~~~~~~~~---------------~~~~~~-------~~~~v~a~~vlH~f~~~r~gQ~RGis~lap 250 (530) T protein:vir:38 203 VSDD----------GYPGWMAQ---------------NWTYIP-------RELPGGRPSFIHVFEPMEDGQTRGANAFYS 250 (530) T ss_pred Eeec----------cCCCcccc---------------ccceee-------eeeccChhHeEeeccccCCCcccCCchHHH Confidence 2210 00111110 000011 123457778999988888888899999999 Q ss_pred hhHHHHHHHHHHh-HHHHHHHHhCCC-cEEechh-----hhhcccccccccccccccccccc---c---ccccccccccc Q lcl|NC_020883. 305 LESKQDEINWTIT-RSAVIYEQNGKP-RISITKE-----MMDTLLNIAYERDGHSAKEASMM---T---PRIDHRDMEIT 371 (589) Q Consensus 305 ie~l~DeLd~t~S-~~srildk~gkp-RI~VP~~-----~L~t~~g~~~d~dge~~~~~~~~---~---~~~d~~dlev~ 371 (589) +...+..|++-.. -+.+. +.+.- ..+|-.. ......+...+.+.......... . ..+.-.+..+ T Consensus 251 vl~~l~~l~~y~dael~~a--~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i- 327 (530) T protein:vir:38 251 VMEQMKMLDTLQNTQLQSA--IVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARV- 327 (530) T ss_pred HHHHHHHHhHHHHHHHHHH--HHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCcee- Confidence 8888888886421 11111 11111 1111000 00000000000000000000000 0 0000001011 Q ss_pred ccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_020883. 372 TFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYID 451 (589) Q Consensus 372 ~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~ 451 (589) ..-..|..+.+++.+.-...+..++..+++.|-+..++|+..+.- +-+++ +=+|.|..++..++.++.+|..+.. T Consensus 328 ~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~-D~s~~----nYSS~R~~~~e~~r~~~~~q~~~~~ 402 (530) T protein:vir:38 328 PHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSR-NYSQM----SYSTARASANESWAYFMGRRKFVAS 402 (530) T ss_pred eecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhc-ccccc----cHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112334556666666666777788899999999999999998863 11121 4567777777777778888876655 Q ss_pred HH-HHHHHHHHHHHhhc-CcccCc------c-------cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC Q lcl|NC_020883. 452 FL-KELYESCLWLLNDQ-DSSIRI------E-------EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN 516 (589) Q Consensus 452 aL-k~li~~~l~L~~~~-~~~~~~------e-------~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh 516 (589) .+ +.+.+. ||+... .+.+.+ . -..+.|-..-...-+++..++.......+++.|++..++... T Consensus 403 ~~~~pi~~~--wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G 480 (530) T protein:vir:38 403 RQACQMFLC--WLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG 480 (530) T ss_pred HHhhHHHHH--HHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC Confidence 43 333222 333221 111110 0 013455221111113444455666677999999999999986 Q ss_pred CCCC--HHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 517 PDAS--EDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 517 pdw~--dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) -||+ .++..+|.+++++- +. +.+.....++.. +.+.++.++++.-.+. T Consensus 481 ~D~~~v~~q~a~e~~~~~~~-Gl--~~~~~~~~~~~~---------~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 481 DDYQEIFAQQVRESMERRAA-GL--NPPAWAAAAFEA---------GVKKSNEEEQDGARAA 530 (530) T ss_pred CCHHHHHHHHHHHHHHHHHc-CC--CCCCCcccccCC---------CCCCCCCCCCCCCCCC Confidence 5554 22333333333221 11 111111111111 1111111222222222 No 101 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.07 E-value=0.001 Score=36.93 Aligned_cols=450 Identities=12% Similarity=0.036 Sum_probs=164.7 Q ss_pred CccceeccchhHH------HHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccC-cceeeecCcceEEEEcch Q lcl|NC_020883. 1 MIDWTVRGWTDKT------TKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS-SQTARETQTPYVIFNLPK 73 (589) Q Consensus 1 ~~~~~~~~~~~~~------~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~n~~~ 73 (589) |+. -.|=...| ....+-.+...|.+|.|. ..++-|..-|.. . .+.-+. .|-+.-.|.. ..|+++ T Consensus 1 ~~~--~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~-~~~~~r~~yl~~-~---~~~~~e~~Y~~rl~rA~--~~n~~~ 71 (489) T protein:vir:78 1 MLT--ENGQGSGVKTKHREWLHYAPKWQKVRHALAGE-LVSYLRNVGLNE-P---DKAYGEARQAEYEAGGI--VYNFTR 71 (489) T ss_pred Ccc--CCCccCCCCccCHHHHHHHHHHHHHHHHhcCc-ccccccCCCCCC-C---CCCCChHHHHHHHhccc--cCChHH Confidence 331 11211111 112233445566778884 333444321111 0 000000 1111111221 125544 Q ss_pred hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhH Q lcl|NC_020883. 74 VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNI 153 (589) Q Consensus 74 ~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l 153 (589) -+ ++..+|.+-. .++.-+.+..++ .-.+=+|++.--+++++++ .+ T Consensus 72 ~t-------l~~l~G~vfr--------k~p~~~~p~~l~-----------~l~~d~D~~G~~L~~f~~~---------~~ 116 (489) T protein:vir:78 72 RT-------LSGMVGSVMR--------KEPEINIPKELE-----------YLLKNADGSGVGLIQHAQD---------TL 116 (489) T ss_pred HH-------HHHHhchhhc--------CCcceeccHHHH-----------HHHhccCCCCCCHHHHHHH---------HH Confidence 33 3334454433 222222222221 1112233333334444443 33 Q ss_pred HHHHHcCceeEEEEEecC-------------ceeEEEecCceecc----cccCc--ceeEEEeecC------CC-ccceE Q lcl|NC_020883. 154 VQHQVDGGIVAAPVIDEL-------------GPRIVFKARDVYFP----HDDEK--GADLAYYIDH------GQ-YGQFL 207 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~~~-------------~~~i~f~~~d~~~P----~~d~~--~~div~~~e~------~~-~~~~l 207 (589) ..++..|++-.-+=.... -+++.++.|.+.+= +.+|+ -+-+++.+.. ++ ..+.+ T Consensus 117 ~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~ 196 (489) T protein:vir:78 117 MEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYG 196 (489) T ss_pred HHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeE Confidence 445566665443322211 15666666655532 23343 2444543310 01 12233 Q ss_pred EEEEeeeccccceeehhhhccccccchhheeecccccc-cccccccccchhhhhhcccCCccccccccccCCCCcceEEE Q lcl|NC_020883. 208 HIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELV-TNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISY 286 (589) Q Consensus 208 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~-~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvy 286 (589) ..||.+.-..-.....++|+ ...+|... ...+. +|+ .-.+.+....|++ T Consensus 197 ~q~RvL~~~~~g~~~~~~~r----------~~~~g~~~~~~~~~---------------~~~-----~g~~~l~~IPfv~ 246 (489) T protein:vir:78 197 EQYRVLDIDSDGNYRQRLFR----------FDAEGGAQEDVVEI---------------YPD-----LGESLRGVIPFTF 246 (489) T ss_pred EEEEEEecCCCcceEEEEEE----------eecCCcccceeeEE---------------ecc-----CCCCccCeeeEEE Confidence 33443321100111112332 12223211 11111 011 1113455555555 Q ss_pred ecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccc Q lcl|NC_020883. 287 WANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHR 366 (589) Q Consensus 287 vPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~ 366 (589) +-.....-. -|.+-+-+|..+--+.=..-+....++-..+-|.+.+. |.. +.+.++....+. .++-.. T Consensus 247 ~~~~~~~~~-~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~--------G~d-~~~~~~~~~~~~--~~i~~g 314 (489) T protein:vir:78 247 IGATNNDAT-IDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY--------PGE-NLTPQAFKEANP--NGIKFG 314 (489) T ss_pred EecCCCCCC-CCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee--------cCc-cCCcccccccCc--cceeeC Confidence 532221111 24444444333311100011122233334477766642 100 000011110000 001001 Q ss_pred cccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_020883. 367 DMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ 446 (589) Q Consensus 367 dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R 446 (589) +-..+..+. +.+..|+++..+-. ..+.++.|.+++. .++...+.. ++ +.|+++.+.+.....+-...+- T Consensus 315 ~~~~~~lp~-~~~~~~ie~~~~~~-~r~~l~~le~qm~---~lGa~l~~~----~~--~~Ta~~~~~~~~~~~S~L~~~a 383 (489) T protein:vir:78 315 SRRGHNLGY-GGSAQLIQAGENNL-ARQNMLDKEQQAI---QIGAQLITP----TQ--QITAQSARIQRGADTSVMATIA 383 (489) T ss_pred CcccccCCC-CCCcceeccCcchH-HHHHHHHHHHHHH---HHhhhhccC----Cc--chhHHHHHHHHHHhhHHHHHHH Confidence 111222223 34567888886554 4677777877765 344444432 11 2344444433333333344455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcccCcc-cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh----CCCCCH Q lcl|NC_020883. 447 KEYIDFLKELYESCLWLLNDQDSSIRIE-EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM----NPDASE 521 (589) Q Consensus 447 ~~~~~aLk~li~~~l~L~~~~~~~~~~e-~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L----hpdw~d 521 (589) ..+.+++..++..+...... ..+..++ ..+.+|... +.+..+ .+.+++ +..+|.+|.+|..+.| -.++++ T Consensus 384 ~~~e~al~~~l~~~a~w~G~-~~~~~~~i~~n~dF~~~-~~d~~~--~~al~~-~~~~G~is~~t~~~~L~~~gv~d~~~ 458 (489) T protein:vir:78 384 RNVSQAYTDALRWVAVMLGK-PEDTEVEFRLNMDFFLE-PMTAQD--RAAWMA-DINAGLLPATAYYAALRKAGVTDWTD 458 (489) T ss_pred HHHHHHHHHHHHHHHHHcCC-CCCCceEEEeecccCcc-cCCHHH--HHHHHH-HHhcCCCCHHHHHHHHHhCCCCCccH Confidence 55667777776444332211 1111111 234456442 233222 222333 3467899999888766 244566 Q ss_pred HHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCC Q lcl|NC_020883. 522 DWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEE 572 (589) Q Consensus 522 E~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ee 572 (589) +++++ +|.++ |.+.|..++.. ..++++..|- T Consensus 459 e~~~~---ei~~~-----~~~~~~~~~g~------------~~~~~q~~~~ 489 (489) T protein:vir:78 459 ADIKD---AVADQ-----PLPVATEVQGE------------IPQSAQQQEK 489 (489) T ss_pred HHHHH---HHhhc-----CCCcccCCccc------------CCCCcccccC Confidence 65444 55443 22233221111 1111111111 No 102 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=96.01 E-value=0.0011 Score=36.73 Aligned_cols=495 Identities=11% Similarity=0.009 Sum_probs=192.4 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeec-----------CcceEEE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARET-----------QTPYVIF 69 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~-----------~~~y~~~ 69 (589) |-| =+++.++..++. |+..-+..+ +-..+ -+++.+|. +.+|+||.... .-|-+++ T Consensus 1 m~~-----~~~~~~~~~~~~---~~~~~~~~~-~~r~~---~~~D~~f~--~~~G~QW~~~~~~~l~~~~q~~grP~~~~ 66 (708) T protein:vir:10 1 MAE-----TLEKKHERIMLR---FDRAYSPQK-EVREK---CIEATRFA--RVPGGQWEGATAAGTKLDEQFEKYPKFEI 66 (708) T ss_pred Cch-----hHHHHHHHHHHH---HHHHHHhhH-HHHHH---HHHHHHhh--cCCCCCCCHHHHHHHHHhhhhcCCCceEE Confidence 222 122233333332 222222111 11111 12222221 34577775321 2378999 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+.+.++.. . +|....+-+ +.-+.|.+... .. +.++.=..++..+...|+..... T Consensus 67 N~i~~~v~~--v-----~g~~~~nr~-------------d~~v~P~~~~~--d~---~~Ae~l~~~~~~~~~~~~~~~~~ 121 (708) T protein:vir:10 67 NKVATELNR--I-----IAEYRNNRI-------------TVKFRPGDREA--SE---ELANKLNGLFRADYEETDGGEAC 121 (708) T ss_pred cchHHHHHH--H-----HHHHHhCCc-------------ceEEEcCCCCc--hH---HHHHHHHHHHHHHHHhcCchHHH Confidence 999988776 2 333322222 11122322111 11 22222344888888899988777 Q ss_pred hhhHHHHHHcCceeEEEEEe----------cCceeEEEe--cCceecc--c---ccCcceeEEEeecCCCc--------- Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVID----------ELGPRIVFK--ARDVYFP--H---DDEKGADLAYYIDHGQY--------- 203 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~----------~~~~~i~f~--~~d~~~P--~---~d~~~~div~~~e~~~~--------- 203 (589) -+.+.++++.|=-..++.-| ..++.|.-+ +...+|. + -|..-|.+++...+--. T Consensus 122 s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~ 201 (708) T protein:vir:10 122 DNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGK 201 (708) T ss_pred HHHHHhhhhcccceeeeeeccccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCC Confidence 77777766665444444322 122333222 2223332 1 12222333332221110 Q ss_pred ---------------------c-ceEEEE-Eeeeccccceeehhhhccccccchhheeecccccccccccccccchhhh- Q lcl|NC_020883. 204 ---------------------G-QFLHIY-RERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEEL- 259 (589) Q Consensus 204 ---------------------~-~~l~~~-~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~- 259 (589) + -+|-.| +.+.....+. .|.. +. .|+.+. +.++...... T Consensus 202 ~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~~~----~~~~-~~---------tg~~~~---~~~~~~~~~~~ 264 (708) T protein:vir:10 202 KPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVI----SYRH-PI---------TGEIAT---YDSDQVEDIED 264 (708) T ss_pred CcccccccccCCCccccccCCCceEEEEeeeEEEEEEEEE----EEec-CC---------CCceee---ecchhhhhHHH Confidence 0 011111 1111100000 0000 00 111111 1111000000 Q ss_pred -hh------------cc------cCCcc-ccccccccCCCCcceEEEecCC---CCCCCcccCcchhhhhHHHHHHHHHH Q lcl|NC_020883. 260 -IR------------EV------LNIPD-DRPLENFYPGRNRPFISYWANN---ETFMNPYGISALDNLESKQDEINWTI 316 (589) Q Consensus 260 -i~------------~~------i~ip~-~~e~~~i~TGv~~plvvyvPN~---~~~~~~lG~SD~~~ie~l~DeLd~t~ 316 (589) .. .+ +-.+. ..+...-.+|...|+|.+|-.. .....||| -+.++.+.++.+|.+. T Consensus 265 ~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG--~vr~~kd~Q~~~N~~~ 342 (708) T protein:vir:10 265 ELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEG--HIAKAMDPQRLYNLQV 342 (708) T ss_pred HHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCCCcccce--eecccchhHHHHHHHH Confidence 00 00 00000 1122233455566666665332 22223343 4677899999999999 Q ss_pred hHHHHHHHHhCCCcEEechhhhhccccc--cccccccccccccccccccccccccccccccccCccceeeecccHHHHHH Q lcl|NC_020883. 317 TRSAVIYEQNGKPRISITKEMMDTLLNI--AYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMD 394 (589) Q Consensus 317 S~~srildk~gkpRI~VP~~~L~t~~g~--~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~ 394 (589) |+....+-+.++...+++...+.....- ..+.+...+...+.. ...+..+. ..+..++.++.---..++++ T Consensus 343 S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~----~~~~G~~~---~~~~~~~~~q~~~~~~~~~~ 415 (708) T protein:vir:10 343 SMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREV----RDKSGNII---AGATPAGYTQPAVMNQALAA 415 (708) T ss_pred HHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccc----cccccccc---cccCCccccCCccchHHHHH Confidence 9988887666666656655555321000 011111111100000 00000111 12223444554445666788 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---ccc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSI 471 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~ 471 (589) .+......|-.+++.+..+.|.. ++ .||+|+..+..+........=..+..+++++.+.++.|-..+. +.+ T Consensus 416 l~q~~~~~i~~vsG~~~~~lG~~--sn----~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~ 489 (708) T protein:vir:10 416 LLQQTSADIQEVTGGSQAMQQMP--SN----IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREV 489 (708) T ss_pred HHHHHHHHHHHHhCcChhHccCc--cc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEE Confidence 88888888878899999888842 21 3788876555444444444445555666666666665554321 000 Q ss_pred -------------------Ccc-------------cceeeeCCcCCC-CCCHHHHHHHHHHHhccchhhHHH------HH Q lcl|NC_020883. 472 -------------------RIE-------------EPNIETQDMILK-PRAELVAENMAAYAASKQGQSLET------TV 512 (589) Q Consensus 472 -------------------~~e-------------~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~S~et------aV 512 (589) ++. +-+|...++-.. ...+...+.+++++......-..+ .+ T Consensus 490 RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l 569 (708) T protein:vir:10 490 RIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIIL 569 (708) T ss_pred EEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHH Confidence 000 012333332111 113333444555544433221111 12 Q ss_pred HHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC-CCCCCcchhhhh--hc---ccc Q lcl|NC_020883. 513 RRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT-EEEPSAEENEEI--EK---EGE 586 (589) Q Consensus 513 r~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~-~eep~~~~~e~~--~~---~~~ 586 (589) ..+ |+. ..++-++||+.-.... +.. .+ +..|.+. .......+.++. +. ... T Consensus 570 ~~~--D~p--~~~ei~erir~~~~~~-----~~~--------~~------~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~ 626 (708) T protein:vir:10 570 DNI--DGE--GLDDFKEYNRNQLLIS-----GIA--------KP------RNEKEQQIVQQAQMAAQSQPNPEMVLAQAQ 626 (708) T ss_pred Hhc--CCc--ChHHHHHHHHHhhccc-----ccc--------cc------cchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 222 2344566665544321 100 00 0000000 000000000000 00 000 Q ss_pred cC---C Q lcl|NC_020883. 587 PI---A 589 (589) Q Consensus 587 ~~---~ 589 (589) -. | T Consensus 627 ~~~~qA 632 (708) T protein:vir:10 627 MVAAQA 632 (708) T ss_pred HHHHHH Confidence 00 0 No 103 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=497 Identities=13% Similarity=0.123 Sum_probs=195.8 Q ss_pred Cccce-----------eccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee-------ec Q lcl|NC_020883. 1 MIDWT-----------VRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------ET 62 (589) Q Consensus 1 ~~~~~-----------~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------~~ 62 (589) |-|=+ .+-+..+....+--+++.+++- |+ .-.+.-. +.+|+||.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------R~-~a~~d~~----fy~G~Qw~~~~~~~l~~~ 65 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKW----------RD-AANKACA----YYDGDQLPPEVLQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHH----------HH-HHHHHHH----hhcCCCCCHHHHHHHHhc Confidence 11100 0001111211222222222211 11 1111111 334666632 34 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) ..|-+++|+.+.+++. . +|.-..+-+ +.-+.|.+. ..+... +.+.=..++..+..+ T Consensus 66 g~p~~~~N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~p~~~-~~~~~~---~Ae~l~~~~~~~~~~ 121 (714) T protein:vir:81 66 GQPMTIHNLIAPTVDG--V-----LGMEAKTRT-------------DLVVMSDEP-DDETEK---LAEAINAEFADACRL 121 (714) T ss_pred CCCcEEeccHHHHHHH--H-----HhHHHhCCc-------------ceEEecCCC-CchhHH---HHHHHHHHHHHHHHh Confidence 5688999999988876 2 333333222 111223211 001112 222234588889999 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccc-----cCcceeEEEeecCCCccceEEE---- Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHD-----DEKGADLAYYIDHGQYGQFLHI---- 209 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~-----d~~~~div~~~e~~~~~~~l~~---- 209 (589) |+.....-+++.+++..|=-+...+++ +..++|..+++.-+|..- |..-|.++++..+--.+.+--. T Consensus 122 ~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~ 201 (714) T protein:vir:81 122 GNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred hchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCc Confidence 999988888888877777666777777 334888888888888732 2234555654443221110000 Q ss_pred ---------------------------------EEeee-------ccccceee-hhh-hccccccchhheeecccccccc Q lcl|NC_020883. 210 ---------------------------------YRERV-------EKDGLRTT-NML-YPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 210 ---------------------------------~~~~~-------~~~~~~~~-~~~-y~~~~~~~~~~~~~~~gd~~~~ 247 (589) ++.+. ..+.-.+. ... ||...+..... ...|..+.. T Consensus 202 a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~--~~~g~~~~~ 279 (714) T protein:vir:81 202 AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE--LSNGRVVAF 279 (714) T ss_pred hhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec--cCCCceEEe Confidence 00000 00000000 000 11000000000 001111111 Q ss_pred cccccccchhhhhhcc----------------cCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREV----------------LNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQD 310 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~----------------i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~D 310 (589) ... ...+..-.+... +|=.-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++ T Consensus 280 d~~-~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr 356 (714) T protein:vir:81 280 DKN-NLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQD 356 (714) T ss_pred Ccc-CHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHH Confidence 000 000000000000 00001112222334445666666655322 234554 4578899999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC----ccceeeec Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR----SMEIHQID 386 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~----~~~~iq~D 386 (589) .+|.+.|+...++ .+|..+ +..+.+. +.+.+..-. ..++.....+ .+...+|. .++..+.- T Consensus 357 ~~N~~~s~~~~~l--~~~~~~-~~~~a~~-------~~d~~~~e~---~arp~~vi~~--~p~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:81 357 EVNFRRIKLTWLL--QAKRVI-MDEDATQ-------LSDNDLMEQ---IERPDGIIKL--NPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred HHHHHHHHHHHhh--cCCcee-eecCccc-------ccHHHHHHh---ccCCCCceee--cccccccCCCCccccccCCC Confidence 9999998887775 455444 4333321 111110000 0000000001 11111111 12222222 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) --..++++.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-.. T Consensus 422 ~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~ 496 (714) T protein:vir:81 422 QVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD 496 (714) T ss_pred CccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566777777777777678899999998532 2458887765544433222333334445566665555544432 Q ss_pred cCc---ccCc--c----------------------------cceeeeCCcCCCCC--CHHHHHHHHHHHhcc----chhh Q lcl|NC_020883. 467 QDS---SIRI--E----------------------------EPNIETQDMILKPR--AELVAENMAAYAASK----QGQS 507 (589) Q Consensus 467 ~~~---~~~~--e----------------------------~p~I~f~D~lPvde--~El~~A~t~~~l~~a----~~~S 507 (589) +.. .+-+ + +-+|...++ |.-. .+...+.+++++..- +... T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~-p~~~t~r~~~~~~l~~l~~~~~p~~~~~~ 575 (714) T protein:vir:81 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec-cCchHHHHHHHHHHHHHHhhcCchhhhhH Confidence 210 0000 0 012222222 1111 122333444443321 1122 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-- Q lcl|NC_020883. 508 LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-- 585 (589) Q Consensus 508 ~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-- 585 (589) ....|..++ +. ..++-++||++-+...+ +.++-.+++.+....-+++++.. T Consensus 576 ~~~~l~~~d--~p--~~~el~~~ir~~~~~~~-----------------------~~~~~~~e~q~~~~~~q~~~~~q~~ 628 (714) T protein:vir:81 576 LDLWVNLLD--VP--QKQEFVERIRAALGTPK-----------------------SPDEMTPEEQEVAAQQQALQQQQAE 628 (714) T ss_pred HHHHHHhcC--CC--CHHHHHHHHHHHcCCCC-----------------------CccccchhhHHHHHHHHHHHHHHHH Confidence 333343332 32 23345667765332110 00000111111111111111100 Q ss_pred -------------ccCC Q lcl|NC_020883. 586 -------------EPIA 589 (589) Q Consensus 586 -------------~~~~ 589 (589) +.-+ T Consensus 629 lq~~~~~a~~~k~eae~ 645 (714) T protein:vir:81 629 LQMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHHHHHHHHHHH Confidence 0000 No 104 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=497 Identities=13% Similarity=0.123 Sum_probs=195.8 Q ss_pred Cccce-----------eccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee-------ec Q lcl|NC_020883. 1 MIDWT-----------VRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------ET 62 (589) Q Consensus 1 ~~~~~-----------~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------~~ 62 (589) |-|=+ .+-+..+....+--+++.+++- |+ .-.+.-. +.+|+||.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------R~-~a~~d~~----fy~G~Qw~~~~~~~l~~~ 65 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKW----------RD-AANKACA----YYDGDQLPPEVLQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHH----------HH-HHHHHHH----hhcCCCCCHHHHHHHHhc Confidence 11100 0001111211222222222211 11 1111111 334666632 34 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) ..|-+++|+.+.+++. . +|.-..+-+ +.-+.|.+. ..+... +.+.=..++..+..+ T Consensus 66 g~p~~~~N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~p~~~-~~~~~~---~Ae~l~~~~~~~~~~ 121 (714) T protein:vir:32 66 GQPMTIHNLIAPTVDG--V-----LGMEAKTRT-------------DLVVMSDEP-DDETEK---LAEAINAEFADACRL 121 (714) T ss_pred CCCcEEeccHHHHHHH--H-----HhHHHhCCc-------------ceEEecCCC-CchhHH---HHHHHHHHHHHHHHh Confidence 5688999999988876 2 333333222 111223211 001112 222234588889999 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccc-----cCcceeEEEeecCCCccceEEE---- Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHD-----DEKGADLAYYIDHGQYGQFLHI---- 209 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~-----d~~~~div~~~e~~~~~~~l~~---- 209 (589) |+.....-+++.+++..|=-+...+++ +..++|..+++.-+|..- |..-|.++++..+--.+.+--. T Consensus 122 ~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~ 201 (714) T protein:vir:32 122 GNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred hchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCc Confidence 999988888888877777666777777 334888888888888732 2234555654443221110000 Q ss_pred ---------------------------------EEeee-------ccccceee-hhh-hccccccchhheeecccccccc Q lcl|NC_020883. 210 ---------------------------------YRERV-------EKDGLRTT-NML-YPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 210 ---------------------------------~~~~~-------~~~~~~~~-~~~-y~~~~~~~~~~~~~~~gd~~~~ 247 (589) ++.+. ..+.-.+. ... ||...+..... ...|..+.. T Consensus 202 a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~--~~~g~~~~~ 279 (714) T protein:vir:32 202 AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE--LSNGRVVAF 279 (714) T ss_pred hhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec--cCCCceEEe Confidence 00000 00000000 000 11000000000 001111111 Q ss_pred cccccccchhhhhhcc----------------cCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREV----------------LNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQD 310 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~----------------i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~D 310 (589) ... ...+..-.+... +|=.-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++ T Consensus 280 d~~-~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr 356 (714) T protein:vir:32 280 DKN-NLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQD 356 (714) T ss_pred Ccc-CHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHH Confidence 000 000000000000 00001112222334445666666655322 234554 4578899999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC----ccceeeec Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR----SMEIHQID 386 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~----~~~~iq~D 386 (589) .+|.+.|+...++ .+|..+ +..+.+. +.+.+..-. ..++.....+ .+...+|. .++..+.- T Consensus 357 ~~N~~~s~~~~~l--~~~~~~-~~~~a~~-------~~d~~~~e~---~arp~~vi~~--~p~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:32 357 EVNFRRIKLTWLL--QAKRVI-MDEDATQ-------LSDNDLMEQ---IERPDGIIKL--NPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred HHHHHHHHHHHhh--cCCcee-eecCccc-------ccHHHHHHh---ccCCCCceee--cccccccCCCCccccccCCC Confidence 9999998887775 455444 4333321 111110000 0000000001 11111111 12222222 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) --..++++.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-.. T Consensus 422 ~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~ 496 (714) T protein:vir:32 422 QVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD 496 (714) T ss_pred CccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566777777777777678899999998532 2458887765544433222333334445566665555544432 Q ss_pred cCc---ccCc--c----------------------------cceeeeCCcCCCCC--CHHHHHHHHHHHhcc----chhh Q lcl|NC_020883. 467 QDS---SIRI--E----------------------------EPNIETQDMILKPR--AELVAENMAAYAASK----QGQS 507 (589) Q Consensus 467 ~~~---~~~~--e----------------------------~p~I~f~D~lPvde--~El~~A~t~~~l~~a----~~~S 507 (589) +.. .+-+ + +-+|...++ |.-. .+...+.+++++..- +... T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~-p~~~t~r~~~~~~l~~l~~~~~p~~~~~~ 575 (714) T protein:vir:32 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec-cCchHHHHHHHHHHHHHHhhcCchhhhhH Confidence 210 0000 0 012222222 1111 122333444443321 1122 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-- Q lcl|NC_020883. 508 LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-- 585 (589) Q Consensus 508 ~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-- 585 (589) ....|..++ +. ..++-++||++-+...+ +.++-.+++.+....-+++++.. T Consensus 576 ~~~~l~~~d--~p--~~~el~~~ir~~~~~~~-----------------------~~~~~~~e~q~~~~~~q~~~~~q~~ 628 (714) T protein:vir:32 576 LDLWVNLLD--VP--QKQEFVERIRAALGTPK-----------------------SPDEMTPEEQEVAAQQQALQQQQAE 628 (714) T ss_pred HHHHHHhcC--CC--CHHHHHHHHHHHcCCCC-----------------------CccccchhhHHHHHHHHHHHHHHHH Confidence 333343332 32 23345667765332110 00000111111111111111100 Q ss_pred -------------ccCC Q lcl|NC_020883. 586 -------------EPIA 589 (589) Q Consensus 586 -------------~~~~ 589 (589) +.-+ T Consensus 629 lq~~~~~a~~~k~eae~ 645 (714) T protein:vir:32 629 LQMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHHHHHHHHHHH Confidence 0000 No 105 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=497 Identities=13% Similarity=0.123 Sum_probs=195.8 Q ss_pred Cccce-----------eccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee-------ec Q lcl|NC_020883. 1 MIDWT-----------VRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------ET 62 (589) Q Consensus 1 ~~~~~-----------~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------~~ 62 (589) |-|=+ .+-+..+....+--+++.+++- |+ .-.+.-. +.+|+||.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------R~-~a~~d~~----fy~G~Qw~~~~~~~l~~~ 65 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKW----------RD-AANKACA----YYDGDQLPPEVLQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHH----------HH-HHHHHHH----hhcCCCCCHHHHHHHHhc Confidence 11100 0001111211222222222211 11 1111111 334666632 34 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) ..|-+++|+.+.+++. . +|.-..+-+ +.-+.|.+. ..+... +.+.=..++..+..+ T Consensus 66 g~p~~~~N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~p~~~-~~~~~~---~Ae~l~~~~~~~~~~ 121 (714) T protein:vir:10 66 GQPMTIHNLIAPTVDG--V-----LGMEAKTRT-------------DLVVMSDEP-DDETEK---LAEAINAEFADACRL 121 (714) T ss_pred CCCcEEeccHHHHHHH--H-----HhHHHhCCc-------------ceEEecCCC-CchhHH---HHHHHHHHHHHHHHh Confidence 5688999999988876 2 333333222 111223211 001112 222234588889999 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccc-----cCcceeEEEeecCCCccceEEE---- Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHD-----DEKGADLAYYIDHGQYGQFLHI---- 209 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~-----d~~~~div~~~e~~~~~~~l~~---- 209 (589) |+.....-+++.+++..|=-+...+++ +..++|..+++.-+|..- |..-|.++++..+--.+.+--. T Consensus 122 ~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~ 201 (714) T protein:vir:10 122 GNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred hchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCc Confidence 999988888888877777666777777 334888888888888732 2234555654443221110000 Q ss_pred ---------------------------------EEeee-------ccccceee-hhh-hccccccchhheeecccccccc Q lcl|NC_020883. 210 ---------------------------------YRERV-------EKDGLRTT-NML-YPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 210 ---------------------------------~~~~~-------~~~~~~~~-~~~-y~~~~~~~~~~~~~~~gd~~~~ 247 (589) ++.+. ..+.-.+. ... ||...+..... ...|..+.. T Consensus 202 a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~--~~~g~~~~~ 279 (714) T protein:vir:10 202 AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE--LSNGRVVAF 279 (714) T ss_pred hhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec--cCCCceEEe Confidence 00000 00000000 000 11000000000 001111111 Q ss_pred cccccccchhhhhhcc----------------cCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREV----------------LNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQD 310 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~----------------i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~D 310 (589) ... ...+..-.+... +|=.-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++ T Consensus 280 d~~-~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr 356 (714) T protein:vir:10 280 DKN-NLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQD 356 (714) T ss_pred Ccc-CHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHH Confidence 000 000000000000 00001112222334445666666655322 234554 4578899999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC----ccceeeec Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR----SMEIHQID 386 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~----~~~~iq~D 386 (589) .+|.+.|+...++ .+|..+ +..+.+. +.+.+..-. ..++.....+ .+...+|. .++..+.- T Consensus 357 ~~N~~~s~~~~~l--~~~~~~-~~~~a~~-------~~d~~~~e~---~arp~~vi~~--~p~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:10 357 EVNFRRIKLTWLL--QAKRVI-MDEDATQ-------LSDNDLMEQ---IERPDGIIKL--NPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred HHHHHHHHHHHhh--cCCcee-eecCccc-------ccHHHHHHh---ccCCCCceee--cccccccCCCCccccccCCC Confidence 9999998887775 455444 4333321 111110000 0000000001 11111111 12222222 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) --..++++.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-.. T Consensus 422 ~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~ 496 (714) T protein:vir:10 422 QVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD 496 (714) T ss_pred CccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566777777777777678899999998532 2458887765544433222333334445566665555544432 Q ss_pred cCc---ccCc--c----------------------------cceeeeCCcCCCCC--CHHHHHHHHHHHhcc----chhh Q lcl|NC_020883. 467 QDS---SIRI--E----------------------------EPNIETQDMILKPR--AELVAENMAAYAASK----QGQS 507 (589) Q Consensus 467 ~~~---~~~~--e----------------------------~p~I~f~D~lPvde--~El~~A~t~~~l~~a----~~~S 507 (589) +.. .+-+ + +-+|...++ |.-. .+...+.+++++..- +... T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~-p~~~t~r~~~~~~l~~l~~~~~p~~~~~~ 575 (714) T protein:vir:10 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec-cCchHHHHHHHHHHHHHHhhcCchhhhhH Confidence 210 0000 0 012222222 1111 122333444443321 1122 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-- Q lcl|NC_020883. 508 LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-- 585 (589) Q Consensus 508 ~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-- 585 (589) ....|..++ +. ..++-++||++-+...+ +.++-.+++.+....-+++++.. T Consensus 576 ~~~~l~~~d--~p--~~~el~~~ir~~~~~~~-----------------------~~~~~~~e~q~~~~~~q~~~~~q~~ 628 (714) T protein:vir:10 576 LDLWVNLLD--VP--QKQEFVERIRAALGTPK-----------------------SPDEMTPEEQEVAAQQQALQQQQAE 628 (714) T ss_pred HHHHHHhcC--CC--CHHHHHHHHHHHcCCCC-----------------------CccccchhhHHHHHHHHHHHHHHHH Confidence 333343332 32 23345667765332110 00000111111111111111100 Q ss_pred -------------ccCC Q lcl|NC_020883. 586 -------------EPIA 589 (589) Q Consensus 586 -------------~~~~ 589 (589) +.-+ T Consensus 629 lq~~~~~a~~~k~eae~ 645 (714) T protein:vir:10 629 LQMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHHHHHHHHHHH Confidence 0000 No 106 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=497 Identities=13% Similarity=0.123 Sum_probs=195.8 Q ss_pred Cccce-----------eccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee-------ec Q lcl|NC_020883. 1 MIDWT-----------VRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------ET 62 (589) Q Consensus 1 ~~~~~-----------~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------~~ 62 (589) |-|=+ .+-+..+....+--+++.+++- |+ .-.+.-. +.+|+||.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------R~-~a~~d~~----fy~G~Qw~~~~~~~l~~~ 65 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKW----------RD-AANKACA----YYDGDQLPPEVLQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHH----------HH-HHHHHHH----hhcCCCCCHHHHHHHHhc Confidence 11100 0001111211222222222211 11 1111111 334666632 34 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) ..|-+++|+.+.+++. . +|.-..+-+ +.-+.|.+. ..+... +.+.=..++..+..+ T Consensus 66 g~p~~~~N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~p~~~-~~~~~~---~Ae~l~~~~~~~~~~ 121 (714) T protein:vir:99 66 GQPMTIHNLIAPTVDG--V-----LGMEAKTRT-------------DLVVMSDEP-DDETEK---LAEAINAEFADACRL 121 (714) T ss_pred CCCcEEeccHHHHHHH--H-----HhHHHhCCc-------------ceEEecCCC-CchhHH---HHHHHHHHHHHHHHh Confidence 5688999999988876 2 333333222 111223211 001112 222234588889999 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccc-----cCcceeEEEeecCCCccceEEE---- Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHD-----DEKGADLAYYIDHGQYGQFLHI---- 209 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~-----d~~~~div~~~e~~~~~~~l~~---- 209 (589) |+.....-+++.+++..|=-+...+++ +..++|..+++.-+|..- |..-|.++++..+--.+.+--. T Consensus 122 ~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~ 201 (714) T protein:vir:99 122 GNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred hchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCc Confidence 999988888888877777666777777 334888888888888732 2234555654443221110000 Q ss_pred ---------------------------------EEeee-------ccccceee-hhh-hccccccchhheeecccccccc Q lcl|NC_020883. 210 ---------------------------------YRERV-------EKDGLRTT-NML-YPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 210 ---------------------------------~~~~~-------~~~~~~~~-~~~-y~~~~~~~~~~~~~~~gd~~~~ 247 (589) ++.+. ..+.-.+. ... ||...+..... ...|..+.. T Consensus 202 a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~--~~~g~~~~~ 279 (714) T protein:vir:99 202 AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE--LSNGRVVAF 279 (714) T ss_pred hhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec--cCCCceEEe Confidence 00000 00000000 000 11000000000 001111111 Q ss_pred cccccccchhhhhhcc----------------cCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREV----------------LNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQD 310 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~----------------i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~D 310 (589) ... ...+..-.+... +|=.-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++ T Consensus 280 d~~-~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr 356 (714) T protein:vir:99 280 DKN-NLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQD 356 (714) T ss_pred Ccc-CHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHH Confidence 000 000000000000 00001112222334445666666655322 234554 4578899999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC----ccceeeec Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR----SMEIHQID 386 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~----~~~~iq~D 386 (589) .+|.+.|+...++ .+|..+ +..+.+. +.+.+..-. ..++.....+ .+...+|. .++..+.- T Consensus 357 ~~N~~~s~~~~~l--~~~~~~-~~~~a~~-------~~d~~~~e~---~arp~~vi~~--~p~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:99 357 EVNFRRIKLTWLL--QAKRVI-MDEDATQ-------LSDNDLMEQ---IERPDGIIKL--NPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred HHHHHHHHHHHhh--cCCcee-eecCccc-------ccHHHHHHh---ccCCCCceee--cccccccCCCCccccccCCC Confidence 9999998887775 455444 4333321 111110000 0000000001 11111111 12222222 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) --..++++.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-.. T Consensus 422 ~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~ 496 (714) T protein:vir:99 422 QVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD 496 (714) T ss_pred CccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566777777777777678899999998532 2458887765544433222333334445566665555544432 Q ss_pred cCc---ccCc--c----------------------------cceeeeCCcCCCCC--CHHHHHHHHHHHhcc----chhh Q lcl|NC_020883. 467 QDS---SIRI--E----------------------------EPNIETQDMILKPR--AELVAENMAAYAASK----QGQS 507 (589) Q Consensus 467 ~~~---~~~~--e----------------------------~p~I~f~D~lPvde--~El~~A~t~~~l~~a----~~~S 507 (589) +.. .+-+ + +-+|...++ |.-. .+...+.+++++..- +... T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~-p~~~t~r~~~~~~l~~l~~~~~p~~~~~~ 575 (714) T protein:vir:99 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec-cCchHHHHHHHHHHHHHHhhcCchhhhhH Confidence 210 0000 0 012222222 1111 122333444443321 1122 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-- Q lcl|NC_020883. 508 LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-- 585 (589) Q Consensus 508 ~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-- 585 (589) ....|..++ +. ..++-++||++-+...+ +.++-.+++.+....-+++++.. T Consensus 576 ~~~~l~~~d--~p--~~~el~~~ir~~~~~~~-----------------------~~~~~~~e~q~~~~~~q~~~~~q~~ 628 (714) T protein:vir:99 576 LDLWVNLLD--VP--QKQEFVERIRAALGTPK-----------------------SPDEMTPEEQEVAAQQQALQQQQAE 628 (714) T ss_pred HHHHHHhcC--CC--CHHHHHHHHHHHcCCCC-----------------------CccccchhhHHHHHHHHHHHHHHHH Confidence 333343332 32 23345667765332110 00000111111111111111100 Q ss_pred -------------ccCC Q lcl|NC_020883. 586 -------------EPIA 589 (589) Q Consensus 586 -------------~~~~ 589 (589) +.-+ T Consensus 629 lq~~~~~a~~~k~eae~ 645 (714) T protein:vir:99 629 LQMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHHHHHHHHHHH Confidence 0000 No 107 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=96.00 E-value=0.0011 Score=36.73 Aligned_cols=497 Identities=13% Similarity=0.123 Sum_probs=195.8 Q ss_pred Cccce-----------eccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee-------ec Q lcl|NC_020883. 1 MIDWT-----------VRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR-------ET 62 (589) Q Consensus 1 ~~~~~-----------~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~-------~~ 62 (589) |-|=+ .+-+..+....+--+++.+++- |+ .-.+.-. +.+|+||.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------R~-~a~~d~~----fy~G~Qw~~~~~~~l~~~ 65 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKW----------RD-AANKACA----YYDGDQLPPEVLQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHH----------HH-HHHHHHH----hhcCCCCCHHHHHHHHhc Confidence 11100 0001111211222222222211 11 1111111 334666632 34 Q ss_pred CcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh Q lcl|NC_020883. 63 QTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN 142 (589) Q Consensus 63 ~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn 142 (589) ..|-+++|+.+.+++. . +|.-..+-+ +.-+.|.+. ..+... +.+.=..++..+..+ T Consensus 66 g~p~~~~N~i~~~v~~--v-----~g~~~~nr~-------------~~~v~p~~~-~~~~~~---~Ae~l~~~~~~~~~~ 121 (714) T protein:vir:27 66 GQPMTIHNLIAPTVDG--V-----LGMEAKTRT-------------DLVVMSDEP-DDETEK---LAEAINAEFADACRL 121 (714) T ss_pred CCCcEEeccHHHHHHH--H-----HhHHHhCCc-------------ceEEecCCC-CchhHH---HHHHHHHHHHHHHHh Confidence 5688999999988876 2 333333222 111223211 001112 222234588889999 Q ss_pred ccccccchhhHHHHHHcCceeEEEEEe----cCceeEEEecCceecccc-----cCcceeEEEeecCCCccceEEE---- Q lcl|NC_020883. 143 SKLERRHWSNIVQHQVDGGIVAAPVID----ELGPRIVFKARDVYFPHD-----DEKGADLAYYIDHGQYGQFLHI---- 209 (589) Q Consensus 143 ~~~~~~~~~~l~~~~v~Gg~~~~~~~~----~~~~~i~f~~~d~~~P~~-----d~~~~div~~~e~~~~~~~l~~---- 209 (589) |+.....-+++.+++..|=-+...+++ +..++|..+++.-+|..- |..-|.++++..+--.+.+--. T Consensus 122 ~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~ 201 (714) T protein:vir:27 122 GNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred hchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCc Confidence 999988888888877777666777777 334888888888888732 2234555654443221110000 Q ss_pred ---------------------------------EEeee-------ccccceee-hhh-hccccccchhheeecccccccc Q lcl|NC_020883. 210 ---------------------------------YRERV-------EKDGLRTT-NML-YPVVKAKGDVKKEIKKGELVTN 247 (589) Q Consensus 210 ---------------------------------~~~~~-------~~~~~~~~-~~~-y~~~~~~~~~~~~~~~gd~~~~ 247 (589) ++.+. ..+.-.+. ... ||...+..... ...|..+.. T Consensus 202 a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~--~~~g~~~~~ 279 (714) T protein:vir:27 202 AQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE--LSNGRVVAF 279 (714) T ss_pred hhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeec--cCCCceEEe Confidence 00000 00000000 000 11000000000 001111111 Q ss_pred cccccccchhhhhhcc----------------cCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHH Q lcl|NC_020883. 248 VEGAEDLEGEELIREV----------------LNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQD 310 (589) Q Consensus 248 ~~e~~d~e~e~~i~~~----------------i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~D 310 (589) ... ...+..-.+... +|=.-+.+...-+++-..|+|.+|..... .+.|+| -+.++.+.++ T Consensus 280 d~~-~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr 356 (714) T protein:vir:27 280 DKN-NLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQD 356 (714) T ss_pred Ccc-CHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHH Confidence 000 000000000000 00001112222334445666666655322 234554 4578899999 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccC----ccceeeec Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGR----SMEIHQID 386 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~----~~~~iq~D 386 (589) .+|.+.|+...++ .+|..+ +..+.+. +.+.+..-. ..++.....+ .+...+|. .++..+.- T Consensus 357 ~~N~~~s~~~~~l--~~~~~~-~~~~a~~-------~~d~~~~e~---~arp~~vi~~--~p~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:27 357 EVNFRRIKLTWLL--QAKRVI-MDEDATQ-------LSDNDLMEQ---IERPDGIIKL--NPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred HHHHHHHHHHHhh--cCCcee-eecCccc-------ccHHHHHHh---ccCCCCceee--cccccccCCCCccccccCCC Confidence 9999998887775 455444 4333321 111110000 0000000001 11111111 12222222 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLND 466 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~ 466 (589) --..++++.+......|-.+++.+..++|... .+.||+|+..+..+........=..+..+++++.+.++.|-.. T Consensus 422 ~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-----na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~ 496 (714) T protein:vir:27 422 QVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-----GATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD 496 (714) T ss_pred CccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566777777777777678899999998532 2458887765544433222333334445566665555544432 Q ss_pred cCc---ccCc--c----------------------------cceeeeCCcCCCCC--CHHHHHHHHHHHhcc----chhh Q lcl|NC_020883. 467 QDS---SIRI--E----------------------------EPNIETQDMILKPR--AELVAENMAAYAASK----QGQS 507 (589) Q Consensus 467 ~~~---~~~~--e----------------------------~p~I~f~D~lPvde--~El~~A~t~~~l~~a----~~~S 507 (589) +.. .+-+ + +-+|...++ |.-. .+...+.+++++..- +... T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~-p~~~t~r~~~~~~l~~l~~~~~p~~~~~~ 575 (714) T protein:vir:27 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec-cCchHHHHHHHHHHHHHHhhcCchhhhhH Confidence 210 0000 0 012222222 1111 122333444443321 1122 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhccc-- Q lcl|NC_020883. 508 LETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG-- 585 (589) Q Consensus 508 ~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~-- 585 (589) ....|..++ +. ..++-++||++-+...+ +.++-.+++.+....-+++++.. T Consensus 576 ~~~~l~~~d--~p--~~~el~~~ir~~~~~~~-----------------------~~~~~~~e~q~~~~~~q~~~~~q~~ 628 (714) T protein:vir:27 576 LDLWVNLLD--VP--QKQEFVERIRAALGTPK-----------------------SPDEMTPEEQEVAAQQQALQQQQAE 628 (714) T ss_pred HHHHHHhcC--CC--CHHHHHHHHHHHcCCCC-----------------------CccccchhhHHHHHHHHHHHHHHHH Confidence 333343332 32 23345667765332110 00000111111111111111100 Q ss_pred -------------ccCC Q lcl|NC_020883. 586 -------------EPIA 589 (589) Q Consensus 586 -------------~~~~ 589 (589) +.-+ T Consensus 629 lq~~~~~a~~~k~eae~ 645 (714) T protein:vir:27 629 LQMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHHHHHHHHHHH Confidence 0000 No 108 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=502 Identities=10% Similarity=0.011 Sum_probs=202.5 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee------ecCcceEEEEcchh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------ETQTPYVIFNLPKV 74 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------~~~~~y~~~n~~~~ 74 (589) |-| +++.+..+|+-|.++..-- .+ -|+ +-+++-.|. +|+||.. ..+.+ .++|+.+. T Consensus 1 m~d------~~~~~~~~~~~~~~~~~~~----~~--~r~-~a~~d~~fy----~G~Qw~~~~~~~l~~q~r-p~~N~i~~ 62 (725) T protein:vir:92 1 MAD------NENRLESILSRFDADWTAS----DE--ARR-EAKNDLFFS----RISQWDDWLSQYTTLQYR-GQFDVVRP 62 (725) T ss_pred CCc------hHHHHHHHHHHHHHHHHhh----HH--HHH-HHHHHHHhh----cCCCCCHHHHHHHHhcCC-CcccchHH Confidence 555 3455555555544443210 00 011 111222222 3555532 11222 26788887 Q ss_pred hhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHH Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIV 154 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~ 154 (589) +++. . +|.-..+-+ +-. +.|.+. ...+.+++ =..++..+...|+.....-+++. T Consensus 63 ~i~~--v-----~g~e~~nr~------------d~~-v~P~~~---~d~~~Ae~---l~~~~~~~~~~~~~~~a~s~Af~ 116 (725) T protein:vir:92 63 VVRK--L-----VSEMRQNPI------------DVL-YRPKDG---ASPDAADV---LMGMYRTDMRHNTAKIAVNVAVR 116 (725) T ss_pred HHHH--H-----HhhHHhCCc------------ceE-EecCCc---cHHHHHHH---HHHHHHHHHHhhCchHHHHHHHH Confidence 7665 2 333222111 111 223221 11222222 24478888889999988888888 Q ss_pred HHHHcCceeEEEEEe---c----CceeEEEec----CceecccccCc-----ceeEEEeecCCC----------ccc--- Q lcl|NC_020883. 155 QHQVDGGIVAAPVID---E----LGPRIVFKA----RDVYFPHDDEK-----GADLAYYIDHGQ----------YGQ--- 205 (589) Q Consensus 155 ~~~v~Gg~~~~~~~~---~----~~~~i~f~~----~d~~~P~~d~~-----~~div~~~e~~~----------~~~--- 205 (589) ++++.|=..+++..| . ..++|..+. ..++|.--+.+ -|.++|+.++-- .+. T Consensus 117 ~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~ 196 (725) T protein:vir:92 117 EQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDAD 196 (725) T ss_pred HHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchh Confidence 888777667666544 1 224444432 22233311111 122232222111 000 Q ss_pred -------eEEEEEeeeccccceeehhhhccccccchhheeec--ccccccccccccccchhhhhh----cc--------- Q lcl|NC_020883. 206 -------FLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK--KGELVTNVEGAEDLEGEELIR----EV--------- 263 (589) Q Consensus 206 -------~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~--~gd~~~~~~e~~d~e~e~~i~----~~--------- 263 (589) +...+..+.+.+.+++. +.|......-......+ .|+.+.. ..++++ .... .. T Consensus 197 ~~~~~~~~~~~~~~~~~~d~vrv~-e~~~r~~~~~~~~~~~d~~~g~~~~~--~~~~~~--~~~~~~~~~g~~~~~~r~~ 271 (725) T protein:vir:92 197 DIPSFQNPNDWVFPWLTQDTIQIA-EFYEVVEKKETAFIYQDPVTGEPVSY--FKRDIK--DVIDDLADSGFIKIAERQI 271 (725) T ss_pred hhhhcccCCcccccccCCCeEEEE-EEEEEEEEeeeEEeecCCCCCceeec--ChhhHH--HHHHHHhccCchhhhhccc Confidence 00001111111222211 11111000000000000 1221110 000000 0000 00 Q ss_pred --------cCCccc-cccccccCCCCcceEEEecCCC-CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe Q lcl|NC_020883. 264 --------LNIPDD-RPLENFYPGRNRPFISYWANNE-TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI 333 (589) Q Consensus 264 --------i~ip~~-~e~~~i~TGv~~plvvyvPN~~-~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V 333 (589) +-.+.+ -+....++|...|+|.+|.... ..+.+++-+-+.++.+.++.+|.+.|....++-..++.+..+ T Consensus 272 ~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~ 351 (725) T protein:vir:92 272 KRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFF 351 (725) T ss_pred eeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCccccc Confidence 000111 1112234566667776665543 344566667889999999999999999888887778888888 Q ss_pred chhhhhccccccccc-cccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCch Q lcl|NC_020883. 334 TKEMMDTLLNIAYER-DGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEK 412 (589) Q Consensus 334 P~~~L~t~~g~~~d~-dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~ 412 (589) +.+.++.... .|+. +...+...+ +.......-....+.+++.---..++++.+......|-.+++.+.. T Consensus 352 ~~~~i~~~~~-~~~~~~~~~~~~~~---------~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~ 421 (725) T protein:vir:92 352 WPEQIAGFEH-MYDGNDDYPYYLLN---------RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVD 421 (725) T ss_pred chhhhhHHHH-HHhccCccceeecc---------ccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHH Confidence 8888754211 1110 111110000 0000000000112223333233456777777777777778899988 Q ss_pred hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---cccCc-----c---------- Q lcl|NC_020883. 413 AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSIRI-----E---------- 474 (589) Q Consensus 413 AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~~~-----e---------- 474 (589) +.|.. +.+.||+|+..+...........-..+..+.+++.+.++.|-..+. +.+-+ . T Consensus 422 ~lG~~-----~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~ 496 (725) T protein:vir:92 422 AEAVN-----GGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEV 496 (725) T ss_pred HhccC-----chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEecccc Confidence 88862 2346888876555444433344444555566666666555543221 00000 0 Q ss_pred ----------------cceeeeCCcCC-CCCCHHHHHHHHHHHhccchh--hHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 475 ----------------EPNIETQDMIL-KPRAELVAENMAAYAASKQGQ--SLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 475 ----------------~p~I~f~D~lP-vde~El~~A~t~~~l~~a~~~--S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) +-+|...++.. ....+...+.+++++.+.... .....+...-+..+-+.+++.++||+... T Consensus 497 ~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~ 576 (725) T protein:vir:92 497 VDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQL 576 (725) T ss_pred ccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhh Confidence 00111111100 011222223333443322111 11122222222233345566667776433 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhh--cccccCC Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIE--KEGEPIA 589 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~--~~~~~~~ 589 (589) ..... .++ .+ +++.+...+..+.+ +...-++ T Consensus 577 ~~~~~----------------~~~----~~---~e~~q~~~~~qqa~~~q~~~e~~ 609 (725) T protein:vir:92 577 IQMGV----------------KKP----ET---PEEQQWLVEAQQAKQGQQDPAMV 609 (725) T ss_pred chhcc----------------CCc----cc---hhhhHHHHHHHHHHHhhhHHHHH Confidence 22100 000 00 01111111111111 1000011 No 109 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=95.58 E-value=0.0018 Score=35.62 Aligned_cols=505 Identities=12% Similarity=0.009 Sum_probs=195.6 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeec-----------CcceEEE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARET-----------QTPYVIF 69 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~-----------~~~y~~~ 69 (589) |- ++.-|-.-.-+.||+.-++. ..+...++++=. ++ .+.+|+||.... ..|-+++ T Consensus 1 ma--------~~~~~~~~~~~~r~~~~~~~-~~~~r~~~~~d~---~f--~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~ 66 (708) T protein:vir:17 1 MA--------ETLEKKHERIMLRFDRAYSP-QQEVREKCIEAT---RF--ARVPGGQWEGATAAGTKLDEQFEKYPKFEI 66 (708) T ss_pred Cc--------hhHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHH---Hh--hccCCCCCCHHHHHHHHhhhhhcCCCceEE Confidence 32 22222222234556655442 222222222211 11 234688887421 1378999 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+.+.+++. . +|.-..+-+ +-. +.|.+.. +..+.+ +.=..++..+...|+..... T Consensus 67 N~i~~~i~~--v-----~g~e~~nr~------------d~~-v~p~~~~--~d~~~A---e~l~~l~~~~~~~~~~~~~~ 121 (708) T protein:vir:17 67 NKVATELNR--I-----IAEYRNNRI------------TVK-FRPGDRE--ASEELA---NKLNGLFRADYEETDGGEAC 121 (708) T ss_pred cchHHHHHH--H-----HhhHhhCCc------------ceE-EecCCCc--chHHHH---HHHHHHHHHHHHhcCchhHH Confidence 999988776 3 343333222 111 2222110 011222 22244888888899988888 Q ss_pred hhhHHHHHHcCceeEEEEEe----------cCceeEEEe--cCceecccc-----cCcceeEEEeecCCCccc----e-- Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVID----------ELGPRIVFK--ARDVYFPHD-----DEKGADLAYYIDHGQYGQ----F-- 206 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~----------~~~~~i~f~--~~d~~~P~~-----d~~~~div~~~e~~~~~~----~-- 206 (589) -+++.++++.|=-+++++-+ ..++.|.-+ ++..+|.-- |..-|-+++..++--.+. | T Consensus 122 s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~ 201 (708) T protein:vir:17 122 DNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGK 201 (708) T ss_pred hHHHHHhhhcccceeeeeecccccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCcc Confidence 88888877777555555321 223333322 333343311 112233333222111000 0 Q ss_pred -----E------EEEEeeeccccceeehhhhccccccchhheeec--ccccccccccccccchh---hhh---------- Q lcl|NC_020883. 207 -----L------HIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK--KGELVTNVEGAEDLEGE---ELI---------- 260 (589) Q Consensus 207 -----l------~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~--~gd~~~~~~e~~d~e~e---~~i---------- 260 (589) . +-...+-..+.+++. ..|..-...-......+ .|+.+. +.++-... ... T Consensus 202 ~a~~~~~~~~~~~~~~~~~~~d~vrv~-e~~~r~~~~~~~~~~~~~~~g~~~~---~~~~~~~~~~~~~~~~g~~~~~~r 277 (708) T protein:vir:17 202 KPPASLDVTSMTSWEYDWFDADVIYIA-KYYEVRKESVDVISYRHPITGEIAT---YDSDQVEDIEDELAIAGFQEVARR 277 (708) T ss_pred ccchhhhhhhhccccccccCCCeEEEE-EEEEEeeeeeEEEEEecCccCceee---eCccchhhHHHHHHhcccccceee Confidence 0 000000011111111 01100000000000000 121111 11110000 000 Q ss_pred -hccc------CCcc-ccccccccCCCCcceEEEecCC---CCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCC Q lcl|NC_020883. 261 -REVL------NIPD-DRPLENFYPGRNRPFISYWANN---ETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKP 329 (589) Q Consensus 261 -~~~i------~ip~-~~e~~~i~TGv~~plvvyvPN~---~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkp 329 (589) ..+. -.++ +.+...-.+|...|+|.||... .....|||. +.++.+.++.+|.++|.....+-+.++. T Consensus 278 ~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~--vr~~kd~Q~~~N~~~S~~~~~~a~~~~~ 355 (708) T protein:vir:17 278 SVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGH--IAKAMDPQRLYNLQVSMLADTAAQDPGQ 355 (708) T ss_pred eeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccccCCCcccch--hhhchhHHHHHHHHHHHHHHHHHhcCCc Confidence 0000 0011 1122233455566777666432 223334543 5678999999999999888887777888 Q ss_pred cEEechhhhhccccccccc---cccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 330 RISITKEMMDTLLNIAYER---DGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIE 406 (589) Q Consensus 330 RI~VP~~~L~t~~g~~~d~---dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~ 406 (589) ..+++.++++.... .|+. +...+...+ +.......+. .++..+..++.---..++++.+......|=.+ T Consensus 356 ~~i~~~~a~~g~~~-~~~~~~~~~~~~~~~~----~~~~~~g~v~---~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~ 427 (708) T protein:vir:17 356 IPIVGMEQIRGLEK-HWEARNKKRPAFLPLR----EVRDKYGNII---AGATPAGYTQPAVMNQALAALLQQTSADIQEV 427 (708) T ss_pred ceeechhhhhhhHH-hhhhcccchhhhhhhh----ccCCcccccc---cccCCcccCCCccccHHHHHHHHHHHHHHHHh Confidence 88899888754311 0111 111111110 0111111111 11122233333333467888888888888788 Q ss_pred hcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---cccCc---------- Q lcl|NC_020883. 407 TQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSIRI---------- 473 (589) Q Consensus 407 a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~~~---------- 473 (589) ++.+..+.|.. + .+||+|+..+-.+-.......=..+..+.+++.+.++.|-.... +.+-+ T Consensus 428 tGi~d~~~G~~----s--n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v 501 (708) T protein:vir:17 428 TGGSQAMQQMP----S--NIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIA 501 (708) T ss_pred cCCChHHccCc----c--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccee Confidence 89888888741 1 23788776544443333333334444555555555544443211 00000 Q ss_pred ----------------------ccceeeeCCcCCC-CCCHHHHHHHHHHHhccchhhHHH------HHHHhCCCCCHHHH Q lcl|NC_020883. 474 ----------------------EEPNIETQDMILK-PRAELVAENMAAYAASKQGQSLET------TVRRMNPDASEDWI 524 (589) Q Consensus 474 ----------------------e~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~S~et------aVr~Lhpdw~dE~v 524 (589) .+-+|...++... ...+...+.+++++......-..+ .+..+ |+. .. T Consensus 502 ~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~--D~p--~~ 577 (708) T protein:vir:17 502 VLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNI--DGE--GL 577 (708) T ss_pred eecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhc--CCC--Ch Confidence 0012222222111 112222333444443322211111 12211 122 22 Q ss_pred HHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 525 QEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 525 ~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) ++-++||+.-.... +... + +..+ +...+......+.++..-... T Consensus 578 ~ei~e~ir~~~~~~-----~~~~-----------~---~~~e--~~q~~~q~qq~~q~q~~~~~~ 621 (708) T protein:vir:17 578 DDFKEYNRNQLLIS-----GIAK-----------P---RNEK--EQQIVQQAQMAAQSQPNPEMV 621 (708) T ss_pred HHHHHHHHHHhhcc-----cccc-----------C---cchh--hHHHHHHHHHHHHHHHHHHHH Confidence 23344554333211 1000 0 0000 000000000000000000000 No 110 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=95.44 E-value=0.0021 Score=35.31 Aligned_cols=438 Identities=11% Similarity=0.007 Sum_probs=151.1 Q ss_pred Cccc---eeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhc-ccccee--------ccCccee--eecCcce Q lcl|NC_020883. 1 MIDW---TVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEG-DAVGRF--------LDSSQTA--RETQTPY 66 (589) Q Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~-~~~~~~--------~~~~~~~--~~~~~~y 66 (589) |+.. .-||+.=- +...|-.|..+..-.+ .-.+-.-.+ +..+| .|.-+. .+..+-+ .....-| T Consensus 1 ~~~~~~~~~~~~~m~-V~~~hp~y~a~~~~W~-~~~d~g~~~--~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y 76 (488) T protein:vir:96 1 MLKCLYIKHRGFFML-TPIYHPDYLVNAPQWL-RNLDCVMDN--IKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDW 76 (488) T ss_pred CceeEEEeecceeec-ccccCHHHHHHhhhhh-HhhhhhhHH--HHHhhhhcCCCCCCccccccCcchhhhhhccchhhh Confidence 2111 11333221 3444554444331111 000000001 11111 122111 1111111 0111111 Q ss_pred EEEEcch-hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccc Q lcl|NC_020883. 67 VIFNLPK-VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKL 145 (589) Q Consensus 67 ~~~n~~~-~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~ 145 (589) -.-.+.+ .+.-.++.-++...|.+-..-| .....++..+ ..-.+=+|++.--+++++++ T Consensus 77 ~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p------~~~~~~~~~l-----------~~l~~d~D~~G~~L~~f~~~--- 136 (488) T protein:vir:96 77 EDLTWRLANYVNIVNPTMNAITGAVMRREP------EFDTMDNPVL-----------IGLRDNIDGKGNGIDQECKQ--- 136 (488) T ss_pred HhhhhhccccCchhHHHHHHhcchhhccCc------eeccCCcHHH-----------HHHHhccCCCCCCHHHHHHH--- Confidence 1111100 1112233445555565554222 1111111111 01112223333334444433 Q ss_pred cccchhhHHHHHHcCceeEEEEEecC------------ceeEEEecCceecc----cccCc--ceeEEEeec----CCC- Q lcl|NC_020883. 146 ERRHWSNIVQHQVDGGIVAAPVIDEL------------GPRIVFKARDVYFP----HDDEK--GADLAYYID----HGQ- 202 (589) Q Consensus 146 ~~~~~~~l~~~~v~Gg~~~~~~~~~~------------~~~i~f~~~d~~~P----~~d~~--~~div~~~e----~~~- 202 (589) .+..++..|++-.-|=.... -|++.++.|.+.+= +.+|+ -+-+++.+. ++. T Consensus 137 ------~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~ 210 (488) T protein:vir:96 137 ------ALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGT 210 (488) T ss_pred ------HHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCC Confidence 33344566665443322211 15566665555443 12232 223444321 111 Q ss_pred -ccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCc Q lcl|NC_020883. 203 -YGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNR 281 (589) Q Consensus 203 -~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~ 281 (589) ..+-.+.++.++ +..+.+......+ +.. +. +|.. .-.+.+.. T Consensus 211 ~~~~~~~~~~~l~---------------~g~~~v~~~~~~~-~~~--e~---------------~~~~----~g~~~l~~ 253 (488) T protein:vir:96 211 YVSKQRLINHRLV---------------DGLCEFQEVTDDE-YSD--EW---------------TPVL----INSKQSDT 253 (488) T ss_pred cccceEEEEEEEE---------------CcEEEEEEEecCC-ccc--ce---------------Eeec----CCCcccCe Confidence 011111111111 1111111111111 110 00 1111 11234555 Q ss_pred ceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccc Q lcl|NC_020883. 282 PFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTP 361 (589) Q Consensus 282 plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~ 361 (589) ..|+++-.... ....|.+-+-+|..+--+.=..-|....++-..+-|..+.+- .+...+.+.+..+ ++... T Consensus 254 IP~v~~~~~~~-~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~------~~~~~~~~~~~~~--~g~~~ 324 (488) T protein:vir:96 254 IPFFLASSQSN-EWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDM------GDMNKTMASEMNP--LGFTL 324 (488) T ss_pred eEEEEEecCCC-CCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeecc------CCCCccccccccc--ceeee Confidence 55555532211 111244444443333111101111112222122444443321 1111111111111 00000 Q ss_pred ccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHH Q lcl|NC_020883. 362 RIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILK 441 (589) Q Consensus 362 ~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~K 441 (589) . .. ......+| ...|++..++.. .++.++.|.+++.. ++...+.. ++ +.|+++.+.+.....+- T Consensus 325 ~-~~----~~~~~~~g-~~~~~e~~~~~l-~~~~l~~l~~qm~~---~Ga~l~~~----~~--~~Ta~~~~~~~~~~~S~ 388 (488) T protein:vir:96 325 A-GR----MPYYVKNG-DVKVIQAQFSPE-TENKVEKLFEQAVK---VGASLFTQ----QS--NETATGAAIRSGSSTAS 388 (488) T ss_pred c-cc----ccccccCC-ceeecCCchhHH-HHHHHHHHHHHHHH---HhHhhccC----CC--cchHHHHHHHHHHhhHH Confidence 0 00 00011122 355666666544 36678888888743 33333331 11 23444444333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHhhcCcccCccccee----eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh- Q lcl|NC_020883. 442 SRRLQKEYIDFLKELYESCL-WLLNDQDSSIRIEEPNI----ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM- 515 (589) Q Consensus 442 v~~~R~~~~~aLk~li~~~l-~L~~~~~~~~~~e~p~I----~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L- 515 (589) ...+-..+.++++.++..+. |+-...+ ......+.| +|... +.+..+ ...+. .+..+|.+|.+|..+.| T Consensus 389 L~~~a~~le~al~~~l~~~A~w~g~~~~-~~~~~~~~~~in~dF~~~-~ld~~~--~~al~-~~~~~G~Is~~t~~~~L~ 463 (488) T protein:vir:96 389 MATLGNNVEDTVRNMLRFIMRYFEGTNL-YVNPDELVFKLNRDYFDV-EVNPQM--LQVAY-AAMMEGNLPQVSWFELLK 463 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCC-CcCccceEEEeccCCCCc-cCCHHH--HHHHH-HHHhcCCCCHHHHHHHHH Confidence 44455556688877775443 4432211 112223333 33221 122222 22333 34467889999887766 Q ss_pred -----CCCCCHHHHHHHHHHHHhhcccc Q lcl|NC_020883. 516 -----NPDASEDWIQEEIARIEEEQAGS 538 (589) Q Consensus 516 -----hpdw~dE~v~eEv~RI~~E~a~~ 538 (589) .||++.|+ |.+||+++.-.. T Consensus 464 ~~gvl~~d~~~e~---~~~~ie~~g~~~ 488 (488) T protein:vir:96 464 RARVVRGDMSKEE---FDEHIAELGFGM 488 (488) T ss_pred hCCcCCccCCHHH---HHHHHhhcCCCC Confidence 58888874 667776543322 No 111 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=94.94 E-value=0.0031 Score=34.32 Aligned_cols=438 Identities=12% Similarity=0.049 Sum_probs=147.1 Q ss_pred Cccceeccch-hHHHHhhcch-----h-hhh--hhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEc Q lcl|NC_020883. 1 MIDWTVRGWT-DKTTKNVHGD-----Y-ERY--RQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNL 71 (589) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~-----~-~~~--r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~ 71 (589) ++- .+.|+. |..-..+-+. . .++ +-++.-=-.-.|++-+ |+ ..|--==+ T Consensus 91 ~~~-~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyq-l~--------------------alY~~~~l 148 (862) T protein:vir:99 91 AIK-AITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQ-AC--------------------ALIAQHWL 148 (862) T ss_pred hhh-hhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHH-HH--------------------HHHHhCch Confidence 110 011111 1000000000 0 000 1111000000122210 00 01111114 Q ss_pred chhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchh Q lcl|NC_020883. 72 PKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWS 151 (589) Q Consensus 72 ~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~ 151 (589) .+-+|++||+-+-|---.|++.-- +.+..+.+.+.++...+..++ +. T Consensus 149 arkiVd~pAeDatR~g~~I~~~~d-----------------------------~~e~~~e~~~~ie~~~~rL~v----~~ 195 (862) T protein:vir:99 149 VDKACSLAGEDAIRNGWHLKSLGE-----------------------------GEEIDEESLEKFKAIDVEFKV----KE 195 (862) T ss_pred hhhhhhhhhHHHhhCCceEeecCc-----------------------------ccccCHHHHHHHHHHHHHhhH----HH Confidence 456788888877555555554100 000011112234444444433 34 Q ss_pred hHHHHHHcCceeE----EEEEecCceeEEEecCceecccccCcceeEEEeecCCCcc--ceEEEEEeeeccccceeehhh Q lcl|NC_020883. 152 NIVQHQVDGGIVA----APVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYG--QFLHIYRERVEKDGLRTTNML 225 (589) Q Consensus 152 ~l~~~~v~Gg~~~----~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~--~~l~~~~~~~~~~~~~~~~~~ 225 (589) .|.++..-.++++ ...++....+ ... -| +.. +....+ ++++++.-.+-. .+.+.+ . T Consensus 196 ~l~eair~~RLyGga~ililv~~~D~~-~Ls-----qP----Ln~------e~I~kG~lkgl~vlDp~w~~-p~~v~~-~ 257 (862) T protein:vir:99 196 NLIEFNRFKNVFGIRVAIFVVDSEDPD-YYE-----KP----FNP------DGITPGSYRGISQIDPYWMM-PMLTAE-S 257 (862) T ss_pred HHHHHHHhcccccceEEEEEecCcCch-hhh-----cC----cCc------ccccccceeEEEEechhhhc-cccccc-c Confidence 4444444333332 2222211110 000 11 000 001111 233333221111 111000 0 Q ss_pred hccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCC-CCCCCcccCcchhh Q lcl|NC_020883. 226 YPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANN-ETFMNPYGISALDN 304 (589) Q Consensus 226 y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~-~~~~~~lG~SD~~~ 304 (589) +. +. ....+..... +. +.+.++ |..--.++.|.+ +|+. +...++||+|.++. T Consensus 258 ~~--Dp---------~sp~yGkP~~---y~---I~g~~I----H~SRliif~g~~------vpd~lk~ay~f~G~SvLe~ 310 (862) T protein:vir:99 258 TA--DP---------SSQFFYEPEF---WI---ISGQKY----HRSHLIIARGPQ------PADILKPTYIFGGIPLVQR 310 (862) T ss_pred cc--cc---------cccccCCcee---ee---ecCeee----ccceeEEecCCC------chhhhhccCCccCccHHHH Confidence 00 00 0000000000 00 000000 111011111211 1221 23455789999998 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceee Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQ 384 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq 384 (589) +.+.+...+.+......++-+..-..+.+ ..+..+.+ .+ .....+...-..-+..-+.++..+ -+|-+ T Consensus 311 iyd~L~~~d~t~~saa~Ll~ka~l~v~kt--d~l~~l~~----ed-~l~~r~~~~~~~rdN~Gi~liD~e-----Ee~e~ 378 (862) T protein:vir:99 311 IYERVYAAERTANEAPLLAMNKRTTAIHT--DTAKAIAN----ED-KFIQRLMFWVRYRDNHAVKVLGTD-----ETMEQ 378 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeec--hhHhhhcc----HH-HHHHHHHHHHhccCcceeEEecCC-----CceeE Confidence 88888888877665666654433333322 12211110 00 000000000000010001122111 12445 Q ss_pred ecccHHHHHHHHHHHHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_020883. 385 IDISKIGDMDHVKNLIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ-KEYIDFLKELYESCLW 462 (589) Q Consensus 385 ~Dirveeh~~~ie~L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R-~~~~~aLk~li~~~l~ 462 (589) .++.+..-...++....+|-+.+++|.. .||.- . .+.-+||....+.+..++ +..| ..+...|.+++.++++ T Consensus 379 ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqs-p--aGlnATGE~D~~nYyD~I---~s~QE~~L~P~LerL~~li~~ 452 (862) T protein:vir:99 379 FDTSLADFDAVIMGQYQLVASIAKTPATKLLGTA-P--KGFNSTGEFETISYHEEL---ESIQEHVYMPFLQRHYLISRL 452 (862) T ss_pred EecccCChHHHHHHHHHHHHhhhCCCceeecccC-c--ccccCchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 5566666677888888888888888877 56641 0 111124444444444443 3333 2355666666544322 Q ss_pred HHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHH-------HHhccchhhHHHHHHHh-------CCCCCHHHHHHHH Q lcl|NC_020883. 463 LLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAA-------YAASKQGQSLETTVRRM-------NPDASEDWIQEEI 528 (589) Q Consensus 463 L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~-------~l~~a~~~S~etaVr~L-------hpdw~dE~v~eEv 528 (589) . . + .+.+..|.|++....+++|+ |++.. ++.+++++|.+.+..+| .+..+++.+++ T Consensus 453 -~-l-g---~~~d~~ieFnpL~~~sekEk--AEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~-- 522 (862) T protein:vir:99 453 -S-L-G---IQHEIDVVMEPVASMTAQQQ--ADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEE-- 522 (862) T ss_pred -h-c-C---CCCcceEEeCCCCCCCHHHH--HHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccc-- Confidence 2 1 1 12356799999877665555 55543 44444444444333332 11122221110 Q ss_pred HHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhh--------hhcccccCC Q lcl|NC_020883. 529 ARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEE--------IEKEGEPIA 589 (589) Q Consensus 529 ~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~--------~~~~~~~~~ 589 (589) +|.... + .+.. +...|+.+..+++++.+. ..++.+|.+ T Consensus 523 ----------d~~~~~-e--~~~~----------~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~ 568 (862) T protein:vir:99 523 ----------TPGASP-E--NLAA----------YQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMV 568 (862) T ss_pred ----------cCCCCc-c--cccc----------cccCCcccccccccccccccCCccccCCccccccc Confidence 000000 0 0000 001111111111111110 011111111 No 112 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=94.73 E-value=0.0036 Score=33.95 Aligned_cols=493 Identities=12% Similarity=0.079 Sum_probs=192.9 Q ss_pred ccchhHHHHhhc------------chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee-------eecCcceE Q lcl|NC_020883. 7 RGWTDKTTKNVH------------GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA-------RETQTPYV 67 (589) Q Consensus 7 ~~~~~~~~~~~~------------~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~-------~~~~~~y~ 67 (589) ---|..--+..| ..+.+|+.=-++. .+ + |+. -.++-. +.+|+||. +.-..|-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q-~~-~-r~~-a~~d~~----fy~G~QW~~~~~~~l~~~g~p~~ 72 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQ-PA-W-RAV-ADKEMD----YADGNQLDTELLRRQQALGIPPA 72 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHHHHhcc-HH-H-HHH-HHHHHH----hhcCCCCCHHHHHHHHhcCCCcE Confidence 000000000000 0011111100110 00 1 100 001111 22345553 23466789 Q ss_pred EEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc Q lcl|NC_020883. 68 IFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER 147 (589) Q Consensus 68 ~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~ 147 (589) ++|+.+.+++. . +|....+-+ +.-+.|.++ .+..+ +.+.=..++..+...|++.. T Consensus 73 ~~N~i~~~v~~--v-----~g~~~~nr~-------------d~~v~Pr~~--~~d~~---~Ae~l~~~~~~~~~~~~~~~ 127 (772) T protein:vir:10 73 VEDLIGPALLS--L-----QGYEAVTRT-------------DWRVTPNGD--VGGQE---VADALNYRLNTAERQSGADR 127 (772) T ss_pred EEcchHHHHHH--H-----HHHHHhcCc-------------ceEEecCCC--chHHH---HHHHHHHHHHHHHHhcChHH Confidence 99999998876 3 333333222 111223211 11122 22223448888999999998 Q ss_pred cchhhHHHHHHcCceeEEEEEec----CceeEEEecCceeccc----ccCcceeEEEeecCCCccc-------------- Q lcl|NC_020883. 148 RHWSNIVQHQVDGGIVAAPVIDE----LGPRIVFKARDVYFPH----DDEKGADLAYYIDHGQYGQ-------------- 205 (589) Q Consensus 148 ~~~~~l~~~~v~Gg~~~~~~~~~----~~~~i~f~~~d~~~P~----~d~~~~div~~~e~~~~~~-------------- 205 (589) ..-+.+.+++..|=-+..++++. ..|+|..+++.-+|.- .|..-|.++++..+--.+. T Consensus 128 ~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~ 207 (772) T protein:vir:10 128 ACSEAFRPQIACGIGWVEVSRESDPFKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGM 207 (772) T ss_pred HHHHHHHHhhhcCceeEEeccccCCCCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHh Confidence 88888877777765566667662 2377888887777762 1222333333221110000 Q ss_pred -------------------------eEEEEEe---------eec--cccceeehhhhccccccchhheeecccccccccc Q lcl|NC_020883. 206 -------------------------FLHIYRE---------RVE--KDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVE 249 (589) Q Consensus 206 -------------------------~l~~~~~---------~~~--~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~ 249 (589) |...+.. +.. .+.+++...-||.-........ -.|..+.... T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~--~~g~~~~~~~ 285 (772) T protein:vir:10 208 VGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKS--PDGRVVEYDP 285 (772) T ss_pred hhhhcccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeecc--CCCceEeeCc Confidence 0000000 000 0001111001111000000000 0011111000 Q ss_pred cccccchh---------------------hhhhcccCCccccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhH Q lcl|NC_020883. 250 GAEDLEGE---------------------ELIREVLNIPDDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLES 307 (589) Q Consensus 250 e~~d~e~e---------------------~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~ 307 (589) .+.+.. .+++.. -+.+...-+++-..|+|.+|..... ...|+| -+.++.+ T Consensus 286 --~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~----~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G--~vr~~kd 357 (772) T protein:vir:10 286 --NNLAHNIALASGRISPKKVTVSRVRRSYWLGPH----CLHDGPTPYTHRHFPYVPFFGFREDATGIPYG--YVRGMKY 357 (772) T ss_pred --ccHHHHHHHhhcccchheeeeeEEEEEEEecce----eeccCCCCCCCCccceEEEeeeEeccCCcccc--hhhhhhh Confidence 000000 011111 0122223345556788877766433 334564 5678899 Q ss_pred HHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeecc Q lcl|NC_020883. 308 KQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDI 387 (589) Q Consensus 308 l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Di 387 (589) .++.+|.+.|+...++ +..++....+++. +.+.++... ..++.....+.-....+.|..+++.+.-- T Consensus 358 ~Qr~~N~~~S~~~~~l---~~~~~~~~~gav~-------~~d~~~~e~---~arp~~vi~~~~~~~~~~~~~~~~~~~~~ 424 (772) T protein:vir:10 358 AQDSLNSGVSKLRWGM---SVARVERTKGAVA-------MTDAQFRRQ---IARPDADIVLDENHMAKPGARFDVKRDYT 424 (772) T ss_pred HHHHHHHHHHHHHHHH---hcccccccCCCcc-------chhHHHHHh---ccCCCCeEEeCCccccCCCCCccccCCcc Confidence 9999999999888876 3334544443332 212111000 00010000011011112344455444433 Q ss_pred cHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020883. 388 SKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQ 467 (589) Q Consensus 388 rveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~ 467 (589) --.++++.+......|-.+++.+..++|.. +.+.||+|+..+-..........=..+..+++++.+.++.|-..+ T Consensus 425 ~~~~~~~llq~~~~~i~~vsGv~~~~lG~~-----~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~ 499 (772) T protein:vir:10 425 LTDQHFQMLQDNRATIERVSNITAGFQGRK-----GTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVED 499 (772) T ss_pred ccHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 456788888888888888899999998852 224588887644433332233333444456666666655555432 Q ss_pred Cc---ccCc--------c--------------------------cceeeeCCcCCC--CCCHHHHHHHHHHHhccchhhH Q lcl|NC_020883. 468 DS---SIRI--------E--------------------------EPNIETQDMILK--PRAELVAENMAAYAASKQGQSL 508 (589) Q Consensus 468 ~~---~~~~--------e--------------------------~p~I~f~D~lPv--de~El~~A~t~~~l~~a~~~S~ 508 (589) .. .+.+ . +-+|...++ |- ...+...+.+++++......-. T Consensus 500 y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~-p~~~t~r~~~~~~m~ql~~~~~P~~~ 578 (772) T protein:vir:10 500 IGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDV-PSTNSYRGQQLNAMSEAVKSMPPQYQ 578 (772) T ss_pred cCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeecc-ccchHHHHHHHHHHHHHHhccChhHH Confidence 10 0000 0 001111111 11 1123333445555432111111 Q ss_pred HHHHHHh--CCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhc--- Q lcl|NC_020883. 509 ETTVRRM--NPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEK--- 583 (589) Q Consensus 509 etaVr~L--hpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~--- 583 (589) ...+-.+ .-||. ..++-++||++-+.+.+|.... ..+....-.+.++ T Consensus 579 ~~~~~~~le~~D~p--~~~ei~~~ir~~~~~~~peq~~--------------------------~~~~q~~qq~~~~~~~ 630 (772) T protein:vir:10 579 AAVLPFLVSLMDVP--FKRDVVEAIRAVDQQQTPEQIQ--------------------------QQIDQAVQDALAKAGN 630 (772) T ss_pred HHHHHHHHhhcCCC--ChHHHHHHHHHHhccCChHHHH--------------------------HHHHHHHHHHHHHHHH Confidence 1111111 12343 2234555565433221111100 0000000000000 Q ss_pred c----------------cccCC Q lcl|NC_020883. 584 E----------------GEPIA 589 (589) Q Consensus 584 ~----------------~~~~~ 589 (589) + ...+. T Consensus 631 el~~~q~~a~~~~~~A~a~~~~ 652 (772) T protein:vir:10 631 DIKLRELEIKERKADSEISGLN 652 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHH Confidence 0 00000 No 113 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=94.61 E-value=0.004 Score=33.76 Aligned_cols=454 Identities=10% Similarity=-0.010 Sum_probs=181.6 Q ss_pred Cccceeccchh-HHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTD-KTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) |-.|....++. ..+ ..+ ...|=-||+.|.+.-+++... ++.- T Consensus 35 ~~~w~p~~~s~~~~~-------~~~-------~~~lr~RaRdl~rNn~~a~~a-----------------------v~~~ 77 (533) T protein:vir:34 35 LRSWNPPSESVDAAL-------LPN-------FTRGNARADDLVRNNGYAANA-----------------------IQLH 77 (533) T ss_pred ccccccCCCCHHHHH-------HHH-------HHHHHHHHHHHHhcChHHHHH-----------------------HHHH Confidence 33343322221 111 111 112234555555522222111 1211 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc------hhhH Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH------WSNI 153 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~------~~~l 153 (589) ...++|+ |=....-|+ ....+-++ .+ -..+++.+-++=....+.....|.+..++ .-.+ T Consensus 78 ~~nvVG~-Gi~~~~~p~----~~~lg~~~-----~~-----~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~ 142 (533) T protein:vir:34 78 QDHIVGS-FFRLSHRPS----WRYLGIGE-----EE-----ARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGV 142 (533) T ss_pred HHHhhCC-Cceeeeccc----hhhcCCCh-----hH-----HHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHH Confidence 1222222 211111110 00010001 00 01233333233222222222223333332 3345 Q ss_pred HHHHHcCceeEEEEEecC-----ceeEEEecCceec-ccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 154 VQHQVDGGIVAAPVIDEL-----GPRIVFKARDVYF-PHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~~~-----~~~i~f~~~d~~~-P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) -..+++|=|.++..+... ..+|+..++|..= |.+..-+-.|..=.|..+ .|=.+.|.+|+ T Consensus 143 r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~--------------~Gr~~aY~i~~ 208 (533) T protein:vir:34 143 AMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQIND--------------SGAALGYYVSE 208 (533) T ss_pred HHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECC--------------CCCeEEEEEee Confidence 566889999999887733 3577888776532 111111112222112111 22222222321 Q ss_pred cccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhH Q lcl|NC_020883. 228 VVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLES 307 (589) Q Consensus 228 ~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~ 307 (589) . -..|+... . -+-......+++.-|+|+-+........|+|+|.-+.. T Consensus 209 ~----------~~~~~~~~---~-------------------~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~ 256 (533) T protein:vir:34 209 D----------GYPGWMPQ---K-------------------WTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVME 256 (533) T ss_pred c----------CCCCcccc---c-------------------cceeeeeeccChhHeeeeccccCCCcccCCchHHHHHH Confidence 0 01111100 0 00001123456777999988888888899999998888 Q ss_pred HHHHHHHHHh-HHHHHHHHhCC-CcEEechh-----hhhccccccccccccccccccccc------cccccccccccccc Q lcl|NC_020883. 308 KQDEINWTIT-RSAVIYEQNGK-PRISITKE-----MMDTLLNIAYERDGHSAKEASMMT------PRIDHRDMEITTFD 374 (589) Q Consensus 308 l~DeLd~t~S-~~srildk~gk-pRI~VP~~-----~L~t~~g~~~d~dge~~~~~~~~~------~~~d~~dlev~~~d 374 (589) .+..|++-.. -+.+. +.+. =..+|-.. .++...+..-+...+......... .++...+..+ ..- T Consensus 257 ~l~~l~~y~dael~~a--~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L 333 (533) T protein:vir:34 257 QMKMLDTLQNTQLQSA--IVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKV-PHL 333 (533) T ss_pred HHHHHHHHHHHHHHHH--HHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCcee-eec Confidence 8888886421 11111 1111 11111100 000000000000000000000000 0000001111 111 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHH- Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFL- 453 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL- 453 (589) ..|..+.+++..--...+..++..+++.|-+..++|+..+.-.. +++ +=+|.|..++...+.++.+|..+...+ T Consensus 334 ~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~-s~~----nYSS~R~~~~e~~r~~~~~q~~~~~~~~ 408 (533) T protein:vir:34 334 MPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNY-AQM----SYSTARASANESWAYFMGRRKFVASRQA 408 (533) T ss_pred CCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhc-ccc----cHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33444555565555566777888899999988899999886321 121 455677777777777777776666544 Q ss_pred HHHHHHHHHHHhhc-CcccCc---------c----cceeeeCCcCC--CCCCHHHHHHHHHHHhccchhhHHHHHHHhCC Q lcl|NC_020883. 454 KELYESCLWLLNDQ-DSSIRI---------E----EPNIETQDMIL--KPRAELVAENMAAYAASKQGQSLETTVRRMNP 517 (589) Q Consensus 454 k~li~~~l~L~~~~-~~~~~~---------e----~p~I~f~D~lP--vde~El~~A~t~~~l~~a~~~S~etaVr~Lhp 517 (589) +.+.+. ||+... .+.+.+ . -..+.|-..-. +| ++..++.......+|++|++..++...- T Consensus 409 ~pi~~~--wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iD--P~Ke~~a~~~~i~~G~~s~~~~~a~~G~ 484 (533) T protein:vir:34 409 SQMFLC--WLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAID--GLKEVQEAVMLIEAGLSTYEKECAKRGD 484 (533) T ss_pred HHHHHH--HHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccC--hHHHHHHHHHHHHcCCCCHHHHHHHcCC Confidence 222222 222211 111110 0 01345532111 34 4344555556679999999999999865 Q ss_pred CCCHHHHHHHHHHHHhhcccccc--ccccccccccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 518 DASEDWIQEEIARIEEEQAGSDT--SSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 518 dw~dE~v~eEv~RI~~E~a~~~p--~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) ||+ ++.+|+++ |...... +++... +..+ ...+.+.++.++.+...+. T Consensus 485 D~~--ev~~q~a~---e~~~~~~~gl~~~~~-~~~~------~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 485 DYQ--EIFAQQVR---ETMERRAAGLKPPAW-AAAA------FESGLRQSTEEEKSDSRAA 533 (533) T ss_pred CHH--HHHHHHHH---HHHHHHhcCCCCCCC-CCcC------ccCCCCCCCCCCcccCCCC Confidence 554 33343332 2222111 111111 0000 0111111111111111111 No 114 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=93.86 E-value=0.0061 Score=32.71 Aligned_cols=199 Identities=11% Similarity=0.062 Sum_probs=82.2 Q ss_pred cccCcchhhhhHH-HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 296 PYGISALDNLESK-QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 296 ~lG~SD~~~ie~l-~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d 374 (589) .|..+++.++..- ..++..++....+ .+. + .. + +.+... T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~-----~~~-~------~~---~------------------------~~ld~~- 40 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDN-----NSG-V------GQ---A------------------------IGIDAD- 40 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHH-----hhh-h------hh---h------------------------heeecC- Confidence 2232332211100 0112222111110 000 0 00 0 001100 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCccc-chhHHHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGA-SGAQSGVAKFYDLLTTILKSRRLQKEYIDFL 453 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~-~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aL 453 (589) .-+|-|.++.+.+-...+.....+|-+.++.|. ....|.+. +-..||-...+....++.-.+. ..+.++| T Consensus 41 ----~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~---t~LfG~sp~Glnatge~d~~nyyd~i~~~Qe--~~l~p~l 111 (201) T protein:vir:10 41 ----SEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHE---IILKGKNVGGVSASQNTALETFYGYVDRKRK--AELLPLL 111 (201) T ss_pred ----CcceeeeecCcCChHHHHHHHHHHHHhHhcCch---hhhcCCCCccccccchhHHHHHHHHHHHHHH--HHHHHHH Confidence 122445666666666778888888888787774 32222222 2223566555555555322221 2345666 Q ss_pred HHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHh Q lcl|NC_020883. 454 KELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEE 533 (589) Q Consensus 454 k~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~ 533 (589) .+++.+.. .+..-.|.|++....+++|+ |++...... +..+++. ..-.+.+++.+++++.-. T Consensus 112 e~l~~~~~----------~~~~~~~~f~pL~~~s~kek--Aei~~~~a~----a~~~~~~--~g~i~~~e~r~~L~~~~~ 173 (201) T protein:vir:10 112 EFLLPFIV----------TEQEWSVEFNPLSQVSDKDK--SEILEKNVN----SVAALIA--AGIIDADEARDTLRAIST 173 (201) T ss_pred HHHHHhhc----------CCCCceEeeCCCCCCCHHHH--HHHHHHHHH----HHHHHHH--cCCCCHHHHHHHHHhcCC Confidence 66654321 12345789999877665554 777766332 2222332 244677777777755322 Q ss_pred hccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 534 EQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 534 E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) ... .+.+ .-+++-++.++.++++.|..+ T Consensus 174 ~~~------~~~~---------~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 174 EVK------IGEG---------SIQTEVVINESEDPLDVSANN 201 (201) T ss_pred cCC------CCCC---------CCCccccccccCCCCCCCCCC Confidence 111 1100 001111111111222222211 No 115 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=93.12 E-value=0.0087 Score=31.88 Aligned_cols=484 Identities=11% Similarity=0.024 Sum_probs=148.2 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCH---HHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFP---RAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAE 77 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~---ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~ 77 (589) |-+|.+-.|--+.+++. ..+|+.+|=+..|++. ++.+....-.-..-...++..+.. ++- | +..-+.+ T Consensus 20 ~~~~~~~~~l~~~~~~~----~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-r~k-i---~~~~~~~ 90 (641) T protein:vir:94 20 LSTDRIGGVVISKWQES----RDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADW-RHR-I---NTGHTFE 90 (641) T ss_pred CCchhHHHHHHHHHHHH----HHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcc-ccc-c---cchhHHH Confidence 66665444433333322 2233333322222211 001100000000001111211110 110 0 1111111 Q ss_pred cchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHH Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQ 157 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~ 157 (589) ..-.|.+ +|..+.+ |+. .-.+-.. .++++ .+.+.+-++.++..+..|+|..-|...+-+++ T Consensus 91 ~~~~l~s-~Lm~~~~--p~~-----~wf~~~p--~~~ed---------~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~ 151 (641) T protein:vir:94 91 VVETLVA-YFKGATF--PSD-----DWFDLKG--MVPEL---------ADAARVVKQLTKTKLEAASIRDIFETYVRNLV 151 (641) T ss_pred HHHHHhh-HHhhhhc--CCC-----ceEEEec--CCCCh---------HHHHHHHHHHHHHHHhhcchHHHHHHHHHHHh Confidence 1111222 2222211 211 0111100 01111 11122223455555555666666666666766 Q ss_pred HcCceeEEEEEec-----------------------------CceeEEEecCceecccccCcceeEEEeecCCC--ccce Q lcl|NC_020883. 158 VDGGIVAAPVIDE-----------------------------LGPRIVFKARDVYFPHDDEKGADLAYYIDHGQ--YGQF 206 (589) Q Consensus 158 v~Gg~~~~~~~~~-----------------------------~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~--~~~~ 206 (589) ..|=.+.+++|+- .++++.-+.+..+|. . .-+. ...| T Consensus 152 ~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~-----------d-ps~~~~~~~f 219 (641) T protein:vir:94 152 LYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL-----------D-TSGGKNTGTF 219 (641) T ss_pred hcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheee-----------c-CCCCcccccc Confidence 6776667776651 111222222211121 1 1111 1112 Q ss_pred EEEEEeeeccccceeehhh----hccccc--cchhheeecccc-----------cccccccccccchhhh---------h Q lcl|NC_020883. 207 LHIYRERVEKDGLRTTNML----YPVVKA--KGDVKKEIKKGE-----------LVTNVEGAEDLEGEEL---------I 260 (589) Q Consensus 207 l~~~~~~~~~~~~~~~~~~----y~~~~~--~~~~~~~~~~gd-----------~~~~~~e~~d~e~e~~---------i 260 (589) + +++..+..+ ..++ |...++ .....+..+..+ .+..+++.-|+..+.. . T Consensus 220 ~---~~r~t~~t~--~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~ 294 (641) T protein:vir:94 220 V---RLRHTREEL--HELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFY 294 (641) T ss_pred e---ehhhhHHHH--HHHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEe Confidence 1 111111100 0000 000000 000000000000 0111111111111110 0 Q ss_pred hcccCCcccccccc-ccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhh Q lcl|NC_020883. 261 REVLNIPDDRPLEN-FYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMD 339 (589) Q Consensus 261 ~~~i~ip~~~e~~~-i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~ 339 (589) ++. | +...+. .+.+ .|+++. +-.+..++.||+|....+.+.+..||...-....-+.++.+|.+.++....- T Consensus 295 g~~--i--l~~~~~~~~d~--~Pf~~~-r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~ 367 (641) T protein:vir:94 295 GKQ--L--IRLSDSKYWCG--SPFVTT-TLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGIL 367 (641) T ss_pred CCE--E--eecccccccCc--CCeEEe-cceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccc Confidence 111 0 011011 1122 355444 4556677889999999999999999987665566666889999887654432 Q ss_pred ccccccccccccccccccccccccccccccccccccccCcccee---eecccHHHHHHHHHHHHHHHHHHhcCCchhccc Q lcl|NC_020883. 340 TLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIH---QIDISKIGDMDHVKNLIKLMLIETQTSEKAVDF 416 (589) Q Consensus 340 t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~i---q~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~ 416 (589) +. ++-+..|+ ... .....+. ++.+ +.+..+ ....++.+-..+-....++...++. T Consensus 368 ~~------~~l~~~PG--~ii-----------~~~~~~~-v~pl~~~~~~~~~--~~~~~~~~~~~i~~~~~~~~~~~~~ 425 (641) T protein:vir:94 368 KR------EDVKAKPG--AVF-----------KVAQHGS-LQPIDMGRQDFVV--TYQEAQVQESSVYRNTSTGPLIGNA 425 (641) T ss_pred cc------ceeeccCC--cce-----------eeCCCCc-ceeecCCccccch--hHHHHHHHHHHHHHhhhhhhhhccc Confidence 21 11111111 110 0011111 1111 122222 2333444433333333344444443 Q ss_pred ccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcC----------------cccCccccee- Q lcl|NC_020883. 417 YLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYI-DFLKELYESCLWLLNDQD----------------SSIRIEEPNI- 478 (589) Q Consensus 417 ~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~-~aLk~li~~~l~L~~~~~----------------~~~~~e~p~I- 478 (589) ...+|...|+++++.+..- ...+...+-.+|. ++|+.++..++.+..... ..+.+.+..+ T Consensus 426 ~~~~~~~~TAtEV~~~~~e--~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~ 503 (641) T protein:vir:94 426 APRGGERVTAAEIQGVRDA--GGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLH 503 (641) T ss_pred ccccchhccHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCcccee Confidence 2223333355666654221 1122222223333 355555544333332210 1111222222 Q ss_pred -eeCCcCCCCCCHH-HHHHHHHHHhccchhhHHHHHHHhCCCC----CHHHHHHHHHHHHhhcccccccccccccccccc Q lcl|NC_020883. 479 -ETQDMILKPRAEL-VAENMAAYAASKQGQSLETTVRRMNPDA----SEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ 552 (589) Q Consensus 479 -~f~D~lPvde~El-~~A~t~~~l~~a~~~S~etaVr~Lhpdw----~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~ 552 (589) .++ .+|.....+ ..++..+-+.+ ..+ ++.. +|.+ +-.++.+++.+.. |-+ .+.- T Consensus 504 ~~~~-iv~l~~~q~~~~~~~i~~l~~--~~~---~~a~-~P~v~d~~d~~~~~~~~~~~~-----------g~~--~p~~ 563 (641) T protein:vir:94 504 YPYK-FLALGANYVVERERMVTDLLQ--LLD---ISGR-VPQIGQSLDYALILEDLLRQM-----------RFT--DPMR 563 (641) T ss_pred eeee-EeecchhHHHHHHHHHHHHHH--HHH---Hhhc-ChhhhhcCCHHHHHHHHHHHh-----------CCC--Cchh Confidence 221 133332111 11111111110 111 1111 2321 1122222222111 111 1111 Q ss_pred ccCcccCCCCCCCCCCCCCCCCcchhhhhhccc---ccCC Q lcl|NC_020883. 553 MNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEG---EPIA 589 (589) Q Consensus 553 ~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~---~~~~ 589 (589) ++-..+ .+....+.+..|..+..+ .+++ T Consensus 564 ~ir~~~---------~~~~~~~~~~~~~q~~~~~~a~~~~ 594 (641) T protein:vir:94 564 YIKKAE---------APPAAPPIAPAEPGALPPEMMNSVG 594 (641) T ss_pred hccCcc---------CchhHHHHHHHHHHHHHHHHHHHHH Confidence 221111 110111111111111111 1111 No 116 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=92.82 E-value=0.0098 Score=31.59 Aligned_cols=372 Identities=9% Similarity=-0.026 Sum_probs=130.6 Q ss_pred EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhc-------ccccccchhhhhhhh-hhhhh----hhh Q lcl|NC_020883. 67 VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMI-------EGPQDEEEAGKNENN-TVIDL----QNE 134 (589) Q Consensus 67 ~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i-------~~~~~~~~~~~~~~~-~~~~~----~~e 134 (589) .+-=|.+.+....+..-++.............+-..........+ .+++ +.-..-.+ .++-+ .+. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v---~~~v~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDL---FSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHH---HHHHHHHHHhhccCceeeccc Confidence 000011111111110000000000000000000000000000000 0000 00000000 00000 000 Q ss_pred HHHHHHhhcccc---ccchhhHHHH-HHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEE Q lcl|NC_020883. 135 IIEQITKNSKLE---RRHWSNIVQH-QVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLH 208 (589) Q Consensus 135 ~i~~v~kn~~~~---~~~~~~l~~~-~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~ 208 (589) ..+.++..-+.+ ..|+..++.+ +..|-.++.+..+..+ ..+....+++.-+ + .....++ . T Consensus 78 ~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v---------~---~~~~~~~--~ 143 (392) T protein:vir:74 78 KNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNT---------Y---YFEYENG--M 143 (392) T ss_pred hhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEE---------E---EcCCCce--E Confidence 011122222222 2244445533 3444444444444333 2333334433322 1 0000001 0 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) .|+...... ..+.+.. + +.--|+|++ T Consensus 144 ~y~~~~~~~----------------------~~~~~~~-------------------~-------------~~~evih~~ 169 (392) T protein:vir:74 144 YYNITFDDP----------------------KIEPILQ-------------------A-------------PQSDLIHMK 169 (392) T ss_pred EEEEEecCC----------------------ccceeEE-------------------E-------------cCccEEEec Confidence 110000000 0000000 0 001188999 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEE--echhhhhccccccccccccccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRIS--ITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHR 366 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~--VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~ 366 (589) +......++|.|-+.-+...++....+-....+.|...+.|+.+ +|.....+-.. ...-...+.+. ...+. T Consensus 170 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~--~~~~~~~~~~~-----~n~g~ 242 (392) T protein:vir:74 170 LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD--KASRSRSFMKR-----SRSGG 242 (392) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH--HHHHHHHHhcc-----ccCCC Confidence 87777778999998777776655444433344556555777644 44332211100 00000001000 00111 Q ss_pred cccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_020883. 367 DMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQ 446 (589) Q Consensus 367 dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R 446 (589) .+ +. ++|...+-++......+.++..+...++|..+=+.|+.-+|... +++ .+..+ ...+.+. .+.-|. T Consensus 243 ~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~---~~~e~-~~~~~~~--~l~p~~ 311 (392) T protein:vir:74 243 PV-VL---DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQ---SSIQQ-ISGMYAS--ALNRYL 311 (392) T ss_pred ee-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-Ccc---cHHHH-HHHHHHH--HHHHHH Confidence 11 11 23333333344445556677777788888887799998887422 111 11222 1222111 122222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHH Q lcl|NC_020883. 447 KEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQE 526 (589) Q Consensus 447 ~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~e 526 (589) ..+.+.|.+.+ .. .++|+-....+.+.+..+.....+.+++++|+..+-+++ T Consensus 312 ~~ie~~l~~~l----------~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~----------- 363 (392) T protein:vir:74 312 RPAISELEYKL----------SD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVL----------- 363 (392) T ss_pred HHHHHHHHHhc----------cc-------hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHH----------- Confidence 22223332211 11 122221112222344456666677777877776655432 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) .+.+. .+|+. .+.++.+|...||+++.+| T Consensus 364 ------~~~g~------------~pne~-r~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 364 ------QEAGY------------IPKDL-PAPENTNKKTTGQSNEPVP 392 (392) T ss_pred ------HhCCC------------Ccccc-chhcCCCCCCCCCCCCCCC Confidence 11111 11111 2334566778887766655 No 117 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=92.76 E-value=0.01 Score=31.53 Aligned_cols=443 Identities=12% Similarity=0.036 Sum_probs=142.2 Q ss_pred hhhhhhhhcCCccccCHHHHHHHhhccccce--eccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCc Q lcl|NC_020883. 21 YERYRQLYEGKHELLFPRAKRLIEEGDAVGR--FLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGE 98 (589) Q Consensus 21 ~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~--~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~ 98 (589) |-....||..++.-....+....- ..+.-. ...+..++--.-|+--+..++.+..-+ .++++.+.+++=..-. T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i--~~ia~~iA~lp~~~~~-- 75 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAW-EPYDPSIYNLGATASSGERVTPHDALQVSAVFASV--RLLSETIATLPLSTYS-- 75 (457) T ss_pred Cchhhhhhcccccccccccccccc-ccchhhhhhccccccCCceechHHhhccHHHHHHH--HHHHHhHhhCceEEEE-- Confidence 332333332111111110000000 000000 000000100000111122223333333 4455555444321110 Q ss_pred ccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc-CceeEEEEEecCce-eEE Q lcl|NC_020883. 99 IDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD-GGIVAAPVIDELGP-RIV 176 (589) Q Consensus 99 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~-Gg~~~~~~~~~~~~-~i~ 176 (589) -.....+.+..+.-. .--+..|+.+. ...|+..++.+... |-.++.+..+++++ .+. T Consensus 76 ----~~~~~~~~~~~~~~~-~ll~~pn~~~t----------------~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~ 134 (457) T protein:vir:62 76 ----KRGGTRKEIDTPEWL-DFPNAEPGGMG----------------RIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLD 134 (457) T ss_pred ----ecCCccccccchHHH-HhccccCCCCC----------------HHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEE Confidence 000111111000000 00001111110 11244444444333 43444443333332 333 Q ss_pred EecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccch Q lcl|NC_020883. 177 FKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEG 256 (589) Q Consensus 177 f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~ 256 (589) ...++++.+ .-....+. .++ .++ .| .....|.......+.. T Consensus 135 ~l~p~~v~v---------~~~~~~~~-~~~-~~~--------------~y----------~~~~~g~~~~~~~~~~---- 175 (457) T protein:vir:62 135 VLDPTKIHV---------HMVMVDGL-RRK-VFE--------------AY----------DIDADGNEVLLGWFTP---- 175 (457) T ss_pred EEcCcceEE---------EEeccCCc-cce-eEE--------------EE----------EEccCCceeEEEeeCc---- Confidence 333333221 10000000 000 000 01 0011111111000000 Q ss_pred hhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEech- Q lcl|NC_020883. 257 EELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITK- 335 (589) Q Consensus 257 e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~- 335 (589) . -|.|+.+......++|.|-+.-+...+......-....+.|...+.|+.++-- T Consensus 176 -------------~------------eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 230 (457) T protein:vir:62 176 -------------R------------DVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP 230 (457) T ss_pred -------------c------------ceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC Confidence 1 18898887777778999998766666655444433445556555677644321 Q ss_pred hhhhccccccccccccccccccccccccc--cccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchh Q lcl|NC_020883. 336 EMMDTLLNIAYERDGHSAKEASMMTPRID--HRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKA 413 (589) Q Consensus 336 ~~L~t~~g~~~d~dge~~~~~~~~~~~~d--~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~A 413 (589) +.|+.- ..++-.+... ..+.+.+ +..+ +. +.|...+-++....-.+.++..+....+|..+=+.|+.- T Consensus 231 ~~ls~e---~~~~~~~~~~---~~~~G~~nag~~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 300 (457) T protein:vir:62 231 GTMSEE---GLARAREAWR---AANSGVDNAHRVA-LL---TEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHL 300 (457) T ss_pred CCCCHH---HHHHHHHHHH---HHhcCccccCcce-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 111110 0000000000 0000000 0001 11 222222222332333345566666788888888999998 Q ss_pred cccccCcccchhHHHHHH-HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHH Q lcl|NC_020883. 414 VDFYLDGGASGAQSGVAK-FYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELV 492 (589) Q Consensus 414 Fg~~~~~g~~~A~Sg~A~-r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~ 492 (589) .|...+++.. .|.++. .+.+..- .+.-|...++..|.+.+ +..... ........+...+. .+.++ T Consensus 301 lg~~~~~~~~--~sn~eq~~~~f~~~--~l~P~~~~ie~~ln~~L-----~~~~~~---~~~~i~fd~~~l~~--~d~~~ 366 (457) T protein:vir:62 301 ISDATNSTSW--GSGLAEQNIAFTMF--SLRPWLERIEAGFNRLL-----FAETAD---RFRFVKFNLDEIKR--GAPKE 366 (457) T ss_pred cCCCCCcccc--cchHHHHHHHHHHH--HHHHHHHHHHHHHHhhh-----cCcccc---CceEEEeechhhhc--cCHHH Confidence 8864433221 122221 1122111 12223323333333221 111101 11112233334333 34556 Q ss_pred HHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHHHH-----HHHHhhccccccccccccccccccccCcccCCCCCCCC Q lcl|NC_020883. 493 AENMAAYAASKQGQSLETTVRRMN-PDASEDWIQEEI-----ARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEE 566 (589) Q Consensus 493 ~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~eEv-----~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~de 566 (589) +++..+.+.++++++.-.+-+++. |-..+-+.++-+ ..+.... ...+.+.+...++-. .++. ++.+. T Consensus 367 r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~-~~~~~~~~~~~~~~~--~~~~----~~~~~ 439 (457) T protein:vir:62 367 RMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEP-EPEPAPAPPAIDPPA--EEPA----DDEEP 439 (457) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccc-cccccCCCccCCCCc--cCCC----CCCCC Confidence 688888889999999887777664 112211111111 1110000 000001010000000 0000 00111 Q ss_pred CCCCCCCCcchhhhhhccccc Q lcl|NC_020883. 567 GDTEEEPSAEENEEIEKEGEP 587 (589) Q Consensus 567 g~~~eep~~~~~e~~~~~~~~ 587 (589) .+.+..|+.+|.|+ .-|+ T Consensus 440 ~~~~~~~d~~~~~~---~~~~ 457 (457) T protein:vir:62 440 DNAEGDPDEGETED---DDDA 457 (457) T ss_pred CCCCCCCccccccc---cccC Confidence 11222222222222 2222 No 118 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=92.52 E-value=0.011 Score=31.31 Aligned_cols=469 Identities=12% Similarity=0.060 Sum_probs=151.6 Q ss_pred HHhhcchhhhhhhhhcCCccccC---------------HHHHHHHhhccccc-----eecc-Ccceeee------cCcce Q lcl|NC_020883. 14 TKNVHGDYERYRQLYEGKHELLF---------------PRAKRLIEEGDAVG-----RFLD-SSQTARE------TQTPY 66 (589) Q Consensus 14 ~~~~~~~~~~~r~l~~g~~~~~f---------------~ra~~~~~~~~~~~-----~~~~-~~~~~~~------~~~~y 66 (589) .++--|-..|.| ++|...+.| +|....|++.-.-. +... +-++... .+++| T Consensus 1 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~ 78 (551) T protein:vir:80 1 MKNKLGLFESIR--LVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQ 78 (551) T ss_pred CchhhhhHHHhh--hccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChh Confidence 444455555666 334433333 22222222211100 0000 0010000 00000 Q ss_pred EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc Q lcl|NC_020883. 67 VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE 146 (589) Q Consensus 67 ~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~ 146 (589) =+.++.+.+++- ..+.+.+-.+...++.-+ .- ...+..-+--.+ .......+.......+.+.+..++.++++. T Consensus 79 ~l~~~~~~~~~n--piv~~~I~~ia~~IA~~~--~~-~~~~~~g~~~~i-~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~ 152 (551) T protein:vir:80 79 DLHGVLKKFGGN--IILNAIINTRSNQVSMYC--KP-ARHSEKGVGFEV-RLKDLDKKPTSHDEATIKRIESFIEKTGVD 152 (551) T ss_pred HHHHHHHHhhcC--HHHHHHHHHHHHHHhhhh--hh-hhhhcCCCCceE-EecccCcccChhHHHHHHHHHHHHHhcCCC Confidence 000111111111 111111111221111000 00 000000000000 000001111122223344566677777665 Q ss_pred c--------cchhhHHH-HHHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeec Q lcl|NC_020883. 147 R--------RHWSNIVQ-HQVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVE 215 (589) Q Consensus 147 ~--------~~~~~l~~-~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~ 215 (589) + .|...++. .+.-|..++.+..+.++ ..+....+++..++.+.-+. .....+.+++ T Consensus 153 ~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~--------~~~~~~~y~~----- 219 (551) T protein:vir:80 153 NDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK--------IPDNGNRFVQ----- 219 (551) T ss_pred CCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccc--------cccCceEEEE----- Confidence 3 23334443 34456677777776555 44555566555442111100 0000000000 Q ss_pred cccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCC--- Q lcl|NC_020883. 216 KDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNET--- 292 (589) Q Consensus 216 ~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~--- 292 (589) ...++..+. + . ..+ |.|+..++. T Consensus 220 ----------------------~~~g~~~~~---~----~-------------~~e------------iiH~~~n~~~~~ 245 (551) T protein:vir:80 220 ----------------------VIDQKIVAT---F----N-------------ARE------------MAFAVRNPRSDI 245 (551) T ss_pred ----------------------EeCCcEEEE---E----c-------------ccc------------eEEecccCCCCc Confidence 001111000 0 0 001 677764433 Q ss_pred CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcE--Eechhh-hhcccccccccccccccccccccccc--cccc Q lcl|NC_020883. 293 FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRI--SITKEM-MDTLLNIAYERDGHSAKEASMMTPRI--DHRD 367 (589) Q Consensus 293 ~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI--~VP~~~-L~t~~g~~~d~dge~~~~~~~~~~~~--d~~d 367 (589) ...++|.|-+.-+...+.....+-....+.|...+.|+- .+|... |+.- ..++-.+.+.. .+.+. .+.. T Consensus 246 ~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e---~~~~lk~~~~~---~~~G~~nag~~ 319 (551) T protein:vir:80 246 YATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQH---ALEIFKREWKN---SLSGINGSWQI 319 (551) T ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHH---HHHHHHHHHHH---HhcCccccCcc Confidence 335689998876666665444333333455544467774 334221 2110 00000000000 00000 0100 Q ss_pred ccccccccccCccceeee--cccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccch------hHHHHHHH-HHhhhH Q lcl|NC_020883. 368 MEITTFDENGRSMEIHQI--DISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASG------AQSGVAKF-YDLLTT 438 (589) Q Consensus 368 lev~~~de~g~~~~~iq~--Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~------A~Sg~A~r-~~~~~~ 438 (589) . +. .++| +.+.+. ...-.+..+..+...+.|..+=+.|+.-.|+....+..+ +.|.+... ..+.+ T Consensus 320 ~-vl--~~~g--~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~- 393 (551) T protein:vir:80 320 P-VV--SAED--VKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKN- 393 (551) T ss_pred c-cc--cCCC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHH- Confidence 0 11 1122 333333 333344555666677888777789999888643322111 11111111 11100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-- Q lcl|NC_020883. 439 ILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-- 516 (589) Q Consensus 439 ~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-- 516 (589) ..+.-|...++..|.+.+ ....+ ....+.|......+ +...+++.+.. .+++++.-.+-++++ T Consensus 394 -~tL~P~~~~ie~~ln~~L----------~~~~~-~~~~f~f~~~~~~~--~~~~~~~~~~~-~~g~lT~NE~R~~~gl~ 458 (551) T protein:vir:80 394 -KGLQPLLGFIEDFINKHI----------VAEFG-DKYTFQFVGGDIKS--ELESVKILAEK-AKVAMTVNEVRKELNLP 458 (551) T ss_pred -HHHHHHHHHHHHHHHhhh----------ccccC-CceEEEeeccChhh--HHHHHHHHHHH-hcCCcCHHHHHHHhCCC Confidence 012222222223332211 11111 12345676544333 33445555443 456777776666664 Q ss_pred CCCCHHHHHHHHHHHHhhccccccccccccccc-ccc--ccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 517 PDASEDWIQEEIARIEEEQAGSDTSSLMGINQT-FEQ--MNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 517 pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~-l~~--~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) |.. +.-+.-+.-. ..+..........+. ..+ -+++-.+.+.+.+..+++.+|.+.|.....++-..+- T Consensus 459 P~~--egGD~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 529 (551) T protein:vir:80 459 GDV--IGGDIPLNGV---IVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRK 529 (551) T ss_pred CCC--CCCceeeccc---ccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCcccccccc Confidence 311 1100000000 000000000000000 000 0111111111222223333444433322221111111 No 119 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=92.01 E-value=0.013 Score=30.88 Aligned_cols=449 Identities=11% Similarity=0.031 Sum_probs=162.1 Q ss_pred CccceeccchhHH------HHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccC-cceeeecCcceEEEEcch Q lcl|NC_020883. 1 MIDWTVRGWTDKT------TKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDS-SQTARETQTPYVIFNLPK 73 (589) Q Consensus 1 ~~~~~~~~~~~~~------~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~-~~~~~~~~~~y~~~n~~~ 73 (589) |+. ..|=...| ....+-.+...|.+|.|.. ..+-|..-|.. -.+.-+. .|-+.-.|..| .|+++ T Consensus 1 ~~~--~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~-~~~~r~~yl~~----~~~~~~e~~Y~~rl~rA~~--~n~~~ 71 (491) T protein:vir:95 1 MLT--ANGQGSGVKTKHREWLHYAPKWQKVRHALAGDL-VGYLRNVGLNE----PDKAYGEARQAEYEAGGIV--YNFTR 71 (491) T ss_pred Ccc--cCCccCCCCccCHHHHHHHHHHHHHHHHhcCcc-hhhcccCCCcC----CCCCCCHHHHHHHHhcccC--CChHH Confidence 331 12211111 1122334455567777742 22222111110 0000000 01111222222 25544 Q ss_pred hhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhH Q lcl|NC_020883. 74 VIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNI 153 (589) Q Consensus 74 ~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l 153 (589) -++. ..+|.+-. .++.-+.+..++ .-.+=+|++.--+++++++ .+ T Consensus 72 ~tl~-------~l~G~vfr--------k~p~~~~p~~l~-----------~l~~d~D~~G~~L~~f~~~---------~~ 116 (491) T protein:vir:95 72 RTLS-------GMVGSVMR--------KEPEINIPKELE-----------YLLKNADGSGVGLIQHAQD---------TL 116 (491) T ss_pred HHHH-------HHhchhhc--------CCceeeccHHHH-----------HHHhccCCCCCCHHHHHHH---------HH Confidence 4333 33444433 122212222221 1112223333334444433 33 Q ss_pred HHHHHcCceeEEEEEecC-------------ceeEEEecCceecc----cccCc--ceeEEEeecC------CC-ccceE Q lcl|NC_020883. 154 VQHQVDGGIVAAPVIDEL-------------GPRIVFKARDVYFP----HDDEK--GADLAYYIDH------GQ-YGQFL 207 (589) Q Consensus 154 ~~~~v~Gg~~~~~~~~~~-------------~~~i~f~~~d~~~P----~~d~~--~~div~~~e~------~~-~~~~l 207 (589) ..++..|++-.-+=.... -|++.++.|.+.+= +.+|+ -+-+++.+.. ++ ..+.+ T Consensus 117 ~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~ 196 (491) T protein:vir:95 117 MEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYG 196 (491) T ss_pred HHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceE Confidence 444566665443322111 15566666655533 12332 1334443310 00 12233 Q ss_pred EEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEe Q lcl|NC_020883. 208 HIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYW 287 (589) Q Consensus 208 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyv 287 (589) ..||.+.-..-.....++|+. ..+|.......+. +|+.. .+.+....|+++ T Consensus 197 ~qyRvL~l~~~g~~~~~v~r~----------~~~g~~~~~~~~~--------------~~~~g-----~~~l~~IPfv~~ 247 (491) T protein:vir:95 197 EQYRVLDIDTDGNYRQRLFRF----------DAEGGAQEEVVEI--------------YPDLG-----ESLRGVIPFTFI 247 (491) T ss_pred EEEEEEeecCCCceEEEEEEE----------cCCCcceeeeeee--------------eecCC-----CcccCeeEEEEE Confidence 444433111001111234321 1112111111110 11111 134555555554 Q ss_pred cCCCCCCCcccCcchhhhhHHHHHHHHHHh----HHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccc Q lcl|NC_020883. 288 ANNETFMNPYGISALDNLESKQDEINWTIT----RSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRI 363 (589) Q Consensus 288 PN~~~~~~~lG~SD~~~ie~l~DeLd~t~S----~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~ 363 (589) -.....-. -|.+-+-+|..+ |-..- ....++-..+-|.+.+. |. -+...++....+.....+ T Consensus 248 ~~~~~~~~-~~~pPLl~LA~l----ni~Hy~~ssd~~~~l~~~~~P~l~~~--------G~-d~~~~~~~~~~~~~~i~~ 313 (491) T protein:vir:95 248 GATNNDAT-IDDAPLLPLAEL----NIGHYRNSADNEESSFVVGQPTLFIY--------PG-DNLTPQSFKEANPNGIKF 313 (491) T ss_pred ecCCCCCC-CCcCchHHHHHH----HHHHhhhhhHHHHHHHHcccceeeee--------cC-cccCcchhhccCcceeEe Confidence 32211111 234444433332 32211 12222323477766542 00 000011111111000111 Q ss_pred ccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHH Q lcl|NC_020883. 364 DHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSR 443 (589) Q Consensus 364 d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~ 443 (589) ..+. .+..+. +....++++..+-. ..+.++.+..++.. .+...+.. ++ +.|+++.+.+.....+-.. T Consensus 314 g~~~--~~~lP~-~~~~~~ie~~~~~~-~~~~l~~~e~qm~~---~Ga~l~~~---~~---~~Ta~~~~~~~~~~~S~L~ 380 (491) T protein:vir:95 314 GSRC--GHNLGY-GGSAQLIQAGENNL-ARQNMLDKEQQAIQ---IGAQLITP---SQ---QITAESARIQRGADTSVMA 380 (491) T ss_pred cCcC--CcCCCC-CCccceeecCcchH-HHHHHHHHHHHHHH---HHHHhccC---Cc---chhHHHHHHHHHHhhHHHH Confidence 1111 122222 45567888876554 46777777777643 33333331 11 2444444443333333444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCcccCc-ccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh----CCC Q lcl|NC_020883. 444 RLQKEYIDFLKELYESCLWLLNDQDSSIRI-EEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM----NPD 518 (589) Q Consensus 444 ~~R~~~~~aLk~li~~~l~L~~~~~~~~~~-e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L----hpd 518 (589) .+-..+.+++..++.++...... +.+..+ -..+.+|... +.+ .+.. +.++ .+..++.+|.+|..+.| -++ T Consensus 381 ~~a~~~e~al~~~l~~~a~w~G~-~~~~~v~i~~n~dF~~~-~~~-~~~~-~all-~~~~~G~is~~t~~~~L~~~~vl~ 455 (491) T protein:vir:95 381 TIARNVSQAYTDALRWVAMMLGK-PEDSEVEFQLNMDFFLQ-PMT-AQDR-AAWM-ADINAGLLPATAYYAALRKAGVTD 455 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHcCC-CCCCceEEEeecccccc-cCC-HHHH-HHHH-HHHhcCCCCHHHHHHHHHhCCCCC Confidence 45555667777776443332211 111111 1124456442 233 2222 2233 34456889999888766 245 Q ss_pred CCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCC Q lcl|NC_020883. 519 ASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEE 571 (589) Q Consensus 519 w~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~e 571 (589) ++.| +|.++|+++... .|+..++-.-+++-.|. .+| T Consensus 456 ~~~e---~~~~~ie~~~~~-----~~~~~~~~~~~~~~~~~---------~~~ 491 (491) T protein:vir:95 456 WTDE---DILNAIEDAPLP-----SGAVTQVAGEIPQAAQQ---------QQE 491 (491) T ss_pred ccHH---HHHHHHHhcCCC-----CCccccccccchhhhhh---------ccC Confidence 5554 577788766521 11111111111111111 111 No 120 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=91.31 E-value=0.016 Score=30.36 Aligned_cols=408 Identities=11% Similarity=0.086 Sum_probs=144.1 Q ss_pred hhhhhhhhcCCcc------ccCHHHHHHHhhccccceeccCcceeeecCcceEEE------Ecchhhhccchhhhccccc Q lcl|NC_020883. 21 YERYRQLYEGKHE------LLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIF------NLPKVIAEIPATMVSGSIG 88 (589) Q Consensus 21 ~~~~r~l~~g~~~------~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~------n~~~~i~~~pa~~~~~~~~ 88 (589) |....++|-.+-+ .+-..+..++. +.+ .+ ..++.+ ..+-+..-+ .++++.+. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~---~~g----~~-------~~~~~v~~~~al~~~~v~~~i--~~ia~~ia 64 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLE---WLG----IS-------PSTISVKGKNALKVATVFACI--KILSESVS 64 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHH---Hhc----CC-------CCcceechhhhhccHHHHHHH--HHHHHhhc Confidence 5555555531110 11111111111 010 00 011111 122222222 34444443 Q ss_pred cccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hcccc---ccchhhHHHH-HHcCcee Q lcl|NC_020883. 89 QIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKLE---RRHWSNIVQH-QVDGGIV 163 (589) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~~---~~~~~~l~~~-~v~Gg~~ 163 (589) .++=..-. .-.....+..+.+ +..++. .-+.+ ..|+..++.+ +..|-.+ T Consensus 65 ~l~~~~~~-----~~~~~~~~~~~~~---------------------l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay 118 (429) T protein:vir:10 65 KLPLKIYQ-----EDEYGIQRGTKHY---------------------LNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSY 118 (429) T ss_pred cCceEEEE-----ecCCceeeccccH---------------------HHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeE Confidence 33211100 0000000000000 111111 11111 1344555544 4455555 Q ss_pred EEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccce-eehhhhccccccchhheeec Q lcl|NC_020883. 164 AAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLR-TTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 164 ~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~-~~~~~y~~~~~~~~~~~~~~ 240 (589) +.+..+..+ ..+....++++-+. ..+. +.. .....| +.... T Consensus 119 ~~i~r~~~G~~~~L~~i~~~~v~v~----------~~~~-----------------~~~~~~~~~~---------~~~~~ 162 (429) T protein:vir:10 119 ANIEFDRKGKVQALWPIDASKVTVY----------IDDV-----------------GLLNSKTKMW---------YVVNT 162 (429) T ss_pred EEEEECCCCcEEEEEEEcCceeEEE----------EcCc-----------------ccccccceEE---------EEEcc Confidence 555555443 34445555444321 1110 000 000000 01112 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) .|.+... | ..+ |+|+++....+.++|.|-+..+...++.....-.... T Consensus 163 ~g~~~~~-------------------~-~~e------------vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 210 (429) T protein:vir:10 163 GGQQRVL-------------------K-PEE------------ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 210 (429) T ss_pred CCeEEEE-------------------c-ccc------------EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 2321110 0 011 8999887777788999999887777776655444445 Q ss_pred HHHHHhCCCcEEechhhhhccccccccccccccccccccccccc-cccccccccccccCccceeeecccHHHHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRID-HRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNL 399 (589) Q Consensus 321 rildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d-~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L 399 (589) +.+...+.|+.++-- .+.+..+........+...+.+.+ ....-+. +.|...+-++....-.+.++..+.. T Consensus 211 ~~~~ng~~~~~il~~-----~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl---~~g~~~~~l~~~~~d~q~~e~~~~~ 282 (429) T protein:vir:10 211 NFYKQGLQVKGLVQY-----VGDLNEDAKKVFRENFESMSSGLQNSHRIALM---PVGYQFQPISLNMSDAQFLENTELT 282 (429) T ss_pred HHHhccCCccEEEEc-----CCCCCHHHHHHHHHHHHHHhccccccCceeec---CCCceEEEccCChhHHHHHHHHHHH Confidence 556555677765421 111111100000000000000000 0001121 2222222222223333445566677 Q ss_pred HHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceee Q lcl|NC_020883. 400 IKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIE 479 (589) Q Consensus 400 ~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~ 479 (589) .++|..+-+.|+..+|....+..+ +.......+.+- .+.-|...+...|.+- ++........+ ..... T Consensus 283 ~~~Ia~~fgVP~~~lg~~~~~~~s---n~e~~~~~f~~~--~l~P~~~~ie~~ln~k----l~~~~~~~~g~---~~~fd 350 (429) T protein:vir:10 283 IRQIATAFGIKMHQLNDLSKATLN---NIEQQQQQFYTD--TLQATLTMYEQEMTYK----LFLDSELDKGF---YSKFN 350 (429) T ss_pred HHHHHHHhCCCHHHhCCCCCCCcc---cHHHHHHHHHHH--HHHHHHHHHHHHHHHh----hcChhhcCCCc---EEEee Confidence 888888899999999853322111 112222222111 1122222222222211 11111111111 11233 Q ss_pred eCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccC Q lcl|NC_020883. 480 TQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDE 559 (589) Q Consensus 480 f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~ 559 (589) ++..+..| ..+.++..+.+.++++++.-.+-+.+. +.+- +.-++... +.-..|....+. . -+.+-++ T Consensus 351 ~~~ll~~d--~~~~~~~~~~~~~~G~~T~NE~R~~~g--l~p~---~ggD~~~~-~~n~~~~d~~~~-~----~~k~g~~ 417 (429) T protein:vir:10 351 VDAILRAD--IKTRYEAYRTGIQGGFLKPNEARSKED--LPPE---AGGDRLLV-NGNMLPIDMAGQ-A----YLKGGDT 417 (429) T ss_pred chhhhcCC--HHHHHHHHHHHHhCCCcCHHHHHHHhC--CCCC---CCcCeeee-cccccchhhccc-c----ccCCCCC Confidence 33443334 445577888888888888776655552 1110 00000000 000000000000 0 0111222 Q ss_pred CCCCCCCCCCCC Q lcl|NC_020883. 560 DGNIIEEGDTEE 571 (589) Q Consensus 560 ~~~p~deg~~~e 571 (589) ++...++|+|+. T Consensus 418 ~~~~~~~~~e~~ 429 (429) T protein:vir:10 418 NGEVSKEGNEGN 429 (429) T ss_pred CCCCCCCCCCCC Confidence 333334444443 No 121 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=91.13 E-value=0.017 Score=30.24 Aligned_cols=430 Identities=11% Similarity=0.105 Sum_probs=144.5 Q ss_pred CccceeccchhHHHHhh--c------chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNV--H------GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLP 72 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~--~------~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~ 72 (589) |.|+--|+ .+.-+.- . ..+.....+|-|.. ..+-. -|+=-+.+++ T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--------------------~~g~~-----v~~~~al~~~ 53 (454) T protein:vir:93 1 MWNLLRRT--RKNQKSGRDVREAGWTSLFQAVAEPFAGAW--------------------QQGVK-----ADPEAVLSFH 53 (454) T ss_pred CCCccccC--cccccccccccchhhhhhhhhhhhhhcchh--------------------hcCcc-----cChHHhhccH Confidence 55554331 1111100 0 00000111111100 00000 0111122233 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc---cc Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER---RH 149 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~---~~ 149 (589) .+.+-+ .++++.+..++-.+-. .+. .|.. .... ...+..+...-+.++ .| T Consensus 54 ~V~~~v--~~Ia~~iA~lp~~~~~---------~~~---~g~~-----~~~~--------~~~~~~L~~~PN~~~t~~~f 106 (454) T protein:vir:93 54 AVFACI--SLISQDIAKMRLRLMQ---------TDA---QGIR-----RETR--------RGDIARLCRRPNAQQNRIQF 106 (454) T ss_pred HHHHHH--HHHHHhhccCceEEEE---------ecc---CCcc-----chhh--------hHHHHHHHhcCCCCCCHHHH Confidence 333333 4454444444321110 000 0000 0000 012223333434443 56 Q ss_pred hhhHHHH-HHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhh Q lcl|NC_020883. 150 WSNIVQH-QVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLY 226 (589) Q Consensus 150 ~~~l~~~-~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y 226 (589) +..++.+ +..|-.++.+..+..+ ..+....+++. .++.. .. +.. .|+........ T Consensus 107 ~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v---------~v~~~-~~---g~~--~y~~~~~~~~~------- 164 (454) T protein:vir:93 107 FELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRV---------EPLVA-DD---GEV--FYRITPDRNCG------- 164 (454) T ss_pred HHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcce---------EEEEc-CC---CcE--EEEEEeccccc------- Confidence 6666644 5556666666665433 23333333322 22211 01 111 11111000000 Q ss_pred ccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhh Q lcl|NC_020883. 227 PVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLE 306 (589) Q Consensus 227 ~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie 306 (589) .+..+. + +.--|.|++.....+.++|.|-+.-+. T Consensus 165 --------------~~~~~~-------------------~-------------~~~eViH~k~~~~~~~~~G~sp~~~~~ 198 (454) T protein:vir:93 165 --------------ITEAVT-------------------V-------------PAREVIHDRFNCFFHPLIGLPPVYAAG 198 (454) T ss_pred --------------cceeEE-------------------e-------------cCcceEEeccCCCCCCceeccHHHHHH Confidence 000000 0 011188998766677889999987666 Q ss_pred HHHHHHHHHHhHHHHHHHHhCCCcEEech-hhhhcccccccccccccccccccccccc-ccccccccccccccCccceee Q lcl|NC_020883. 307 SKQDEINWTITRSAVIYEQNGKPRISITK-EMMDTLLNIAYERDGHSAKEASMMTPRI-DHRDMEITTFDENGRSMEIHQ 384 (589) Q Consensus 307 ~l~DeLd~t~S~~srildk~gkpRI~VP~-~~L~t~~g~~~d~dge~~~~~~~~~~~~-d~~dlev~~~de~g~~~~~iq 384 (589) ..+......-....+.|...+.|+.++-- +.|+. +..-.....+...+.+. .+..+ +. +.|...+-++ T Consensus 199 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~------e~~~~~~~~~~~~~~g~n~g~~~-vl---~~g~~~~~l~ 268 (454) T protein:vir:93 199 LAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITE------ENAKKLKSNWDSGYTGENAGKTA-IL---SNGAKYNPTT 268 (454) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCH------HHHHHHHHHHHHHhcccccCCce-ec---cCCceEEEcc Confidence 65554433333334556554667655421 11211 10000000000000000 00011 11 2222222233 Q ss_pred ecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 385 IDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLL 464 (589) Q Consensus 385 ~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~ 464 (589) ....-.+.++......++|..+=+.|+.-+|...+... +.+ ....+.+.+. .+.-|...++..|.+.+ +. T Consensus 269 ~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--sn~-e~~~~~f~~~--~l~P~~~~ie~~ln~~L-----~~ 338 (454) T protein:vir:93 269 FSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSS--DNV-EALEQQYYSQ--CLQTLIESIELLLDEAL-----ET 338 (454) T ss_pred cChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc--hhH-HHHHHHHHHH--HHHHHHHHHHHHHHHhh-----cC Confidence 33333445556667778888888999998885432211 111 1111122111 22233333333333221 11 Q ss_pred hhcCcccCcccceeee--CCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccc Q lcl|NC_020883. 465 NDQDSSIRIEEPNIET--QDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTS 541 (589) Q Consensus 465 ~~~~~~~~~e~p~I~f--~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~ 541 (589) . ....|+| .+.+..| ..++++....+.++++++.-.+-++++ |-...- .++ .+.... .+. T Consensus 339 --~------~~~~~~f~~~~ll~~D--~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg---D~~-~~~~~~---~~~ 401 (454) T protein:vir:93 339 --G------ENESTEFDVTTLLRMD--SERRMKTLGDAVKNTLLTPNEARKRENLPPLAGG---DAL-YLQQQN---YSL 401 (454) T ss_pred --C------CCcEEEeechhhhccC--HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC---Cee-eeccCc---cch Confidence 1 1113444 3443434 455677777888888888776666553 101100 000 000000 000 Q ss_pred -cccccccccc--cccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 542 -SLMGINQTFE--QMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 542 -~~g~~~~~l~--~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) ..+...+... +-.++......+..+++.+..++..+.........-|. T Consensus 402 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~~~~ 452 (454) T protein:vir:93 402 EALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFRGIL 452 (454) T ss_pred HhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhhhhh Confidence 0000000000 00000001111111112222111111111111111111 No 122 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=90.97 E-value=0.018 Score=30.13 Aligned_cols=506 Identities=10% Similarity=0.013 Sum_probs=187.5 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeec-----------CcceEEE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARET-----------QTPYVIF 69 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~-----------~~~y~~~ 69 (589) |-+ .-+.|.+++. ..|.+-..- ..+-..+| +.+.+|. +.+|+||...+ .-|-+++ T Consensus 1 m~e-~~~~~~~~~~----~~~~~~~~~----~~~~r~~~---~~d~~f~--~~~G~QW~~~~~~~l~~~~q~~grP~~~~ 66 (706) T protein:vir:10 1 MAE-SRQKQHERVM----LRFDRAWSP----QQVVREKC---IEATRFV--RVPGGQWEGATVAGTKLDEQFEKYPKFEI 66 (706) T ss_pred CCc-chHHHHHHHH----HHHHHHHHH----HHHHHHHH---HHHHHhh--ccCCccCCHHHHHHHHhhhhhcCCCceEe Confidence 444 2233332222 221111100 00111111 1111111 23466665422 2368999 Q ss_pred EcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccc Q lcl|NC_020883. 70 NLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRH 149 (589) Q Consensus 70 n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~ 149 (589) |+.+.+++. . +|.-..+-++- . +.|.+..- ..+.+++. ..++..+...|+..... T Consensus 67 N~i~~~v~~--v-----~g~~~~nr~~~------------~-v~P~~~~~--d~~~Ae~l---~~l~~~~~~~~~~~~a~ 121 (706) T protein:vir:10 67 NKVATELNR--I-----ISEYRNNRISV------------K-FRPGDNAA--SEELANKL---NGLFRADYEETDGGEAC 121 (706) T ss_pred cchHHHHHH--H-----hhHHHhCCCce------------E-EecCCCCc--hHHHHHHH---HHHHHHHHHhcCchHHH Confidence 999988876 2 34433322211 1 12211100 11122222 33778888888888888 Q ss_pred hhhHHHHHHcCceeEEEEEe---c-----C--ceeEEEe--cCceecc-c----ccCcceeEEEeecCCCccc------- Q lcl|NC_020883. 150 WSNIVQHQVDGGIVAAPVID---E-----L--GPRIVFK--ARDVYFP-H----DDEKGADLAYYIDHGQYGQ------- 205 (589) Q Consensus 150 ~~~l~~~~v~Gg~~~~~~~~---~-----~--~~~i~f~--~~d~~~P-~----~d~~~~div~~~e~~~~~~------- 205 (589) .+++.+++..|=.+.++..| + . .+.|.-+ ..+.+|. . -|..-|-++++..+--.+. T Consensus 122 s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~ 201 (706) T protein:vir:10 122 DNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDK 201 (706) T ss_pred HHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCC Confidence 88888888888566666543 1 1 2333222 2233332 1 1222233444332111000 Q ss_pred -----------eEEEEEeeeccccceeeh-----------hhhccccccchhheeecccccccccccccccchhhhhhcc Q lcl|NC_020883. 206 -----------FLHIYRERVEKDGLRTTN-----------MLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREV 263 (589) Q Consensus 206 -----------~l~~~~~~~~~~~~~~~~-----------~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~ 263 (589) +++ ..+...+++.+.. ..|+...+ +.. .....++.......+....... +..+ T Consensus 202 ~~~~~~~~~~~~~~--~d~~~~d~~~~~eyy~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~l~~~~~~~-~~~~ 276 (706) T protein:vir:10 202 APTSLDRVGSVSWQ--YDWFTPDVVYIAKYYEVRKESVDVISYRQPLT-QEI-ATYDSEQIADIQDELEQAGFEE-IGRR 276 (706) T ss_pred Chhhhhhhcccccc--ccccCCCcceecccccccceeEEEEEeecccc-CCc-eeeccchhhhhHHHHhhCCchh-hhhc Confidence 000 0000011111110 01110000 000 0000010000000000000000 0000 Q ss_pred ------cCC----c-cccccccccCCCCcceEEEecCCCC-CCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcE Q lcl|NC_020883. 264 ------LNI----P-DDRPLENFYPGRNRPFISYWANNET-FMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRI 331 (589) Q Consensus 264 ------i~i----p-~~~e~~~i~TGv~~plvvyvPN~~~-~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI 331 (589) +-+ + ...+...-++|-..|+|.+|..... ......-+-+.++.+.++.+|.+.|+...++-+.++-.- T Consensus 277 ~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~ 356 (706) T protein:vir:10 277 SVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTP 356 (706) T ss_pred ccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCccc Confidence 000 0 0112233456677788877765432 222222344677899999999999888777644443222 Q ss_pred Eechhhhhcccccccccc-ccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020883. 332 SITKEMMDTLLNIAYERD-GHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTS 410 (589) Q Consensus 332 ~VP~~~L~t~~g~~~d~d-ge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts 410 (589) .++-+-++.. ...|... .+..+.+ ...+....+..+. .....+.+++.---..++++.+......|-.+++.+ T Consensus 357 ~~~~~~i~~~-~~~~~~~~~~~~~~l--~~~~~~~~~g~i~---~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~ 430 (706) T protein:vir:10 357 IVDMEQIRGL-EQHWEGRNRKRPAFL--PLRTVTDKTGNVV---APANVAGYTQAPVLNQALAALLQQTSADIQEVTGSS 430 (706) T ss_pred ccchhHHHHH-HHHhhhcccccccch--hcccccCCCCccc---ccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCC Confidence 2322222111 0011100 0000000 0000000000010 011222333332234567777777777777889999 Q ss_pred chhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC---cccCc-------------- Q lcl|NC_020883. 411 EKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQD---SSIRI-------------- 473 (589) Q Consensus 411 ~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~---~~~~~-------------- 473 (589) ..++|.. ++ .||+|+..+-.+........=..+..+++++.+.++.|-..+. +.+.+ T Consensus 431 ~~~lG~~--sn----~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~ 504 (706) T protein:vir:10 431 QAMQQMP--SN----VARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNA 504 (706) T ss_pred HHHcCCc--cc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeecc Confidence 9988852 11 3888876554444333333444455566666666555544211 00000 Q ss_pred ------------------ccceeeeCCcCCC-CCCHHHHHHHHHHHhccchhhHHH------HHHHhCCCCCHHHHHHHH Q lcl|NC_020883. 474 ------------------EEPNIETQDMILK-PRAELVAENMAAYAASKQGQSLET------TVRRMNPDASEDWIQEEI 528 (589) Q Consensus 474 ------------------e~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~S~et------aVr~Lhpdw~dE~v~eEv 528 (589) .+-+|...++-.. ...+...+.+++++.+....--.+ .+..+ |+ ...++-+ T Consensus 505 ~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~--d~--p~~~e~~ 580 (706) T protein:vir:10 505 AVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNM--EG--EGLDDFK 580 (706) T ss_pred ceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhc--Cc--cchHHHH Confidence 0012222222111 113333445555554332221111 12222 22 2334455 Q ss_pred HHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 529 ARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 529 ~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) +||+.-.....+ +..+ .+++.....+..+..+..--++ T Consensus 581 e~irk~~~~q~~----------------------~~~~-~~~eq~~~~q~qq~q~~q~~~~ 618 (706) T protein:vir:10 581 AFNRRQLLTQGI----------------------VKPR-NQQEQAIVQQAQQAQATQPDPN 618 (706) T ss_pred HHHHHhhcccCC----------------------cccc-chhHHHHHHHHHHHHHHHHHHH Confidence 666543321100 0000 0111111111111111000000 No 123 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=90.09 E-value=0.023 Score=29.60 Aligned_cols=358 Identities=10% Similarity=-0.002 Sum_probs=127.4 Q ss_pred hhccccccccccccC---CcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHH--------------------- Q lcl|NC_020883. 82 MVSGSIGQIKSSITT---GEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIE--------------------- 137 (589) Q Consensus 82 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~--------------------- 137 (589) |+.+.+..+...-+. .+...-........+.+.-.+ ..+.. -+...++++-.|. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLG-DNNEW-VSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcC-CCCce-echHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 222222222110000 000000000000000000000 00000 0011111111111 Q ss_pred --HHHhhcccc---ccchhhHHHH-HHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEE Q lcl|NC_020883. 138 --QITKNSKLE---RRHWSNIVQH-QVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHI 209 (589) Q Consensus 138 --~v~kn~~~~---~~~~~~l~~~-~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~ 209 (589) .++..-+.+ ..|+..++.+ +..|-.++.+..+..+ +.+....++..-+ + .....+. .. T Consensus 79 ~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~---------~---~~~~~~~--~~ 144 (392) T protein:vir:39 79 NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNT---------Y---YFEYENG--MY 144 (392) T ss_pred hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEE---------E---EcCCCce--EE Confidence 111112222 2344445533 3444444444444333 2333333322221 1 0000000 00 Q ss_pred EEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecC Q lcl|NC_020883. 210 YRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWAN 289 (589) Q Consensus 210 ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN 289 (589) |+...... ..+.+.. + +.--|+|+++ T Consensus 145 y~~~~~~~----------------------~~~~~~~-------------------~-------------~~~eiih~~~ 170 (392) T protein:vir:39 145 YNITFDDP----------------------KIEPILQ-------------------A-------------PQSDLIHMKL 170 (392) T ss_pred EEEEecCc----------------------ccceeEE-------------------E-------------ccccEEEecC Confidence 10000000 0000000 0 0011889998 Q ss_pred CCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe--chhhhhcccccccccccccccccccccccccccc Q lcl|NC_020883. 290 NETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI--TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRD 367 (589) Q Consensus 290 ~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V--P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~d 367 (589) ......++|.|-+.-+...++....+-....+.|...+.|+.++ |.....+. +..+..... .......+.. T Consensus 171 ~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~------~~~~~~~~~-~~~~~~~g~~ 243 (392) T protein:vir:39 171 LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD------KDKASRSRS-FMKRSRSGGP 243 (392) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH------HHHHHHHHH-HhccccCCCe Confidence 88788889999998777777655544334455565667777553 33222111 000000000 0000001111 Q ss_pred ccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 368 MEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 368 lev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) + +. +.|...+-++......+..+..+...++|..+=+.|+.-+|... .+. |.......+++. .+.-+-. T Consensus 244 ~-vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~-~~~----~~~~~~~~f~~~--~l~P~~~ 312 (392) T protein:vir:39 244 V-VL---DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQ----SSIQQISGMYAS--ALNRYLR 312 (392) T ss_pred e-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-Ccc----cHHHHHHHHHHH--HHHHHHH Confidence 1 11 22322222333334445566777778888877799999987422 111 111111222111 1111221 Q ss_pred HHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEE 527 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eE 527 (589) .+...|.+.+ .. .++|+-....+.+....+.....+.+++++|+..+-+ T Consensus 313 ~ie~~l~~~L----------~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~-------------- 361 (392) T protein:vir:39 313 PAISELEYKL----------SD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATF-------------- 361 (392) T ss_pred HHHHHHHHhc----------cc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-------------- Confidence 2222222211 11 1222211112223344455666666777777665443 Q ss_pred HHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 528 IARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 528 v~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) |..+.+- .| ++. .+.++.+|...|++++.+| T Consensus 362 ---~l~~~g~-~p-----------~e~-r~~e~l~~~~~Gd~~~p~p 392 (392) T protein:vir:39 362 ---VLQEAGY-IP-----------KDL-PAPENTNKKTTGQSNEPVP 392 (392) T ss_pred ---HHHhcCC-Cc-----------ccc-chhcCCCCCCCCCCCCCCC Confidence 2222221 11 111 2223456777777766666 No 124 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=90.09 E-value=0.023 Score=29.60 Aligned_cols=358 Identities=10% Similarity=-0.002 Sum_probs=127.4 Q ss_pred hhccccccccccccC---CcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHH--------------------- Q lcl|NC_020883. 82 MVSGSIGQIKSSITT---GEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIE--------------------- 137 (589) Q Consensus 82 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~--------------------- 137 (589) |+.+.+..+...-+. .+...-........+.+.-.+ ..+.. -+...++++-.|. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLG-DNNEW-VSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcC-CCCce-echHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 222222222110000 000000000000000000000 00000 0011111111111 Q ss_pred --HHHhhcccc---ccchhhHHHH-HHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEE Q lcl|NC_020883. 138 --QITKNSKLE---RRHWSNIVQH-QVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHI 209 (589) Q Consensus 138 --~v~kn~~~~---~~~~~~l~~~-~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~ 209 (589) .++..-+.+ ..|+..++.+ +..|-.++.+..+..+ +.+....++..-+ + .....+. .. T Consensus 79 ~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~---------~---~~~~~~~--~~ 144 (392) T protein:vir:10 79 NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNT---------Y---YFEYENG--MY 144 (392) T ss_pred hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEE---------E---EcCCCce--EE Confidence 111112222 2344445533 3444444444444333 2333333322221 1 0000000 00 Q ss_pred EEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecC Q lcl|NC_020883. 210 YRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWAN 289 (589) Q Consensus 210 ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN 289 (589) |+...... ..+.+.. + +.--|+|+++ T Consensus 145 y~~~~~~~----------------------~~~~~~~-------------------~-------------~~~eiih~~~ 170 (392) T protein:vir:10 145 YNITFDDP----------------------KIEPILQ-------------------A-------------PQSDLIHMKL 170 (392) T ss_pred EEEEecCc----------------------ccceeEE-------------------E-------------ccccEEEecC Confidence 10000000 0000000 0 0011889998 Q ss_pred CCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe--chhhhhcccccccccccccccccccccccccccc Q lcl|NC_020883. 290 NETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI--TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRD 367 (589) Q Consensus 290 ~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V--P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~d 367 (589) ......++|.|-+.-+...++....+-....+.|...+.|+.++ |.....+. +..+..... .......+.. T Consensus 171 ~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~------~~~~~~~~~-~~~~~~~g~~ 243 (392) T protein:vir:10 171 LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD------KDKASRSRS-FMKRSRSGGP 243 (392) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH------HHHHHHHHH-HhccccCCCe Confidence 88788889999998777777655544334455565667777553 33222111 000000000 0000001111 Q ss_pred ccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 368 MEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 368 lev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) + +. +.|...+-++......+..+..+...++|..+=+.|+.-+|... .+. |.......+++. .+.-+-. T Consensus 244 ~-vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~-~~~----~~~~~~~~f~~~--~l~P~~~ 312 (392) T protein:vir:10 244 V-VL---DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQ----SSIQQISGMYAS--ALNRYLR 312 (392) T ss_pred e-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-Ccc----cHHHHHHHHHHH--HHHHHHH Confidence 1 11 22322222333334445566777778888877799999987422 111 111111222111 1111221 Q ss_pred HHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEE 527 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eE 527 (589) .+...|.+.+ .. .++|+-....+.+....+.....+.+++++|+..+-+ T Consensus 313 ~ie~~l~~~L----------~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~-------------- 361 (392) T protein:vir:10 313 PAISELEYKL----------SD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATF-------------- 361 (392) T ss_pred HHHHHHHHhc----------cc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-------------- Confidence 2222222211 11 1222211112223344455666666777777665443 Q ss_pred HHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCC Q lcl|NC_020883. 528 IARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPS 574 (589) Q Consensus 528 v~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~ 574 (589) |..+.+- .| ++. .+.++.+|...|++++.+| T Consensus 362 ---~l~~~g~-~p-----------~e~-r~~e~l~~~~~Gd~~~p~p 392 (392) T protein:vir:10 362 ---VLQEAGY-IP-----------KDL-PAPENTNKKTTGQSNEPVP 392 (392) T ss_pred ---HHHhcCC-Cc-----------ccc-chhcCCCCCCCCCCCCCCC Confidence 2222221 11 111 2223456777777766666 No 125 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=86.94 E-value=0.042 Score=28.13 Aligned_cols=443 Identities=11% Similarity=0.010 Sum_probs=142.5 Q ss_pred ccchhHHHHhhcchhhhhhhhhcCCc-cccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhcc Q lcl|NC_020883. 7 RGWTDKTTKNVHGDYERYRQLYEGKH-ELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSG 85 (589) Q Consensus 7 ~~~~~~~~~~~~~~~~~~r~l~~g~~-~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~ 85 (589) -||-+...+..+..- ..=++++. .-+++..-.+.. .-..+..+ |.--+..++.+..-+ .++++ T Consensus 1 Mg~~~~l~~r~~~~~---~~~~~~~~~~~~~~~~~~~~~------~~~~g~~V-----~~~~al~~~~V~~~v--~~Ia~ 64 (457) T protein:vir:13 1 MGFWSALFGRGHSPA---LDGIEARAWEPYDPSIYNLGA------VAASGETV-----TPHDALQVSAVFASV--RLLSE 64 (457) T ss_pred Cchhhhhhccccccc---ccccccccccccchHHHhhcc------cccCCcee-----chHHhhccHHHHHHH--HHHHH Confidence 232222222111100 00011111 001111000000 00000000 111122233333333 45555 Q ss_pred ccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc---ccchhhHHHHHH-cCc Q lcl|NC_020883. 86 SIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE---RRHWSNIVQHQV-DGG 161 (589) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~---~~~~~~l~~~~v-~Gg 161 (589) .+.+++=..-.. ..+..+.+ ... .+...+.....+ ..|+..++.+.. .|- T Consensus 65 ~iA~lp~~~~~~------~~~~~~~~------------~~~--------~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gn 118 (457) T protein:vir:13 65 TIATLPLSTYSK------RGGSRKEI------------VTP--------EWLDYPNAEPGGMGRIDILSQTVLSLLLQGN 118 (457) T ss_pred hhccCceEEEEe------cCCccccc------------ccc--------hHHHhccccCCCCCHHHHHHHHHHHHhhcCC Confidence 555544221110 00000000 000 011111110001 124444544333 343 Q ss_pred eeEEEEEecCc-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeec Q lcl|NC_020883. 162 IVAAPVIDELG-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 162 ~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 240 (589) .++.+..++++ +.+....++++.+ +-....+...+....|+. .. T Consensus 119 a~~~i~~~~g~~~~l~~l~p~~v~v---------~~~~~~~~~~~~~~~y~~--------------------------~~ 163 (457) T protein:vir:13 119 AFLAVRWQGPNIVGLDVLDPTKIHV---------HMVMVDGLRRKVFEAYDI--------------------------DA 163 (457) T ss_pred eEEEEEecCCcEEEEEEEccCceEE---------EEecCCCccceeEEEEEE--------------------------ec Confidence 44444444444 3444444443332 100000000000001110 01 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) .|.......+.. . -|.|+++......++|.|-+.-+...+.-....-.... T Consensus 164 ~~~~~~~~~~~~-----------------~------------diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 214 (457) T protein:vir:13 164 DGNEVLLGWFTP-----------------R------------DVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGS 214 (457) T ss_pred CCceeeEEeeCc-----------------c------------ceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 111110000000 1 18888887777778999998766665554443322334 Q ss_pred HHHHHhCCCcEEechhhhhccccccccccccccccccccccccc--cccccccccccccCccceeeecccHHHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRID--HRDMEITTFDENGRSMEIHQIDISKIGDMDHVKN 398 (589) Q Consensus 321 rildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d--~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~ 398 (589) +.|...+.|..++-- .+.+..+.-...-......+.+.+ +..+ +. +.|...+-++....-.+..+..+. T Consensus 215 ~~f~ng~~p~gil~~-----~~~ls~e~~~~~~~~~~~~~~g~~nag~~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~ 285 (457) T protein:vir:13 215 KFFANGAMPGAVVEV-----PGTMSEEGLARAREAWRAANSGVDNAHRVA-LL---TEGAKFSKVAMSPDEAQFLQTRQF 285 (457) T ss_pred HHHhcCCCcceEEEc-----CCCCCHHHHHHHHHHHHHHhcCccccCcce-ec---CCCceEEEccCChhHHHHHHHHHH Confidence 555555777654421 111110000000000000000000 0011 11 222222223333333344555556 Q ss_pred HHHHHHHHhcCCchhcccccCcccchhHHHHHH-HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccce Q lcl|NC_020883. 399 LIKLMLIETQTSEKAVDFYLDGGASGAQSGVAK-FYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPN 477 (589) Q Consensus 399 L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~-r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~ 477 (589) ..++|..+=+.|+.-.|...+++.. .|.++. ...+..- .+.-|...+...|.+.+ + ..... ...... T Consensus 286 ~~~~Ia~~fgVPp~~lg~~~~~~~~--~sn~eq~~~~f~~~--tl~P~~~~ie~~ln~~L----~-~~~~~---~~~~i~ 353 (457) T protein:vir:13 286 QVPEIARIFGVPPHLISDATNSTSW--GSGLAEQNIAFTMF--SLRPWLERIEAGFNRLL----F-AETAD---RFRFVK 353 (457) T ss_pred HHHHHHHHhCCCHHHcCCCCCcccc--cchHHHHHHHHHHH--HHHHHHHHHHHHHHHhh----c-Ccccc---CceeEE Confidence 7788888889999988764433221 122221 1122111 12223333333333221 1 11100 111123 Q ss_pred eeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCc Q lcl|NC_020883. 478 IETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDN 556 (589) Q Consensus 478 I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~ 556 (589) ..+.+.+..| .+++++..+.+.++++++.-.+-+++. |-..+-..++- .+..... +++......+ .-.+ T Consensus 354 fd~~~l~~~D--~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~--~~~~n~~-----~~~~~~~~~~-~~~~ 423 (457) T protein:vir:13 354 FNLDEIKRGA--PKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKY--RVPLNLG-----EVGEEPEPEP-APAP 423 (457) T ss_pred eechhhhccC--HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccce--eeccccc-----cccccccccc-cCCC Confidence 3444543334 456678888888999998876655542 11111110100 0110000 0000000000 0000 Q ss_pred ccCCCCCCCCCCCCCCCCcchhhhhh----ccccc Q lcl|NC_020883. 557 RDEDGNIIEEGDTEEEPSAEENEEIE----KEGEP 587 (589) Q Consensus 557 ~~~~~~p~deg~~~eep~~~~~e~~~----~~~~~ 587 (589) ...++|.++.+++.++.+..+++.+ ++-++ T Consensus 424 -~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 424 -PAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred -CCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 0011122222222222222222221 11112 No 126 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=86.03 E-value=0.048 Score=27.79 Aligned_cols=466 Identities=12% Similarity=0.051 Sum_probs=170.1 Q ss_pred Cccce-eccc--hhHHHHhh------cchhhhhh---------------hhhcCCccccCHHHHHHHhhccccceeccCc Q lcl|NC_020883. 1 MIDWT-VRGW--TDKTTKNV------HGDYERYR---------------QLYEGKHELLFPRAKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~~~~-~~~~--~~~~~~~~------~~~~~~~r---------------~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~ 56 (589) .+|-. |.-= .-...++| .+++++.- -+|+|--===||.-..|-+-. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~---------- 123 (698) T protein:vir:10 54 ALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLP---------- 123 (698) T ss_pred ccccccccCCCccccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhcc---------- Confidence 00000 0000 00000000 00111100 012221111122222222211 Q ss_pred ceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHH Q lcl|NC_020883. 57 QTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEII 136 (589) Q Consensus 57 ~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i 136 (589) .++-.+.+.|..+.|.-+.++.+.. +.+....-.+.|.... ..+-- |-+.| T Consensus 124 --------------eyr~~~~~ia~e~~R~w~~~~~~~~------e~~~~~g~~~~~~~~~----~~d~d-----qi~~L 174 (698) T protein:vir:10 124 --------------EYRAMHEVLADECIRTWGEAIGGTK------EKADTSGLAAGGNAAS----TSDGD-----QLKQI 174 (698) T ss_pred --------------chhhHHHHHHHHhhcccceeccccc------hhhhhhcccccccccc----cccHH-----HHHHH Confidence 1244566778888888777765322 0000000011100000 00000 11233 Q ss_pred HHHHhhccccccchhhHHHHHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecc Q lcl|NC_020883. 137 EQITKNSKLERRHWSNIVQHQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEK 216 (589) Q Consensus 137 ~~v~kn~~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~ 216 (589) ++-.+..+.+.++-..+.=.-.=||-++-..++++.-.++ .+ ++|- .+....+....++.+.+.++.- T Consensus 175 ~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~--~P--L~~~--------~~~I~kGslKGL~ViDp~~vtP 242 (698) T protein:vir:10 175 NDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMD--TP--LVPR--------PYTVPKGSFQGLRVVEPYWVTP 242 (698) T ss_pred HHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccc--cc--cccc--------cccccCccceeeeeeccccccc Confidence 3333443333333333333334455555555554331111 11 0110 0001112222344444444333 Q ss_pred ccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCc Q lcl|NC_020883. 217 DGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNP 296 (589) Q Consensus 217 ~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~ 296 (589) ..+...+-+ +.-+-...+.. ..+.++ |..---++.|++-|.. .++..++ T Consensus 243 ~~~n~~dP~----------------spdfgkP~~y~------V~G~~I----H~SRL~~~vg~pvpd~-----LKp~y~f 291 (698) T protein:vir:10 243 NNYNSINPV----------------ADDFYKPSTWW------MIGSEV----HATRLHTIVSRPVGDM-----LKPTYSF 291 (698) T ss_pred chhhhccch----------------hhccCCCceEE------Eeccee----cceeEEEecCCCchhh-----hcchhcc Confidence 222111110 00011111111 111111 1111111222221111 2455667 Q ss_pred ccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccc Q lcl|NC_020883. 297 YGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDEN 376 (589) Q Consensus 297 lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~ 376 (589) +|.|-...+.+.+++-++|-...+.++.+..-..+ -..|..-+++.. +-+......+.-..-+.+-+.++ | T Consensus 292 ~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l--~~dla~aL~~g~---~~~l~~R~eli~~~Rsn~G~~ll--D-- 362 (698) T protein:vir:10 292 AGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI--LMDLAQALTPGA---NVDLSMRAELINRYRDNRNILFL--D-- 362 (698) T ss_pred CCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHH--HHHHHHhcCChh---hHHHHHHHHHHHHhcCccceEEE--e-- Confidence 89999988888888888775555555422111111 011111111110 00011111110000011111122 1 Q ss_pred cCccceeeecccHHHHHHHHHHHHHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 377 GRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKE 455 (589) Q Consensus 377 g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~ 455 (589) +..-+|.|+++.+.+--..+...+.+|-..+++|.- .||. +-.+-..||.+..+.....+.-.+ ...+..+|++ T Consensus 363 k~~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPltkLfGq---SPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~r 437 (698) T protein:vir:10 363 KATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGI---TPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMND 437 (698) T ss_pred cCCcceEEEecCcCCHHHHHHHHHHHHHhhhcCchhhhhcc---CCcccCccchhhHHHHHHHHHHHH--HHHHHHHHHH Confidence 123567889988888888888888888777777743 3443 111112356666555555543333 2335577888 Q ss_pred HHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhc Q lcl|NC_020883. 456 LYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQ 535 (589) Q Consensus 456 li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~ 535 (589) ++.+++.-. . | ...+ ...+.|++-..-+ +...|++.+....+-.+=..++ -.+..+|. +|+.++. T Consensus 438 l~~ii~rS~-~-G-~idp-~i~~~fnPL~qmt--d~EkAeI~~k~A~~d~~~~~~g------vI~~~evr---~rL~~d~ 502 (698) T protein:vir:10 438 VIVMIQLSL-F-G-AVDP-SIKWQWNALRELD--DLEVAEARYKQAQSDVLYVQEQ------VIRPDQVA---ARLNTEP 502 (698) T ss_pred HHHHHHHHh-c-C-CCCC-cceEEeCCCCCcC--HHHHHHHHhhhhHHHHHHHHhc------CCCHHHHH---HHHhccC Confidence 776653322 2 1 2222 3556887765544 4455777666333222222222 24555433 4554443 Q ss_pred cccccccccccccccccccCcccCCCCCCCCCC-CCCCC---CcchhhhhhcccccCC Q lcl|NC_020883. 536 AGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGD-TEEEP---SAEENEEIEKEGEPIA 589 (589) Q Consensus 536 a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~-~~eep---~~~~~e~~~~~~~~~~ 589 (589) .. ++.+++|-.+ ++.+.+|.+ +++-+ .+.+..+...+++|-. T Consensus 503 ~s-----------~Y~~~~d~~d-~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (698) T protein:vir:10 503 DG-----------PYAGKLDAND-DPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGG 548 (698) T ss_pred CC-----------ccccccCCcc-cCCCCCCCcchHHHhhhcCCcCCCCccccccccc Confidence 21 1222233222 111222222 11111 2233333333333222 No 127 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=85.62 E-value=0.051 Score=27.65 Aligned_cols=463 Identities=12% Similarity=0.070 Sum_probs=151.1 Q ss_pred cchhhhhhhhhc-------------CCccccCHHHHHHHhhccccceeccCcceee------------ecCcceEEEEcc Q lcl|NC_020883. 18 HGDYERYRQLYE-------------GKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------------ETQTPYVIFNLP 72 (589) Q Consensus 18 ~~~~~~~r~l~~-------------g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------------~~~~~y~~~n~~ 72 (589) -|-.+|.||-+- +-...++++..+.++++-.-+..-..++.-. ..+++|=+-++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 333344444331 1224456666666665522211111110000 011111110111 Q ss_pred hhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc----- Q lcl|NC_020883. 73 KVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER----- 147 (589) Q Consensus 73 ~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~----- 147 (589) +...+- ..+.+.+-.+...++. +.. +...+...+ |-+-.......+.......+-+.+..++.+.++++ T Consensus 81 ~~~~~n--piv~~~I~~~a~~ia~--~~~-~~~~~~~~~-~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~ 154 (547) T protein:vir:63 81 KKFGGN--IILNAIINTRSNQVSM--YCK-PARHSEKGV-GFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRD 154 (547) T ss_pred HHhhcC--HHHHHHHHHHHHHHhh--hhh-hhhhhccCC-CceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccc Confidence 111111 1111112122211110 000 000000000 00000000000111111122234555666665543 Q ss_pred ---cchhhHHH-HHHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecccccee Q lcl|NC_020883. 148 ---RHWSNIVQ-HQVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRT 221 (589) Q Consensus 148 ---~~~~~l~~-~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~ 221 (589) .|+..++. .+.-|..++-+..+.++ ..+....+++..++.+.-+ ...+..+.+++ T Consensus 155 s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g--------~~~~~~~~y~~----------- 215 (547) T protein:vir:63 155 SFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADG--------KIPDNGNRFVQ----------- 215 (547) T ss_pred hHHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcc--------ccccCceEEEE----------- Confidence 23444443 34456555566666544 4455555554443111000 00000000000 Q ss_pred ehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCC---CCCccc Q lcl|NC_020883. 222 TNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNET---FMNPYG 298 (589) Q Consensus 222 ~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~---~~~~lG 298 (589) ..+++.... ++ ..+ |.|+..++. ...++| T Consensus 216 ----------------~~~~~~~~~-------------------~~-~~e------------iih~r~n~~~~~~~~~~G 247 (547) T protein:vir:63 216 ----------------VIDQKIVAT-------------------FN-ARE------------MAFAVRNPRSDIYATGYG 247 (547) T ss_pred ----------------EcCCcEEEE-------------------ec-ccc------------EEEecccCCCCccccccc Confidence 001111000 00 001 677765443 335689 Q ss_pred CcchhhhhHHHHHHHHHHhHHHHHHHHhCCCc--EEechhh-hhcccccccccccccc-ccccccccccccccccccccc Q lcl|NC_020883. 299 ISALDNLESKQDEINWTITRSAVIYEQNGKPR--ISITKEM-MDTLLNIAYERDGHSA-KEASMMTPRIDHRDMEITTFD 374 (589) Q Consensus 299 ~SD~~~ie~l~DeLd~t~S~~srildk~gkpR--I~VP~~~-L~t~~g~~~d~dge~~-~~~~~~~~~~d~~dlev~~~d 374 (589) .|-+..+...+.....+-....+.|...+.|+ |.+|... |+.-+ .++-.+.+ ...++. ...+... +. . T Consensus 248 ~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~---~~~lk~~~~~~~~G~--~nagk~~-vl--~ 319 (547) T protein:vir:63 248 YPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHA---LEIFKREWKNSLSGI--NGSWQIP-VV--S 319 (547) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHH---HHHHHHHHHHHhcCc--ccccccc-cc--c Confidence 99987666666554444333345554445677 4444321 22110 00000000 000000 0001000 11 1 Q ss_pred cccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccch------hHHHHHH-HHHhhhHHHHHHHHHH Q lcl|NC_020883. 375 ENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASG------AQSGVAK-FYDLLTTILKSRRLQK 447 (589) Q Consensus 375 e~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~------A~Sg~A~-r~~~~~~~~Kv~~~R~ 447 (589) ++|....-++....-.+..+..+...+.|..+=+.|+.-.|+....+..+ +.|.+.. ...+.+ ..+.-|.. T Consensus 320 ~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~--~tL~P~~~ 397 (547) T protein:vir:63 320 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKN--KGLQPLLG 397 (547) T ss_pred CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHH--HHHHHHHH Confidence 12222222233334444555566677888888899999998643321111 1111111 111111 11222222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC--CC--CCHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN--PD--ASEDW 523 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh--pd--w~dE~ 523 (589) .+...|.+.+ + ...+ ....+.|......+ +...+.+.+. ..+++++.-.+-++++ |. +-|. T Consensus 398 ~ie~~ln~~L-----~-----~~~~-~~~~~~f~~~~~~~--~~~~~~~~~~-~~~g~lT~NE~R~~~gl~P~~egGD~- 462 (547) T protein:vir:63 398 FIEDFINKHI-----V-----AEFG-DKYTFQFVGGDIKS--ELESVKILAE-KAKVAMTVNEVRKELNLPGDVIGGDI- 462 (547) T ss_pred HHHHHHHhhc-----c-----cccC-CceEEEeecccccc--HHHHHHHHHH-HhCCCcCHHHHHHHhCCCCCCCCCce- Confidence 2223332211 1 1111 12345676544444 4444555443 3456777766666553 31 1111 Q ss_pred HHHHHHHHHhhccccccccccccccccccc--------cCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 524 IQEEIARIEEEQAGSDTSSLMGINQTFEQM--------NDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 524 v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~--------~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) +..... ..+ .|.. ....+. ++.-.+.+...+.++++++|++.+......+-..+- T Consensus 463 -------~~~~~~-~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 525 (547) T protein:vir:63 463 -------PLNGVI-VQR--IGQL-MQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRK 525 (547) T ss_pred -------eecccc-ccc--cccc-ccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCcccccc Confidence 000000 000 0000 000000 000011111123334444444433322111111111 No 128 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=84.66 E-value=0.059 Score=27.33 Aligned_cols=440 Identities=12% Similarity=0.070 Sum_probs=132.8 Q ss_pred cceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccc-cCCcccchhhccchhhcccccccchhhh-hhhh Q lcl|NC_020883. 49 VGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSI-TTGEIDPDIEEDTDEMIEGPQDEEEAGK-NENN 126 (589) Q Consensus 49 ~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~ 126 (589) ...|+..| -+.|-++ .++++.+..++=.+ +.... .........++-. ..+.-. .-|. T Consensus 1 l~~l~~~n------~~v~~ci-----------~~ia~~ia~~p~~i~~~~~~--~~~~~~~~~~~~~--~~~l~~~~pn~ 59 (467) T protein:vir:31 1 MAELLEHN------ETHAKCV-----------HAKSRYVAGFGINIIPHPEA--EDPDRDGEQYERV--WDFWFGDDSNW 59 (467) T ss_pred ChhhhhcC------HHHHHHH-----------HHHHHhhhcCCeEEEEccCc--ccccchhhhhhhH--HHHhhccCCCc Confidence 11111111 1111111 12222222211100 00000 0000000000000 000000 0000 Q ss_pred hhhhhhhhHHHHHHhhccccccchhhHHH-HHHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCc Q lcl|NC_020883. 127 TVIDLQNEIIEQITKNSKLERRHWSNIVQ-HQVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQY 203 (589) Q Consensus 127 ~~~~~~~e~i~~v~kn~~~~~~~~~~l~~-~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~ 203 (589) .+... .+.++... .++..++. ....|-+.+.+..+..+ +.+....++..-+..+.+.+ T Consensus 60 ~~~~~--~~~~~t~~------~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~----------- 120 (467) T protein:vir:31 60 QVGPM--ESERATAT------NVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGF----------- 120 (467) T ss_pred cccch--hhHhhHHH------HHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeeccee----------- Confidence 00000 00011111 12333332 33446565556656433 45555555555443332221 Q ss_pred cceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcce Q lcl|NC_020883. 204 GQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPF 283 (589) Q Consensus 204 ~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~pl 283 (589) +..+... ..+.. .| +..+.....|+......+..+.. .+... .++.-- T Consensus 121 ---~~~~~~~---~~~~~---~~------~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-------------~~~~~d 168 (467) T protein:vir:31 121 ---VQLLEEK---EKYFG---VA------GDRYQTNGNGDLDPVFVDADDGS----TGTSV-------------SNPANE 168 (467) T ss_pred ---EeecCCc---eeeEE---ec------cccceeecccceeeeeeeecccc----cccee-------------Eecccc Confidence 1111000 00000 00 00000001111111000100000 00000 012223 Q ss_pred EEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEE--echhhhhcccc----ccccccccc----- Q lcl|NC_020883. 284 ISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRIS--ITKEMMDTLLN----IAYERDGHS----- 352 (589) Q Consensus 284 vvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~--VP~~~L~t~~g----~~~d~dge~----- 352 (589) |.|+.+......++|.|.+..+...+..-..+-..-.+.|...+.|..+ +|..+|+.-+. ..|...... T Consensus 169 iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~ 248 (467) T protein:vir:31 169 LIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTA 248 (467) T ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhh Confidence 8899877777888999988765544432222211223334344566643 35444422110 000000000 Q ss_pred cccccccccccccccccccccccccC--ccceeeec--ccH-HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHH Q lcl|NC_020883. 353 AKEASMMTPRIDHRDMEITTFDENGR--SMEIHQID--ISK-IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQS 427 (589) Q Consensus 353 ~~~~~~~~~~~d~~dlev~~~de~g~--~~~~iq~D--irv-eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~S 427 (589) .....+.. ......+...+.... .+.+.... ... .+..+..+...++|..+=+.|+.-.|...+++.+ .+ T Consensus 249 ~~~~~g~~---n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~--s~ 323 (467) T protein:vir:31 249 FIETEKIV---QNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFS--TD 323 (467) T ss_pred hhhhcccc---cccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcc--cC Confidence 00000000 000001111111100 11111111 111 2334555566777888888999888764332221 11 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhh Q lcl|NC_020883. 428 GVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQS 507 (589) Q Consensus 428 g~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S 507 (589) ..+....+..- .+.-+...+..+|.+.+ +.. ........|.|+-.-....+.++.++..+.+.++++++ T Consensus 324 ~e~~~~~f~~~--~l~P~~~~ie~~ln~~l-----~~~----~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T 392 (467) T protein:vir:31 324 AEEQRKEFAEE--TIQPKQHDFGELLYELV-----HKQ----GLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLT 392 (467) T ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHhh-----cch----hhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcC Confidence 11222222111 12223333333333221 111 11111223444322223344666788888889999999 Q ss_pred HHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCC----CCcchhhhhh Q lcl|NC_020883. 508 LETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEE----PSAEENEEIE 582 (589) Q Consensus 508 ~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ee----p~~~~~e~~~ 582 (589) .-.+.+++. |-..++. + .. .....+..-|+..+. ..-++.+.+..|++.++. +++-|.|++. T Consensus 393 ~NE~R~~~Gl~pi~d~~----~---~~-~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (467) T protein:vir:31 393 VNELRDEFGFEPFPEEH----V---YG-GETLVAEVTGGSGPG-----GGIGDQIEQLVEDRADEIIDSYQADLETEQLI 459 (467) T ss_pred HHHHHHHhCCCCCCccc----c---cC-CcccccccccccCCC-----CcccCcCCCCCCCcccchHhhhhhccccchhh Confidence 988887763 2122211 0 00 000000011111000 000011111111111111 1111222221 Q ss_pred cccccCC Q lcl|NC_020883. 583 KEGEPIA 589 (589) Q Consensus 583 ~~~~~~~ 589 (589) ..+..-- T Consensus 460 ~~~~~~~ 466 (467) T protein:vir:31 460 EIGANAD 466 (467) T ss_pred hhccccC Confidence 1111111 No 129 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=84.61 E-value=0.059 Score=27.32 Aligned_cols=453 Identities=12% Similarity=0.059 Sum_probs=168.2 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) +.||.- .+-+.+ -+|+|--===||.-..|-+-.+ ++-.+.+.| T Consensus 90 ~~~~~~---------~~~~~l----~~~~~~~F~Gy~~la~laQ~~e------------------------yr~~~~~ia 132 (694) T protein:vir:10 90 ALDFNG---------TSMDAL----SFVTSSGFPGFPTLVLLAQLPE------------------------YRAMHEVLA 132 (694) T ss_pred hhccCc---------ccccch----hhhhccCcchHHHHHHHhhccc------------------------hhhHHHHHH Confidence 333300 000111 0122221111333222222111 244566778 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) ..+.|.-+.++.+.. +.+....-.+.|.... ..+-- |-+.|++-.+..+++.++-..+.=.-.=| T Consensus 133 ~e~~R~w~~~~~~~~------e~~~~~g~~~~~~~~~----~~d~d-----qi~~L~~e~erl~V~~~l~eaik~aRlfG 197 (694) T protein:vir:10 133 DECIRTWGEAIGGTK------EKADTSGLAAGGNAAS----TSDGD-----QLKQINDEIERLRIRDAVRTTVIHDQAFG 197 (694) T ss_pred HHhhcccceeccccc------hhhhhhcccccccccc----cccHH-----HHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 888888777765322 0000000011100000 00000 11233333444433333333443333445 Q ss_pred ceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeec Q lcl|NC_020883. 161 GIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 161 g~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 240 (589) |-++-..++++.-.++ .+ ++|- .+....+....++.+.+.+..-..+...+-+ T Consensus 198 Ga~~~i~I~gdd~~l~--~P--L~~~--------~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~--------------- 250 (694) T protein:vir:10 198 RAHPYFKIKGDDQIMD--TP--LVPR--------PYTVPKGSFQGLRVVEPYWVTPNNYNSINPV--------------- 250 (694) T ss_pred ceEEEEEeecCccccc--cc--cccc--------cccccCcceeeeEeecccccccchhhhccch--------------- Confidence 5555555554331111 11 0110 0001112222334444444333222111110 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) +.-+-..++.. .++.++ |..---++.|++-|.. .++..+++|.|-...+.+.+++-++|-...+ T Consensus 251 -spdfgkP~~y~------V~G~~I----H~SRL~~f~g~plPd~-----LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~ 314 (694) T protein:vir:10 251 -ADDFYKPSTWW------MIGTEV----HATRLHTIVSRPVGDM-----LKPTYSFAGISMTQLAMPYIDNWLRTRQSVS 314 (694) T ss_pred -hhccCCCceEE------EeceEE----eeeeEEEecCCCchhh-----hhcccccCcccHHHHHHHHHHHHHHHHhHHH Confidence 00011111100 111111 1111112223322211 2445567899999888888888887755455 Q ss_pred HHHHHhCCCcEEe-chhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISI-TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNL 399 (589) Q Consensus 321 rildk~gkpRI~V-P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L 399 (589) .++-. .++.+ -..|...+.+.. +-+......+....-+.+-+.++ | +..-+|.|+++.+.+--..+... T Consensus 315 ~Li~~---~~v~~lk~dla~~L~~g~---~~~l~~R~eli~~~Rsn~G~~ll--D--k~~Eefeq~stslSGLddVi~qf 384 (694) T protein:vir:10 315 DIVKQ---FSVSGILMDLAQALMPGA---NVDLSMRAELINRYRDNRNILFL--D--KATEEFFQFNTPLSGLDALQAQA 384 (694) T ss_pred HHHHh---hhhHHHHHHHHHhhcChh---HHHHHHHHHHHHHhcCccceEEE--e--cCCcceEEEecccCCHHHHHHHH Confidence 44421 11111 011111111110 00011111110000011111122 1 12256788999888888888888 Q ss_pred HHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCccccee Q lcl|NC_020883. 400 IKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNI 478 (589) Q Consensus 400 ~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I 478 (589) +.+|-..+++|.- .||. . -.+-..||.+..+.....+.-.+ ...+..+|++++.+++.-. .+ ...+ ...+ T Consensus 385 ~q~VAgaa~IPltkLfGq-S--PkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS~-~G--~idp-~i~~ 455 (694) T protein:vir:10 385 QEQMSAVSHIPLIKLLGI-T--PTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLSL-FG--AVDP-SIKW 455 (694) T ss_pred HHHHHhhhcCchhhhhcc-C--cccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHh-cC--CCCC-cceE Confidence 8888888887743 3443 1 11112356655555555543333 2335577888776653322 11 1222 3456 Q ss_pred eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCccc Q lcl|NC_020883. 479 ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRD 558 (589) Q Consensus 479 ~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~ 558 (589) .|++-..-+ ++..|++.+....+-.+=..+++ .+..+|. +|+..+-.. +. .+.+|-.+ T Consensus 456 ~fnPL~qmt--d~EkAeI~~k~A~~d~~~~~~gv------I~~~evr---~rL~~d~~s----~Y-------~~~~D~~d 513 (694) T protein:vir:10 456 QWNALRELD--DLEVAESRYKQAQSDVLYVQEQV------IRPDQVA---ARLNTEPDG----PY-------AGKLDAND 513 (694) T ss_pred EeCCCCCcC--HHHHHHHHhhhhHHHHHHHHhcC------CCHHHHH---HHHhcCCCc----cc-------cccccccc Confidence 787765444 44557777663332222222222 4444433 455554322 11 11122111 Q ss_pred CCCCCCCCCCCCCCC---CcchhhhhhcccccCC Q lcl|NC_020883. 559 EDGNIIEEGDTEEEP---SAEENEEIEKEGEPIA 589 (589) Q Consensus 559 ~~~~p~deg~~~eep---~~~~~e~~~~~~~~~~ 589 (589) +-+.|.|.-.+++.+ ...|-++...++.+=+ T Consensus 514 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (694) T protein:vir:10 514 DPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARA 547 (694) T ss_pred CCCcCccchhhhhHhhhcCcccccccCCCCcccc Confidence 101111110000000 1111122222333222 No 130 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=80.26 E-value=0.097 Score=26.15 Aligned_cols=488 Identities=11% Similarity=0.010 Sum_probs=156.3 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhh-ccccceeccCcceeeecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEE-GDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~-~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~p 79 (589) |-- --|.+-+..+. ||.+|-.-. ...-.+++++-.= .+..... ++....+...++| =-=-.+.+-.+- T Consensus 1 ~~~--~~~~~~~~~~~------r~~~l~~~R-~~~e~~w~e~~~y~lP~~~~~-~~~~~~~~~~~~~-dst~~~a~~~La 69 (522) T protein:vir:94 1 MAE--REGFAAEGAKA------VYDRLKNGR-QPYETRAQNCAAVTIPSLFPK-ESDNSSTEYTTPW-QAVGARCLNNLA 69 (522) T ss_pred Ccc--cchhhHHHHHH------HHHHHHHHh-hHHHHHHHHHHHHhcccccCC-CCCcccccccccc-cccHHHHHHHHH Confidence 433 22444444443 333322211 0111222222110 0111111 1111111122221 000112233333 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) |-|++...+.- ..|- -+........+...-......+.|..++ .+.+...+-.|+|+......+.++.+- T Consensus 70 s~l~~~ltP~~-~WFr-----l~~~d~~~~~~~~~~~~~~~v~~~L~~v----e~~~~~~~~~snf~~~~~~~~~~L~~~ 139 (522) T protein:vir:94 70 AKLMLALFPQS-PWMR-----LTVSEYEAKTLSQDSEAAARVDEGLAMV----ERVLMAYMETNSFRVPLFEALKQLIVS 139 (522) T ss_pred HHHHhhcCCCC-cccc-----cccchhhhhccCcccchhHHHHHHHHHH----HHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 34443332211 1111 1111100000000000111123344444 345667777888999999999988776 Q ss_pred CceeEEEEEecC---c-eeEEEecCceecccccCcc-eeEEEe-ecCCCcc------ceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 160 GGIVAAPVIDEL---G-PRIVFKARDVYFPHDDEKG-ADLAYY-IDHGQYG------QFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 160 Gg~~~~~~~~~~---~-~~i~f~~~d~~~P~~d~~~-~div~~-~e~~~~~------~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) |=.+ -|++.+ + .++.+++-..|+=..|+.| +|=+|. .+.+-.. +.+.... ....+...+-+.+|+ T Consensus 140 G~a~--l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~-~~p~~~v~v~~~v~~ 216 (522) T protein:vir:94 140 GNCL--LYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADD-YEPDTELEVYTHIYR 216 (522) T ss_pred CcEe--EeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhccc-CCccceEEEEEEEEe Confidence 6322 344432 2 3455555555444333333 332321 1111000 0000000 000000000011111 Q ss_pred cccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCC-CcceEEEecCCCCCCCcccCcchhhhh Q lcl|NC_020883. 228 VVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGR-NRPFISYWANNETFMNPYGISALDNLE 306 (589) Q Consensus 228 ~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv-~~plvvyvPN~~~~~~~lG~SD~~~ie 306 (589) .++.+....+..+.. ++. .+.. .|- ..|+++. -=++.....||||=..+.. T Consensus 217 -------------~~~~~~~~~~~~g~~----------~~~-~~~~---~~~~e~P~~~~-Rw~~~~ge~YGrgp~~~~l 268 (522) T protein:vir:94 217 -------------QDDEYLRYEEVEGIE----------VTG-TDGS---YPLTACPYIPV-RMVRLDGEDYGRSYCEEYL 268 (522) T ss_pred -------------eCCceeEEeeccCce----------ecc-cCCC---CccccCCceee-eeeecCCCccccchHHHHH Confidence 011111111100000 000 0000 011 2343333 2235566679999889999 Q ss_pred HHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec Q lcl|NC_020883. 307 SKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID 386 (589) Q Consensus 307 ~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D 386 (589) +-+..||..--....-.++..+|.+.||......... ..+...+.+.+.+..+ ++.+++. T Consensus 269 ~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~--------~~~~~~g~~v~g~~~~------------v~~~~~~ 328 (522) T protein:vir:94 269 GDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRR--------LNKAATGEFVAGRVED------------INFLQLT 328 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCceeecccccccchh--------eeccCCceeecCCccc------------ceeeecc Confidence 9999999754344444557899999997654432211 1111111111111111 1112221 Q ss_pred --ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHh-hhHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 387 --ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDL-LTTIL-KSRRLQKEYIDFLKELYESCLW 462 (589) Q Consensus 387 --irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~-~~~~~-Kv~~~R~~~~~aLk~li~~~l~ 462 (589) .++....+.++.+...|-..- -...++.- ++...|++++..+... ...+. -..+. -.+.|..+++.+.. T Consensus 329 ~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~--~~~r~TAtEV~~r~~E~~~~LG~v~~rl---~~E~l~Pli~r~~~ 401 (522) T protein:vir:94 329 KGQDFTIAKSVADAIEQRLGWAF--LLNSAVQR--NAERVTAEEIRYVAGELEATLGGVYSVQ---SQELQLPIVRVLMN 401 (522) T ss_pred cccchhHHHHHHHHHHHHHHHHH--hhhhhccC--CCccccHHHHHHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHH Confidence 133334444444444332111 11112221 2222344666655422 22221 22222 23445555544443 Q ss_pred HHhhcCc--ccCcccceeeeCCcCCCCCCHHHHHHHHHHHh---ccc---h---hhHHHHHHHhC--CCCCHHHH---HH Q lcl|NC_020883. 463 LLNDQDS--SIRIEEPNIETQDMILKPRAELVAENMAAYAA---SKQ---G---QSLETTVRRMN--PDASEDWI---QE 526 (589) Q Consensus 463 L~~~~~~--~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~---~a~---~---~S~etaVr~Lh--pdw~dE~v---~e 526 (589) +....+. .+..+..+|++--+|..-.+-.....+.+.+. +-+ + +....+++.+- -..+...+ ++ T Consensus 402 il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~e 481 (522) T protein:vir:94 402 QLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQD 481 (522) T ss_pred HHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHH Confidence 3322221 11112234555333221111101111111111 000 0 11112222220 00111111 22 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) |++.+.+++.+......++. ...+-+.--.+.+.+.|.++- T Consensus 482 e~~~~~~q~~~~~~~~~~~~--~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 482 EKIQRMAEQSSQQAVVQGAS--AAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHhhhhhhcccchhhhcC Confidence 33333222221111111110 111111111122222232222 No 131 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=79.06 E-value=0.11 Score=25.88 Aligned_cols=383 Identities=11% Similarity=0.062 Sum_probs=125.5 Q ss_pred cchhhhccccccccccccCCcccchh-------hccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH---------- Q lcl|NC_020883. 78 IPATMVSGSIGQIKSSITTGEIDPDI-------EEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT---------- 140 (589) Q Consensus 78 ~pa~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~---------- 140 (589) .|-.=.-+.+|.+++.|...++.... .+.+...+.+... ..+.. -+...++|+-.|...+ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~g~~-v~~~~a~~~~aV~~~v~~Ia~~ia~l 77 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIIS--DTGAA-VNADAIMRLDAVAACVKLVSQAVAAM 77 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhccccc--ccCcc-cchHhhhcchHHHHHHHHHHHhhccC Confidence 22222223355555555432211000 0000000000000 00000 0111122222221111 Q ss_pred -------------------------hhccccc---cchhhHHH-HHHcCceeEEEEEecCc-eeEEEecCceecccccCc Q lcl|NC_020883. 141 -------------------------KNSKLER---RHWSNIVQ-HQVDGGIVAAPVIDELG-PRIVFKARDVYFPHDDEK 190 (589) Q Consensus 141 -------------------------kn~~~~~---~~~~~l~~-~~v~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~~d~~ 190 (589) ...|.++ .|+..++. .+..|-+++.+..++++ ..+.+..++++-+..+ T Consensus 78 p~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~p~~v~v~~~-- 155 (432) T protein:vir:97 78 PLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLANDRLTITTD-- 155 (432) T ss_pred ceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEcCcceEEEEc-- Confidence 1111111 23333332 22333333333333322 1222222222221000 Q ss_pred ceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCcccc Q lcl|NC_020883. 191 GADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDR 270 (589) Q Consensus 191 ~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~ 270 (589) ..+ . ..|+ .....|..+. + T Consensus 156 -------------------------~~g-~---~~y~---------~~~~~g~~~~-------------------~---- 174 (432) T protein:vir:97 156 -------------------------TKG-N---TAYR---------YRRTDGQMID-------------------I---- 174 (432) T ss_pred -------------------------CCC-c---EEEE---------EEecCceEEE-------------------E---- Confidence 000 0 0111 0001122111 0 Q ss_pred ccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccc Q lcl|NC_020883. 271 PLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDG 350 (589) Q Consensus 271 e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dg 350 (589) +.--|.|+.+... +.++|.|-+.-+...+.--...-..-.+.|...+.|..++ ...+.+..+.-. T Consensus 175 ---------~~~~iih~r~~~~-dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil-----~~~~~l~~e~~~ 239 (432) T protein:vir:97 175 ---------PRQQIWKIMGYSL-DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYY-----QIDRFLTDDQYD 239 (432) T ss_pred ---------ccccEEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeE-----ecCCCCCHHHHH Confidence 1111788887644 4578999776554444322222112234454445554433 221111111111 Q ss_pred cccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHH Q lcl|NC_020883. 351 HSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVA 430 (589) Q Consensus 351 e~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A 430 (589) .+.....+.. ..+.. -+. ++|...+-++....-.+.++..+...++|..+=+.|+.-+|....+..+ ..|++. T Consensus 240 ~~~~~~~~~~--nag~~-~vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-~~s~~e 312 (432) T protein:vir:97 240 SFSKKVSGSV--EAGRA-PLL---EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS-WGSGIE 312 (432) T ss_pred HHHHHHhhhh--cCCCc-eec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccc-cchhHH Confidence 1111110000 00111 111 2333332233333444555666777888888889999999864432221 112222 Q ss_pred -HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHH Q lcl|NC_020883. 431 -KFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLE 509 (589) Q Consensus 431 -~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~e 509 (589) .+..+.+- .+.-|-..++..|.+-+ +.... . ..-.++|+..-....+.+++++..+.+.+++++|.- T Consensus 313 ~~~~~f~~~--tl~P~~~~ie~~ln~kL----l~~~e-~-----~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~N 380 (432) T protein:vir:97 313 SQQLGFLTM--TLSPWLRRIEQSIALNL----LTPAE-R-----RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRD 380 (432) T ss_pred HHHHHHHHH--HHHHHHHHHHHHHhhhc----cCccc-c-----CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHH Confidence 22223211 12223323333333211 10100 0 011244543222233455678888888888888877 Q ss_pred HHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCC-CCCCCCCCCCcchhhhhhc Q lcl|NC_020883. 510 TTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNII-EEGDTEEEPSAEENEEIEK 583 (589) Q Consensus 510 taVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~-deg~~~eep~~~~~e~~~~ 583 (589) .+-++++ +.. |..+ .+-..+...--++ ++-.+.+.|. .+|.++ +++.+..+ T Consensus 381 E~R~~~g--lpp---------~~g~---~~~~~~~~~~~pl----~~~~~~~~~~~~~~~~~-----~~~~~~~~ 432 (432) T protein:vir:97 381 EAREIEG--LPK---------LGGN---AAVLTVQSAMVPL----DSIGLQASPEPASGLGN-----QQQDKVSK 432 (432) T ss_pred HHHHHhC--CCC---------CCCC---cceEeecccccch----hhhcccCCCCCCCCCCC-----cccccccC Confidence 6655442 111 0000 0000000000011 1111111111 111111 11111111 No 132 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=78.83 E-value=0.11 Score=25.83 Aligned_cols=453 Identities=12% Similarity=0.055 Sum_probs=167.7 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) -+||.- .+-+.+ -+|+|--===||.-..|-+-. .++-.+.+.| T Consensus 91 ~~~~~~---------~~~~~l----~~~~~~~F~Gy~~la~laQ~~------------------------eyr~~~~~ia 133 (695) T protein:vir:78 91 ALDFNG---------TSMDAL----SFVTSSGFPGFPTLVLLAQLP------------------------EYRAMHEVLA 133 (695) T ss_pred hhcccc---------cccccc----hhhhccCcchHHHHHHHhhcc------------------------chhhHHHHHH Confidence 222200 000111 012221111122222222211 1244566778 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) ..+.|.-+.++.+.. +.+....-.+.|.... ..+-- |-+.|++-.+..+++.++-..+.=.-.=| T Consensus 134 ~e~~R~w~~~~~~~~------e~~~~~g~~~~~~~~~----~~d~d-----qi~~L~~e~erL~V~~~l~eaik~aRlfG 198 (695) T protein:vir:78 134 DECIRTWGEAIGGTK------EKADTSGLAAGGNAAS----TSDGD-----QLKQINDEIERLRIRDAVRTTVIHDQAFG 198 (695) T ss_pred HHhhcccceeccccc------hhhhhhcccccccccc----cccHH-----HHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 888888777765322 0000000011100000 00000 11233333444433333333443333445 Q ss_pred ceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeec Q lcl|NC_020883. 161 GIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 161 g~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 240 (589) |-++-..++++.-.++ .+ ++|- .+....+....++.+.+.+..-..+...+-+ T Consensus 199 Ga~~~i~i~gdd~~l~--~P--L~~~--------~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~--------------- 251 (695) T protein:vir:78 199 RAHPYFKIKGDDQIMD--TP--LVPR--------PYTVPKGSFQGLRVVEPYWVTPNNYNSINPV--------------- 251 (695) T ss_pred ceEEEEEeccCccccc--cc--cccc--------cccccCcceeeeEeecccccccchhhhccch--------------- Confidence 6555555554331111 11 0110 0001112222334444444333222111110 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) +.-+-..++.. .++.++ |..---++.|++-|.. .++..+++|.|-...+.+.+++-++|-...+ T Consensus 252 -spdfgkP~~y~------V~G~kI----H~SRL~~f~g~plPd~-----LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~ 315 (695) T protein:vir:78 252 -ADDFYKPSTWW------MIGTEV----HATRLHTIVSRPVGDM-----LKPTYSFAGISMTQLAMPYIDNWLRTRQSVS 315 (695) T ss_pred -hhccCCCceEE------EeceEE----eeeeEEEecCCCchhh-----hhcccccCcccHHHHHHHHHHHHHHHHhHHH Confidence 00011111100 111111 1111112223322211 2445567899999888888888887755555 Q ss_pred HHHHHhCCCcEEe-chhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISI-TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNL 399 (589) Q Consensus 321 rildk~gkpRI~V-P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L 399 (589) .++-. .++.+ -..|...+.+.. +-+......+....-+.+-+.++ | +..-+|.|+++.+.+--..+... T Consensus 316 ~Li~~---~~v~~lk~dla~~L~~g~---~~~l~~R~eli~~~Rsn~G~~ll--D--k~~Eefeq~stslSGLddVi~qf 385 (695) T protein:vir:78 316 DIVKQ---FSVSGILMDLAQALMPGA---NVDLSMRAELINRYRDNRNILFL--D--KATEEFFQFNTPLSGLDALQAQA 385 (695) T ss_pred HHHHh---hhhHHHHHHHHHhhcChh---HHHHHHHHHHHHHhcCccceEEE--e--cCCcceEEEecccCCHHHHHHHH Confidence 44421 11111 011111111110 00011111110000011111122 1 12256788999888888888888 Q ss_pred HHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCccccee Q lcl|NC_020883. 400 IKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNI 478 (589) Q Consensus 400 ~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I 478 (589) +.+|-..+++|.- .||. . -.+-..||.+..+.....+.-.+ ...+..+|++++.+++.-. .+ ...+ ...+ T Consensus 386 ~q~VAgaa~IPltkLfGq-S--PkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS~-~G--~idp-di~~ 456 (695) T protein:vir:78 386 QEQMSAVSHIPLIKLLGI-T--PTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLSL-FG--AVDP-SIKW 456 (695) T ss_pred HHHHHhhhcCchhhhhcc-C--CccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHh-cC--CCCC-cceE Confidence 8888888887743 3443 1 11112356655555555543333 2335577888776653322 11 1222 3456 Q ss_pred eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCccc Q lcl|NC_020883. 479 ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRD 558 (589) Q Consensus 479 ~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~ 558 (589) .|++-..-+ ++..|++.+....+-.+=..+++ .+..+|. +|+..+-.. +. .+.+|-.+ T Consensus 457 ~fnPL~qmt--d~EkAeI~~k~A~~d~~~~~~gv------I~~~evr---~rL~~d~~s----~Y-------~~~~D~~d 514 (695) T protein:vir:78 457 QWNALRELD--DLEVAESRYKQAQSDVLYVQEQV------IRPDQVA---ARLNTEPDG----PY-------AGKLDAND 514 (695) T ss_pred EeCCCCCcC--HHHHHHHHhhhhHHHHHHHHhcC------CCHHHHH---HHHhcCCCc----cc-------cccccccc Confidence 787765444 44557777663332222222222 4444433 455554322 11 11122111 Q ss_pred CCCCCCCCCCCCCCC---CcchhhhhhcccccCC Q lcl|NC_020883. 559 EDGNIIEEGDTEEEP---SAEENEEIEKEGEPIA 589 (589) Q Consensus 559 ~~~~p~deg~~~eep---~~~~~e~~~~~~~~~~ 589 (589) +-+.|.|.-.+++.+ ...|-++...++.+=+ T Consensus 515 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (695) T protein:vir:78 515 DPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARA 548 (695) T ss_pred CCCcCccchhhhhHhhhcCcccccccCCCCCCCC Confidence 101111110000000 1111122233333333 No 133 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=76.65 E-value=0.13 Score=25.38 Aligned_cols=453 Identities=12% Similarity=0.055 Sum_probs=166.5 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) -+||.- .+-+.+ -+|+|--===||.-..|-+-. .++-.+.+.| T Consensus 91 ~~~~~~---------~~~~~l----~~~~~~~F~Gy~~la~laQ~~------------------------eyr~~~~~ia 133 (695) T protein:vir:36 91 ALDFNG---------TSMDAL----SFVTSSGFPGFPTLVLLAQLP------------------------EYRAMHEVLA 133 (695) T ss_pred hhcccc---------cccccc----hhhhccCcchHHHHHHHhhcc------------------------chhhHHHHHH Confidence 222200 000111 012221111123322222211 1244566778 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) ..+.|.-+.++.+.. +.+.+..-.+.|... +..+-- |-+.|++-.+..+++.++-..+.=.-.=| T Consensus 134 ~e~~R~w~~~~~~~~------e~~~~~g~~~~~~~~----~~~d~d-----qik~L~~e~erL~V~~~l~eaik~aRlfG 198 (695) T protein:vir:36 134 DECIRTWGEAIGGTK------EKADTSGLAAGGNAA----STSDGD-----QLKQINDEIERLRIRDAVRTTVIHDQAFG 198 (695) T ss_pred HHhhcccceecccch------hhhhhcccccccccc----ccCchH-----HHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 888888777765322 100010001111000 000000 11233333343333333333333333445 Q ss_pred ceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeec Q lcl|NC_020883. 161 GIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 161 g~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 240 (589) |-++-..++++.-.++ .+ ++|- .+....+....++.+.+.+..-..+...+-+ T Consensus 199 Ga~~~i~i~gdd~~l~--~P--L~~~--------~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~--------------- 251 (695) T protein:vir:36 199 RAHPYFKIKGDDQIMD--TP--LVPR--------PYTVPKGSFQGLRVVEPYWVTPNNYNSINPV--------------- 251 (695) T ss_pred ceEEEEEeccCccccc--cc--cccc--------cccccCcceeeeEeecccccccchhhhccch--------------- Confidence 5555555554331111 11 0110 0001112222334444444333222111110 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) +.-+-..++.. .++.++ |..---++.|++-|.. .++..+++|.|-...+.+.+++-++|-...+ T Consensus 252 -spdfgkP~~y~------V~G~kI----H~SRL~~f~g~plPd~-----LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~ 315 (695) T protein:vir:36 252 -ADDFYKPSTWW------MIGTEV----HATRLHTIVSRPVGDM-----LKPTYSFAGISMTQLAMPYIDNWLRTRQSVS 315 (695) T ss_pred -hhccCCCceEE------EeceEE----eeeeEEEecCCCchhh-----hhcccccCcccHHHHHHHHHHHHHHHHhHHH Confidence 00011111100 111111 1111112223322211 2445567899999888888888887755444 Q ss_pred HHHHHhCCCcEEe-chhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISI-TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNL 399 (589) Q Consensus 321 rildk~gkpRI~V-P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L 399 (589) .++.+ .++.+ -..|..-+.+.. +-+......+....-+.+-+.++ | +..-+|.|+++.+.+--..+... T Consensus 316 ~Li~~---~~v~~lk~dla~aL~~g~---~~~l~~R~eli~~~Rsn~G~~ll--D--k~~Eefeq~stslSGLddVi~qf 385 (695) T protein:vir:36 316 DIVKQ---FSVSGILMDLAQALMPGA---NVDLSMRAELINRYRDNRNILFL--D--KATEEFFQFNTPLSGLDALQAQA 385 (695) T ss_pred HHHHh---hhHHHHHHHHHHhhcChh---HHHHHHHHHHHHHhcCccceEEE--e--cCCcceEEEecccCCHHHHHHHH Confidence 44421 11111 011111111100 00011111110000011111122 1 12256788999888888888888 Q ss_pred HHHHHHHhcCCch-hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCccccee Q lcl|NC_020883. 400 IKLMLIETQTSEK-AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNI 478 (589) Q Consensus 400 ~~~Il~~a~ts~~-AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I 478 (589) +.+|-..+++|.- .||. . -.+-..||.+..+.....+.-.+ ...+..+|++++.+++.-. .+ ...+ ...+ T Consensus 386 ~q~VAgaa~IPltkLfGq-S--PkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS~-~G--~idp-di~~ 456 (695) T protein:vir:36 386 QEQMSAVSHIPLIKLLGI-T--PTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLSL-FG--AVDP-SIKW 456 (695) T ss_pred HHHHHhhhcCchhhhhcc-C--cccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHh-cC--CCCC-cceE Confidence 8888888887743 3443 1 11112356655555555543333 2335577888776653322 11 1222 3456 Q ss_pred eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCccc Q lcl|NC_020883. 479 ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRD 558 (589) Q Consensus 479 ~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~ 558 (589) .|++-..-+ ++..|++.+....+-.+=..+++ .+..+|. +|+..+-.. +. .+.+|-.+ T Consensus 457 ~fnPL~qmt--d~EkAeI~~k~A~~d~~~~~~gv------I~~~evr---~rL~~d~~s----~Y-------~~~~D~~d 514 (695) T protein:vir:36 457 QWNALRELD--DLEVAESRYKQAQSDVLYVQEQV------IRPDQVA---ARLNTEPDG----PY-------AGKLDAND 514 (695) T ss_pred EeCCCCCcC--HHHHHHHHhhhhHHHHHHHHhcC------CCHHHHH---HHHhcCCCc----cc-------cccccccc Confidence 787765444 44557777663332222222222 4444433 455554322 11 11122111 Q ss_pred CCCCCCCCCCCCCCC---CcchhhhhhcccccCC Q lcl|NC_020883. 559 EDGNIIEEGDTEEEP---SAEENEEIEKEGEPIA 589 (589) Q Consensus 559 ~~~~p~deg~~~eep---~~~~~e~~~~~~~~~~ 589 (589) +-+.|.|.-.+++.+ ...|-++...++.+=+ T Consensus 515 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (695) T protein:vir:36 515 DPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARA 548 (695) T ss_pred CCCcCccchhhhhHhhhcCcccccccCCCCcccc Confidence 101111110000000 1111122222333222 No 134 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=75.74 E-value=0.14 Score=25.21 Aligned_cols=382 Identities=10% Similarity=0.058 Sum_probs=127.5 Q ss_pred hhccchhhhccccccccccccCCcccchh-------hccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH------- Q lcl|NC_020883. 75 IAEIPATMVSGSIGQIKSSITTGEIDPDI-------EEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT------- 140 (589) Q Consensus 75 i~~~pa~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~------- 140 (589) ...=+++ +.++.+++.|...++.... .+.+...+.+... ..+..- +.-.++|+-.|...+ T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s--~~g~~v-~~~~al~~~~V~~~i~~Ia~~i 74 (432) T protein:vir:10 1 MPDEKKL---GLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIIS--DTGAAV-NADAIMRLDAVAACVKLVSQAI 74 (432) T ss_pred CCCCccc---chhhhhHhhcCCccccccccccccccCcchhhhhccccc--ccCccc-chhhhhcchHHHHHHHHHHHhh Confidence 1122222 2255555555432211000 0000000000000 000000 011122222221111 Q ss_pred ----------------------------hhccccc---cchhhHHHH-HHcCceeEEEEEecCc-eeEEEecCceecccc Q lcl|NC_020883. 141 ----------------------------KNSKLER---RHWSNIVQH-QVDGGIVAAPVIDELG-PRIVFKARDVYFPHD 187 (589) Q Consensus 141 ----------------------------kn~~~~~---~~~~~l~~~-~v~Gg~~~~~~~~~~~-~~i~f~~~d~~~P~~ 187 (589) ..-|.++ .|+..++.+ +..|-.++.+..++++ ..+.+..++++.+.- T Consensus 75 a~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~~~~v~v~~ 154 (432) T protein:vir:10 75 AAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLANDRLTITT 154 (432) T ss_pred hhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEcCCceEEEE Confidence 1111111 223333322 2233333333332222 222333332222210 Q ss_pred cCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCc Q lcl|NC_020883. 188 DEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIP 267 (589) Q Consensus 188 d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip 267 (589) +. .+ . ..|+ +....|..+.+ T Consensus 155 ~~---------------------------~g-~---~~y~---------~~~~~g~~~~~-------------------- 174 (432) T protein:vir:10 155 DT---------------------------KG-N---TAYR---------YRRTDGQMIDI-------------------- 174 (432) T ss_pred cC---------------------------CC-c---EEEE---------EEecCceEEEE-------------------- Confidence 00 00 0 0110 00112221110 Q ss_pred cccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccc Q lcl|NC_020883. 268 DDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYE 347 (589) Q Consensus 268 ~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d 347 (589) +.--|.|+++.. .+.++|.|-+.-+...+.--...-..-.+.|...+.|..++- ..+.+..+ T Consensus 175 ------------~~~~iih~~~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~-----~~~~l~~e 236 (432) T protein:vir:10 175 ------------PKQQIWKIMGYS-LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQ-----IDRFLTDD 236 (432) T ss_pred ------------cCccEEEecCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEe-----cCCCCCHH Confidence 111278888764 345789987765554443322111112344544456655442 21111111 Q ss_pred ccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHH Q lcl|NC_020883. 348 RDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQS 427 (589) Q Consensus 348 ~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~S 427 (589) .-..+....++.. ..+..+ +. +.|...+-++....-.+.++..+...++|..+=+.|+.-+|....+..+. .| T Consensus 237 ~~~~~~~~~~~~~--nag~~~-vl---~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~-~s 309 (432) T protein:vir:10 237 QYDSFAKKVSGSV--EAGRAP-LL---EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSW-GS 309 (432) T ss_pred HHHHHHHHHhhhh--hCCCce-ec---CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccc-cc Confidence 1101111111100 001111 11 23333332334344445566667788888888899999998644322221 12 Q ss_pred HH-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchh Q lcl|NC_020883. 428 GV-AKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQ 506 (589) Q Consensus 428 g~-A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~ 506 (589) .+ ..+..+.+- .+.-|-..++..|.+-+ +..... ..-.|+|+-.-....+.+++++..+.+.+++++ T Consensus 310 n~e~~~~~f~~~--tl~P~~~~ie~~ln~kL-----~~~~~~-----~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~ 377 (432) T protein:vir:10 310 GIESQQLGFLSM--TLSPWLRRIEQSIALNL-----LSPAER-----RRYFADFDTSALLRADSAARSSYYSQLVNNGLM 377 (432) T ss_pred hHHHHHHHHHHH--HHHHHHHHHHHHHHhhh-----cCcccc-----CceEEEeechhhhccCHHHHHHHHHHHHhCCCC Confidence 22 222233221 22333333334343311 111000 011345543222233455668888888888888 Q ss_pred hHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCC-CCCCCCCCCcchhhhhhc Q lcl|NC_020883. 507 SLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIE-EGDTEEEPSAEENEEIEK 583 (589) Q Consensus 507 S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~d-eg~~~eep~~~~~e~~~~ 583 (589) +.-.+-++++ |-.... . +-..+...--+ +|+-.+.+.|.+ +|.+++ ++.+..+ T Consensus 378 T~NE~R~~~glppi~g~------------~---~~~~~~~~~~p----l~~~~~~~~~~~~~~~~~~-----~~~~~~~ 432 (432) T protein:vir:10 378 TRDEAREIEGLPKLGGN------------A---AVLTVQSAMVP----LDSIGLQASPEPASGLGNQ-----QQDKVSK 432 (432) T ss_pred CHHHHHHHhCCCCCCCC------------c---ceEeecCcccc----hhhhcccCCCCCCCCCCCc-----ccccccC Confidence 8876666653 111110 0 00000100001 111111111211 111111 1111111 No 135 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=74.28 E-value=0.16 Score=24.95 Aligned_cols=214 Identities=10% Similarity=-0.024 Sum_probs=85.1 Q ss_pred ecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccC Q lcl|NC_020883. 198 IDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYP 277 (589) Q Consensus 198 ~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~T 277 (589) .+..+++.|+.+++.. .| ...|+... + T Consensus 1 ~r~~~dg~~~y~~~~~-----------~~------------~~~g~~~~-------------------~----------- 27 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKS-----------LY------------DTKSEIYE-------------------Y----------- 27 (219) T ss_pred CceeecCeEEEEEecc-----------ee------------cCCceeEE-------------------e----------- Confidence 3344444433333211 00 00111110 0 Q ss_pred CCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH-HHHHHhCCCcEE--echhhhhccccccccccccccc Q lcl|NC_020883. 278 GRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA-VIYEQNGKPRIS--ITKEMMDTLLNIAYERDGHSAK 354 (589) Q Consensus 278 Gv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s-rildk~gkpRI~--VP~~~L~t~~g~~~d~dge~~~ 354 (589) +.--|.|+++..+...++|.|.+..+...+. ++....++. +.|+-.+.|..+ +|...|+.-+ .+.-.+... T Consensus 28 --~~~eilH~r~~~~~~~~~Glspi~~a~~~i~-~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~---~~~~~~~~~ 101 (219) T protein:vir:98 28 --NKNDVIFIKLYDPMQQVYGSPDYVGGITSAL-LNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEM---EDEIAERIR 101 (219) T ss_pred --ccccEEEecCCCCCCCcceecHHHHHHHHHH-HHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHH---HHHHHHHHH Confidence 0011789988777778899998865443333 222222222 345455777643 4544443211 000000000 Q ss_pred cccccccccccccccccc--cccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHH Q lcl|NC_020883. 355 EASMMTPRIDHRDMEITT--FDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKF 432 (589) Q Consensus 355 ~~~~~~~~~d~~dlev~~--~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r 432 (589) ...+ ..+...+-+.. .+++|...+-++....-.+.++.......+|.++=+.|+.-.|...+++++.+..+ ..+ T Consensus 102 ~~~g---~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~e-q~~ 177 (219) T protein:vir:98 102 DSKG---VGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPL-KIR 177 (219) T ss_pred HhcC---cccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHH-HHH Confidence 0000 00111111211 11233222223333333445555555677788888999999987543332222111 122 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCC Q lcl|NC_020883. 433 YDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRA 489 (589) Q Consensus 433 ~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~ 489 (589) +.+.+- ...-|...++.+|.+- -.-+....+.|.+-.+.|.. T Consensus 178 ~~f~~~--tL~P~~~~ie~~ln~~-------------~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 178 EAYQAD--EVLPLQEIIAESINSD-------------YEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred HHHHHH--HHHHHHHHHHHHhhhh-------------hcCCCccEEeecCcccccCC Confidence 222111 1122222222322210 01133346788765544433 No 136 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=71.55 E-value=0.19 Score=24.49 Aligned_cols=377 Identities=8% Similarity=0.004 Sum_probs=131.1 Q ss_pred hhhhhhhhcCCccccCHHHH-HHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcc Q lcl|NC_020883. 21 YERYRQLYEGKHELLFPRAK-RLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEI 99 (589) Q Consensus 21 ~~~~r~l~~g~~~~~f~ra~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~ 99 (589) |-..++++..+-.-...+.. .-+...........+..+ ++.-+...+.+..-+ .++++.+++++-..-.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-----~~~~al~~~~v~~~i--~~ia~~ia~~p~~~~~~-- 71 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWV-----SAENALKNSDLFSII--SQLSNDLATAKITTSRK-- 71 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCcee-----chhhhhccHHHHHHH--HHHHHHhhhCceeeccc-- Confidence 44455554332211111100 000000000000000000 011111222222222 33433333332211100 Q ss_pred cchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc---ccchhhHHHHHHc-CceeEEEEEecCc--e Q lcl|NC_020883. 100 DPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE---RRHWSNIVQHQVD-GGIVAAPVIDELG--P 173 (589) Q Consensus 100 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~---~~~~~~l~~~~v~-Gg~~~~~~~~~~~--~ 173 (589) .. +.++...+.+ ..|+..++.++.. |-.+..+..+..+ + T Consensus 72 -------~~----------------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~ 116 (386) T protein:vir:49 72 -------QL----------------------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDM 116 (386) T ss_pred -------hh----------------------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEE Confidence 00 0011111111 1355555544444 4444444445433 3 Q ss_pred eEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccc Q lcl|NC_020883. 174 RIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAED 253 (589) Q Consensus 174 ~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d 253 (589) .+....+++.-++-+ ..+ .. ..|+.... ..+.|.+.. T Consensus 117 ~l~~i~~~~v~v~~~----------~~~--~~--~~y~~~~~----------------------~~~~~~~~~------- 153 (386) T protein:vir:49 117 KWEYLRPSQVSFNRL----------DNQ--NG--LYYNITFD----------------------DPHIAPKQH------- 153 (386) T ss_pred EEEEecCceeEEEEc----------CCC--ce--EEEEEEEc----------------------CccccceeE------- Confidence 444445544332100 000 00 01100000 001111110 Q ss_pred cchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe Q lcl|NC_020883. 254 LEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI 333 (589) Q Consensus 254 ~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V 333 (589) +| ..+ |+|+.+......++|.|-+.-+...++-....-....+.|...+.|+.++ T Consensus 154 ------------~~-~~e------------vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il 208 (386) T protein:vir:49 154 ------------VP-QND------------ILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGIL 208 (386) T ss_pred ------------Ec-ccc------------EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEE Confidence 00 011 89998877777789999988777777655544333445565567787543 Q ss_pred --chhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_020883. 334 --TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSE 411 (589) Q Consensus 334 --P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~ 411 (589) |..+.+... ..+.+... . + . ...+..+ +. +.|...+-++....-.+.++..+.+.++|..+-+.|+ T Consensus 209 ~~~~~~~~~~~-~~~~~~~~---~--~-~-~n~g~~~-vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp 276 (386) T protein:vir:49 209 KIKGGGLLDFK-TKVSRSRQ---A--M-K-QMQGGPL-VL---DDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPE 276 (386) T ss_pred EeCCCCChHHH-HHHHHHHH---H--h-c-cCCCCce-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 333222110 00000000 0 0 0 0111111 11 2232232233344555667778888999998889999 Q ss_pred hhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHH Q lcl|NC_020883. 412 KAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAEL 491 (589) Q Consensus 412 ~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El 491 (589) ..+|. .+++.+ ++...+..+...+ ..+-..+...| +...+. .+.|+-.-....+.. T Consensus 277 ~~lg~-~~~~~~---~~~~~~~~~~~~i---~~~l~~i~~~~----------~~~l~~-------~~~~~~~~~~~~d~~ 332 (386) T protein:vir:49 277 SIVGG-DGDQQS---SLEMIYNIYFKSV---SRYLRPFVSEM----------SKKLSC-------EVDVDISPAVDPTGS 332 (386) T ss_pred HHhCC-CCCccc---hHHHHHHHHHHHH---HHHHHHHHHHH----------HHHhcc-------hhcccchhhhccCHH Confidence 99984 222211 1111111111111 11111111111 111111 122221111112233 Q ss_pred HHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCC Q lcl|NC_020883. 492 VAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEE 571 (589) Q Consensus 492 ~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~e 571 (589) ..+.....+.++++++.-.+-.++ ..... .|..+..+ . +...++...|++++ T Consensus 333 ~~~~~~~~l~~~g~~t~nE~r~~l-----------------~~~~~-~~~~~~~~-------~---~~~~~~~~gGd~~~ 384 (386) T protein:vir:49 333 NYISLINSMVKSGTLAQNQGLYIL-----------------QQAEI-LPKELPDG-------K---NPNRTSLKGGEINE 384 (386) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHH-----------------hhCCC-CCCcCcch-------h---ccCCCCCCCCCCCC Confidence 345555556666666554333222 11111 11111111 1 11112333333322 Q ss_pred CC Q lcl|NC_020883. 572 EP 573 (589) Q Consensus 572 ep 573 (589) +- T Consensus 385 ~~ 386 (386) T protein:vir:49 385 QD 386 (386) T ss_pred CC Confidence 22 No 137 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=68.65 E-value=0.23 Score=24.04 Aligned_cols=382 Identities=12% Similarity=0.041 Sum_probs=132.1 Q ss_pred cccCHHHHHHHhhccccceeccCcceeeecC--cceEEEEcchhhhccchh-----hhccccccccccccCCcccchhhc Q lcl|NC_020883. 33 ELLFPRAKRLIEEGDAVGRFLDSSQTARETQ--TPYVIFNLPKVIAEIPAT-----MVSGSIGQIKSSITTGEIDPDIEE 105 (589) Q Consensus 33 ~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~--~~y~~~n~~~~i~~~pa~-----~~~~~~~~~~~~~~~~~~~~~~~~ 105 (589) =-+|.|.+.--+..... ...|..... ..+.-+|--. ....|++ ++++.+.+++ + T Consensus 1 M~~f~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~v~~~~-al~~~~V~~~v~~ia~~ia~~p-------~------ 61 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLN-----DPDWVNFLTGGEAQKYVSADT-ALKNSDIFSLIMQLSGDLAMVR-------Y------ 61 (397) T ss_pred CcchhhhhcccCcccCC-----chhhhhhhcCCcCCceechHH-hhccHHHHHHHHHHHHHHhhCc-------c------ Confidence 22222210000000000 000000000 0011111111 1112221 1111111111 0 Q ss_pred cchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc---cchhhHHHHH-HcCceeEEEEEecCc--eeEEEec Q lcl|NC_020883. 106 DTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER---RHWSNIVQHQ-VDGGIVAAPVIDELG--PRIVFKA 179 (589) Q Consensus 106 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~---~~~~~l~~~~-v~Gg~~~~~~~~~~~--~~i~f~~ 179 (589) +.+. ..+..+..+.+.++ .|+..++.+. ..|-+++.+..+..+ +.+.... T Consensus 62 ----------------~~~~--------~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~ 117 (397) T protein:vir:38 62 ----------------TSES--------DRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLR 117 (397) T ss_pred ----------------cccc--------cHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEc Confidence 0000 01222333333332 3566666444 445555555555443 3444444 Q ss_pred CceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhh Q lcl|NC_020883. 180 RDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEEL 259 (589) Q Consensus 180 ~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~ 259 (589) ++.+-+ . ....+. . + .|+.. +. ....|.+.. T Consensus 118 ~~~v~i---------~-~~~~~~--~-~-~y~~~------------~~----------~~~~~~~~~------------- 148 (397) T protein:vir:38 118 PSQVQP---------M-LLQDGS--G-L-IYNIN------------FD----------EPAIGYMEN------------- 148 (397) T ss_pred CceeEE---------E-EcCCCc--e-E-EEEEE------------ec----------cccccceeE------------- Confidence 443332 1 111100 0 0 00000 00 000111000 Q ss_pred hhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEec--hhh Q lcl|NC_020883. 260 IREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISIT--KEM 337 (589) Q Consensus 260 i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP--~~~ 337 (589) + +.--|+|+++......++|.|.+..+...+......-....+.|...+.|..++- ..+ T Consensus 149 ------~-------------~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~ 209 (397) T protein:vir:38 149 ------V-------------PAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGG 209 (397) T ss_pred ------e-------------cCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC Confidence 0 1111889998888888899999987777776655544444555655577765542 211 Q ss_pred hhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccc Q lcl|NC_020883. 338 MDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFY 417 (589) Q Consensus 338 L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~ 417 (589) .+.. .. +-.+........ ...+..+ +. +.|...+-++......+..+..+.+.++|..+=+.|+.-+|.. T Consensus 210 ~~e~--~~--~~~~~~~~~~~~--~n~~~~~-vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~ 279 (397) T protein:vir:38 210 LLDA--ET--RIARSKEISKQI--HNSDGPV-VI---DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQ 279 (397) T ss_pred CHHH--HH--HHHHHHHHHhcc--cccCCce-ec---CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 1100 00 000000000000 0001111 11 1222222233334555667777888899888889999999853 Q ss_pred cCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHH Q lcl|NC_020883. 418 LDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMA 497 (589) Q Consensus 418 ~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~ 497 (589) .+.. +..+....++. ..+.-|...+...|.+. ..... +.++.| - .+.+.+..++.. T Consensus 280 ~~~~-----~~~e~~~~~~~--~~l~P~~~~ie~~ln~~----------l~~~~---~~~~~~--~--~~~d~~~~~~~~ 335 (397) T protein:vir:38 280 GDQQ-----SSITQISGQYA--KSLNRYVQAIVGELNDK----------LHANI---SANIRF--A--IDAMGDQYASTI 335 (397) T ss_pred CCcc-----cHHHHHHHHHH--HHHHHHHHHHHHHHHHh----------ccChh---cccccc--c--ccCCHHHHHHHH Confidence 2111 11111111111 11222332333333221 11111 122333 1 223455667777 Q ss_pred HHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCCCCcch Q lcl|NC_020883. 498 AYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEE 577 (589) Q Consensus 498 ~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~ 577 (589) +.+.++++++.-.+-+.+. +. |.. +++.+...+...+. ...++..+|+.++++..+. T Consensus 336 ~~~~~~G~~t~nE~R~~lg--~~-------------------p~~-~~d~~~~~~~~~~~-~~~~~~~~g~~~~~~~~e~ 392 (397) T protein:vir:38 336 SSSVKGGTIAGNQARFILQ--NS-------------------GYL-AKDLPDPEKEPQQA-IQLIQQEGGENDGNNSDER 392 (397) T ss_pred HHHHhCCCcCHHHHHHHhC--CC-------------------CCC-CCcccccccccccc-ccccccccCCCCCCCCCCC Confidence 7888888877765555442 00 000 11111111000000 0001111111111111111 Q ss_pred hhhhh Q lcl|NC_020883. 578 NEEIE 582 (589) Q Consensus 578 ~e~~~ 582 (589) .++-| T Consensus 393 ~~~~~ 397 (397) T protein:vir:38 393 GSDPE 397 (397) T ss_pred CCCCC Confidence 11100 No 138 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=63.75 E-value=0.31 Score=23.36 Aligned_cols=377 Identities=8% Similarity=0.005 Sum_probs=132.8 Q ss_pred hhhhhhhhcCCccccCHHHHH--HHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCc Q lcl|NC_020883. 21 YERYRQLYEGKHELLFPRAKR--LIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGE 98 (589) Q Consensus 21 ~~~~r~l~~g~~~~~f~ra~~--~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~ 98 (589) |-..+++...+.+ ...++.. .+....+......+..+. +--+...+-+..-+ .++++.+++++=..-.. T Consensus 1 M~~f~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~~~~~~~~v~~~i--~~ia~~ia~~p~~~~~~- 71 (386) T protein:vir:48 1 MPIFNITNLATES-PPISQGGFFDITDPDFLSTLNGSEWVS-----AESALRNSDLFSII--NQLSNDLATVKLTASRK- 71 (386) T ss_pred Ccccccccccccc-cccccccccccccchhcccccCCceec-----hhhhhcchHHHHHH--HHHHHhhccCceeeccc- Confidence 6556655544432 1111000 011111122211111110 00011112221112 33444444332211100 Q ss_pred ccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccc---cchhhHHHH-HHcCceeEEEEEecCc-- Q lcl|NC_020883. 99 IDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER---RHWSNIVQH-QVDGGIVAAPVIDELG-- 172 (589) Q Consensus 99 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~---~~~~~l~~~-~v~Gg~~~~~~~~~~~-- 172 (589) -.+.++...+.++ .|+..++.. +..|-.+..+..+..+ T Consensus 72 ------------------------------------~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~ 115 (386) T protein:vir:48 72 ------------------------------------QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRD 115 (386) T ss_pred ------------------------------------hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcE Confidence 0111222233322 233444433 3444444445555433 Q ss_pred eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeeccccccccccccc Q lcl|NC_020883. 173 PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAE 252 (589) Q Consensus 173 ~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~ 252 (589) +.+....++.+-+ + ......+ ..|+.. ..|....... T Consensus 116 ~~L~~l~~~~v~v---------~---~~~~~~~--~~y~~~--------------------------~~~~~~~~~~--- 152 (386) T protein:vir:48 116 MKWEYLRPSQVSF---------N---RLDNKDG--IYYNIT--------------------------FDDPRIPPKQ--- 152 (386) T ss_pred EEEEEecCceeEE---------E---EcCCCce--EEEEEE--------------------------ecCcccccee--- Confidence 3333344433322 1 0000000 011000 0000000000 Q ss_pred ccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEE Q lcl|NC_020883. 253 DLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRIS 332 (589) Q Consensus 253 d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~ 332 (589) .+ +.--|.|+++..+...++|.|.+.-+...+......-....+.|...+.|+.+ T Consensus 153 ------------~~-------------~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i 207 (386) T protein:vir:48 153 ------------HV-------------PQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGI 207 (386) T ss_pred ------------Ee-------------cCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE Confidence 00 00118999988888889999998776666665554444456666666777766 Q ss_pred echhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCch Q lcl|NC_020883. 333 ITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEK 412 (589) Q Consensus 333 VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~ 412 (589) +.. ......+...........+. ...+..+ +. +.|....-++....-.+.++..+...++|..+=+.|+. T Consensus 208 i~~-----~~~~~~e~~~~~~~~~~~~~-~n~g~~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 277 (386) T protein:vir:48 208 LKI-----KGGGLLDFKTKLSRSRQAMK-QMQGGPL-VL---DDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPEN 277 (386) T ss_pred EEe-----CCCCCHHHHHHHHHHHHHhh-cCCCCce-ec---CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 532 11111111000000000000 0111111 11 12222222233334445667777888899888899999 Q ss_pred hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHH Q lcl|NC_020883. 413 AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELV 492 (589) Q Consensus 413 AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~ 492 (589) -+|.. ++++ ..++..+ .+++. .+.-|...+...|.+.+ ...++ ..+ ....+.+... T Consensus 278 ~lg~~-~~~~--~~e~~~~--~~~~~--~l~P~~~~ie~~l~~~l----------~~~~~-----~~~--~~~~~~d~~~ 333 (386) T protein:vir:48 278 VVGGQ-GDQQ--SSLEMSL--DLYNK--AVSRYLRPFLSELSQKL----------SCDVD-----ADI--LPAVDPTGSN 333 (386) T ss_pred HhCCC-CCcc--cHHHHHH--HHHHH--HHHHHHHHHHHHHHHhh----------cchhh-----cch--hhhhccChHH Confidence 99842 1211 1112211 11111 11112222222222211 11111 111 1112223333 Q ss_pred HHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCC Q lcl|NC_020883. 493 AENMAAYAASKQGQSLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEE 571 (589) Q Consensus 493 ~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~e 571 (589) .+.....+.+++++++-.+-+.+. +-+.+- |+ +... ..+.+|...|+..+ T Consensus 334 ~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~----~~-------------------~~~~------~~~~~~~~gGd~~~ 384 (386) T protein:vir:48 334 SVSRINSMVKSGTLAQNQGLYILQQAEILPK----EL-------------------PEGE------NPNKTTLKGGEING 384 (386) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhhcCCCCCc----cc-------------------hhhc------CCCCCccCCCCCCC Confidence 444555566667776655544332 101110 00 0000 01112333333211 Q ss_pred CC Q lcl|NC_020883. 572 EP 573 (589) Q Consensus 572 ep 573 (589) +. T Consensus 385 ~~ 386 (386) T protein:vir:48 385 ED 386 (386) T ss_pred CC Confidence 11 No 139 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=58.41 E-value=0.41 Score=22.69 Aligned_cols=375 Identities=8% Similarity=-0.028 Sum_probs=129.7 Q ss_pred hhhhhhhhcCCccccCHHHHHH-HhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCcc Q lcl|NC_020883. 21 YERYRQLYEGKHELLFPRAKRL-IEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEI 99 (589) Q Consensus 21 ~~~~r~l~~g~~~~~f~ra~~~-~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~ 99 (589) |--+.+++.++..-.-.++.-. +......+....+.++. .--+...+.+..-+ +++++.+..++=..-+.. T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~-----~~~al~~~~V~~~i--~~Ia~~ia~l~~~~~~~~- 72 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVS-----AETALKNSDLFSII--SQLSNDLATAKITTSRKQ- 72 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceec-----hhhhhccHHHHHHH--HHHHHHHhhCceeeecch- Confidence 5445555555443322221111 11111111111111110 00011222222222 334333333322111000 Q ss_pred cchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHH-HHcCceeEEEEEecCc--eeEE Q lcl|NC_020883. 100 DPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQH-QVDGGIVAAPVIDELG--PRIV 176 (589) Q Consensus 100 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~-~v~Gg~~~~~~~~~~~--~~i~ 176 (589) ....... -|.... ...|+..++.+ +..|-.+..+..+..+ +.+. T Consensus 73 -------~~~l~~~----------PN~~~t----------------~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~ 119 (384) T protein:vir:49 73 -------LQGIVDN----------PSNNAN----------------RFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWE 119 (384) T ss_pred -------hhhhhhc----------cCCCCC----------------HHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEE Confidence 0000000 000000 01233334433 3445545555555433 3343 Q ss_pred EecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccch Q lcl|NC_020883. 177 FKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEG 256 (589) Q Consensus 177 f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~ 256 (589) ...++.+-+ +.. ....++.+.++.. .. ..|.+.. T Consensus 120 ~l~~~~v~v---------~~~---~~~~~~~y~~~~~----~~--------------------~~~~~~~---------- 153 (384) T protein:vir:49 120 YLRPSQVSF---------NRL---DNQNGLYYNITFD----DP--------------------RIPPKQH---------- 153 (384) T ss_pred EEcCceeEE---------EEc---CCCceEEEEEEec----Cc--------------------cccceeE---------- Confidence 444433332 100 0001100000000 00 0011000 Q ss_pred hhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe--c Q lcl|NC_020883. 257 EELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI--T 334 (589) Q Consensus 257 e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V--P 334 (589) + +.--|+|+.+......++|.|-+.-+...++.....-....+.|...+.|+.++ | T Consensus 154 ---------~-------------~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~ 211 (384) T protein:vir:49 154 ---------V-------------PQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIK 211 (384) T ss_pred ---------e-------------cCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 0 001189998877777889999887666666644444333445565667777543 3 Q ss_pred hhhhhccccccccccccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhc Q lcl|NC_020883. 335 KEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAV 414 (589) Q Consensus 335 ~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AF 414 (589) ..+........+ ...+ .+.. ..+..+ +. +.|....-++......+..+..+.+.++|..+-+.|+.-+ T Consensus 212 ~~~~~~~~~~~~---~~~~---~~~~--n~~~~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~l 279 (384) T protein:vir:49 212 GGGLLDFKTKQS---RSRQ---AMKQ--MQGGPL-VL---DDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVV 279 (384) T ss_pred CCCChHHHHHHH---HHHH---hccc--CCccce-ec---CCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 333221100000 0000 0000 011111 11 2232222233344555667777888899999999999999 Q ss_pred ccccCcccchhHHHHHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHH Q lcl|NC_020883. 415 DFYLDGGASGAQSGVAKFYDLLTTI-LKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVA 493 (589) Q Consensus 415 g~~~~~g~~~A~Sg~A~r~~~~~~~-~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~ 493 (589) |... ++.+ +....+..+...+ ..++-|...+...|.+-+.. .. .+.+.. + ..... T Consensus 280 g~~~-~~~~---~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~------------~~-~~~~~~------~-~~~~~ 335 (384) T protein:vir:49 280 GGEG-DKQS---SLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDA------------DI-LPAVDP------T-GSNYI 335 (384) T ss_pred CCCC-Cccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhh------------hh-hhhhhc------c-chHHH Confidence 8532 2222 1112221111111 12222222222222211100 00 000000 0 01111 Q ss_pred HHHHHHHhccchhhHHHHHHHh--CCCCCHHHHHHHHHHHHhhccccccccccccccccccccCccc Q lcl|NC_020883. 494 ENMAAYAASKQGQSLETTVRRM--NPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRD 558 (589) Q Consensus 494 A~t~~~l~~a~~~S~etaVr~L--hpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~ 558 (589) .++. -+..+++.++-.+...+ .| +... |+.++..- .|.. |++ -|++= T Consensus 336 ~~~~-~l~~~~~~t~~e~~~~l~~~g-~~~n----e~r~~~~~----~p~~-gGd-------~~~~~ 384 (384) T protein:vir:49 336 GLIN-SMVKTGTLAQNQGLYVLQQAE-ILPK----DLPEGETD----STLK-GGE-------TNEQY 384 (384) T ss_pred HHHH-HHhhcCcccHHHHHHHHhhCC-CCCh----hHHHHcCC----CCCC-CCC-------CCCCC Confidence 2222 23344555555554443 23 3322 33333321 1222 222 11111 No 140 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=405 Identities=11% Similarity=0.072 Sum_probs=139.9 Q ss_pred cchhhhhhhhhcCCcc------ccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch-----hhhccc Q lcl|NC_020883. 18 HGDYERYRQLYEGKHE------LLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA-----TMVSGS 86 (589) Q Consensus 18 ~~~~~~~r~l~~g~~~------~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa-----~~~~~~ 86 (589) =|-++|-++.|-.+.. .+-.-+..++. +.+ .+ ...+-+|--. ....|+ .++++. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~---~~g----~~-------~~~~~v~~~~-al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLE---WLG----IS-------PSTISVKGKN-ALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHH---HhC----CC-------cCccccchhh-hhccHHHHHHHHHHHHh Confidence 3444444444421111 11111111111 000 00 0111111111 111222 233333 Q ss_pred cccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hcccc---ccchhhHHHH-HHcCc Q lcl|NC_020883. 87 IGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKLE---RRHWSNIVQH-QVDGG 161 (589) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~~---~~~~~~l~~~-~v~Gg 161 (589) +..++=..-.. . .....+..+.+ + ..++. .-+.+ ..|+..++.+ +..|- T Consensus 66 ia~lp~~~~~~----~-~~~~~~~~~~~-------------l--------~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 66 VSKLPLKIYQE----D-EYGIQRGTKHY-------------L--------NNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hccCceEEEEe----c-CCceeeccccH-------------H--------HHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 33222110000 0 00000000000 0 00110 01111 1123333332 44566 Q ss_pred eeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccce-eehhhhccccccchhhee Q lcl|NC_020883. 162 IVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLR-TTNMLYPVVKAKGDVKKE 238 (589) Q Consensus 162 ~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~-~~~~~y~~~~~~~~~~~~ 238 (589) .++.+..+..+ ..+....+++.-++. .+ .++. .....| +.. T Consensus 120 ay~~i~r~~~G~~~~L~~i~~~~v~v~~----------d~-----------------~~~~~~~~~~~---------y~~ 163 (432) T protein:vir:10 120 SYANIEFDRKGKVQALWPIDASKVTVYI----------DD-----------------VGLLNSKTKMW---------YVV 163 (432) T ss_pred eEEEEEECCCCcEEEEEEEcCceeEEEE----------cC-----------------cccccccceEE---------EEE Confidence 66666666444 334444444333211 00 0000 000000 001 Q ss_pred ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhH Q lcl|NC_020883. 239 IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITR 318 (589) Q Consensus 239 ~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~ 318 (589) ...|.+.. +| ---|+|+++....+.++|.|-+.-+...++.....-.. T Consensus 164 ~~~g~~~~-------------------~~-------------~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 164 NTGGQQRV-------------------LK-------------PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred ecCCeEEE-------------------Ec-------------cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 11222111 00 00189998776777889999998777777766554444 Q ss_pred HHHHHHHhCCCcEEech-hhhhccccccccccccccccccccccccc-cccccccccccccCccceeeecc--cHHHHHH Q lcl|NC_020883. 319 SAVIYEQNGKPRISITK-EMMDTLLNIAYERDGHSAKEASMMTPRID-HRDMEITTFDENGRSMEIHQIDI--SKIGDMD 394 (589) Q Consensus 319 ~srildk~gkpRI~VP~-~~L~t~~g~~~d~dge~~~~~~~~~~~~d-~~dlev~~~de~g~~~~~iq~Di--rveeh~~ 394 (589) ..+.|...+.|+.++-. +.|+..+ .++-.+.+.. .+.+.+ ....-+. +.| +.+.+... .-.+..+ T Consensus 212 ~~~~~~ng~~p~gil~~~~~l~~e~---~~~~~~~~~~---~~~g~~n~~~~~vl---~~g--~~~~~l~~~~~d~q~~e 280 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGDLNEDA---KKVFRENFES---MSSGLQNSHRIALM---PVG--YQFQPISLNMSDAQFLE 280 (432) T ss_pred HHHHHhccCCccEEEEcCCCCCHHH---HHHHHHHHHH---HhcccccCCcceec---CCC--ceEEEccCChhHHHHHH Confidence 45556555677755421 1111100 0000000000 000000 0001111 122 23333332 2233445 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIE 474 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e 474 (589) ..+...++|..+=+.|+.-+|....+. . + +.......+.+- .+.-|...+...|.+ .++........+ T Consensus 281 ~~~~~~~~Ia~~fgVP~~~lg~~~~~~-~-s-~~e~~~~~~~~~--~l~P~~~~ie~~ln~----kLl~~~~~~~g~--- 348 (432) T protein:vir:10 281 NTELTIRQIATAFGIKMHQLNDLSKAT-L-N-NIEQQQQQFYTD--TLQATLTMYEQEMTY----KLFLDSELDKGF--- 348 (432) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCC-c-c-cHHHHHHHHHHH--HHHHHHHHHHHHHHH----hhcChhhcCCCc--- Confidence 566677888888899999998533221 1 1 111122222111 122222222232322 111111111111 Q ss_pred cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc-- Q lcl|NC_020883. 475 EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ-- 552 (589) Q Consensus 475 ~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~-- 552 (589) ...+.+.+.+..| ..+.++..+.+.++++++.-.+-+.+. +. |.+ | ++..+-+ T Consensus 349 ~~~fd~~~l~~~d--~~~~~~~~~~~~~~G~~t~NE~R~~~g--~~-------------------pi~-g-gD~~~~~~n 403 (432) T protein:vir:10 349 YSKFNVDAILRAD--IKTRYEAYRTGIQGGFLKPNEARSKED--LP-------------------PEA-G-GDRLLVNGN 403 (432) T ss_pred EEEeechhhhcCC--HHHHHHHHHHHHhCCCcCHHHHHHHhC--CC-------------------CCC-C-CCeEeeccc Confidence 1233444544444 445577888888888888776655542 11 111 0 0000000 Q ss_pred c--cCcccCCCCCCCCCCCCCCCCcchhhhh Q lcl|NC_020883. 553 M--NDNRDEDGNIIEEGDTEEEPSAEENEEI 581 (589) Q Consensus 553 ~--~~~~~~~~~p~deg~~~eep~~~~~e~~ 581 (589) . ++..++ .....|++.++...+.+|+- T Consensus 404 ~~~~~~~~~--~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 404 MLPIDMAGQ--AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ccchhhccc--cccCCCCCCCCCCCCCCCCC Confidence 0 000000 01112222222222222222 No 141 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=405 Identities=11% Similarity=0.072 Sum_probs=139.9 Q ss_pred cchhhhhhhhhcCCcc------ccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch-----hhhccc Q lcl|NC_020883. 18 HGDYERYRQLYEGKHE------LLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA-----TMVSGS 86 (589) Q Consensus 18 ~~~~~~~r~l~~g~~~------~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa-----~~~~~~ 86 (589) =|-++|-++.|-.+.. .+-.-+..++. +.+ .+ ...+-+|--. ....|+ .++++. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~---~~g----~~-------~~~~~v~~~~-al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLE---WLG----IS-------PSTISVKGKN-ALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHH---HhC----CC-------cCccccchhh-hhccHHHHHHHHHHHHh Confidence 3444444444421111 11111111111 000 00 0111111111 111222 233333 Q ss_pred cccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hcccc---ccchhhHHHH-HHcCc Q lcl|NC_020883. 87 IGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKLE---RRHWSNIVQH-QVDGG 161 (589) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~~---~~~~~~l~~~-~v~Gg 161 (589) +..++=..-.. . .....+..+.+ + ..++. .-+.+ ..|+..++.+ +..|- T Consensus 66 ia~lp~~~~~~----~-~~~~~~~~~~~-------------l--------~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 66 VSKLPLKIYQE----D-EYGIQRGTKHY-------------L--------NNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hccCceEEEEe----c-CCceeeccccH-------------H--------HHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 33222110000 0 00000000000 0 00110 01111 1123333332 44566 Q ss_pred eeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccce-eehhhhccccccchhhee Q lcl|NC_020883. 162 IVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLR-TTNMLYPVVKAKGDVKKE 238 (589) Q Consensus 162 ~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~-~~~~~y~~~~~~~~~~~~ 238 (589) .++.+..+..+ ..+....+++.-++. .+ .++. .....| +.. T Consensus 120 ay~~i~r~~~G~~~~L~~i~~~~v~v~~----------d~-----------------~~~~~~~~~~~---------y~~ 163 (432) T protein:vir:10 120 SYANIEFDRKGKVQALWPIDASKVTVYI----------DD-----------------VGLLNSKTKMW---------YVV 163 (432) T ss_pred eEEEEEECCCCcEEEEEEEcCceeEEEE----------cC-----------------cccccccceEE---------EEE Confidence 66666666444 334444444333211 00 0000 000000 001 Q ss_pred ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhH Q lcl|NC_020883. 239 IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITR 318 (589) Q Consensus 239 ~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~ 318 (589) ...|.+.. +| ---|+|+++....+.++|.|-+.-+...++.....-.. T Consensus 164 ~~~g~~~~-------------------~~-------------~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 164 NTGGQQRV-------------------LK-------------PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred ecCCeEEE-------------------Ec-------------cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 11222111 00 00189998776777889999998777777766554444 Q ss_pred HHHHHHHhCCCcEEech-hhhhccccccccccccccccccccccccc-cccccccccccccCccceeeecc--cHHHHHH Q lcl|NC_020883. 319 SAVIYEQNGKPRISITK-EMMDTLLNIAYERDGHSAKEASMMTPRID-HRDMEITTFDENGRSMEIHQIDI--SKIGDMD 394 (589) Q Consensus 319 ~srildk~gkpRI~VP~-~~L~t~~g~~~d~dge~~~~~~~~~~~~d-~~dlev~~~de~g~~~~~iq~Di--rveeh~~ 394 (589) ..+.|...+.|+.++-. +.|+..+ .++-.+.+.. .+.+.+ ....-+. +.| +.+.+... .-.+..+ T Consensus 212 ~~~~~~ng~~p~gil~~~~~l~~e~---~~~~~~~~~~---~~~g~~n~~~~~vl---~~g--~~~~~l~~~~~d~q~~e 280 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGDLNEDA---KKVFRENFES---MSSGLQNSHRIALM---PVG--YQFQPISLNMSDAQFLE 280 (432) T ss_pred HHHHHhccCCccEEEEcCCCCCHHH---HHHHHHHHHH---HhcccccCCcceec---CCC--ceEEEccCChhHHHHHH Confidence 45556555677755421 1111100 0000000000 000000 0001111 122 23333332 2233445 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIE 474 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e 474 (589) ..+...++|..+=+.|+.-+|....+. . + +.......+.+- .+.-|...+...|.+ .++........+ T Consensus 281 ~~~~~~~~Ia~~fgVP~~~lg~~~~~~-~-s-~~e~~~~~~~~~--~l~P~~~~ie~~ln~----kLl~~~~~~~g~--- 348 (432) T protein:vir:10 281 NTELTIRQIATAFGIKMHQLNDLSKAT-L-N-NIEQQQQQFYTD--TLQATLTMYEQEMTY----KLFLDSELDKGF--- 348 (432) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCC-c-c-cHHHHHHHHHHH--HHHHHHHHHHHHHHH----hhcChhhcCCCc--- Confidence 566677888888899999998533221 1 1 111122222111 122222222232322 111111111111 Q ss_pred cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc-- Q lcl|NC_020883. 475 EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ-- 552 (589) Q Consensus 475 ~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~-- 552 (589) ...+.+.+.+..| ..+.++..+.+.++++++.-.+-+.+. +. |.+ | ++..+-+ T Consensus 349 ~~~fd~~~l~~~d--~~~~~~~~~~~~~~G~~t~NE~R~~~g--~~-------------------pi~-g-gD~~~~~~n 403 (432) T protein:vir:10 349 YSKFNVDAILRAD--IKTRYEAYRTGIQGGFLKPNEARSKED--LP-------------------PEA-G-GDRLLVNGN 403 (432) T ss_pred EEEeechhhhcCC--HHHHHHHHHHHHhCCCcCHHHHHHHhC--CC-------------------CCC-C-CCeEeeccc Confidence 1233444544444 445577888888888888776655542 11 111 0 0000000 Q ss_pred c--cCcccCCCCCCCCCCCCCCCCcchhhhh Q lcl|NC_020883. 553 M--NDNRDEDGNIIEEGDTEEEPSAEENEEI 581 (589) Q Consensus 553 ~--~~~~~~~~~p~deg~~~eep~~~~~e~~ 581 (589) . ++..++ .....|++.++...+.+|+- T Consensus 404 ~~~~~~~~~--~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 404 MLPIDMAGQ--AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ccchhhccc--cccCCCCCCCCCCCCCCCCC Confidence 0 000000 01112222222222222222 No 142 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=405 Identities=11% Similarity=0.072 Sum_probs=139.9 Q ss_pred cchhhhhhhhhcCCcc------ccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch-----hhhccc Q lcl|NC_020883. 18 HGDYERYRQLYEGKHE------LLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA-----TMVSGS 86 (589) Q Consensus 18 ~~~~~~~r~l~~g~~~------~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa-----~~~~~~ 86 (589) =|-++|-++.|-.+.. .+-.-+..++. +.+ .+ ...+-+|--. ....|+ .++++. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~---~~g----~~-------~~~~~v~~~~-al~~~~v~~~i~~ia~~ 65 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLE---WLG----IS-------PSTISVKGKN-ALKVATVFACIKILSES 65 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHH---HhC----CC-------cCccccchhh-hhccHHHHHHHHHHHHh Confidence 3444444444421111 11111111111 000 00 0111111111 111222 233333 Q ss_pred cccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hcccc---ccchhhHHHH-HHcCc Q lcl|NC_020883. 87 IGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKLE---RRHWSNIVQH-QVDGG 161 (589) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~~---~~~~~~l~~~-~v~Gg 161 (589) +..++=..-.. . .....+..+.+ + ..++. .-+.+ ..|+..++.+ +..|- T Consensus 66 ia~lp~~~~~~----~-~~~~~~~~~~~-------------l--------~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 66 VSKLPLKIYQE----D-EYGIQRGTKHY-------------L--------NNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hccCceEEEEe----c-CCceeeccccH-------------H--------HHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 33222110000 0 00000000000 0 00110 01111 1123333332 44566 Q ss_pred eeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccce-eehhhhccccccchhhee Q lcl|NC_020883. 162 IVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLR-TTNMLYPVVKAKGDVKKE 238 (589) Q Consensus 162 ~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~-~~~~~y~~~~~~~~~~~~ 238 (589) .++.+..+..+ ..+....+++.-++. .+ .++. .....| +.. T Consensus 120 ay~~i~r~~~G~~~~L~~i~~~~v~v~~----------d~-----------------~~~~~~~~~~~---------y~~ 163 (432) T protein:vir:10 120 SYANIEFDRKGKVQALWPIDASKVTVYI----------DD-----------------VGLLNSKTKMW---------YVV 163 (432) T ss_pred eEEEEEECCCCcEEEEEEEcCceeEEEE----------cC-----------------cccccccceEE---------EEE Confidence 66666666444 334444444333211 00 0000 000000 001 Q ss_pred ecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhH Q lcl|NC_020883. 239 IKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITR 318 (589) Q Consensus 239 ~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~ 318 (589) ...|.+.. +| ---|+|+++....+.++|.|-+.-+...++.....-.. T Consensus 164 ~~~g~~~~-------------------~~-------------~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 164 NTGGQQRV-------------------LK-------------PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred ecCCeEEE-------------------Ec-------------cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 11222111 00 00189998776777889999998777777766554444 Q ss_pred HHHHHHHhCCCcEEech-hhhhccccccccccccccccccccccccc-cccccccccccccCccceeeecc--cHHHHHH Q lcl|NC_020883. 319 SAVIYEQNGKPRISITK-EMMDTLLNIAYERDGHSAKEASMMTPRID-HRDMEITTFDENGRSMEIHQIDI--SKIGDMD 394 (589) Q Consensus 319 ~srildk~gkpRI~VP~-~~L~t~~g~~~d~dge~~~~~~~~~~~~d-~~dlev~~~de~g~~~~~iq~Di--rveeh~~ 394 (589) ..+.|...+.|+.++-. +.|+..+ .++-.+.+.. .+.+.+ ....-+. +.| +.+.+... .-.+..+ T Consensus 212 ~~~~~~ng~~p~gil~~~~~l~~e~---~~~~~~~~~~---~~~g~~n~~~~~vl---~~g--~~~~~l~~~~~d~q~~e 280 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGDLNEDA---KKVFRENFES---MSSGLQNSHRIALM---PVG--YQFQPISLNMSDAQFLE 280 (432) T ss_pred HHHHHhccCCccEEEEcCCCCCHHH---HHHHHHHHHH---HhcccccCCcceec---CCC--ceEEEccCChhHHHHHH Confidence 45556555677755421 1111100 0000000000 000000 0001111 122 23333332 2233445 Q ss_pred HHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIE 474 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e 474 (589) ..+...++|..+=+.|+.-+|....+. . + +.......+.+- .+.-|...+...|.+ .++........+ T Consensus 281 ~~~~~~~~Ia~~fgVP~~~lg~~~~~~-~-s-~~e~~~~~~~~~--~l~P~~~~ie~~ln~----kLl~~~~~~~g~--- 348 (432) T protein:vir:10 281 NTELTIRQIATAFGIKMHQLNDLSKAT-L-N-NIEQQQQQFYTD--TLQATLTMYEQEMTY----KLFLDSELDKGF--- 348 (432) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCC-c-c-cHHHHHHHHHHH--HHHHHHHHHHHHHHH----hhcChhhcCCCc--- Confidence 566677888888899999998533221 1 1 111122222111 122222222232322 111111111111 Q ss_pred cceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc-- Q lcl|NC_020883. 475 EPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ-- 552 (589) Q Consensus 475 ~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~-- 552 (589) ...+.+.+.+..| ..+.++..+.+.++++++.-.+-+.+. +. |.+ | ++..+-+ T Consensus 349 ~~~fd~~~l~~~d--~~~~~~~~~~~~~~G~~t~NE~R~~~g--~~-------------------pi~-g-gD~~~~~~n 403 (432) T protein:vir:10 349 YSKFNVDAILRAD--IKTRYEAYRTGIQGGFLKPNEARSKED--LP-------------------PEA-G-GDRLLVNGN 403 (432) T ss_pred EEEeechhhhcCC--HHHHHHHHHHHHhCCCcCHHHHHHHhC--CC-------------------CCC-C-CCeEeeccc Confidence 1233444544444 445577888888888888776655542 11 111 0 0000000 Q ss_pred c--cCcccCCCCCCCCCCCCCCCCcchhhhh Q lcl|NC_020883. 553 M--NDNRDEDGNIIEEGDTEEEPSAEENEEI 581 (589) Q Consensus 553 ~--~~~~~~~~~p~deg~~~eep~~~~~e~~ 581 (589) . ++..++ .....|++.++...+.+|+- T Consensus 404 ~~~~~~~~~--~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 404 MLPIDMAGQ--AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ccchhhccc--cccCCCCCCCCCCCCCCCCC Confidence 0 000000 01112222222222222222 No 143 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=54.42 E-value=0.5 Score=22.22 Aligned_cols=327 Identities=11% Similarity=0.025 Sum_probs=107.4 Q ss_pred ceeEEEeecCCCccceEEEEEeee-ccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccc Q lcl|NC_020883. 191 GADLAYYIDHGQYGQFLHIYRERV-EKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDD 269 (589) Q Consensus 191 ~~div~~~e~~~~~~~l~~~~~~~-~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~ 269 (589) =+.|+|..+.+ + ++..+-.+ ....+. + -.| ...|....+..... .. T Consensus 1 v~Eivw~~~~g---~-~~~~~l~~r~~~~~~-~-f~~------------~~~~~l~~~~~~~~---------------~g 47 (355) T protein:vir:78 1 MFEQVYRIENG---R-ARLGKLAWRPPRTIS-R-FDV------------APDGGLVAIEQWGV---------------FG 47 (355) T ss_pred CeEEEEEeeCC---e-EEEeeeeecCcccee-e-eee------------ccCCceeEEEecCC---------------CC Confidence 12334432221 1 11100000 000000 0 000 01111111110000 00 Q ss_pred cccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHh--CCCcEEechhhhhcc------ Q lcl|NC_020883. 270 RPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQN--GKPRISITKEMMDTL------ 341 (589) Q Consensus 270 ~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~--gkpRI~VP~~~L~t~------ 341 (589) .+...++ +.=+++|.. .....+|+|.+.+..+....-.=.-.+..+..-++|+ |=|-..+|...-.+- T Consensus 48 ~~~~~lp---~~kfi~~~~-~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~ 123 (355) T protein:vir:78 48 KATVRIP---VDRLVVFVN-EREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARA 123 (355) T ss_pred CCcceec---cCCEEEEEe-CCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhH Confidence 0100010 122566754 4678889999999887776655555566778888888 556666663211000 Q ss_pred -ccccccccc--cccccccccccccccccccccccccccCccceeeeccc---HHHHHHHHHHHHHHHHHHhcCCchhcc Q lcl|NC_020883. 342 -LNIAYERDG--HSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS---KIGDMDHVKNLIKLMLIETQTSEKAVD 415 (589) Q Consensus 342 -~g~~~d~dg--e~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir---veeh~~~ie~L~~~Il~~a~ts~~AFg 415 (589) +....+++- .......+.... ..+.+. |..++++.-..+ ..+..+.|+.=+..++ ++.-... T Consensus 124 ~~~~~~~~~~l~~~~~~i~~g~~a-----~~iip~---g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i----LGqtlTs 191 (355) T protein:vir:78 124 EQWLNDQKEEGLQLAKEFRAGEAA-----GGYIPH---GANFTLTGVQGKLPEMDGPIRYHDEQIARAV----LAHFLTL 191 (355) T ss_pred HHHHHHHHHHHHHHHHHhhCCcce-----eEeecC---CceEEEeecCCCcccHHHHHHHHHHHHHHHH----hhhhhcc Confidence 000000000 000000000000 001110 111222221111 1223344444222222 1111111 Q ss_pred cccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHH Q lcl|NC_020883. 416 FYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKE-LYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAE 494 (589) Q Consensus 416 ~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~-li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A 494 (589) ...++|++-|.+.+-. ....- .++.-+..+.+.|.+ ++..+..+ | .+. ....|.+.|.. .+ +.++..| T Consensus 192 ~~~~~gGS~Alg~vh~--~v~~~--~~~aD~~~i~~~ln~~li~~l~~l-N-~~~--~~~~P~~~~~~-~~--~~~~~~a 260 (355) T protein:vir:78 192 GGDKSTGSYALGDTFA--SFFTG--SLNAVMKHIADVTQQHVVEDLVDQ-N-WGP--EEPAPRLVPAQ-LG--KEQPVTA 260 (355) T ss_pred ccCCccchhhHHHHHH--HHHHH--HHHHHHHHHHHHHHHHHHHHHHHh-c-CCC--CCCCCEEEecC-cC--hhHHHHH Confidence 1111222222222221 11111 123333345555643 44433222 1 221 12247788865 23 3344557 Q ss_pred HHHHHHhccchhhH----HHHHHHhCCCCC-HHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 495 NMAAYAASKQGQSL----ETTVRRMNPDAS-EDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 495 ~t~~~l~~a~~~S~----etaVr~Lhpdw~-dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) +..+.+...|..-. ++.++.... .. .+.-++++. ..+.+.+......+.+ .+++. + ....-.. T Consensus 261 ~~~~~l~~~G~~~~~~~~~~~~~e~~g-ip~p~~~~~~~~------~~~~~~~~~~~~~~~~---~~~~~-~-~~~a~~~ 328 (355) T protein:vir:78 261 EAIRALVECGAFTADPELEKDLRARYG-LPAPAERDDGAD------AAAAKAAGRRRAKRLP---GQRQG-A-ALPSRSP 328 (355) T ss_pred HHHHHHHhCCCccccHHHHHHHHHHhC-CCCCCCCCcccC------CccccccccccccccC---Ccccc-c-cccccCC Confidence 77777777776533 334554321 11 000000110 0000000000000000 00000 0 0000012 Q ss_pred CCCCCcchhhhhhcccccCC Q lcl|NC_020883. 570 EEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 570 ~eep~~~~~e~~~~~~~~~~ 589 (589) ..+++.+.......+.+|-+ T Consensus 329 ~a~~~~~~~~~~~~~~~~~~ 348 (355) T protein:vir:78 329 RADPPRRRGPLRRRPRHPAH 348 (355) T ss_pred CCCChhhhHHHHHHhhcccc Confidence 33455555556666777766 No 144 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=54.00 E-value=0.51 Score=22.17 Aligned_cols=425 Identities=11% Similarity=0.006 Sum_probs=145.9 Q ss_pred CccceeccchhHHH-----HhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTT-----KNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~~~~~~~~~~~~-----~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) +-.|-.+|.|-.-. ..-.|++.+|..||+ ++-+|.-.+.. T Consensus 32 ~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~----~m~e~D~~i~s------------------------------- 76 (528) T protein:vir:10 32 FANHPAKGLTPAKLAHILIEAEQGHLQAQAELFM----DMEERDAHLFA------------------------------- 76 (528) T ss_pred hcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHH----HHHhhChHHHH------------------------------- Confidence 44566677775332 223577888888775 12112111111 Q ss_pred hccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 76 ~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) .+++.+.... ..+=.|+.+.++. ... ..+-+++++.+.+-.-+..+...+.+ T Consensus 77 ----------~l~~Rk~av~----------~~~w~I~p~~~~~----~~~----~~~a~~v~~~l~~~~~f~~~i~~~ld 128 (528) T protein:vir:10 77 ----------EMSKRKRAVL----------GLDWTIEPPRNAS----AAE----KADAEYLHELLLDLEGIEDLMLDCMD 128 (528) T ss_pred ----------HHHHHHHHHh----------cCCceEecCCCCC----HHH----HHHHHHHHHHHhCCccHHHHHHHHHh Confidence 0111111000 0011111111100 000 11233555555442111122222222 Q ss_pred HHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchh Q lcl|NC_020883. 156 HQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDV 235 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 235 (589) +..=|=-+.-.+|.- ....+.| +.|.|+... + ..|. ...++. . +.+.+. T Consensus 129 a~~~G~s~~Ei~w~~--------~~g~~~~------~~~~~r~~~----~--f~~~---~~~~~~-l--~~~~~~----- 177 (528) T protein:vir:10 129 GVGHGYSAIELDWSL--------QGREWLP------QAFDHRPQS----W--FQLN---PDDQDE-L--RLRDNS----- 177 (528) T ss_pred hhhhcceeEEEEEee--------cCCceeE------EEeeeeccc----c--eeec---cCCCcE-E--eccCCC----- Confidence 222222222222321 0111222 222222110 1 1110 001110 0 000000 Q ss_pred heeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHH Q lcl|NC_020883. 236 KKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWT 315 (589) Q Consensus 236 ~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t 315 (589) -.| +.+| +.-+++|.+ .....+|+|.+-+..+....-.=+-. T Consensus 178 ----~~g---------------------~~l~------------~~k~iv~~~-~~~~g~p~g~gLlr~~~w~~~fK~~~ 219 (528) T protein:vir:10 178 ----IAG---------------------EVLQ------------PFGWIMHKP-RSRSGYVARSGLFRVLAWPYLFKHYS 219 (528) T ss_pred ----CCc---------------------eeec------------CCCeEEEee-cCCCCCccccchHHHHHHHHHHHHhh Confidence 001 0011 122466654 57788999999998877766666666 Q ss_pred HhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHH Q lcl|NC_020883. 316 ITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMD 394 (589) Q Consensus 316 ~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~ 394 (589) +..++.-++|+|-|-.++-- ..+.. +++...... .. ..+......+.+. |..+++++.. ...+-+.. T Consensus 220 ~~~w~~f~E~yG~P~~igky-----~~~a~-~~ek~~L~~--al-~~i~~~~~~iiP~---~~~ie~~ea~~~~~~~f~~ 287 (528) T protein:vir:10 220 TADLAEMLEIYGLPIRLGKY-----PPGTP-DEEKVTLLR--AV-TGLGHAAAGIIPE---SMSIDFQEASKGSAEPFMA 287 (528) T ss_pred HHHHHHHHHHcCCCeEEEec-----CCCCC-HHHHHHHHH--HH-HHHhhCcEEEecC---CceeEEeecCCCChhHHHH Confidence 77788888999999776521 11111 111111000 00 0000011122322 2335555532 33333444 Q ss_pred HHHHHHHHHHHHhcCCchhccc--ccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCccc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDF--YLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKE-LYESCLWLLNDQDSSI 471 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~--~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~-li~~~l~L~~~~~~~~ 471 (589) .++..=++|..+- ++ ++.+. ..+.+++.|.+.+-.. . +...++.-+..+...|.+ ++..+..+. ++... T Consensus 288 li~~~d~~Isk~i-LG-qtlTs~~~~g~~gS~Alg~vh~~--v--~~di~~aDa~~i~~tln~~li~~l~~~N--~~~~~ 359 (528) T protein:vir:10 288 MMRWCDDSMSKAI-LG-GTLTSQTSESGGGAYALGQVHNE--V--RHDLLAADARQLAATLSRDLLWPLLVLN--RSGNL 359 (528) T ss_pred HHHHHHHHHHHHH-hh-hhhhccccccccchhhhHHHHHH--H--HHHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCCC Confidence 4444444433221 11 11111 1111222222222211 1 111233334445566643 544432222 22222 Q ss_pred C-cccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccc---- Q lcl|NC_020883. 472 R-IEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGI---- 546 (589) Q Consensus 472 ~-~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~---- 546 (589) . ...|.+.|...=+.|. ...|+..+.+...|..-.+..++.... ....+-.+++ ........+.+.... T Consensus 360 ~~~~~p~~~~~~~e~eDl--~~~a~~~~~L~~~G~~i~~~~i~e~~g-ip~p~~~e~~---~~~~~~~~~~~~~~~~~~~ 433 (528) T protein:vir:10 360 DARRAPRLVFDLKDRADL--AAMATSLPPLVKLGVQVPVNWVQEQLG-IPLPANGEAV---LGDQAGAGIAQLSRRPGPR 433 (528) T ss_pred CccccceEEecCCCcccH--HHHHHHHHHHHhCCCCCCHHHHHHHhC-CCCCCCCccc---ccCCCcccccccCcccccc Confidence 2 2347788866544443 345777777887776433444444421 1110000111 111111111111111 Q ss_pred ccccccccCcccCCCCCCCCCCCCCC----CCcchhhhhhcc-cccCC Q lcl|NC_020883. 547 NQTFEQMNDNRDEDGNIIEEGDTEEE----PSAEENEEIEKE-GEPIA 589 (589) Q Consensus 547 ~~~l~~~~~~~~~~~~p~deg~~~ee----p~~~~~e~~~~~-~~~~~ 589 (589) ..+++....+.. ++.+..++ -.+++++..-++ +++|. T Consensus 434 ~~~~~~~~~~~~------~~~~~~d~~~~~~~~~~~~~~~~~~l~~i~ 475 (528) T protein:vir:10 434 IAALAQVIGPRY------RDQEALDQVLASLPAQDMQNQADSLVAPLL 475 (528) T ss_pred cccccccccccc------cccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011110000000 00000000 011222222211 22332 No 145 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=52.11 E-value=0.56 Score=21.95 Aligned_cols=401 Identities=10% Similarity=0.008 Sum_probs=131.6 Q ss_pred hhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCccc Q lcl|NC_020883. 21 YERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEID 100 (589) Q Consensus 21 ~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~ 100 (589) |-...+||..+..........+..-...----..+..+ ++--+...+-+..-+ .++++.+.+++=..-.. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v-----~~~~al~~~~v~~~i--~~Ia~~ia~~p~~~~~~--- 70 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQI-----SSQRAMRLTAVFSCV--RVLAESVGMLPCNLYHL--- 70 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCcee-----chhhhhccHHHHHHH--HHHHHHhccCceEEEEe--- Confidence 44444455443332221111111100000000000000 000001111111112 33444443333211100 Q ss_pred chhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hcccc---ccchhhHHHH-HHcCceeEEEEEecCce-e Q lcl|NC_020883. 101 PDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKLE---RRHWSNIVQH-QVDGGIVAAPVIDELGP-R 174 (589) Q Consensus 101 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~~---~~~~~~l~~~-~v~Gg~~~~~~~~~~~~-~ 174 (589) ..+..+.+ ..... ..++. ..|.+ ..|+..++.+ +..|..++.+..+++++ . T Consensus 71 ---~~~~~~~~------------~~~~~--------~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~ 127 (414) T protein:vir:44 71 ---NGSLKQRA------------TGERL--------HKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAE 127 (414) T ss_pred ---cCCceeec------------ccchH--------HHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEE Confidence 00000000 00000 01110 11111 1234444433 34555555554444443 3 Q ss_pred EEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeeccccccccccccccc Q lcl|NC_020883. 175 IVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDL 254 (589) Q Consensus 175 i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~ 254 (589) +....++++.+ .+. . .+++ +|.. ....|.+.. T Consensus 128 L~~l~~~~v~~---------~~~-~---~~~~------------------~y~~---------~~~~g~~~~-------- 159 (414) T protein:vir:44 128 LLPVDPGCVVP---------KLN-S---SWEP------------------VYQV---------TFPDGSTDV-------- 159 (414) T ss_pred EEEEcCceEEE---------EEC-C---CCcE------------------EEEE---------EecCceEEE-------- Confidence 34444433322 110 0 0111 1110 001121111 Q ss_pred chhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEec Q lcl|NC_020883. 255 EGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISIT 334 (589) Q Consensus 255 e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP 334 (589) + +.--|+|+.+. ..+.++|.|-+.-+...++.....-....+.|...+.|+.++- T Consensus 160 -----------~-------------~~~evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 214 (414) T protein:vir:44 160 -----------L-------------SQEDIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLR 214 (414) T ss_pred -----------E-------------ccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 0 00117888875 4556899999877666665444332233444545566765442 Q ss_pred hhhhhcccccccccccccccccccccccc--ccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCch Q lcl|NC_020883. 335 KEMMDTLLNIAYERDGHSAKEASMMTPRI--DHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEK 412 (589) Q Consensus 335 ~~~L~t~~g~~~d~dge~~~~~~~~~~~~--d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~ 412 (589) ..+.+..+....+...+...+.+. .+..+ +. +.|...+-++...+-.+..+..+...++|..+=+.|+. T Consensus 215 -----~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~-vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~ 285 (414) T protein:vir:44 215 -----TEQTLSDQAYERLKKDFEERHTGLGNAHRPM-IL---EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLH 285 (414) T ss_pred -----eCCCCCHHHHHHHHHHHHHHhcCccccCcce-ec---CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHH Confidence 111111110000100010001110 01111 11 12222222222233334555566677888888899999 Q ss_pred hcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHH Q lcl|NC_020883. 413 AVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELV 492 (589) Q Consensus 413 AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~ 492 (589) -+|....++ .+ +.......+.+- .+.-|-..+...|.+. ++.... .. .-.|.|+..-....+.++ T Consensus 286 ~l~~~~~~t--~~-n~e~~~~~~~~~--~l~P~~~~ie~~ln~~----L~~~~~----~~--~~~i~fd~~~ll~~d~~~ 350 (414) T protein:vir:44 286 MVQNTDRAT--FN-NIEELGLGFINY--SLVPYLTRIEQRINTG----LVRKSK----QG--VFYAKFNAGALLRGDMKS 350 (414) T ss_pred HhCCCCCCC--cc-cHHHHHHHHHHH--HHHHHHHHHHHHHHhh----cCCccc----cC--ceEEEEechhhhccCHHH Confidence 887533221 11 111111222111 1111221222222221 110100 01 112444332222334556 Q ss_pred HHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccc--cc---cCcccCCCCCCCCC Q lcl|NC_020883. 493 AENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFE--QM---NDNRDEDGNIIEEG 567 (589) Q Consensus 493 ~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~--~~---~~~~~~~~~p~deg 567 (589) .++..+.+.++++++.-.+-+++. ++ |.+ | ++..+. |. .....+.+.+.|.+ T Consensus 351 ~~~~~~~~~~~G~~t~NE~R~~~g--l~-------------------p~~-g-gD~~~~~~n~~~~~~~~~~~~~~~~~~ 407 (414) T protein:vir:44 351 RFEAYATGINWGIYSPNDCRDLED--MN-------------------PRP-G-GDVYLTPMNMTTKPSDGSKAGKQKDNA 407 (414) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhC--CC-------------------CCC-C-cceecccccccccCCccccCCCCCCCC Confidence 688888888888877665554442 11 111 1 111111 00 00000111111111 Q ss_pred CCCCCCCc Q lcl|NC_020883. 568 DTEEEPSA 575 (589) Q Consensus 568 ~~~eep~~ 575 (589) .+|+++. T Consensus 408 -~~d~~~~ 414 (414) T protein:vir:44 408 -NADETTS 414 (414) T ss_pred -CCCCCCC Confidence 1222222 No 146 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=51.00 E-value=0.59 Score=21.83 Aligned_cols=494 Identities=11% Similarity=0.016 Sum_probs=176.8 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcce-eeecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQT-ARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~n~~~~i~~~p 79 (589) |-.=.-.|...+.+|+.-.-+..-|.=|+....|+. ..+- ++ ++.+.+.. .+...++| =-=-.+.+..+- T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~----~~~l--P~--~~~~~~~~~~~~~~~~~-dst~~~a~~~La 71 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCA----QYTI--PS--LFPKESDNESTDYTTPW-QAVGARGLNNLA 71 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHH----HHhc--cc--ccCCCCCcccccccccc-cccHHHHHHHHH Confidence 666556666676676533333333333333222221 1111 00 11111111 11112222 001134444555 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) |-|++..++. .+.|- -.+.+....+ . .++.......+.+..++ .+.+...+..|+|+......+.++.+. T Consensus 72 a~l~~~ltP~-~~WF~-l~~~d~~~~~---~-~~~~~~~~~v~~~L~~v----e~~~~~~l~~snf~~~~~~~~~~L~~~ 141 (535) T protein:vir:15 72 SKLMLALFPM-QSWMK-LTISEYEAKQ---L-VGDPDGLAKVDEGLSMV----ERIIMNYIESNSYRVTLFECLKQLIVA 141 (535) T ss_pred HHHHHhhcCC-Ccccc-cccChHHHhc---c-CCCcchHHHHHHHHHHH----HHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 5566555543 22222 1111111000 0 01111112223444444 445666677888999999999888776 Q ss_pred Cc-eeEEEEEecCceeEEEecCceecccccCcc-eeEEEeec-CCCc---cceEE-EEEe---eeccccceeehhhhccc Q lcl|NC_020883. 160 GG-IVAAPVIDELGPRIVFKARDVYFPHDDEKG-ADLAYYID-HGQY---GQFLH-IYRE---RVEKDGLRTTNMLYPVV 229 (589) Q Consensus 160 Gg-~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~-~div~~~e-~~~~---~~~l~-~~~~---~~~~~~~~~~~~~y~~~ 229 (589) |= |++.+--.+++++...++-..|+=..|+.| ++.+|... .+-. +.|-- +.+. ....+...+-+.+|+. T Consensus 142 G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~- 220 (535) T protein:vir:15 142 GNALLYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLD- 220 (535) T ss_pred CceeEEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEe- Confidence 53 333322223446666666665554444444 55555322 1100 00000 0000 0000000000001100 Q ss_pred cccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCC-CcceEEEecCCCCCCCcccCcchhhhhHH Q lcl|NC_020883. 230 KAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGR-NRPFISYWANNETFMNPYGISALDNLESK 308 (589) Q Consensus 230 ~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv-~~plvvyvPN~~~~~~~lG~SD~~~ie~l 308 (589) .++..+....+..+.. ++.... . .|- .-|+++. -=+......||||=..+..+- T Consensus 221 ----------~~~~~~~~~~e~~g~~----------~~~~~~-~---~~~~~~P~i~~-Rw~~~~ge~YGrgp~~~~l~D 275 (535) T protein:vir:15 221 ----------EESGDYLKYEEVEDVE----------IDGSDA-T---YPTDAMPYIPV-RMVRIDGESYGRSYCEEYLGD 275 (535) T ss_pred ----------cCCCcEEEEEEeeCcc----------cccccc-c---cccccCCceee-eeeecCCCccccchHHHHHHH Confidence 0011111111111000 000000 0 111 1233333 224556677999988999999 Q ss_pred HHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeee--c Q lcl|NC_020883. 309 QDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQI--D 386 (589) Q Consensus 309 ~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~--D 386 (589) +..||..--....-.++..+|-+.||........ +..+...+.+.+.+..+ ++.+++ . T Consensus 276 ~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~--------~l~~~~~g~~v~g~~~~------------v~~~~~~~~ 335 (535) T protein:vir:15 276 LRSLENLQEAIVKMSMISAKVIGLVNPAGITQPR--------RLTKAQTGDFVPGRRED------------IDFLQLEKQ 335 (535) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeecccccccch--------hcccCCceeeecCCccc------------ceeeecccc Confidence 9999975434444455779999888654443221 11111111111111111 111221 1 Q ss_pred ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHh-hhHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 387 ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDL-LTTI-LKSRRLQKEYIDFLKELYESCLWLL 464 (589) Q Consensus 387 irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~-~~~~-~Kv~~~R~~~~~aLk~li~~~l~L~ 464 (589) .++....+.++.+...|-. +=+.. .+.. . ++...|++++..+... ...+ --..+.- .+.|..+++.+..+. T Consensus 336 ~~~~~~~~~i~~~~~~I~~-af~~~-~~~~-~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~---~Ell~Pli~r~~~il 408 (535) T protein:vir:15 336 ADFTVAKAVSDQIEARLSY-AFMLN-SAVQ-R-TGERVTAEEIRYVASELEDTLGGVYSILS---QELQLPLVRVLLKQL 408 (535) T ss_pred cchhHHHHHHHHHHHHHHH-HHhhh-hccc-C-CCccccHHHHHHHHHHHHHHHhHHHHHHH---HHHHHHHHHHHHHHH Confidence 2344444455554443321 10110 1211 1 2222344666665432 2222 1223322 344455554443333 Q ss_pred hhcCcc--cCcccceeeeCCcCCCCCCHHHHHHHHH---HHhcc------chhhHHHHHHHhC-----C-C---CCHHHH Q lcl|NC_020883. 465 NDQDSS--IRIEEPNIETQDMILKPRAELVAENMAA---YAASK------QGQSLETTVRRMN-----P-D---ASEDWI 524 (589) Q Consensus 465 ~~~~~~--~~~e~p~I~f~D~lPvde~El~~A~t~~---~l~~a------~~~S~etaVr~Lh-----p-d---w~dE~v 524 (589) ...+.. ...+..+|++--+|..-.+-.....+.+ .+.+. ..+....+++.+- | . -++|++ T Consensus 409 ~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev 488 (535) T protein:vir:15 409 QATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQK 488 (535) T ss_pred HhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHH Confidence 222211 1112234555333321111111111111 11110 0112233333331 2 2 256666 Q ss_pred HHHHHHHHhhcccc-ccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 525 QEEIARIEEEQAGS-DTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 525 ~eEv~RI~~E~a~~-~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) ++..++-.+.+++. .-...|+...+.+ -.+|.+ ...-.+.|. T Consensus 489 ~~~~~q~~~~~~~~~~a~~~g~~~~~~~--------~~~p~~---------------~~~~~~~~g 531 (535) T protein:vir:15 489 QALMMQDAAQTGIENAAATGGAGVGALA--------TSSPEA---------------MQGAAAQAG 531 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccchh--------ccChHH---------------HHHHHhccC Confidence 66555443222111 1111222211111 001110 000011111 No 147 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=45.57 E-value=0.76 Score=21.22 Aligned_cols=490 Identities=11% Similarity=0.066 Sum_probs=164.9 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEE-cchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFN-LPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n-~~~~i~~~p 79 (589) |.+ +.+-+ ++.--.....-|.=|+..-.|+.. ++ .++...+.+.+...-..++..+.=+ -.+.+-.+- T Consensus 1 m~~-~~~~~----l~~r~~~l~~~R~~~e~~w~e~~~----~~--lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~La 69 (556) T protein:vir:73 1 MAE-TEKER----LLKQLAQLKNERTSFESHWLDLSD----FI--NPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILS 69 (556) T ss_pred CCh-hhHHH----HHHHHHHHHHHhhHHHHHHHHHHH----Hh--ccccCCcCCCCCCcchhhcCccccchHHHHHHHHH Confidence 554 22211 222112222223333332222221 11 0112222111111111111111111 123333444 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) |-|++..++.-...|-=..-+.++. + ....+.|...+ .+.+...+..|+|+......+.++.+. T Consensus 70 s~l~~~ltpp~~~WF~l~~~d~~~~-------~-----~~~v~~~L~~v----e~~~~~~l~~snf~~~~~~~~~~L~~~ 133 (556) T protein:vir:73 70 SGMMSGITSPARPWFKLATPDPDMM-------D-----YGPVKIWLEVV----QRRMNEVFNKSNLYQSLPVMYASLGTF 133 (556) T ss_pred HHHHHhhcCCCCcccccccCccccc-------c-----hHHHHHHHHHH----HHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 4455444443333222110011111 1 11123444444 345667777899999999999988776 Q ss_pred Cc-eeEEEEEecCceeEEEecCceecccccCc-ceeEEEeec----------CCCcc---ceEEEEEeeeccccceeehh Q lcl|NC_020883. 160 GG-IVAAPVIDELGPRIVFKARDVYFPHDDEK-GADLAYYID----------HGQYG---QFLHIYRERVEKDGLRTTNM 224 (589) Q Consensus 160 Gg-~~~~~~~~~~~~~i~f~~~d~~~P~~d~~-~~div~~~e----------~~~~~---~~l~~~~~~~~~~~~~~~~~ 224 (589) |= +++...-....++..-+....|+=..|+. .+|.+|... ++++. .--..+.....+..+.+.+. T Consensus 134 G~a~l~~~~~~~~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~ 213 (556) T protein:vir:73 134 GTGAMAVMEDDQDVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC 213 (556) T ss_pred CceeeeeeecCCceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE Confidence 64 43333211333554445555555444444 466666432 11100 00000111111122334444 Q ss_pred hhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCC-CcceEEEecCCCCCCCcccCcc-h Q lcl|NC_020883. 225 LYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGR-NRPFISYWANNETFMNPYGISA-L 302 (589) Q Consensus 225 ~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv-~~plvvyvPN~~~~~~~lG~SD-~ 302 (589) +|+..+-+.. +.+ -.-..+.+.+-+++.-.+++ +.+ -|- .-|+++. .=++.....||||- . T Consensus 214 V~pr~~~~~~------~~~-~~~~p~~s~~~~~~~~~~~v----l~e-----sg~~e~P~~~~-Rw~~~~ge~YGrg~P~ 276 (556) T protein:vir:73 214 ITPNVNRDSG------KMD-SKNKPYRSVYFESGGDSDKL----LRE-----SGFDEFPILAP-RWEVNGEDVYASSCPG 276 (556) T ss_pred Eecccccccc------ccC-cccceEEEEEEEecCCCcee----ccc-----CCcccCCceee-eeeecCCcccccCccH Confidence 4432111110 000 00011111111111011111 011 111 2233333 33456677799996 8 Q ss_pred hhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccce Q lcl|NC_020883. 303 DNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEI 382 (589) Q Consensus 303 ~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~ 382 (589) .+..+-+..||..--....-.++..+|.+.||..+... .. +..|+. . .+. .+ ..+.++..+ + T Consensus 277 ~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~--~~------~~~pgg--~----~~~--~~-~~~~~~i~p-~ 338 (556) T protein:vir:73 277 MLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQ--RV------SLLPGD--V----TYL--DV-ISGQDGFKP-A 338 (556) T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc--ce------eeccCc--c----ccc--cC-CCCccceee-e Confidence 88889999999654344555668899999999865321 00 111110 0 000 00 011111111 1 Q ss_pred eeecccHHHHHHHHHHHHHHHHHHhcCC--chhcccccCcccchhHHHHHHHHHh-hhHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_020883. 383 HQIDISKIGDMDHVKNLIKLMLIETQTS--EKAVDFYLDGGASGAQSGVAKFYDL-LTTI-LKSRRLQKEYIDFLKELYE 458 (589) Q Consensus 383 iq~Dirveeh~~~ie~L~~~Il~~a~ts--~~AFg~~~~~g~~~A~Sg~A~r~~~-~~~~-~Kv~~~R~~~~~aLk~li~ 458 (589) .+....+....+.++.+...| ..+=+. ...++. .++...|++++..+... ...+ --..+.- .+.|..++. T Consensus 339 ~~~~~d~~~~~~~i~~~~~rI-~~af~~d~~~~l~~--~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~---~E~l~Pli~ 412 (556) T protein:vir:73 339 YLVNPNTADLLADIQDTRQTI-NSAYFVDLFMMLQN--INTRSMPVEAVIEMKEEKLLMLGPVLERLN---DEALNPLID 412 (556) T ss_pred ccccccHHHHHHHHHHHHHHH-HHHhhcchhhhhcc--CCCCCccHHHHHHHHHHHHHHhhHHHHHHH---HHHHHHHHH Confidence 122223333334444433322 111000 011121 12222345666665422 2222 2233332 344555554 Q ss_pred HHHHHHhhcCc------ccCcccceeeeCCcCCCCCCHHHHHHHHHHHh------ccch-----hhHHHHHHHhC----- Q lcl|NC_020883. 459 SCLWLLNDQDS------SIRIEEPNIETQDMILKPRAELVAENMAAYAA------SKQG-----QSLETTVRRMN----- 516 (589) Q Consensus 459 ~~l~L~~~~~~------~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~------~a~~-----~S~etaVr~Lh----- 516 (589) .+..+....+. .+.....+|++--.|-...+....+.+.+++. +.+. +....+++.+- T Consensus 413 r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gv 492 (556) T protein:vir:73 413 RVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGV 492 (556) T ss_pred HHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCC Confidence 44433332221 11111223444222211111111111111111 1111 22333333330 Q ss_pred C---CCCHHHHHHHHHHHHhhccc-----c---ccccccccc-----------cccccccCcccCCCCCCC Q lcl|NC_020883. 517 P---DASEDWIQEEIARIEEEQAG-----S---DTSSLMGIN-----------QTFEQMNDNRDEDGNIIE 565 (589) Q Consensus 517 p---dw~dE~v~eEv~RI~~E~a~-----~---~p~~~g~~~-----------~~l~~~~~~~~~~~~p~d 565 (589) | --++++ |+.|.+.+++ + .-.....+. ..+...++.- |-|.. T Consensus 493 p~~~irs~ee----v~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~---g~~~~ 556 (556) T protein:vir:73 493 SPTVIVPQEQ----VQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAA---GAPQQ 556 (556) T ss_pred ChhhcCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhh---cCCCC Confidence 1 012333 3333221110 0 000000000 0111111111 11111 No 148 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=41.74 E-value=0.91 Score=20.80 Aligned_cols=433 Identities=9% Similarity=0.041 Sum_probs=143.5 Q ss_pred Cccceecc-----chh---HHHHhh----cchhhhhh-hhhcCCccccCHHHHH-HHhhccccceeccCcceeeecCcce Q lcl|NC_020883. 1 MIDWTVRG-----WTD---KTTKNV----HGDYERYR-QLYEGKHELLFPRAKR-LIEEGDAVGRFLDSSQTARETQTPY 66 (589) Q Consensus 1 ~~~~~~~~-----~~~---~~~~~~----~~~~~~~r-~l~~g~~~~~f~ra~~-~~~~~~~~~~~~~~~~~~~~~~~~y 66 (589) -+.|.++- -++ +-++.+ -.+..+|+ .....++.+=|+.-.+ ++ .|.+.+++-+.+- T Consensus 77 g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~-----~Dle~tGna~iei----- 146 (651) T protein:vir:99 77 GFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELAR-----QDYHGVGWLALEM----- 146 (651) T ss_pred ccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHH-----HHHHHHhhHhhhh----- Confidence 22333221 011 001111 11112222 1112222222222111 11 1233333222111 Q ss_pred EEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHH-hhccc Q lcl|NC_020883. 67 VIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQIT-KNSKL 145 (589) Q Consensus 67 ~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~-kn~~~ 145 (589) ++++.|++-..+.-....-+...+.. .+..+ .+.++ ..-+. T Consensus 147 ----------------Irn~~g~pv~L~~lp~~~~Rv~~~~~-~~~~~---------------------~~~ll~~~pn~ 188 (651) T protein:vir:99 147 ----------------LTDIEGRPVGLAYVPARTVRVRRPQN-RFDQP---------------------RHPEEGRYVDG 188 (651) T ss_pred ----------------hhcCccchhhhhhcChhheeeecccc-cccch---------------------hhhhhhccccc Confidence 11222221111100000000000000 00000 00001 00011 Q ss_pred c---ccchhhHHHHHHcCceeEEEEEecCc-e-eEEEecCceecccccCcceeEEEeecCCCc--cceEEEEEeeecccc Q lcl|NC_020883. 146 E---RRHWSNIVQHQVDGGIVAAPVIDELG-P-RIVFKARDVYFPHDDEKGADLAYYIDHGQY--GQFLHIYRERVEKDG 218 (589) Q Consensus 146 ~---~~~~~~l~~~~v~Gg~~~~~~~~~~~-~-~i~f~~~d~~~P~~d~~~~div~~~e~~~~--~~~l~~~~~~~~~~~ 218 (589) + ..+ ..+.|.+-.+...+..+.+..+ + .+.....++ +.+.+....... .++++.. . T Consensus 189 ~~~~~~~-~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~---------v~~~~~~d~~~~~~~~~~~~~-------~ 251 (651) T protein:vir:99 189 DVADIAS-RGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDE---------PTIRYREDEESEREPIFVDRE-------T 251 (651) T ss_pred ccchhHH-HHHHHHHhcCcceEEEeeccccceeeeeccCCcc---------eeEEeccCcceeeeeecccce-------e Confidence 1 111 1223433444445555443221 1 111111111 111111100000 0000000 0 Q ss_pred ceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCccc Q lcl|NC_020883. 219 LRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYG 298 (589) Q Consensus 219 ~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG 298 (589) +. | .....+... . ++.--|.|+++..+...++| T Consensus 252 g~-----~----------~~~~~~~~~---~-----------------------------~~~~eViHir~~~~~~g~~G 284 (651) T protein:vir:99 252 GD-----V----------TTGDANGLE---N-----------------------------RPANELIFIPNPSILEDDYG 284 (651) T ss_pred ee-----E----------EEcCCCcee---E-----------------------------ecccceEEecCCCCCCCccc Confidence 00 0 000000000 0 01112899998877888999 Q ss_pred CcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEE--echhhhhccccccccccccccccccccccccccccccccccc-- Q lcl|NC_020883. 299 ISALDNLESKQDEINWTITRSAVIYEQNGKPRIS--ITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFD-- 374 (589) Q Consensus 299 ~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~--VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~d-- 374 (589) .|.+..+...+.-....-....+.|...+.|+.+ +|...|+.-+- ++-.+...... ...+..+ +.+.+ T Consensus 285 ~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~---~~lr~~~~~~~----~nagk~~-vL~~~~~ 356 (651) T protein:vir:99 285 VPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESK---RDLRQMLNGLR----EESHRAV-VLEVEKF 356 (651) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHH---HHHHHHHHHHh----ccCCceE-Eeecccc Confidence 9999776666543332222223445444556655 35443432110 00000000000 0011111 11100 Q ss_pred ----cccCccceeeecccH---HHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_020883. 375 ----ENGRSMEIHQIDISK---IGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQK 447 (589) Q Consensus 375 ----e~g~~~~~iq~Dirv---eeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~ 447 (589) ..+..+.+...+... .+.++..+...++|..+=+.|+.-.|...+.+- +.+..+.+ .+.+ ..+.-|.. T Consensus 357 ~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~--sn~E~~~~-~f~~--~tL~P~~~ 431 (651) T protein:vir:99 357 QSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANR--SNSDQQDK-DFAL--EVIQPEQH 431 (651) T ss_pred cccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCc--ccHHHHHH-HHHH--HHHHHHHH Confidence 011234444444322 234555566777888888999998886433221 11222221 1111 12333443 Q ss_pred HHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHH Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-PDASEDWIQE 526 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~e 526 (589) .++..|.+.+ |... .........+.|+..--...+.++.++..+.+.++++++.-.+-++++ |-.++++ T Consensus 432 ~ie~eln~kL-----l~~~--e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~--- 501 (651) T protein:vir:99 432 TFAEWLYQII-----HQQA--LGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPY--- 501 (651) T ss_pred HHHHHHHHhh-----cCcc--ccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc--- Confidence 4444443321 1110 000111223445432112234566788888888999998887766663 2122221 Q ss_pred HHHHHHhhccccccccccccccccccccCcccCCCCCCCCCC-CCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 527 EIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGD-TEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 527 Ev~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~-~~eep~~~~~e~~~~~~~~~~ 589 (589) +..+|.+....... .+...|+ +..+++++++...+++.+.+- T Consensus 502 -------------------gd~~l~~~~~~~~g--~~~~gge~~~~~~~~~~~~~~~~e~~~~~ 544 (651) T protein:vir:99 502 -------------------GEMTLSEFEAEVAG--DVAGGGETEAVHEPPEENKIGEREWDTVK 544 (651) T ss_pred -------------------cccccccccccccc--ccccCCCCcccccCccccccccchhhhhh Confidence 11222222111111 1111111 223334445555555554443 No 149 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=39.65 E-value=1 Score=20.57 Aligned_cols=393 Identities=14% Similarity=0.085 Sum_probs=131.9 Q ss_pred hccccccccccccCCcccc------hhhccchhhc----ccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhh Q lcl|NC_020883. 83 VSGSIGQIKSSITTGEIDP------DIEEDTDEMI----EGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSN 152 (589) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~ 152 (589) |+++|+++-+.....+.+. ..-+-++..+ .|. ....+..- +.-.++|+-.|...+.- T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~g~~v-~~~~al~~~~V~~~i~~---------- 67 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGR--ESSSGKKV-TVDKAMKLSAVWACVRL---------- 67 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHHHHHhcC--CccCCcee-chhhhhccHHHHHHHHH---------- Confidence 7777777666433332210 1111111111 010 00011111 12234554444333211 Q ss_pred HHHHHHcCceeEEEEEe-cCceeEEEecCceec-----ccccCc---------------ceeEEEeecCCCccc---eEE Q lcl|NC_020883. 153 IVQHQVDGGIVAAPVID-ELGPRIVFKARDVYF-----PHDDEK---------------GADLAYYIDHGQYGQ---FLH 208 (589) Q Consensus 153 l~~~~v~Gg~~~~~~~~-~~~~~i~f~~~d~~~-----P~~d~~---------------~~div~~~e~~~~~~---~l~ 208 (589) |- ..-+.+.+++|-. .++-+..-++-+.++ |....- |-.++|.... .++ .++ T Consensus 68 ia--~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~--~G~~~~L~~ 143 (434) T protein:vir:43 68 IS--TSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA--AGRPAALDF 143 (434) T ss_pred HH--HhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCcEEEEEE Confidence 11 1234566666643 223222222211111 211111 2223333221 121 111 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) +....+. +. ....+...+. .....|.+.. + +.--|.|++ T Consensus 144 l~p~~v~-----~~--~~~~g~~~y~--~~~~~g~~~~-------------------~-------------~~~eVih~~ 182 (434) T protein:vir:43 144 LLPSRVD-----LE--CDENGRLKYF--YTTKKGARRE-------------------I-------------ERTNMLHIP 182 (434) T ss_pred EcCcceE-----EE--EcCCCeEEEE--EEecCceEEE-------------------E-------------ccccEEEec Confidence 1111100 00 0000000000 0001111110 1 112288998 Q ss_pred CCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccc---cccccccccccccccc Q lcl|NC_020883. 289 NNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERD---GHSAKEASMMTPRIDH 365 (589) Q Consensus 289 N~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~d---ge~~~~~~~~~~~~d~ 365 (589) +.. .+.++|.|-+.-+...+......-..-.+.|...+.|..++ ...+.+..+.. .+......+. ...+ T Consensus 183 ~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil-----~~~~~l~~e~~~~~r~~~~~~~g~--~nag 254 (434) T protein:vir:43 183 AFT-LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAF-----KVDRILQPAQREEFREYVKSVSGA--MNSG 254 (434) T ss_pred CcC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEE-----ecCCCCCHHHHHHHHHHHHHhcCc--cccC Confidence 764 45578999886655555433322222234453445665443 11111111100 0110000000 0001 Q ss_pred ccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHH-HHHHhhhHHHHHHH Q lcl|NC_020883. 366 RDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVA-KFYDLLTTILKSRR 444 (589) Q Consensus 366 ~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A-~r~~~~~~~~Kv~~ 444 (589) ..+ +. +.|...+-++...+-.+..+..+...++|..+=+.|+.-.|...+++. +.|.+. ....+.+- .+.- T Consensus 255 ~~~-vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--~~s~~e~~~~~f~~~--~L~P 326 (434) T protein:vir:43 255 RSP-VL---EQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSN--WGTGLEQQMLAFLTF--SISS 326 (434) T ss_pred Ccc-cc---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCcc--ccchHHHHHHHHHHH--HHHH Confidence 111 11 223222223333344455666777888888888999988876443221 112222 11222111 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHH Q lcl|NC_020883. 445 LQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWI 524 (589) Q Consensus 445 ~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v 524 (589) |-..++..|.+-+ +... . .......+.+.+.+..| .++.++..+.+.+++.++.-.+-+.++ +.+ T Consensus 327 ~~~~ie~~ln~kL-----~~~~-~--~~~~~~~fd~~~llr~d--~~~r~~~~~~~~~~G~~T~NE~R~~~g--l~p--- 391 (434) T protein:vir:43 327 ITNQIQQCVNKRL-----LTAP-E--RIRYYAEFSLEGFLKAD--SAGRAAWYSTMAQNGFMTRNEGRRKEN--LPE--- 391 (434) T ss_pred HHHHHHHHHHhhc-----CChh-h--hcCceEEEechhhhccC--HHHHHHHHHHHHhCCCcCHHHHHHHhC--CCC--- Confidence 2222223332211 1000 0 00011223333443334 556688888888888887765555442 111 Q ss_pred HHHHHHHHhhccccccccc-cccccccccccCcccCCCCCCCCC--CCCCCCCcch Q lcl|NC_020883. 525 QEEIARIEEEQAGSDTSSL-MGINQTFEQMNDNRDEDGNIIEEG--DTEEEPSAEE 577 (589) Q Consensus 525 ~eEv~RI~~E~a~~~p~~~-g~~~~~l~~~~~~~~~~~~p~deg--~~~eep~~~~ 577 (589) | +.+ +-.-+ ++. -++.. ..+.++ ..+..++ +...+|.++| T Consensus 392 ------~--~gg--D~~~~~~n~-~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 392 ------L--PGG--DILTVQSNL-VPIDQ-LGQSNK-SQAVRAALMNWFSQPEPQE 434 (434) T ss_pred ------C--CCC--CeEeeccCc-cchhh-hhccCC-CcchhhhhhccCCCCCCCC Confidence 0 000 00000 000 01110 111111 0000111 1122222222 No 150 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=38.96 E-value=1 Score=20.49 Aligned_cols=470 Identities=14% Similarity=0.085 Sum_probs=185.4 Q ss_pred Cc--cc------eeccc--------hhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCc Q lcl|NC_020883. 1 MI--DW------TVRGW--------TDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQT 64 (589) Q Consensus 1 ~~--~~------~~~~~--------~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (589) ++ +- .-.|| .-...++.-..+.+||.| -.|.||=.- T Consensus 20 ~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~m--a~~pEVd~A-------------------------- 71 (564) T protein:vir:10 20 PVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDM--SLHPEVDSA-------------------------- 71 (564) T ss_pred cccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHH--hhccchhhH-------------------------- Confidence 00 00 00000 001112222333334433 123333221 Q ss_pred ceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcc Q lcl|NC_020883. 65 PYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSK 144 (589) Q Consensus 65 ~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~ 144 (589) |-++=-+.+ . .+.. ..|+.-..+......-+-+.=.|-|+.|++--+ T Consensus 72 ----------v~eIVneaI--------v------------~d~~---~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~ 118 (564) T protein:vir:10 72 ----------IDEIVNEFV--------V------------NDGD---DKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMN 118 (564) T ss_pred ----------HHHhhccee--------E------------ecCC---CceEEEEecccCcchHHHHHHHHHHHHHHHHhc Confidence 111100111 0 0111 011111111111122222222456888999889 Q ss_pred ccccchhhHHHHHHcCceeEEEEEecC----c-eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccc Q lcl|NC_020883. 145 LERRHWSNIVQHQVDGGIVAAPVIDEL----G-PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGL 219 (589) Q Consensus 145 ~~~~~~~~l~~~~v~Gg~~~~~~~~~~----~-~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~ 219 (589) |+++-.+-+-.--|||++.+-.+||.. | ..+...+| +-...|++.+...+....-+ T Consensus 119 F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDP---------r~i~~vr~i~~~~~~~~~~v---------- 179 (564) T protein:vir:10 119 FNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDS---------LKIRKVRQKLKDVDPNRKEI---------- 179 (564) T ss_pred cchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcc---------cceeeeeeecccccccccee---------- Confidence 999999999999999999999999832 3 22333333 33333543332111111122 Q ss_pred eeehhhhccccccchhh----eeecccccccccccccccchhhhhhcccCCccccccccccCCCC--cceEEEecC---- Q lcl|NC_020883. 220 RTTNMLYPVVKAKGDVK----KEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRN--RPFISYWAN---- 289 (589) Q Consensus 220 ~~~~~~y~~~~~~~~~~----~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~--~plvvyvPN---- 289 (589) +++++++.... +...++..+.-.. + ..++... ...-+|++ .-.|+|.+- T Consensus 180 ------~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~-----~--~~~~~~~--------~~~~~~ikI~~daI~y~hSGL~d 238 (564) T protein:vir:10 180 ------EKGTALQYDYGDFIEYYIYNPKGFAGNI-----P--MVTGSMD--------WSNQEGIKIASDAIAQSTSGLMD 238 (564) T ss_pred ------eeeeeeeccccccccceeeccccccCcc-----c--ccccccc--------cccccceeechhhcceeccccee Confidence 11111111100 0000110000000 0 0000000 00112232 223555421 Q ss_pred --CCCCCCcccCcchhhhhHHHHHHHHH--HhHHHHHHHHhCCCcEEe----------chhhhhcccccccccccccccc Q lcl|NC_020883. 290 --NETFMNPYGISALDNLESKQDEINWT--ITRSAVIYEQNGKPRISI----------TKEMMDTLLNIAYERDGHSAKE 355 (589) Q Consensus 290 --~~~~~~~lG~SD~~~ie~l~DeLd~t--~S~~srildk~gkpRI~V----------P~~~L~t~~g~~~d~dge~~~~ 355 (589) ....-+.|.++.= ..+-+..|..+ +-|++|. -..||+- .+.+|+..-+. -++.-++.. T Consensus 239 ~~~~~i~gyLhkAIK--p~NQLkmlEDAlVIYRitRA----PeRRvFYIDVGnLPk~KAeqYlr~iM~k--~KNklVYDa 310 (564) T protein:vir:10 239 LNKKMTLSFLHKAIK--SLNQLRMIEDSLVIYRLSRA----PERRIFYIDVGNLPKVKAEQYLRDVMSR--YRNKLVYDG 310 (564) T ss_pred CCCCceeccchhhhH--hHHhhHHHHhhHHHHhhhcc----ccceEEEEecCCCCchhHHHHHHHHHHh--cCceEEEec Confidence 1111111221111 12223333333 2355544 3334431 12333222000 011111111 Q ss_pred ccccccccccccccc-----cccccccCccceeeec-ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCc-ccchhHHH Q lcl|NC_020883. 356 ASMMTPRIDHRDMEI-----TTFDENGRSMEIHQID-ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDG-GASGAQSG 428 (589) Q Consensus 356 ~~~~~~~~d~~dlev-----~~~de~g~~~~~iq~D-irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~-g~~~A~Sg 428 (589) ..+... .|-.-+.+ ++--++|..-++-|.. ++--+-+.-++..-+.+|..-..|-+-++...++ ..+.+ | T Consensus 311 ~TGevr-ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~-~- 387 (564) T protein:vir:10 311 QTGEIR-DDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKS-T- 387 (564) T ss_pred cCceec-ccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccc-c- Confidence 111110 01111111 1112455555554433 3334455566677777787778887777642111 11111 2 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-ccCcccceeeeCCcCCCCCCHHHHHHHHHHHh------ Q lcl|NC_020883. 429 VAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-SIRIEEPNIETQDMILKPRAELVAENMAAYAA------ 501 (589) Q Consensus 429 ~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~------ 501 (589) .+.+.-+.--+-+.|.|..|...+..+++.-|.|-.-.-. ...-...+|.|.=.-----+|+..++++.-+. T Consensus 388 -EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~ 466 (564) T protein:vir:10 388 -EILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQM 466 (564) T ss_pred -chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 2222222223458888888999888888776555432100 00001123333211001124555555543321 Q ss_pred ---ccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCC---CCCCCCCCCCCCCc Q lcl|NC_020883. 502 ---SKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDG---NIIEEGDTEEEPSA 575 (589) Q Consensus 502 ---~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~---~p~deg~~~eep~~ 575 (589) -|+..|+++..+.+-- .+|+++.+|-..|++|.... .. -+|...+++++.+..+ .|.|.|-..+-++. T Consensus 467 dpyvGky~S~dyi~k~ILr-~tDeei~~~~kqI~~E~k~~----~~-~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 540 (564) T protein:vir:10 467 DPFVGKYFSTEYIRRKILM-QTENEFKEIDKQMKSDIESG----LA-IDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAE 540 (564) T ss_pred hhhhccccchHHHHHHHhc-cCHHHHHHHHHHHHHHhhcC----CC-CCchhhhcCCCccCCCCcCCcchhhhccccccc Confidence 2345588988888753 88999999999999997641 11 1344555555544433 35555444444333 Q ss_pred chhhhhhcccccCC Q lcl|NC_020883. 576 EENEEIEKEGEPIA 589 (589) Q Consensus 576 ~~~e~~~~~~~~~~ 589 (589) -+-+-.+.-.++-. T Consensus 541 ~~~~~~~~a~~~~~ 554 (564) T protein:vir:10 541 REIKKLNSAPKPPP 554 (564) T ss_pred cChhhhccCCCCCC Confidence 33332222222111 No 151 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=37.88 E-value=1.1 Score=20.37 Aligned_cols=434 Identities=14% Similarity=0.081 Sum_probs=137.6 Q ss_pred CccceeccchhHHHHhhcch---hhhhh---------hhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEE Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGD---YERYR---------QLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVI 68 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~~r---------~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 68 (589) |. ++|-. +.||+ .+++++..+.+ ..||=. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------~pp~~~ 41 (540) T protein:vir:41 1 MF-------------NYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYV--------------------------EPKVHP 41 (540) T ss_pred CC-------------CcccChhhccchhhhhccccccccccCCCCccc--------------------------cCCCCH Confidence 11 11211 12221 22222222221 111100 Q ss_pred EEcchhhhccch-----hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhc Q lcl|NC_020883. 69 FNLPKVIAEIPA-----TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNS 143 (589) Q Consensus 69 ~n~~~~i~~~pa-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~ 143 (589) ..+.++...-|. .++++.+...+-... .+...... --.|.... ..+|+.+++.+ T Consensus 42 ~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~---------~~~~~~~~---------~lpN~~~t--~~~f~~~~v~d- 100 (540) T protein:vir:41 42 LVLLSLLQVNPYHASACSIKANDILRTGYLID---------GDDGGVEE---------LLRACRPS--FEFILLQALED- 100 (540) T ss_pred HHHHHHHHhcHHHHHHHHHHHHHHhcCCceEe---------cCccchhh---------hccCCCCC--HHHHHHHHHHH- Confidence 000011101000 122222221111000 00000000 00000110 01233333222 Q ss_pred cccccchhhHHHHHHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeecccccee Q lcl|NC_020883. 144 KLERRHWSNIVQHQVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRT 221 (589) Q Consensus 144 ~~~~~~~~~l~~~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~ 221 (589) .+.-|-.++-+..+..| ..+....+++.-.+.++.+. +....+....|...|+.. . T Consensus 101 ------------lll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~---~~~~d~~~~~~~~~~~~~----~--- 158 (540) T protein:vir:41 101 ------------LQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRY---MQTWDGIHVTYFKDYRYE----G--- 158 (540) T ss_pred ------------HHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCcee---EeeecCceeeeeeccccc----c--- Confidence 23445544444445333 45666666555443222211 011111000011111000 0 Q ss_pred ehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcc Q lcl|NC_020883. 222 TNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISA 301 (589) Q Consensus 222 ~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD 301 (589) . ..+ ..|. ....++.--|.|+.+..+...++|.|. T Consensus 159 ~-~~~-------------~~g~-------------------------------~~~~~~~~eViHir~~~~~~~~~G~Sp 193 (540) T protein:vir:41 159 E-VNP-------------DNGE-------------------------------DQDGVGANEIIFIHLPSPICSYYGVPR 193 (540) T ss_pred e-eec-------------cccc-------------------------------cceeecccceEEecCCCCCCCcccccH Confidence 0 000 0000 000011112889988878888999999 Q ss_pred hhhhhHHHHHHHHHHhHHHHHHHHhCCCcEE--echhhhhccccccccccc---c----ccc-cccccccccccccc--c Q lcl|NC_020883. 302 LDNLESKQDEINWTITRSAVIYEQNGKPRIS--ITKEMMDTLLNIAYERDG---H----SAK-EASMMTPRIDHRDM--E 369 (589) Q Consensus 302 ~~~ie~l~DeLd~t~S~~srildk~gkpRI~--VP~~~L~t~~g~~~d~dg---e----~~~-~~~~~~~~~d~~dl--e 369 (589) +..+...+.....+-..-.+.|.-.+.|..+ +|..+.+.. ....+..- + ... .+.+ .....+.++ + T Consensus 194 i~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~-~~~~~~~~~~~~~~~~~~~~~~~g-~~~nag~~~vLe 271 (540) T protein:vir:41 194 YLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEM-ELGSDGEPTGRTVLQGLIEDNFKY-LKEAPHTPLVFS 271 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchh-ccchHHHHHHHHHHHHHHHHHhcc-ccccccceEEEe Confidence 8765554444333322234445444556543 343322110 00000000 0 000 0000 000011111 1 Q ss_pred ccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020883. 370 ITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEY 449 (589) Q Consensus 370 v~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~ 449 (589) .....+.|....-++...+-.+..+..+...++|..+=+.|+.-.|...+++...+.... ....+... .+.-+...+ T Consensus 272 ~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq-~~~~f~~~--tL~P~~~~i 348 (540) T protein:vir:41 272 IPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEV-ARRTYYES--VVRPQQEIV 348 (540) T ss_pred cCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHH-HHHHHHHH--HHHHHHHHH Confidence 111112222222233344445566677778888888889999999865433222121111 12222111 223343334 Q ss_pred HHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC--CCCCHHHHHHH Q lcl|NC_020883. 450 IDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN--PDASEDWIQEE 527 (589) Q Consensus 450 ~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh--pdw~dE~v~eE 527 (589) ...|.+.+ +. ..+. ...+.|+..-..+ .+. +...+.+..+++++.-.+-..|. |..++.-+ T Consensus 349 e~~ln~~L-----~~-~~~~-----~~~i~f~~~~ll~-~D~--~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l--- 411 (540) T protein:vir:41 349 SSVLTDFI-----QL-KLDP-----GARFVFNEEILME-SEF--VHNYALLVQCGVLTPSEVREKLFGLDGGPDMFM--- 411 (540) T ss_pred HHHHHHhh-----hh-ccCC-----ceEEEecchhhcc-hHH--HHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccc--- Confidence 44444322 11 1111 1235565432222 222 33344567777887766543342 11222110 Q ss_pred HHHHHhhccccccccc-cccc---cccccccCcccCCCCCCCCCCCCCCCCcch-hhhhhcc-cc-------------cC Q lcl|NC_020883. 528 IARIEEEQAGSDTSSL-MGIN---QTFEQMNDNRDEDGNIIEEGDTEEEPSAEE-NEEIEKE-GE-------------PI 588 (589) Q Consensus 528 v~RI~~E~a~~~p~~~-g~~~---~~l~~~~~~~~~~~~p~deg~~~eep~~~~-~e~~~~~-~~-------------~~ 588 (589) .-+ +. ....+ +..+ ..-++...+....+.|.-+++.++++++++ +.++... .+ +| T Consensus 412 -~p~---n~--~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (540) T protein:vir:41 412 -VPS---SI--GKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENGKKMLSI 485 (540) T ss_pred -ccc---cc--ccccccccccccCCCCccccccccchhcccccCccccccccccccccccccccccCCccccchhHHHHH Confidence 000 00 00000 0000 000111112222222222223333333333 1111100 00 01 Q ss_pred C Q lcl|NC_020883. 589 A 589 (589) Q Consensus 589 ~ 589 (589) . T Consensus 486 ~ 486 (540) T protein:vir:41 486 A 486 (540) T ss_pred h Confidence 0 No 152 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=36.93 E-value=1.1 Score=20.26 Aligned_cols=473 Identities=12% Similarity=0.081 Sum_probs=142.3 Q ss_pred Ccccee-----ccc-hhHHHHhhc--chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee------------ Q lcl|NC_020883. 1 MIDWTV-----RGW-TDKTTKNVH--GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------------ 60 (589) Q Consensus 1 ~~~~~~-----~~~-~~~~~~~~~--~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------------ 60 (589) |.|..- |.+ .+|+|++|. +++.. -=++.|.=|++.+.|+++..=...-...++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 75 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQA-----NIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKR 75 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhh-----hHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccc Confidence 332211 011 134444432 12211 11233444455555554321111111111111 Q ss_pred -ecCcceEEEEcchhhhccchhhhccccccccccccCCccc-chhhccchhhc-ccccccch-hhhhhhhhhhhhhhhHH Q lcl|NC_020883. 61 -ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEID-PDIEEDTDEMI-EGPQDEEE-AGKNENNTVIDLQNEII 136 (589) Q Consensus 61 -~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~i-~~~~~~~~-~~~~~~~~~~~~~~e~i 136 (589) ...++|=+-++.+.+.+- +++...+-.+....+. ++ .....+...-. .-..+... ..+.+..+. .-+ T Consensus 76 ~~~~~~~~l~~~l~~~~~n--~i~~~~I~t~~~~vA~--~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~-----~~l 146 (563) T protein:vir:99 76 SYMKNEHNLHDVLKKFGNN--PILNAIILTRSNQVAM--YCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEM-----KRI 146 (563) T ss_pred cCCCCcccHHHHHHHhhcc--hHHHHHHHHHHHHHHH--HhhhhhhhcccccceeEEeecCCCcchhhhhhh-----HHH Confidence 112222222222222221 2222222222221110 00 00000000000 00000000 000111111 112 Q ss_pred HHHHhhcccc--------ccchhhHHHH-HHcCceeEEEEEe-c-Cc--eeEEEecCceecccccCcceeEEEeecCCCc Q lcl|NC_020883. 137 EQITKNSKLE--------RRHWSNIVQH-QVDGGIVAAPVID-E-LG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQY 203 (589) Q Consensus 137 ~~v~kn~~~~--------~~~~~~l~~~-~v~Gg~~~~~~~~-~-~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~ 203 (589) +..+.++..+ ..|+..++.+ +.-|-..+-+++. + .+ +.+....+....+..+..+. T Consensus 147 ~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~----------- 215 (563) T protein:vir:99 147 EDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK----------- 215 (563) T ss_pred HHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc----------- Confidence 2233322221 1344445543 4444344434432 2 22 34555555554442111110 Q ss_pred cceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcce Q lcl|NC_020883. 204 GQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPF 283 (589) Q Consensus 204 ~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~pl 283 (589) .|.+.. .| +...+++.... ++ .+=. T Consensus 216 -~~~~~~--------------~y---------~~~~~g~~~~~-------------------~~------------~~ev 240 (563) T protein:vir:99 216 -IIKGGK--------------RF---------VQVVDKRVVAS-------------------FT------------SREL 240 (563) T ss_pred -eeccce--------------eE---------EEEeCCceeEE-------------------ec------------Ccce Confidence 000000 00 00001111000 00 0001 Q ss_pred EEEecCCC--CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCc--EEechhh-hhccccccccccccccccccc Q lcl|NC_020883. 284 ISYWANNE--TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPR--ISITKEM-MDTLLNIAYERDGHSAKEASM 358 (589) Q Consensus 284 vvyvPN~~--~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpR--I~VP~~~-L~t~~g~~~d~dge~~~~~~~ 358 (589) +.|+.|.. ....++|.|-+.-+...+.-...+-....+.|...+.|+ |.+|... |+.-+ .++-.+.+.. T Consensus 241 I~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~---~~~~~~~~~~--- 314 (563) T protein:vir:99 241 AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHA---LENFKREWKS--- 314 (563) T ss_pred EEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHH---HHHHHHHHHH--- Confidence 34544533 334678999987666666544433334456665667888 4444321 21100 0000000000 Q ss_pred cccccc--cccccccccccccCcccee--eecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHH Q lcl|NC_020883. 359 MTPRID--HRDMEITTFDENGRSMEIH--QIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYD 434 (589) Q Consensus 359 ~~~~~d--~~dlev~~~de~g~~~~~i--q~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~ 434 (589) .+.+.+ +.-.-+. +.| +.|. +....-.+.++......++|..+=+.|+.-.|...+++.+++..+....++ T Consensus 315 ~~~G~~nagk~~~vl---~~G--~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s 389 (563) T protein:vir:99 315 SLSGINGSWQIPVVM---ADD--IKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA 389 (563) T ss_pred HhccccccccceEEc---CCC--ceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc Confidence 000000 0000111 222 3333 333344456667777888888888999999987554433333222222211 Q ss_pred hhh-----HH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHH--HHhccchh Q lcl|NC_020883. 435 LLT-----TI-LKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAA--YAASKQGQ 506 (589) Q Consensus 435 ~~~-----~~-~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~--~l~~a~~~ 506 (589) .+. .+ ..+.-|...+...|.+.+ +. ..+ ....+.|..+ |.+ +.++... .+.+++++ T Consensus 390 n~e~~~~~f~~~tL~P~l~~ie~~ln~~L-----~~-~~~-----~~~~~~f~r~---D~~--~~~e~~~~~~~~~~G~l 453 (563) T protein:vir:99 390 DPGKKQQQSQNKGLQPLLRFIEDLVNRHI-----IS-EYG-----DKYTFQFVGG---DTK--SATDKLNILKLETQIFK 453 (563) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----ch-hcc-----cccEEEeccC---CHH--HHHHHHHHHHHhcCCcc Confidence 110 00 112223323333332211 11 111 1123455332 222 2222222 24566777 Q ss_pred hHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccC------CCCCCCCCCCCCCCCcc--- Q lcl|NC_020883. 507 SLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDE------DGNIIEEGDTEEEPSAE--- 576 (589) Q Consensus 507 S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~------~~~p~deg~~~eep~~~--- 576 (589) |.-.+-+.+. |-.+. - +.+..-.. . .+.+...+.-......++. ...+.+.++++.+++.+ T Consensus 454 T~NE~R~~~gl~Pi~g--G----D~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (563) T protein:vir:99 454 TVNEAREEQGKKPIEG--G----DIILDASF-L--QGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSN 524 (563) T ss_pred CHHHHHHHhCCCCCCC--c----ceeecccc-c--ccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 7765554442 10110 0 00000000 0 0000000000000000000 00001111111112111 Q ss_pred hhhhhhcccccCC Q lcl|NC_020883. 577 ENEEIEKEGEPIA 589 (589) Q Consensus 577 ~~e~~~~~~~~~~ 589 (589) ++.+..+++.+=+ T Consensus 525 ~~~~~~~~~~~~~ 537 (563) T protein:vir:99 525 DDKEIGTDAQIKG 537 (563) T ss_pred Ccccccccccccc Confidence 1111111111111 No 153 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=36.93 E-value=1.1 Score=20.26 Aligned_cols=473 Identities=12% Similarity=0.081 Sum_probs=142.3 Q ss_pred Ccccee-----ccc-hhHHHHhhc--chhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceee------------ Q lcl|NC_020883. 1 MIDWTV-----RGW-TDKTTKNVH--GDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTAR------------ 60 (589) Q Consensus 1 ~~~~~~-----~~~-~~~~~~~~~--~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~------------ 60 (589) |.|..- |.+ .+|+|++|. +++.. -=++.|.=|++.+.|+++..=...-...++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 75 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQA-----NIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKR 75 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhh-----hHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccc Confidence 332211 011 134444432 12211 11233444455555554321111111111111 Q ss_pred -ecCcceEEEEcchhhhccchhhhccccccccccccCCccc-chhhccchhhc-ccccccch-hhhhhhhhhhhhhhhHH Q lcl|NC_020883. 61 -ETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEID-PDIEEDTDEMI-EGPQDEEE-AGKNENNTVIDLQNEII 136 (589) Q Consensus 61 -~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~i-~~~~~~~~-~~~~~~~~~~~~~~e~i 136 (589) ...++|=+-++.+.+.+- +++...+-.+....+. ++ .....+...-. .-..+... ..+.+..+. .-+ T Consensus 76 ~~~~~~~~l~~~l~~~~~n--~i~~~~I~t~~~~vA~--~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~-----~~l 146 (563) T protein:vir:95 76 SYMKNEHNLHDVLKKFGNN--PILNAIILTRSNQVAM--YCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEM-----KRI 146 (563) T ss_pred cCCCCcccHHHHHHHhhcc--hHHHHHHHHHHHHHHH--HhhhhhhhcccccceeEEeecCCCcchhhhhhh-----HHH Confidence 112222222222222221 2222222222221110 00 00000000000 00000000 000111111 112 Q ss_pred HHHHhhcccc--------ccchhhHHHH-HHcCceeEEEEEe-c-Cc--eeEEEecCceecccccCcceeEEEeecCCCc Q lcl|NC_020883. 137 EQITKNSKLE--------RRHWSNIVQH-QVDGGIVAAPVID-E-LG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQY 203 (589) Q Consensus 137 ~~v~kn~~~~--------~~~~~~l~~~-~v~Gg~~~~~~~~-~-~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~ 203 (589) +..+.++..+ ..|+..++.+ +.-|-..+-+++. + .+ +.+....+....+..+..+. T Consensus 147 ~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~----------- 215 (563) T protein:vir:95 147 EDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK----------- 215 (563) T ss_pred HHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc----------- Confidence 2233322221 1344445543 4444344434432 2 22 34555555554442111110 Q ss_pred cceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcce Q lcl|NC_020883. 204 GQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPF 283 (589) Q Consensus 204 ~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~pl 283 (589) .|.+.. .| +...+++.... ++ .+=. T Consensus 216 -~~~~~~--------------~y---------~~~~~g~~~~~-------------------~~------------~~ev 240 (563) T protein:vir:95 216 -IIKGGK--------------RF---------VQVVDKRVVAS-------------------FT------------SREL 240 (563) T ss_pred -eeccce--------------eE---------EEEeCCceeEE-------------------ec------------Ccce Confidence 000000 00 00001111000 00 0001 Q ss_pred EEEecCCC--CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCc--EEechhh-hhccccccccccccccccccc Q lcl|NC_020883. 284 ISYWANNE--TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPR--ISITKEM-MDTLLNIAYERDGHSAKEASM 358 (589) Q Consensus 284 vvyvPN~~--~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpR--I~VP~~~-L~t~~g~~~d~dge~~~~~~~ 358 (589) +.|+.|.. ....++|.|-+.-+...+.-...+-....+.|...+.|+ |.+|... |+.-+ .++-.+.+.. T Consensus 241 I~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~---~~~~~~~~~~--- 314 (563) T protein:vir:95 241 AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHA---LENFKREWKS--- 314 (563) T ss_pred EEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHH---HHHHHHHHHH--- Confidence 34544533 334678999987666666544433334456665667888 4444321 21100 0000000000 Q ss_pred cccccc--cccccccccccccCcccee--eecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHH Q lcl|NC_020883. 359 MTPRID--HRDMEITTFDENGRSMEIH--QIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYD 434 (589) Q Consensus 359 ~~~~~d--~~dlev~~~de~g~~~~~i--q~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~ 434 (589) .+.+.+ +.-.-+. +.| +.|. +....-.+.++......++|..+=+.|+.-.|...+++.+++..+....++ T Consensus 315 ~~~G~~nagk~~~vl---~~G--~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s 389 (563) T protein:vir:95 315 SLSGINGSWQIPVVM---ADD--IKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA 389 (563) T ss_pred HhccccccccceEEc---CCC--ceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc Confidence 000000 0000111 222 3333 333344456667777888888888999999987554433333222222211 Q ss_pred hhh-----HH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHH--HHhccchh Q lcl|NC_020883. 435 LLT-----TI-LKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAA--YAASKQGQ 506 (589) Q Consensus 435 ~~~-----~~-~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~--~l~~a~~~ 506 (589) .+. .+ ..+.-|...+...|.+.+ +. ..+ ....+.|..+ |.+ +.++... .+.+++++ T Consensus 390 n~e~~~~~f~~~tL~P~l~~ie~~ln~~L-----~~-~~~-----~~~~~~f~r~---D~~--~~~e~~~~~~~~~~G~l 453 (563) T protein:vir:95 390 DPGKKQQQSQNKGLQPLLRFIEDLVNRHI-----IS-EYG-----DKYTFQFVGG---DTK--SATDKLNILKLETQIFK 453 (563) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----ch-hcc-----cccEEEeccC---CHH--HHHHHHHHHHHhcCCcc Confidence 110 00 112223323333332211 11 111 1123455332 222 2222222 24566777 Q ss_pred hHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccC------CCCCCCCCCCCCCCCcc--- Q lcl|NC_020883. 507 SLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDE------DGNIIEEGDTEEEPSAE--- 576 (589) Q Consensus 507 S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~------~~~p~deg~~~eep~~~--- 576 (589) |.-.+-+.+. |-.+. - +.+..-.. . .+.+...+.-......++. ...+.+.++++.+++.+ T Consensus 454 T~NE~R~~~gl~Pi~g--G----D~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (563) T protein:vir:95 454 TVNEAREEQGKKPIEG--G----DIILDASF-L--QGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSN 524 (563) T ss_pred CHHHHHHHhCCCCCCC--c----ceeecccc-c--ccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 7765554442 10110 0 00000000 0 0000000000000000000 00001111111112111 Q ss_pred hhhhhhcccccCC Q lcl|NC_020883. 577 ENEEIEKEGEPIA 589 (589) Q Consensus 577 ~~e~~~~~~~~~~ 589 (589) ++.+..+++.+=+ T Consensus 525 ~~~~~~~~~~~~~ 537 (563) T protein:vir:95 525 DDKEIGTDAQIKG 537 (563) T ss_pred Ccccccccccccc Confidence 1111111111111 No 154 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=35.52 E-value=1.2 Score=20.10 Aligned_cols=390 Identities=7% Similarity=-0.003 Sum_probs=120.2 Q ss_pred cCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccccccccccCCccc-------- Q lcl|NC_020883. 29 EGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEID-------- 100 (589) Q Consensus 29 ~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~-------- 100 (589) =|=..-+|.|..+... +... .+ .|+.....+++....+ +...+. T Consensus 1 Mgl~~~~f~~~~~~~~----~~~~------------~~-----------~~~~~~~~~~~g~~v~-~~~al~~~~v~~~v 52 (409) T protein:vir:84 1 MSLFTRIFSGPSEERT----LTKI------------SG-----------IPSPAEDWAMHGDRPG-ANSAMTLGAFYACV 52 (409) T ss_pred CchhhhhhcCCCcccc----cccc------------cc-----------cccccchhhccCcccc-hhhhhccHHHHHHH Confidence 1111111111000000 0000 00 0000000000000000 000000 Q ss_pred chhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcccc---ccchhhHHHHHHcCceeE-EEEE-ecCc--e Q lcl|NC_020883. 101 PDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLE---RRHWSNIVQHQVDGGIVA-APVI-DELG--P 173 (589) Q Consensus 101 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~---~~~~~~l~~~~v~Gg~~~-~~~~-~~~~--~ 173 (589) .-.+++...+=.--......+......+.++ +....+.+ ..|+..++.+...-|-.+ .+.+ +.++ . T Consensus 53 ~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~l-------L~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~ 125 (409) T protein:vir:84 53 TLLADTVASLSIDAYRKKDNVRIPVSPAPKL-------LESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPT 125 (409) T ss_pred HHHHHhhhhCceEEEEecCCcccccchHHHH-------hhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceE Confidence 0000000000000000000000001111110 00111111 123333443333322221 2221 1111 1 Q ss_pred eEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccc Q lcl|NC_020883. 174 RIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAED 253 (589) Q Consensus 174 ~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d 253 (589) .+....++.+. +.... +..+. ..+..| ...|..++ T Consensus 126 ~L~~l~p~~v~------------------------v~~~~-~~~~~-~~~~~~------------~~~g~~~~------- 160 (409) T protein:vir:84 126 AIMPIHPDCIH------------------------VTDAK-DEDGD-WIEPVY------------RIDGKVVP------- 160 (409) T ss_pred EEEEEcCceeE------------------------EEEcC-CCcce-EEEEEe------------cCCceEEc------- Confidence 12222221111 11000 00000 000001 01111110 Q ss_pred cchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEe Q lcl|NC_020883. 254 LEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISI 333 (589) Q Consensus 254 ~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~V 333 (589) .--|.|+.+......++|.|-++-+...++.....-....+.|...++|+.++ T Consensus 161 ---------------------------~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 213 (409) T protein:vir:84 161 ---------------------------NHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGIL 213 (409) T ss_pred ---------------------------hhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE Confidence 01178999888888889999988777666665555445566676668887765 Q ss_pred ch-hhhhccccccccccccccccccccccccccccccccccccccCccceeeecc--cHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020883. 334 TK-EMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDI--SKIGDMDHVKNLIKLMLIETQTS 410 (589) Q Consensus 334 P~-~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Di--rveeh~~~ie~L~~~Il~~a~ts 410 (589) -- +.|+.-+ . ++-.+.+.. .. ...+..+ +. +.| +.|.+... .-.+..+..+...++|..+=+.| T Consensus 214 ~~~~~l~~e~-~--~~~~~~~~~---~~-~n~g~~~-vl---~~g--~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP 280 (409) T protein:vir:84 214 SSDADLTPDQ-V--KQTQKQWIQ---SH-HNRRLPA-VM---SAG--IKWQSVSITPNESQFLETRSFQRSEIAMWFRIP 280 (409) T ss_pred ecCCCCCHHH-H--HHHHHHHHH---Hh-ccCCCee-ec---CCC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 21 1111100 0 000000000 00 0011111 11 222 23333333 33445555667788888888999 Q ss_pred chhcccccCcccchhHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCC Q lcl|NC_020883. 411 EKAVDFYLDGGASGAQSGVA-KFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRA 489 (589) Q Consensus 411 ~~AFg~~~~~g~~~A~Sg~A-~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~ 489 (589) +.-.|...+++.. .|.+. ....+.+- .+.-|-..++.+|.+.+ .. + . .....+...+. .+ T Consensus 281 p~~lg~~~~~~~~--~sn~e~~~~~f~~~--~l~P~~~~ie~~l~~~L------~~--g--~---~i~fd~~~l~~--~d 341 (409) T protein:vir:84 281 PHMIGDVEKSTSW--GTGIEEQGINFVRH--TLLPWLRCIEQALDTFL------PR--G--Q---FVKFNVDGLMR--GD 341 (409) T ss_pred HHHhCCCCCcccc--cchHHHHHHHHHHH--HHHHHHHHHHHHHHHhc------cC--C--C---eEEEechhhhc--cC Confidence 9988864333221 12221 11122111 01111112223332211 10 1 1 12233444333 34 Q ss_pred HHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCC Q lcl|NC_020883. 490 ELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDT 569 (589) Q Consensus 490 El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~ 569 (589) .++.++....+.++++++.-.+-++++ +.+ .+ +++..+-+ +|--.-+..+..++.. T Consensus 342 ~~~~~~~~~~~~~~G~~t~NE~R~~~g--~~p-------------------~~--ggD~~~~~-~n~~~~~~~~~~~~~~ 397 (409) T protein:vir:84 342 VTARFTAYQMGLQNGIWSVNEVRAWED--APP-------------------IP--EGDIHLQP-MNFVPLGYVPPEEPAQ 397 (409) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhC--CCC-------------------CC--Ccceeeec-ccccccccCCccccCc Confidence 556678888888888888766555442 111 10 01000100 0000000001111111 Q ss_pred CCCCCcchhhhh Q lcl|NC_020883. 570 EEEPSAEENEEI 581 (589) Q Consensus 570 ~eep~~~~~e~~ 581 (589) +.+|.++.+-+- T Consensus 398 ~~~~~~~~~gn~ 409 (409) T protein:vir:84 398 EPQPNSATEGNK 409 (409) T ss_pred CCCCCCccCCCC Confidence 111211111111 No 155 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=32.87 E-value=1.4 Score=19.79 Aligned_cols=477 Identities=11% Similarity=0.038 Sum_probs=167.3 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcce-eeecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQT-ARETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~n~~~~i~~~p 79 (589) |-.=...|...+.+|+.-.-+..-|.=|+....|+ ...+- ++ ++.+.+.. .+...++| =-=-.+.+..+- T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~----~~~~l--P~--~~~~~~~~~~~~~~~~~-dst~~~a~~~La 71 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENC----AQYTI--PS--LFPKESDNESTDYTTPW-QAVGARGLNNLA 71 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHH----HHHhc--cc--ccCCCCCcccccccccc-cccHHHHHHHHH Confidence 44433344455555543222222232233222221 11111 00 11111111 11222222 001134445555 Q ss_pred hhhhccccccccccccCCccc-chhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHH Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEID-PDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQV 158 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v 158 (589) |-|++..++. .+.|- -.+. .+..+... +-......+.+..++ .+.+...+..|+|+......+.++.+ T Consensus 72 a~l~~~ltP~-~~WF~-l~~~d~~~~~~~~-----~~~~~~~v~~~l~~v----e~~~~~~~~~snf~~~~~~~~~~L~~ 140 (535) T protein:vir:33 72 SKLMLALFPM-QSWMK-LTISEYEAKQLVG-----DPDGLAKVDEGLSMV----ERIIMNYIESNSYRVTLFECLKQLIV 140 (535) T ss_pred HHHHHhhcCC-Ccccc-cccChHHHhcccc-----CcchHHHHHHHHHHH----HHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 5566555554 22222 1111 11111111 011111123333333 34566777788999999999998877 Q ss_pred cCc-eeEEEEEecCceeEEEecCceecccccCcc-eeEEEeec-CCC-----------------------ccceEEEEEe Q lcl|NC_020883. 159 DGG-IVAAPVIDELGPRIVFKARDVYFPHDDEKG-ADLAYYID-HGQ-----------------------YGQFLHIYRE 212 (589) Q Consensus 159 ~Gg-~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~-~div~~~e-~~~-----------------------~~~~l~~~~~ 212 (589) -|= |++.+--.+.+++...++-..|+=..|+.| ++-+|... .+- -+.|.|+++. T Consensus 141 ~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~ 220 (535) T protein:vir:33 141 AGNALLYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLD 220 (535) T ss_pred hCceeEEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEee Confidence 654 333332113345566665555554444444 55555322 110 0001111110 Q ss_pred eeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCC-CcceEEEecCCC Q lcl|NC_020883. 213 RVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGR-NRPFISYWANNE 291 (589) Q Consensus 213 ~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv-~~plvvyvPN~~ 291 (589) + ++| .+....+..+. . ++.... . .|- .-|+++. -=+. T Consensus 221 ~--------------------------~~~-~~~~~~~~~~~--------~--~~~~~~-~---~~~~~~P~i~~-Rw~~ 258 (535) T protein:vir:33 221 E--------------------------ESG-DYLKYEEVEDV--------E--IDGSDA-T---YPTDAMPYIPV-RMVR 258 (535) T ss_pred C--------------------------CCC-cEEEEEEEeCc--------c--cccccc-c---cccccCCceee-eeee Confidence 0 001 01111111000 0 000000 0 011 1233333 2245 Q ss_pred CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccccccccccccccccccccccccc Q lcl|NC_020883. 292 TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEIT 371 (589) Q Consensus 292 ~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~ 371 (589) .....||||=..+..+-+..||..--....-.++..+|-+.||........ +..+...+.+.+.+..+ T Consensus 259 ~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~--------~~~~~~~g~~v~g~~~~---- 326 (535) T protein:vir:33 259 IDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPR--------RLTKAQTGDFVPGRRED---- 326 (535) T ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchh--------hcccCCceeeecCCccc---- Confidence 566779999889999999999975434444455779999888754443221 11121111111111111 Q ss_pred ccccccCccceeee--cccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHh-hhHH-HHHHHHHH Q lcl|NC_020883. 372 TFDENGRSMEIHQI--DISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDL-LTTI-LKSRRLQK 447 (589) Q Consensus 372 ~~de~g~~~~~iq~--Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~-~~~~-~Kv~~~R~ 447 (589) ++.+++ ..++....+.++.+...|-. +=+.. .+.. . ++...|++++..+... ...+ --..+.- T Consensus 327 --------v~~~~~~~~~~~~~~~~~i~~~~~~I~~-af~~~-~~~~-~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~- 393 (535) T protein:vir:33 327 --------IDFLQLEKQADFTVAKAVSDQIEARLSY-AFMLN-SAVQ-R-TGERVTAEEIRYVASELEDTLGGVYSILS- 393 (535) T ss_pred --------ceeeecccccchhHHHHHHHHHHHHHHH-HHhhh-hccc-C-CCccccHHHHHHHHHHHHHHHhHHHHHHH- Confidence 111221 12344444445444443321 10010 1211 1 2222344666665432 2222 1223322 Q ss_pred HHHHHHHHHHHHHHHHHhhcCcc--cCcccceeeeCCcCCCCCCHHHHHHHHHH---Hhcc------chhhHHHHHHHhC Q lcl|NC_020883. 448 EYIDFLKELYESCLWLLNDQDSS--IRIEEPNIETQDMILKPRAELVAENMAAY---AASK------QGQSLETTVRRMN 516 (589) Q Consensus 448 ~~~~aLk~li~~~l~L~~~~~~~--~~~e~p~I~f~D~lPvde~El~~A~t~~~---l~~a------~~~S~etaVr~Lh 516 (589) .+.|..+++.+..+....+.. ...+..+|++--+|..-.+-.....+.+. +.+. ..+....+++.+- T Consensus 394 --~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a 471 (535) T protein:vir:33 394 --QELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIA 471 (535) T ss_pred --HHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHH Confidence 344455554443333222211 11122345553333211111111111111 1110 0012223333330 Q ss_pred -----C-C---CCHHHHHHHHHHHHhhccc-cccccccccccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhcccc Q lcl|NC_020883. 517 -----P-D---ASEDWIQEEIARIEEEQAG-SDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEKEGE 586 (589) Q Consensus 517 -----p-d---w~dE~v~eEv~RI~~E~a~-~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~~~~ 586 (589) | . -++|++++..+.=.+.+.+ ..-...|++..+.+ .. .|+ .-+.-.+ T Consensus 472 ~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~------------~~------~~~-----~~~~~~~ 528 (535) T protein:vir:33 472 NAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALA------------TS------SPE-----AMQGAAA 528 (535) T ss_pred HHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchh------------hc------CCh-----hHHHHHH Confidence 1 1 1344443333221111110 01111111111111 00 000 0000111 Q ss_pred cCC Q lcl|NC_020883. 587 PIA 589 (589) Q Consensus 587 ~~~ 589 (589) .|. T Consensus 529 ~~g 531 (535) T protein:vir:33 529 KAG 531 (535) T ss_pred hcc Confidence 111 No 156 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=30.88 E-value=1.5 Score=19.55 Aligned_cols=425 Identities=11% Similarity=0.083 Sum_probs=135.5 Q ss_pred hhHHHHhhcchh-hhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccccc Q lcl|NC_020883. 10 TDKTTKNVHGDY-ERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGSIG 88 (589) Q Consensus 10 ~~~~~~~~~~~~-~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~~~ 88 (589) -.|..+.+-+-+ ..+.+ |-|.+.... .+.+...+......+-..-+.--+...+.+..-+ .++++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~g~~~s~~--------~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci--~~Ia~~ia 69 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLK-WLGVPISLT--------DGSFWSAWGGMGSSSGETVTADSALQLSAVWSCV--RLIAETIA 69 (437) T ss_pred CCcchhhhhhhhHHhhhh-hcCCcccCC--------chhHHHhhcccccCCCceechHhhhccHHHHHHH--HHHHHHHh Confidence 000000000000 00111 111111100 0011111100000000000000112223333333 44544444 Q ss_pred cccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccc---cccchhhHHHH-HHcCceeE Q lcl|NC_020883. 89 QIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKL---ERRHWSNIVQH-QVDGGIVA 164 (589) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~---~~~~~~~l~~~-~v~Gg~~~ 164 (589) +++-.......+ |.. ....+..+..+ +...-+. ...|+..++.+ +..|-.++ T Consensus 70 ~lp~~~~~~~~~------------g~~-----~~~~~~~l~~l-------L~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 125 (437) T protein:vir:10 70 TLPLNLYQTKPD------------GTR-----VLAKQHRLYTV-------IHSQPNAENTAAEFWEVIVASMLLWGNGYA 125 (437) T ss_pred hCceeEEEEcCC------------Cce-----eeccccHHHHH-------hhccCCcCCCHHHHHHHHHHHHhhcCCeEE Confidence 433211100000 000 00000001000 0000111 11234444433 44565555 Q ss_pred EEEEecCce-eEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccc Q lcl|NC_020883. 165 APVIDELGP-RIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGE 243 (589) Q Consensus 165 ~~~~~~~~~-~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd 243 (589) .+..+++++ .+....++..-+ .. . ..+++.+.| + ...|. T Consensus 126 ~i~r~~g~~~~L~~l~p~~v~i---------~~---~-~~g~~~y~~---------------~------------~~~g~ 165 (437) T protein:vir:10 126 RKLRSAGVLIGLELMLPQRTTV---------KR---L-TSGALQYTY---------------R------------NVDGT 165 (437) T ss_pred EEEecCCcEEEEEEEcCcceEE---------EE---C-CCCeEEEEE---------------E------------ecCce Confidence 555555443 233333333222 10 0 011100000 0 01121 Q ss_pred cccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHH Q lcl|NC_020883. 244 LVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIY 323 (589) Q Consensus 244 ~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~sril 323 (589) +.. + +.--|.|+.+.. .+.++|.|-+.-+...+......-....+.| T Consensus 166 ~~~-------------------~-------------~~~dIih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f 212 (437) T protein:vir:10 166 VST-------------------L-------------AEDDVFHVRGFS-LDGLMGLTPIQYAREVLGNSTAANKTSASVF 212 (437) T ss_pred EEE-------------------E-------------ccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 0 000178887754 4568999988766666554443333345556 Q ss_pred HHhCCCcEEechhhhhcccccccccccccccccccccccc--ccccccccccccccCccceeeec--ccHHHHHHHHHHH Q lcl|NC_020883. 324 EQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRI--DHRDMEITTFDENGRSMEIHQID--ISKIGDMDHVKNL 399 (589) Q Consensus 324 dk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~--d~~dlev~~~de~g~~~~~iq~D--irveeh~~~ie~L 399 (589) ...+.|+.++.. .+.+.-+....+...+...+.+. .+..+ +. +.| +++.+.. ..-.+..+..+.. T Consensus 213 ~ng~~p~gil~~-----~~~l~~e~~~~~~~~~~~~~~g~~nag~~~-vl---~~g--~~~~~l~~~~~d~q~~e~~~~~ 281 (437) T protein:vir:10 213 RNGLRPSGVLST-----DQILQKEKRAEIRTDLAEQFGGAMQAGKTM-VL---EAG--MKYQAITMNPGDVQLLETRAFN 281 (437) T ss_pred hccCCccEEEEc-----CCCCCHHHHHHHHHHHHHHhcCccccCcce-ec---cCC--ceEEeccCChhhHHHHHHHHHH Confidence 555677666532 11111111000100000001100 01111 11 222 3333333 3333445556667 Q ss_pred HHHHHHHhcCCchhcccccCcccchhHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCccccee Q lcl|NC_020883. 400 IKLMLIETQTSEKAVDFYLDGGASGAQSGVA-KFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNI 478 (589) Q Consensus 400 ~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A-~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I 478 (589) .++|..+=+.|+.-+|...+++.. .|.+. ....+.+- .+.-|...++..|.+-+ |... .. ......+ T Consensus 282 ~~~Ia~~fgVPp~~lg~~~~~t~~--~sn~e~~~~~f~~~--tl~P~~~~ie~~l~~kl-----l~~~-e~--~~~~~~f 349 (437) T protein:vir:10 282 IEEICRWYRVPPFMVGHSEKSTSW--GTGIEQQTLGFLTF--TLRPWLTRIEQAARRSL-----LRPG-ER--DQFYAEF 349 (437) T ss_pred HHHHHHHhCCCHHHhCCCCCcccc--cchHHHHHHHHHHH--HHHHHHHHHHHHHHhhc-----cCcc-cc--CceEEEE Confidence 788888889999999864433221 12221 11112111 11222222223332211 1110 00 0111233 Q ss_pred eeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccccCcc Q lcl|NC_020883. 479 ETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNR 557 (589) Q Consensus 479 ~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~ 557 (589) .+...+..| ..++++..+.+.++++++.-.+-+++. |-.... .++-.+ +....|....+... ..+. T Consensus 350 d~~~ll~~d--~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg---~~~~~~---~~~~~~~~~~~~~~-----~~~~ 416 (437) T protein:vir:10 350 SVEGLLRAD--SAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGN---AAVLTV---QSALLPIDKLGEHT-----TATA 416 (437) T ss_pred echhhhccC--HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC---cceEee---cCcccchhhccCcC-----CCcc Confidence 444444434 455677888888888888776655552 111100 000000 00001111000000 0000 Q ss_pred cCCCCCCCCCCCCCCCCcchhhh Q lcl|NC_020883. 558 DEDGNIIEEGDTEEEPSAEENEE 580 (589) Q Consensus 558 ~~~~~p~deg~~~eep~~~~~e~ 580 (589) .+ +....|+.++|+...+.|. T Consensus 417 ~~--~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 417 AQ--DALKAWLYQEEKTRATQER 437 (437) T ss_pred hh--ccccccCCCCCCCCccccC Confidence 01 1111222222332222222 No 157 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=30.11 E-value=1.6 Score=19.46 Aligned_cols=407 Identities=12% Similarity=0.068 Sum_probs=138.3 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcce----EEEEcchhhh Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPY----VIFNLPKVIA 76 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y----~~~n~~~~i~ 76 (589) |. ...+|+++ +.-+..+.+...+..+-.-.......+ .+...+.+.. T Consensus 1 m~---------------------~~~~~~~~--------~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~ 51 (421) T protein:vir:10 1 MF---------------------IPQMFEGK--------KRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRA 51 (421) T ss_pred CC---------------------Ccchhccc--------ccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHH Confidence 11 11122222 111222222222211111000001111 1233444444 Q ss_pred ccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccc---cccchhhH Q lcl|NC_020883. 77 EIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKL---ERRHWSNI 153 (589) Q Consensus 77 ~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~---~~~~~~~l 153 (589) -+ .++++.+.+++-..-....+ ...+.+ .+... ..-+...-|. ...|+..+ T Consensus 52 ~i--~~Ia~~iA~lp~~~~~~~~~-----g~~~~~------------~~~~l-------~~lL~~~PN~~~t~~~f~~~~ 105 (421) T protein:vir:10 52 CV--TLLAESVAQLPVELYRRDKN-----GGRQRA------------TDHPI-------YDLIHSQPNKKDTSFEYFEQQ 105 (421) T ss_pred HH--HHHHHhhccCceEEEEEcCC-----Cceeec------------ccchH-------HHHHhhcccCCCCHHHHHHHH Confidence 44 56666666544321100000 000000 00000 0000000111 11233333 Q ss_pred HH-HHHcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhcccc Q lcl|NC_020883. 154 VQ-HQVDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVK 230 (589) Q Consensus 154 ~~-~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~ 230 (589) +. .+..|-.++.+..+.++ +.+....++++-+..+. .+. + .| T Consensus 106 ~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~---------------------------~g~-~---~y---- 150 (421) T protein:vir:10 106 QGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGP---------------------------DGM-P---YY---- 150 (421) T ss_pred HHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECC---------------------------Cce-E---EE---- Confidence 32 33455554545545433 23333344433321110 111 0 01 Q ss_pred ccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHH Q lcl|NC_020883. 231 AKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQD 310 (589) Q Consensus 231 ~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~D 310 (589) .....|.+++ .--|.|+.+.. .+.++|.|-+.-+...++ T Consensus 151 ------~~~~~g~~~~----------------------------------~~eiih~~~~~-~d~~~G~spi~~~~~~i~ 189 (421) T protein:vir:10 151 ------EIPEIGETLP----------------------------------MRMMHHVKVFS-LDGYIGSSPIQTNADVLG 189 (421) T ss_pred ------EEcCCCcEEc----------------------------------hhhEEEecCcC-CCCcccccHHHHHHHHHH Confidence 0001222111 01177887754 456789998876666665 Q ss_pred HHHHHHhHHHHHHHHhCCCcEEe--chhhhhccccccccccccccccccccccccccccccccccccccCccceeeeccc Q lcl|NC_020883. 311 EINWTITRSAVIYEQNGKPRISI--TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDIS 388 (589) Q Consensus 311 eLd~t~S~~srildk~gkpRI~V--P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dir 388 (589) .....-......|...++|+.++ |..+-... ..+.-..........+.+.... ..+... +.|...+-++...+ T Consensus 190 ~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~---~~e~~~~~~~~~~~~~~g~~n~-~~~~vl-~~g~~~~~l~~~~~ 264 (421) T protein:vir:10 190 LNLAVEEHASAVFRRGATMSGVIERPKEAPAIK---SQEKIDQLLAKWTDRYSGINNM-FSVALL-QEGMSYKQMSQDNE 264 (421) T ss_pred HHHHHHHHHHHHHhcCCCccEEEEecCccCccC---CHHHHHHHHHHHHHHhcCcccc-Ccceec-CCCceEEecCCChh Confidence 43333223345555557777554 22111100 0000000000000000000000 001101 22322222333334 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHH-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020883. 389 KIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGV-AKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQ 467 (589) Q Consensus 389 veeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~-A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~ 467 (589) -.+..+..+...++|..+=+.|+.-.|....+.- |.+ .....+..- .+.-|-..++..|.+.+ +..... T Consensus 265 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~----sn~e~~~~~f~~~--tl~P~~~~ie~~ln~kL----~~~~~~ 334 (421) T protein:vir:10 265 KAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATN----NNIEHQGLQFVMY--TLLAWLKRHEGALQRDL----LLPSER 334 (421) T ss_pred HHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCcc----ccHHHHHHHHHHH--HHHHHHHHHHHHHhhhc----cCcccc Confidence 4455566667788888888899888875432211 111 111122111 11222222223332211 111000 Q ss_pred CcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020883. 468 DSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGIN 547 (589) Q Consensus 468 ~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~ 547 (589) ..-.|+|+-.-....+.++.++..+.+.++++++.-.+-+.++ + .|.+ +++ T Consensus 335 ------~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g--l-------------------~p~~--ggD 385 (421) T protein:vir:10 335 ------RDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMEN--L-------------------PPIA--GGD 385 (421) T ss_pred ------CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC--C-------------------CCCC--Ccc Confidence 0112444322222334566688888888888777665544442 0 1111 111 Q ss_pred cccccccCccc-CCCCCCCCCCCCCCCCcchhhhhhccc Q lcl|NC_020883. 548 QTFEQMNDNRD-EDGNIIEEGDTEEEPSAEENEEIEKEG 585 (589) Q Consensus 548 ~~l~~~~~~~~-~~~~p~deg~~~eep~~~~~e~~~~~~ 585 (589) ..+-+ +|--+ .+..|.+ +.+..+ .+.|..++-.+| T Consensus 386 ~~~~~-~n~~~~~~~~~~~-~~~~~~-~~~e~d~~~~~~ 421 (421) T protein:vir:10 386 KYLTP-LNMVDSAQIIPGD-KKPTAQ-QMAEIDTILSRT 421 (421) T ss_pred eeeec-cccccccccccCC-CCcccc-cCcccccccccC Confidence 11111 11000 0011111 122222 223344455555 No 158 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=28.49 E-value=1.7 Score=19.26 Aligned_cols=427 Identities=9% Similarity=0.003 Sum_probs=145.2 Q ss_pred CccceeccchhH----HHHh-hcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhh Q lcl|NC_020883. 1 MIDWTVRGWTDK----TTKN-VHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVI 75 (589) Q Consensus 1 ~~~~~~~~~~~~----~~~~-~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i 75 (589) +-.|-.+|.|-. +.++ -.||+.+|..||+ ++-+|.-.+-. .+ .. ....| T Consensus 32 ~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e----~m~e~D~~i~s------~l-~~---------------Rk~av 85 (526) T protein:vir:99 32 FAQHPAKGLTPAKLARILVEAEQGNLQAQAELFM----DMEERDAHLFA------EM-SK---------------RKRAI 85 (526) T ss_pred hcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHH----HHHhhChHHHH------HH-HH---------------HHHHH Confidence 556777787763 3332 3578888888886 22222111111 10 00 00011 Q ss_pred hccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHH Q lcl|NC_020883. 76 AEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQ 155 (589) Q Consensus 76 ~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~ 155 (589) ..++=.+. . ..+++. ... .+-+++++.+.+-.-+..+-..+.+ T Consensus 86 ~~~~w~I~--------p----------~~~~~~---------------~~~----~~a~~v~~~l~~~~~~~~~i~~~ld 128 (526) T protein:vir:99 86 LGLDWAVE--------P----------PRNASA---------------AEK----ADADYLHELLLDLEGLEDLLLDALD 128 (526) T ss_pred hCCCceEe--------c----------CCCCCH---------------HHH----HHHHHHHHHHhcccCHHHHHHHHHH Confidence 11100111 0 000111 001 1123555555442111122222233 Q ss_pred HHHcCceeEEEEEecCceeEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchh Q lcl|NC_020883. 156 HQVDGGIVAAPVIDELGPRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDV 235 (589) Q Consensus 156 ~~v~Gg~~~~~~~~~~~~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 235 (589) +..-|=-+.-.+|.- ....+.| ++|+|+... + +.|. ...++. .+++...+ T Consensus 129 a~~~G~s~~Eivw~~--------~~g~~~~------~~l~~r~~~----~--f~~~---~~~~~~---l~~~~~~~---- 178 (526) T protein:vir:99 129 GIGHGYSCIELEWAL--------QGREWMP------LAFHHRPQS----W--FQLN---PEDQNE---LRLRDNSP---- 178 (526) T ss_pred hhhhcceeEEEEEee--------cCCceeE------EEeeeeccc----c--eeec---cCCCcE---EEecCCCC---- Confidence 222222222222221 1122333 334432221 1 1110 111110 01110000 Q ss_pred heeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHH Q lcl|NC_020883. 236 KKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWT 315 (589) Q Consensus 236 ~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t 315 (589) .| +.+| +.-+++|.+ .....+|+|.+-+..+....-.=+-. T Consensus 179 -----~g---------------------~~l~------------~~k~i~~~~-~~~~g~p~g~gLlr~~~w~~~fK~~~ 219 (526) T protein:vir:99 179 -----AG---------------------EALQ------------PFGWIIHRP-RARSGYVARSGLFRVLAWPYLFRHYA 219 (526) T ss_pred -----Cc---------------------eeec------------CCCeEEEee-cCCcCCccccchHHHHHHHHHHHHhh Confidence 00 0011 223567754 57889999999998776665555555 Q ss_pred HhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec-ccHHHHHH Q lcl|NC_020883. 316 ITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID-ISKIGDMD 394 (589) Q Consensus 316 ~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D-irveeh~~ 394 (589) +..+..-++|+|-|-.++.- ..+.. +++...... .. ..+......+.+. |..+++++-. ...+-+.. T Consensus 220 ~~~w~~f~E~yG~P~~igky-----~~~a~-~~ek~~L~~--av-~~i~~d~~~iiP~---~~~ie~~ea~~~~~~~f~~ 287 (526) T protein:vir:99 220 TSDLAEMLEIYGLPIRLGKY-----PPGTA-DEEKATLLR--AV-TGLGHAAAGIIPE---TMAIDFQQAAQGSSEPFLA 287 (526) T ss_pred HHHHHHHHHHcCCceEEEec-----CCCCC-HHHHHHHHH--HH-HHHhhCcEEEecC---CceeEEeecCCCCHHHHHH Confidence 66778888899999776631 01111 111110000 00 0000111122322 2335555532 33333444 Q ss_pred HHHHHHHHHHHHhcCCchhccc--ccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCccc Q lcl|NC_020883. 395 HVKNLIKLMLIETQTSEKAVDF--YLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKE-LYESCLWLLNDQDSSI 471 (589) Q Consensus 395 ~ie~L~~~Il~~a~ts~~AFg~--~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~-li~~~l~L~~~~~~~~ 471 (589) .++..=++|..+- ++ ++++. ..|++++.|.+.+-.. ...- .++.-++.+.+.|.+ ++..+..+. ++... T Consensus 288 li~~~d~~Isk~i-LG-qtlTs~~~~g~~gS~a~g~vh~~--v~~d--i~~aDa~~i~~tln~~Li~~l~~~N--~~~~~ 359 (526) T protein:vir:99 288 MMRQSEDAISKAV-LG-GTLTSTTSQSGGGAFALGQVHNE--VRHD--LLASDARQLAATLSRDLLWPLLVLN--RPGSP 359 (526) T ss_pred HHHHHHHHHHHHH-hh-hhhccccccCcchhhhHHHHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHHHHhC--CCCcC Confidence 4444444433221 11 11111 1111122222222111 1111 133333445566643 554443332 22222 Q ss_pred Cc-ccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHh-C-CCCCHHHHHHHHHHHHhhcccc--cccccccc Q lcl|NC_020883. 472 RI-EEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRM-N-PDASEDWIQEEIARIEEEQAGS--DTSSLMGI 546 (589) Q Consensus 472 ~~-e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~L-h-pdw~dE~v~eEv~RI~~E~a~~--~p~~~g~~ 546 (589) .+ ..|.+.|+..=+.| -...|+..+.++..|..-.+..++.. . |.-.+. +++ ....... .+...+.. T Consensus 360 ~~~~~p~~~~~~~e~eD--l~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~---e~~---l~~~~~~~~~~~~~~~~ 431 (526) T protein:vir:99 360 DVRRAPRLVFDLREQAD--ITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKN---EPV---LRSAAQPAILSRQHGQR 431 (526) T ss_pred CccccceEEeCCCCccc--HHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCc---ccc---cCCCCCCcccccccccc Confidence 22 24778886643333 33457777778877764333344333 2 211111 111 1111100 00000111 Q ss_pred ccccccccCcccCCCCCCCCCCCCCCCCcchhhhhhc-ccccCC Q lcl|NC_020883. 547 NQTFEQMNDNRDEDGNIIEEGDTEEEPSAEENEEIEK-EGEPIA 589 (589) Q Consensus 547 ~~~l~~~~~~~~~~~~p~deg~~~eep~~~~~e~~~~-~~~~~~ 589 (589) ...+.....+..++....|.- -.+-.+.+++..-+ -.++|. T Consensus 432 ~~~~~~~~~~~~~~~~~~d~~--l~~~~~~~~~~~~~~~l~~i~ 473 (526) T protein:vir:99 432 VAALATIVGPRYGDQQALDKA--LADLPAKDMQNQANDLLAPLL 473 (526) T ss_pred cccccccccccCcchhhHHHH--HHHHHHHHHHHHHHHHHHHHH Confidence 001110000000000000000 00000011111111 112221 No 159 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=27.32 E-value=1.9 Score=19.11 Aligned_cols=491 Identities=13% Similarity=0.032 Sum_probs=184.3 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) |+| -+.|.++|++-+.-. .-=|.--+ +=.+-+++.|. ..+-+.++++-+-.-- -.+++++..-. T Consensus 15 ~~~--~~~~~~~v~~~~~~~-~~~r~~~~----~~w~e~~~yi~---~~~tr~t~~~~~~w~~----s~t~~k~~~~~-- 78 (599) T protein:vir:31 15 RDD--DRAFIDELVVLFTNM-ENARAQKD----REDKELMDYID---ATDTRKTSNSKLPFKN----STTINKLAHLH-- 78 (599) T ss_pred cCc--hHHHHHHHHHHHHhh-hhhhhhhh----cccHHHHHHHh---hhcccccccCCCCccc----ccchHHHHHHH-- Confidence 332 245666665433211 11111111 11133334433 2333455555442111 12334432211 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHcC Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVDG 160 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~G 160 (589) + .+.++|.+.-+-++--.+-..+. +.|+ ..+-.+++ +.++..=+.-|+|..-+..-|-+.+.-| T Consensus 79 ~-------~l~a~~~~~~fp~~~w~d~~~~~--~~~~----~~~~~~~i---~~yi~~Kl~e~~~~~~~~~~v~d~i~~G 142 (599) T protein:vir:31 79 L-------MITTSYMEHLLPNRNWVDFVGFD--NDSV----NAEKREIA---RSYVRGKVEASNLEGVIERMVDDFAVRG 142 (599) T ss_pred H-------HHHHHHHhhhcCCccceEeeecC--Cchh----HHHHHHHH---HHHhhhhhhhcchHHHHHHHHhhhcccC Confidence 1 12334443333323222222222 1111 11111111 2244444555667666666666666667 Q ss_pred ceeEEEEEecC--------------ceeEEEecCceeccc---ccCcceeEEEeecCCC--------ccce--------- Q lcl|NC_020883. 161 GIVAAPVIDEL--------------GPRIVFKARDVYFPH---DDEKGADLAYYIDHGQ--------YGQF--------- 206 (589) Q Consensus 161 g~~~~~~~~~~--------------~~~i~f~~~d~~~P~---~d~~~~div~~~e~~~--------~~~~--------- 206 (589) -|+.++-+.-. ++++.=+.|+-+||- .+...+..+-..-.|+ ++.| T Consensus 143 ~~vat~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~ 222 (599) T protein:vir:31 143 FCVAHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQ 222 (599) T ss_pred ceeEeeeEEEcceeecccccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHH Confidence 77777664421 267777777777772 1111222111222332 1111 Q ss_pred --EEEEEeee--ccccceeehhhhccc-cccchhheeecccccccccccccccchhh-----------hhhcccCCcccc Q lcl|NC_020883. 207 --LHIYRERV--EKDGLRTTNMLYPVV-KAKGDVKKEIKKGELVTNVEGAEDLEGEE-----------LIREVLNIPDDR 270 (589) Q Consensus 207 --l~~~~~~~--~~~~~~~~~~~y~~~-~~~~~~~~~~~~gd~~~~~~e~~d~e~e~-----------~i~~~i~ip~~~ 270 (589) .|.+++.- ..+++.+.+-.=... +..+++.+. +.+..+..++|.-|+.+++ .+++. .+. T Consensus 223 ~~~~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY-~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~----~li 297 (599) T protein:vir:31 223 KLREERRTIREALADGYNGRRKFDSLHKKGYGSMMNY-INEGVVEVLTFMGDFYDEENDELWNNYEITVIDRK----IIG 297 (599) T ss_pred HHHhhccCCCccccchhhhhhhccccccccccchhhh-cccchhhhhhhhhhhhcccCCccccceEEEEecCc----EEe Confidence 13333311 223332222110000 111111111 1233455555553333332 11111 122 Q ss_pred ccccccCCCC-cceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCcEEechhhhhcccccccccc Q lcl|NC_020883. 271 PLENFYPGRN-RPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPRISITKEMMDTLLNIAYERD 349 (589) Q Consensus 271 e~~~i~TGv~-~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~d 349 (589) -.+..+++-. +|++++ ...+..++.||...+..+.+.++.||.+|-...-.++.+.+|.+-. .+.| ...| T Consensus 298 R~e~np~~~g~~Pyvv~-~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~-~~dl-------~~eD 368 (599) T protein:vir:31 298 RKQSKDTWDGSQNLHIA-VYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKK-VGDV-------REKG 368 (599) T ss_pred ecccCCCCCCCCCeEEE-EeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcccccc-cccc-------cccC Confidence 3344455554 355554 6667778899999999999999999998764444455667662211 1111 1112 Q ss_pred ccccccccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHH Q lcl|NC_020883. 350 GHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGV 429 (589) Q Consensus 350 ge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~ 429 (589) -++.| ..+....+.| ..++++...+.-+...-+..+...+--.++.|+.+.|. .+.|. -+++|+ T Consensus 369 ~~~~P-------------~~v~~~~d~~-~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~-~~ag~-~TA~~i 432 (599) T protein:vir:31 369 MRGGP-------------NHVFEVEETG-DVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQ-RTAGE-KTKFEV 432 (599) T ss_pred ccCCC-------------CcceeecCCC-ccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCC-cccch-hhHHHH Confidence 22212 2233223333 23455554443333333333334344468999999885 22222 222333 Q ss_pred HHHHHhhhHHHHH-HHHHHHHHHH-HHHHHHHHHHHHhhcCc----------------ccCcccceeeeC-CcCCC---- Q lcl|NC_020883. 430 AKFYDLLTTILKS-RRLQKEYIDF-LKELYESCLWLLNDQDS----------------SIRIEEPNIETQ-DMILK---- 486 (589) Q Consensus 430 A~r~~~~~~~~Kv-~~~R~~~~~a-Lk~li~~~l~L~~~~~~----------------~~~~e~p~I~f~-D~lPv---- 486 (589) +. ++....|+ +++-++|++. |++++..+......... .+.+.+.++.-+ +.+|. T Consensus 433 s~---l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~ 509 (599) T protein:vir:31 433 QL---LDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATL 509 (599) T ss_pred HH---HHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhH Confidence 32 23333222 2222334343 34455433222221100 111222222110 01121 Q ss_pred -CCCHHHHHHHHHHHhc--cch----hhHHH---H---HHHhC------CCCCHHHHHHHHHHHHhhccccccccccccc Q lcl|NC_020883. 487 -PRAELVAENMAAYAAS--KQG----QSLET---T---VRRMN------PDASEDWIQEEIARIEEEQAGSDTSSLMGIN 547 (589) Q Consensus 487 -de~El~~A~t~~~l~~--a~~----~S~et---a---Vr~Lh------pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~ 547 (589) .+.+-...+..+++.+ +.. +|.+. . ++.|| |...-.+.+.++ |..+++.+++.- T Consensus 510 v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~-~m~Q~~lq~~~~------ 582 (599) T protein:vir:31 510 FAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLA-RMAQKSTQQTEE------ 582 (599) T ss_pred HHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHH-HHHHHHHHHhHh------ Confidence 1111111223333311 111 12221 1 11222 112222223333 222211111000 Q ss_pred cccccccCcccCCCCCC-CCCC Q lcl|NC_020883. 548 QTFEQMNDNRDEDGNII-EEGD 568 (589) Q Consensus 548 ~~l~~~~~~~~~~~~p~-deg~ 568 (589) -.|. +++-|.|. |-|+ T Consensus 583 ~~~~-----~~~~~~~~~~~~~ 599 (599) T protein:vir:31 583 TALT-----QEEVGGPTTDTGQ 599 (599) T ss_pred hhhh-----hhhcCCCCcccCC Confidence 0011 11222333 5555 No 160 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=26.60 E-value=1.9 Score=19.02 Aligned_cols=496 Identities=10% Similarity=0.028 Sum_probs=167.4 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCccee-eecCcceEEEEcchhhhccc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTA-RETQTPYVIFNLPKVIAEIP 79 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~-~~~~~~y~~~n~~~~i~~~p 79 (589) |-.-.-.|-+-+.++..-.-+..-|.=|+....|+.. .+- ++ ++.+.+... ....++|= -=-.+.+..+- T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~----y~l--P~--~~~~~~~~~~~~~~~~~d-st~~~a~~~La 71 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAK----VTI--PS--LFPKDSDNSSTDYTTPWQ-AVGARGLNNLS 71 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHH----Hhc--cc--cCCCCCCccccccccccc-chHHHHHHHHH Confidence 5444445556666664444444444444433333211 111 00 011111111 11122210 00123444555 Q ss_pred hhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhccccccchhhHHHHHHc Q lcl|NC_020883. 80 ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSKLERRHWSNIVQHQVD 159 (589) Q Consensus 80 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~~~~~~l~~~~v~ 159 (589) |-|++..++. .+.|- -.+.+....+. .++..+....+.|..++ .+.+..++-.|+|+......+.++.+- T Consensus 72 a~l~~~ltP~-~~WF~-l~~~d~~~~~~----~~~~~~~~~v~~~L~~v----e~~~~~~~~~snf~~~~~~~~~~L~~~ 141 (543) T protein:vir:88 72 AKVMLALFPL-QSWMK-LKVSEWQAKQL----VSDPSQLAVVEQGLGMV----ERILMSYMEANSYRVTLFELIRQLALA 141 (543) T ss_pred HHHHHhhcCC-Ccccc-cccChHHHhcc----cCChhhHHHHHHHHHHH----HHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 5566555543 22222 11111111110 11111222234454444 345667777889999999999987776 Q ss_pred CceeEEEEEecCcee-EEEecCceecccccCcceeEEEeecCCCccceEEEEEee-eccccceeehhhhccccccchhhe Q lcl|NC_020883. 160 GGIVAAPVIDELGPR-IVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRER-VEKDGLRTTNMLYPVVKAKGDVKK 237 (589) Q Consensus 160 Gg~~~~~~~~~~~~~-i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~-~~~~~~~~~~~~y~~~~~~~~~~~ 237 (589) |=.+ .|++++.-+ +.|... ++|| +.- |.......++-..++|.. +...-+. .-|+. .++. .. T Consensus 142 G~a~--ly~~~~~~~~~~~~~~-~~~p------l~~-y~v~~d~~G~v~~i~r~~~~~~~~l~---~~~~~-~v~~--~~ 205 (543) T protein:vir:88 142 GTAL--IYLPPPDASSNSYNPM-KLYT------LHN-HVVQRDAFGNVLQIVTLDKVAYAALP---EDVRN-SLSG--GQ 205 (543) T ss_pred Ccee--eeeccCccccceecce-EEeE------cce-EEEeeCCCCCeeeeeeeeeccHHHHh---HHhhH-HHHH--Hh Confidence 6543 466543322 122111 1234 110 111111223322232221 1111110 00000 0000 00 Q ss_pred eecccccccccccccccchh-hhhhcc--cCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHH Q lcl|NC_020883. 238 EIKKGELVTNVEGAEDLEGE-ELIREV--LNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINW 314 (589) Q Consensus 238 ~~~~gd~~~~~~e~~d~e~e-~~i~~~--i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~ 314 (589) ..+..+.+.+.+.+.--+.- ...... -+++-..+.....- -..|+++. -=+......||||=..+..+-+..||. T Consensus 206 ~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~-~e~P~i~~-Rw~~~~ge~YGrgp~~~~l~D~k~L~~ 283 (543) T protein:vir:88 206 EYKPEQELEVYTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQ-DALPWIAV-RWTKRDGEHYGRSHVEEYLGDLNSLES 283 (543) T ss_pred hcCCccceEEEEEEEeecCCCcccccccccCeeeecCCCcccc-ccCCceee-eeeecCCCccccchHHHHHHHHHHHHH Confidence 00001112222211100000 000000 00000000000000 02244333 224556677999988999999999997 Q ss_pred HHhHHHHHHHHhCCCcEEechhhhhccccccccccccccccccccccccccccccccccccccCccceeeec--ccHHHH Q lcl|NC_020883. 315 TITRSAVIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRDMEITTFDENGRSMEIHQID--ISKIGD 392 (589) Q Consensus 315 t~S~~srildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~D--irveeh 392 (589) .--....-.++..+|-+.||......... ..+...+.+.+.+.. .+..+++. .++... T Consensus 284 l~~~~l~~~~~~~~pp~~v~~~g~~~~~~--------~~~~~~g~~v~g~~~------------~v~~~~~~~~~~~~~~ 343 (543) T protein:vir:88 284 LNEAMIKFAMISSKVVGLVNPNGITQVRR--------LVKAQTGDFVAGRKA------------DIEFLQLEKTADFTVA 343 (543) T ss_pred HHHHHHHHHHHHhcCceeeccccccchhh--------cccCCCceeecCCCC------------cceeeecccccchhHH Confidence 53344455557899999997654432211 111111111111111 11112222 233444 Q ss_pred HHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHh-hhHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_020883. 393 MDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDL-LTTI-LKSRRLQKEYIDFLKELYESCLWLLNDQDSS 470 (589) Q Consensus 393 ~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~-~~~~-~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~ 470 (589) .+.++.+...|-..- +-. .+.. . ++...|++++..|... ...+ --..+.- .+.|..+++.+..+....+.- T Consensus 344 ~~~i~~~~~rI~~af-~~~-~~~~-~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~---~E~l~Pli~r~~~il~r~g~l 416 (543) T protein:vir:88 344 KSVADAIEARLSYVF-MLN-SAVQ-R-SGERVTAEEIRYVASELEDTLGGVYSILS---QELQLPIVRVLLNQLQATQQI 416 (543) T ss_pred HHHHHHHHHHHHHHH-hhh-hhcc-C-CCCcccHHHHHHHHHHHHHHHhHHHHHHH---HHHHHHHHHHHHHHHHhcCCC Confidence 444444444332100 000 1221 1 2222344666665432 2222 1233332 344455554443333222211 Q ss_pred --cCcccceeeeCCcCCCCCCHHHHH----HHHHHHhccch---------hhHHHHHHHh------CCC---CCHHHHHH Q lcl|NC_020883. 471 --IRIEEPNIETQDMILKPRAELVAE----NMAAYAASKQG---------QSLETTVRRM------NPD---ASEDWIQE 526 (589) Q Consensus 471 --~~~e~p~I~f~D~lPvde~El~~A----~t~~~l~~a~~---------~S~etaVr~L------hpd---w~dE~v~e 526 (589) +..+..++++--+|. .+.++ .+.+++..-+. +....+++.+ .|. -+++ T Consensus 417 P~~p~~~v~~~~vs~l~----~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~---- 488 (543) T protein:vir:88 417 PNLPQEAVEPTVTTGAE----ALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEA---- 488 (543) T ss_pred CCCchhceeeeEEecHH----HHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHH---- Confidence 111112334322221 11111 11111110011 1234444433 121 2233 Q ss_pred HHHHHHhhcccc-----cccccccccc--------ccccccCcccCCCCCCCCCCCCCCCCcc Q lcl|NC_020883. 527 EIARIEEEQAGS-----DTSSLMGINQ--------TFEQMNDNRDEDGNIIEEGDTEEEPSAE 576 (589) Q Consensus 527 Ev~RI~~E~a~~-----~p~~~g~~~~--------~l~~~~~~~~~~~~p~deg~~~eep~~~ 576 (589) |++.+.+++..+ .....|++.. ...+..++.. +++++.-+-- T Consensus 489 e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~p~~~~~ 543 (543) T protein:vir:88 489 EKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAG--------VQPGPIATQV 543 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcC--------CCCCCCCCCC Confidence 333343221110 0111111111 1111112211 1222111111 No 161 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=26.42 E-value=1.9 Score=19.00 Aligned_cols=475 Identities=12% Similarity=0.093 Sum_probs=148.4 Q ss_pred eccchhHHHHhhcchhhhhhhhhcCC-----------ccccCHHHH-HHH-------hhccccceeccCcce--eeecCc Q lcl|NC_020883. 6 VRGWTDKTTKNVHGDYERYRQLYEGK-----------HELLFPRAK-RLI-------EEGDAVGRFLDSSQT--ARETQT 64 (589) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~r~l~~g~-----------~~~~f~ra~-~~~-------~~~~~~~~~~~~~~~--~~~~~~ 64 (589) ..-|.||.+---..-++-+||-+-=+ +.+.|..++ +.- -..+.++....+..+ .-..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 55667776544444555555432111 334444332 111 101111111111100 001111 Q ss_pred ceEEEEcchhhhccc-hhhhccccccccccccCCcccchhhccchhhccccc---c-cchhhhhhhhhhhhhhhhHHHHH Q lcl|NC_020883. 65 PYVIFNLPKVIAEIP-ATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQ---D-EEEAGKNENNTVIDLQNEIIEQI 139 (589) Q Consensus 65 ~y~~~n~~~~i~~~p-a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~---~-~~~~~~~~~~~~~~~~~e~i~~v 139 (589) +| ++++++.-.. ..++++.+-.+...... . -.+.+++.-.+--.+ + +....+.+..+.. -+..+ T Consensus 81 ~~---~~~~~l~~~~~~~iv~~~i~~~~~~V~~--~-~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~-----~l~~l 149 (574) T protein:vir:80 81 SQ---DLHKTLKKFGNNIILNAIINTRSNQVSM--Y-CKPARNSETGVGYEIRLKDIEAEPTSHDIANIK-----RIESF 149 (574) T ss_pred cc---cHHHHHHhhccChhHHHHHHHHHHHHHH--H-HHHHHhhhccCceEEEEeccCCCccchhhhhhh-----HHHHH Confidence 11 1221111000 00000000000000000 0 000000000000000 0 0000001111111 12223 Q ss_pred Hhhc----cccc----cchhhHHHHH-HcCceeEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCccceEE Q lcl|NC_020883. 140 TKNS----KLER----RHWSNIVQHQ-VDGGIVAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLH 208 (589) Q Consensus 140 ~kn~----~~~~----~~~~~l~~~~-v~Gg~~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~ 208 (589) +.++ +.++ .|+..++.+. .-|..++.+..+.++ ..+....++...+..+..+. . .....+|++ T Consensus 150 l~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~----~--~~~~~~y~~ 223 (574) T protein:vir:80 150 LENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGK----L--IKNGERFVQ 223 (574) T ss_pred HhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccc----c--ccCceEEEE Confidence 3322 2222 3555555443 445555555555444 34455555544442111110 0 000000100 Q ss_pred EEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEec Q lcl|NC_020883. 209 IYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWA 288 (589) Q Consensus 209 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvP 288 (589) ....+.... + +.--|.|++ T Consensus 224 -----------------------------~~~g~~~~~-------------------~-------------~~~eiih~~ 242 (574) T protein:vir:80 224 -----------------------------VIDNRIVAK-------------------F-------------NERELAFAV 242 (574) T ss_pred -----------------------------EeCCceEEE-------------------E-------------ccccEEEEe Confidence 001111000 0 000166665 Q ss_pred CCC---CCCCcccCcchhhhhHHHHHHHHHHhHHHHHHHHhCCCc--EEechhh-hhccccccccccccccccccccccc Q lcl|NC_020883. 289 NNE---TFMNPYGISALDNLESKQDEINWTITRSAVIYEQNGKPR--ISITKEM-MDTLLNIAYERDGHSAKEASMMTPR 362 (589) Q Consensus 289 N~~---~~~~~lG~SD~~~ie~l~DeLd~t~S~~srildk~gkpR--I~VP~~~-L~t~~g~~~d~dge~~~~~~~~~~~ 362 (589) .++ ....++|.|-+.-+...+.....+-....+.|...+.|+ |.++... |+.- ..++-.+.+.. .+.+ T Consensus 243 ~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e---~~~~lk~~~~~---~~~G 316 (574) T protein:vir:80 243 RNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQ---ALDIFRREWRS---SLAG 316 (574) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHH---HHHHHHHHHHH---Hhcc Confidence 433 334678999887666666654444333455565557788 4443211 2110 00000000000 0000 Q ss_pred c--ccccccccccccccCccceeee--cccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccchhHH-----HHH--H Q lcl|NC_020883. 363 I--DHRDMEITTFDENGRSMEIHQI--DISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQS-----GVA--K 431 (589) Q Consensus 363 ~--d~~dlev~~~de~g~~~~~iq~--Dirveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~S-----g~A--~ 431 (589) . .+.-. +. ...| +.+.++ ...-.+..+..+...+.|..+=+.|+.-+|+...++..+..+ ..+ . T Consensus 317 ~~n~g~~~-vl--~~~G--~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~ 391 (574) T protein:vir:80 317 INGSWQIP-VV--SAED--VKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEK 391 (574) T ss_pred ccccccce-ee--cCCC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHH Confidence 0 00000 11 1112 233322 233334455666678888888899999998754332211110 111 1 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHH Q lcl|NC_020883. 432 FYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETT 511 (589) Q Consensus 432 r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~eta 511 (589) ...+.+ ..+.-|...+..+|.+.+ +.. .+ ....+.|..+-..+++ ..+.+.. +..+++++.-.+ T Consensus 392 ~~~f~~--~tL~P~~~~ie~~ln~~L-----l~~-~~-----~~~~~~f~~~d~~~~~--~~~~~~~-~~~~G~lT~NE~ 455 (574) T protein:vir:80 392 MQASQN--KGLQPLLRFIEDTVNTYI-----VAE-FG-----EKYQFQFRGGDLSAQL--DKLKIIE-QEGKVFRTVNEI 455 (574) T ss_pred HHHHHH--HHHHHHHHHHHHHHHhhh-----hhh-cC-----CceEEEecccchhhHH--HHHHHHH-HHhCCccCHHHH Confidence 111211 122233333334443322 111 11 1123556543222222 2233333 345677777665 Q ss_pred HHHhC-CCCCHHHHHHHHHHHHhhccccccccccccccccccc--cCcccCCCCCCCCCCCCCCCCcchhhhhh-----c Q lcl|NC_020883. 512 VRRMN-PDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQM--NDNRDEDGNIIEEGDTEEEPSAEENEEIE-----K 583 (589) Q Consensus 512 Vr~Lh-pdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~--~~~~~~~~~p~deg~~~eep~~~~~e~~~-----~ 583 (589) -++++ |-.+. -+ ++ .+-......+...-+...++..+. .+...+...+.+++++.++|+..+....+ + T Consensus 456 R~~lgl~Pi~g--GD-~~-~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~ 531 (574) T protein:vir:80 456 RHDKGLEPIKG--GD-VI-LNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQ 531 (574) T ss_pred HHHhCCCCCCC--CC-Ee-eeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhh Confidence 55552 11110 00 00 000000000000000000000000 11111222233555555555432211111 0 Q ss_pred ---cc-ccCC Q lcl|NC_020883. 584 ---EG-EPIA 589 (589) Q Consensus 584 ---~~-~~~~ 589 (589) .+ ++.- T Consensus 532 ~~~~~~~~~~ 541 (574) T protein:vir:80 532 QGLNGKSKKV 541 (574) T ss_pred hhhccchhhh Confidence 00 0000 No 162 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=25.43 E-value=2.1 Score=18.87 Aligned_cols=491 Identities=12% Similarity=0.037 Sum_probs=178.3 Q ss_pred CccceeccchhHHHHhhcc----hhhhhhhhhcCCccccCHHHHH-HHhhccccceeccCcceeee-----------cCc Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHG----DYERYRQLYEGKHELLFPRAKR-LIEEGDAVGRFLDSSQTARE-----------TQT 64 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~----~~~~~r~l~~g~~~~~f~ra~~-~~~~~~~~~~~~~~~~~~~~-----------~~~ 64 (589) |-| .+++.|. .|.++.. --+.+++ -+.+.+|. +.+|+||-.. -.. T Consensus 1 ma~---------~~~~~l~~~~~~~~~~~~--------~~~~~r~~~~~d~~f~--~~~G~QW~~~~~~~~~~~l~~~~~ 61 (720) T protein:vir:35 1 MAE---------TLQKRHEQIMRKFDRAHS--------PQEAVREKCLEATRFA--RVPGGQWEGATAAGSELGKHFEKY 61 (720) T ss_pred Cch---------HHHHHHHHHHHHHHHHHh--------hhHHHHHHHHHHHhhh--ccCCCCCCHHHHHHHHHHHhhCCC Confidence 222 2222222 1111110 0000000 11111111 1246666432 245 Q ss_pred ceEEEEcchhhhccchhhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhhcc Q lcl|NC_020883. 65 PYVIFNLPKVIAEIPATMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKNSK 144 (589) Q Consensus 65 ~y~~~n~~~~i~~~pa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~ 144 (589) |-+++|+.+.++.. . +|.-..+-+ + .-+.|.+.. +.. .+++.=..++..+...|+ T Consensus 62 P~~~~N~i~~~v~~--v-----~g~~~~nr~------------d-~~v~P~~~~--~d~---~~Ae~l~~~~~~~~~~~~ 116 (720) T protein:vir:35 62 PKFEINKISTELNR--I-----ISEYRHNRI------------T-VKFRPGDKT--ASE---ALANKLNGLFRADYEETD 116 (720) T ss_pred CeEEEccHHHHHHH--H-----HhHHHhCCC------------c-eEEEcCCCc--chH---HHHHHHHHHHHHHHHhcC Confidence 77899999988776 2 333322222 1 112232211 111 222223447888888999 Q ss_pred ccccchhhHHHHHHcCceeEEEEEe---cCc-----eeEEEe----cC-ceec-ccc---cCcceeEEEeecCCCccc-- Q lcl|NC_020883. 145 LERRHWSNIVQHQVDGGIVAAPVID---ELG-----PRIVFK----AR-DVYF-PHD---DEKGADLAYYIDHGQYGQ-- 205 (589) Q Consensus 145 ~~~~~~~~l~~~~v~Gg~~~~~~~~---~~~-----~~i~f~----~~-d~~~-P~~---d~~~~div~~~e~~~~~~-- 205 (589) .....-+++.++++.|=.+.++..| +.. -+|.+. ++ .+|+ |+. |..-|.++++.++--.+. T Consensus 117 ~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~ 196 (720) T protein:vir:35 117 GGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYK 196 (720) T ss_pred chHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHH Confidence 8888888888888888888888765 111 122222 12 3444 321 111233333222211000 Q ss_pred --------------------------eEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhh- Q lcl|NC_020883. 206 --------------------------FLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEE- 258 (589) Q Consensus 206 --------------------------~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~- 258 (589) -|.+-..++.... .++.-+|+. -..|+.+. +.++...+. T Consensus 197 ~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~-~~~~~~~~~----------~~~g~~~~---~~~~~~~~~~ 262 (720) T protein:vir:35 197 AEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKE-SVDVVSFQN----------PLTSETVT---YDSDQLELVE 262 (720) T ss_pred HhCCCccccccccccccccccccCCCceEEEEeeEEEEE-EEEEEEeec----------CCCCCeee---cCCccHHHHH Confidence 0111000000000 000001100 00122111 111110000 Q ss_pred -hhhcccCC-------------------ccc-cccccccCCCCcceEEEecCCC-CCCCcccCcchhhhhHHHHHHHHHH Q lcl|NC_020883. 259 -LIREVLNI-------------------PDD-RPLENFYPGRNRPFISYWANNE-TFMNPYGISALDNLESKQDEINWTI 316 (589) Q Consensus 259 -~i~~~i~i-------------------p~~-~e~~~i~TGv~~plvvyvPN~~-~~~~~lG~SD~~~ie~l~DeLd~t~ 316 (589) .+.. .|+ +.. .+.....+|...|+|-+|-.-. ..+.+..-+-+.++.+.++.+|.+. T Consensus 263 ~~~~~-~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~ 341 (720) T protein:vir:35 263 DELAD-IGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQV 341 (720) T ss_pred HHHhh-hccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHH Confidence 0000 000 000 1111223455556665553211 1222222244677899999999998 Q ss_pred hHHHHHHHHhCCCcEEechhhhhcccc--ccccc-ccc--ccccccccccccccccccccccccccCccceeeecccHHH Q lcl|NC_020883. 317 TRSAVIYEQNGKPRISITKEMMDTLLN--IAYER-DGH--SAKEASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIG 391 (589) Q Consensus 317 S~~srildk~gkpRI~VP~~~L~t~~g--~~~d~-dge--~~~~~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirvee 391 (589) |+...++ ++.++..+...-+.-.+ ..|.. +.+ .+..++ ++...+-.+.. .+..+..++.---... T Consensus 342 s~~~~~~---~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~----~~~~~~G~~~~---~~~~~~~~~~~~~~~~ 411 (720) T protein:vir:35 342 SMLADSA---TQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLN----EIVDKQGNIIA---PPTPVGYTQPQPLNQA 411 (720) T ss_pred HHHHHHH---HcCCccccccCcchHHHHHHHhhccccccccccccc----cccccCccccc---CCCcccccCCCCCchH Confidence 8777776 44444433222111000 00000 000 000000 00000000000 0011222222223345 Q ss_pred HHHHHHHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-- Q lcl|NC_020883. 392 DMDHVKNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDS-- 469 (589) Q Consensus 392 h~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~-- 469 (589) +++.+..-...|=.+++....++|... + +||+|+..+..+........=..+..+.+++.+.++.|-..+.. T Consensus 412 ~~~llq~~~~~i~~vsGi~~~~lG~~s--n----~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~e 485 (720) T protein:vir:35 412 MAALLQQTGADIQEVTGSSQAMQPMPS--N----IAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSD 485 (720) T ss_pred HHHHHHHHHHHHHHHhCCChHHcCccc--c----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 556666655666677899999998522 2 38888765554444333444445555666666665555442210 Q ss_pred -ccC--------------------------------cccceeeeCCcCCC-CCCHHHHHHHHHHHhccchh-hHHHHHHH Q lcl|NC_020883. 470 -SIR--------------------------------IEEPNIETQDMILK-PRAELVAENMAAYAASKQGQ-SLETTVRR 514 (589) Q Consensus 470 -~~~--------------------------------~e~p~I~f~D~lPv-de~El~~A~t~~~l~~a~~~-S~etaVr~ 514 (589) .+- ..+-+|...++... ...+...+.+++++.+-... .....+.- T Consensus 486 r~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~ 565 (720) T protein:vir:35 486 RQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQG 565 (720) T ss_pred cEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHH Confidence 000 00012233232111 12333334444443321110 01111110 Q ss_pred h-CCCCCHHHHHHHHHHHHhhccccccccccccccccccccCcccCCCCCCCCCCCCCC-CCcchhhhhhcccccCC Q lcl|NC_020883. 515 M-NPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQMNDNRDEDGNIIEEGDTEEE-PSAEENEEIEKEGEPIA 589 (589) Q Consensus 515 L-hpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~~~~~~~~~~~p~deg~~~ee-p~~~~~e~~~~~~~~~~ 589 (589) + -...+-...++-++||+.-...+ +.. .+. ..+++ -.++--.+.++..-.++ T Consensus 566 ~ile~~d~p~~~e~~erirk~~~~~-----~~~--------~~~----------~~e~qq~~a~~qq~~qq~~~e~~ 619 (720) T protein:vir:35 566 IILDNMEGEGLDEFKEYNRKQLLTQ-----GVV--------KPR----------NTEEEQMVAQMIQQAQQPNAELV 619 (720) T ss_pred HHHHhcCchhHHHHHHHHHhhcchh-----ccc--------Ccc----------ChhHHHHHHHHHHHHHhHhHHHH Confidence 0 01122223344556665544321 000 000 00000 00000011111111111 No 163 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=25.05 E-value=2.1 Score=18.81 Aligned_cols=395 Identities=11% Similarity=0.062 Sum_probs=135.0 Q ss_pred ccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccchhhhccc Q lcl|NC_020883. 7 RGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPATMVSGS 86 (589) Q Consensus 7 ~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa~~~~~~ 86 (589) -||-+++... |..+. +-.... .+.......+..++ +--+...+.+..-+ .++++. T Consensus 1 MG~~~~~~~~-----------~~~~~-~~~~~~------~~~~~~~~g~~~~~-----~~~al~~~~V~~~v--~~Ia~~ 55 (411) T protein:vir:81 1 MGWWSRLTRF-----------FRPRN-ETVDMT------NPLLLQWLGVDPDT-----PRNQLSEATYFACL--KILSES 55 (411) T ss_pred CchHHHHHhh-----------ccCcc-cccccc------hHHHHHHhcCcccC-----hhhhhccHHHHHHH--HHHHHh Confidence 5666554332 22211 000100 00111111222221 11111122222222 334443 Q ss_pred cccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHh-hccc---cccchhhHHHH-HHcCc Q lcl|NC_020883. 87 IGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITK-NSKL---ERRHWSNIVQH-QVDGG 161 (589) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~k-n~~~---~~~~~~~l~~~-~v~Gg 161 (589) +.+++-..-.. ++ .|.+ ....... ..++. .-+. ...|+..++.+ +..|- T Consensus 56 iA~lp~~~~~~---------~~---~~~~------~~~~~~l--------~~lL~~~PN~~~t~~~f~~~l~~~lll~Gn 109 (411) T protein:vir:81 56 LGKLPLKMYQK---------TE---RGIV------KSDREEL--------YNLLKLRPNPYMTSSVFWSTVEMNRNHYGN 109 (411) T ss_pred HhhCceeEEEe---------cC---Ccee------eecccHH--------HHHHhhccCCCCCHHHHHHHHHHHHhhcCC Confidence 33332211100 00 0000 0000000 01110 0111 11234444433 45565 Q ss_pred eeEEEEEecCce-eEEEecCceecccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeec Q lcl|NC_020883. 162 IVAAPVIDELGP-RIVFKARDVYFPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIK 240 (589) Q Consensus 162 ~~~~~~~~~~~~-~i~f~~~d~~~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 240 (589) .++.+..+++++ .+....++++-++-+..+. ...++..+|+.. ... T Consensus 110 a~~~i~r~~g~~~~l~~l~~~~v~~~~~~~~~---------~~~~~~~~~~~~------------------------~~~ 156 (411) T protein:vir:81 110 AYVWCQYSGPQLQALWILPSQYVTIVVDDRGL---------LGEKNAIWYRYN------------------------DPY 156 (411) T ss_pred eEEEEEecCCceEEEEEECCceEEEEEcCccc---------ccccceEEEEEE------------------------ecC Confidence 555555555443 3333444444331111100 001111111100 001 Q ss_pred ccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCCCCCCcccCcchhhhhHHHHHHHHHHhHHH Q lcl|NC_020883. 241 KGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNETFMNPYGISALDNLESKQDEINWTITRSA 320 (589) Q Consensus 241 ~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~~~lG~SD~~~ie~l~DeLd~t~S~~s 320 (589) .|.++. ++.--|+|+.+..+.+.++|.|-+.-+...++.....-.... T Consensus 157 ~g~~~~--------------------------------~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 204 (411) T protein:vir:81 157 DGKMYV--------------------------------FRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMN 204 (411) T ss_pred CceEEE--------------------------------EccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 121111 011118899866667788999988877776665554444445 Q ss_pred HHHHHhCCCcEEechhhhhccccccccccccccccccccccccc--cccccccccccccCccceeeecc--cHHHHHHHH Q lcl|NC_020883. 321 VIYEQNGKPRISITKEMMDTLLNIAYERDGHSAKEASMMTPRID--HRDMEITTFDENGRSMEIHQIDI--SKIGDMDHV 396 (589) Q Consensus 321 rildk~gkpRI~VP~~~L~t~~g~~~d~dge~~~~~~~~~~~~d--~~dlev~~~de~g~~~~~iq~Di--rveeh~~~i 396 (589) +.|...+.|+.++- ..+.+..+..-.....+...+.+.+ +.. -+. +.| +++-+.+. .-.+..+.. T Consensus 205 ~~f~ng~~p~gil~-----~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~-~vl---~~g--~~~~~l~~~~~d~q~~e~~ 273 (411) T protein:vir:81 205 NLYKTGLTGKAVLE-----YTGDLNQEARDRLVKGFEQFANGSKNAGKI-IPV---PLG--MKLVPLDIKLTDSQFFELK 273 (411) T ss_pred HHHhccCCCceEEE-----eCCCCCHHHHHHHHHHHHHHhcCccccCCc-eec---CCC--ceEEEccCCHHHHHHHHHH Confidence 55645566765441 1111110100000000000000000 111 111 222 23333333 333444555 Q ss_pred HHHHHHHHHHhcCCchhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccc Q lcl|NC_020883. 397 KNLIKLMLIETQTSEKAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEP 476 (589) Q Consensus 397 e~L~~~Il~~a~ts~~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p 476 (589) +...++|..+=+.|+.-+|...++. .+.+. ..+..+.+- .+.-|-..+...|.+-+ +-..... ... T Consensus 274 ~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e-~~~~~f~~~--~l~P~~~~ie~~l~~~l----l~~~~~~-----~~~ 339 (411) T protein:vir:81 274 KYTALQIAAAFGIKPNQINDYEKSS--YASAE-AQNLAFYVD--TLLYVLKQYEEEITYKI----LSNDLIS-----QGH 339 (411) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCC--chhHH-HHHHHHHHH--HHHHHHHHHHHHHHhhc----CChhhcC-----CCc Confidence 6678888888899999998543221 11111 111122111 11112212222222110 0000001 111 Q ss_pred eeee--CCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhCCCCCHHHHHHHHHHHHhhcccccccccccccccccc-- Q lcl|NC_020883. 477 NIET--QDMILKPRAELVAENMAAYAASKQGQSLETTVRRMNPDASEDWIQEEIARIEEEQAGSDTSSLMGINQTFEQ-- 552 (589) Q Consensus 477 ~I~f--~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lhpdw~dE~v~eEv~RI~~E~a~~~p~~~g~~~~~l~~-- 552 (589) .|.| .+.+. .+.++.++..+.+.++++++.-.+-+++. + .|.+ | ++..+.+ T Consensus 340 ~~~fd~~~ll~--~d~~~~~~~~~~~~~~g~~t~NE~R~~~g--l-------------------~p~~-g-gD~~~~~~n 394 (411) T protein:vir:81 340 YFKFNVNVILR--ADIKTQMDSLSTAVQNGIMTPNEARDYLD--M-------------------PADD-Y-GNNLMANGN 394 (411) T ss_pred EEEeechhhhc--cCHHHHHHHHHHHHhCCCcCHHHHHHHhC--C-------------------CCCC-C-CCeeeeccC Confidence 2444 33333 34556677888888888887765544432 1 1111 1 1111100 Q ss_pred c--cCcccCCCCCCCCCCCCCC Q lcl|NC_020883. 553 M--NDNRDEDGNIIEEGDTEEE 572 (589) Q Consensus 553 ~--~~~~~~~~~p~deg~~~ee 572 (589) . ++.- ..+.+..+|- T Consensus 395 ~~pl~~~-----~~~~~kgGd~ 411 (411) T protein:vir:81 395 YIPLSML-----GANYGKGGDS 411 (411) T ss_pred ccchhhh-----hhhhccCCCC Confidence 0 0000 0000001111 No 164 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=21.17 E-value=2.6 Score=18.27 Aligned_cols=460 Identities=10% Similarity=0.012 Sum_probs=138.7 Q ss_pred Cc-cceec-------cchhHHHHhhcchhhhhhhhhcCCccccCHH----------------HHHHHhhccccceeccCc Q lcl|NC_020883. 1 MI-DWTVR-------GWTDKTTKNVHGDYERYRQLYEGKHELLFPR----------------AKRLIEEGDAVGRFLDSS 56 (589) Q Consensus 1 ~~-~~~~~-------~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~r----------------a~~~~~~~~~~~~~~~~~ 56 (589) .| -+.|. .+.. -||.--+...|.+ =|-|=.-.+-+| --++|++-.++-.++.++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~ 84 (945) T protein:vir:10 7 IIKGFIVNANEQKRPSFSS-NIKANVDSLSRGK-DYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQE 84 (945) T ss_pred HhhhheeccccccCccccc-cchhchhhhhccc-CCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccc Confidence 00 01110 0111 1111111222211 122222222222 234445555555544333 Q ss_pred ceee------ecCcc-----------eEEEEcchhhhc-cch-----hhhccccccccccccCCcccchhhccchhhccc Q lcl|NC_020883. 57 QTAR------ETQTP-----------YVIFNLPKVIAE-IPA-----TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEG 113 (589) Q Consensus 57 ~~~~------~~~~~-----------y~~~n~~~~i~~-~pa-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 113 (589) +... +..++ +..+|++.-.+- .|+ .++++.+...+=.. ...+. .| T Consensus 85 ~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlkl---------Yrr~e---dG 152 (945) T protein:vir:10 85 PPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEI---------YKHIE---DK 152 (945) T ss_pred cchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEE---------EEecc---cC Confidence 3110 00111 111221110000 011 11111111111100 00000 00 Q ss_pred ccccchhhhhhhhhhhhhhhhHHHHHHhhccccc-------cchhhHH-HHHHcCceeEEEEEecCc--eeEEEecCcee Q lcl|NC_020883. 114 PQDEEEAGKNENNTVIDLQNEIIEQITKNSKLER-------RHWSNIV-QHQVDGGIVAAPVIDELG--PRIVFKARDVY 183 (589) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~e~i~~v~kn~~~~~-------~~~~~l~-~~~v~Gg~~~~~~~~~~~--~~i~f~~~d~~ 183 (589) ..+... .+. ....-+..++...+.++ .|...++ +...-|-.++.+..+..+ +.+....+++. T Consensus 153 ~~~~~~---kk~-----~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~V 224 (945) T protein:vir:10 153 HVNYYL---KRI-----RDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTI 224 (945) T ss_pred cccccc---ccc-----ccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcce Confidence 000000 000 00111223333222221 1222232 334445555555555433 34444455444 Q ss_pred cccccCcceeEEEeecCCCccceEEEEEeeeccccceeehhhhccccccchhheeecccccccccccccccchhhhhhcc Q lcl|NC_020883. 184 FPHDDEKGADLAYYIDHGQYGQFLHIYRERVEKDGLRTTNMLYPVVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREV 263 (589) Q Consensus 184 ~P~~d~~~~div~~~e~~~~~~~l~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~ 263 (589) .|..+ +. +.....|++. ..|.... . T Consensus 225 ti~~d----------dD---G~~~y~Yv~~--------------------------idG~~~~--~-------------- 249 (945) T protein:vir:10 225 KPILS----------ED---TGIVVGYVQE--------------------------VDGAIVA--H-------------- 249 (945) T ss_pred EEEEc----------CC---CcEEEEEEEe--------------------------cCCceEE--E-------------- Confidence 43100 01 0000111000 0110000 0 Q ss_pred cCCccccccccccCCCCcceEEEecCCCCCC--CcccCcchhhhhHHHHHHHHHHhHHHHHHHHh-CCCcEE--echhhh Q lcl|NC_020883. 264 LNIPDDRPLENFYPGRNRPFISYWANNETFM--NPYGISALDNLESKQDEINWTITRSAVIYEQN-GKPRIS--ITKEMM 338 (589) Q Consensus 264 i~ip~~~e~~~i~TGv~~plvvyvPN~~~~~--~~lG~SD~~~ie~l~DeLd~t~S~~srildk~-gkpRI~--VP~~~L 338 (589) ++ .+=.+.|+.|..... .++|.|-+.-+...+..-..+-..-.+.|.++ +.|+-+ ++.... T Consensus 250 --v~------------a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~ 315 (945) T protein:vir:10 250 --FD------------KRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSY 315 (945) T ss_pred --ec------------CCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccc Confidence 00 001144555544333 34688877654444332221111123344344 356533 322111 Q ss_pred hccc-ccccccc-----cccccc-ccccccccccccccccccccccCccceeeecccHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_020883. 339 DTLL-NIAYERD-----GHSAKE-ASMMTPRIDHRDMEITTFDENGRSMEIHQIDISKIGDMDHVKNLIKLMLIETQTSE 411 (589) Q Consensus 339 ~t~~-g~~~d~d-----ge~~~~-~~~~~~~~d~~dlev~~~de~g~~~~~iq~Dirveeh~~~ie~L~~~Il~~a~ts~ 411 (589) ...+ ....+.+ .+.+.. .++. ..+.++ +. ++|...+-++....-.+..+..+...++|..+=+.|+ T Consensus 316 ~d~k~~~~LseEq~erlKe~wee~~sG~---NnG~pi-VL---deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP 388 (945) T protein:vir:10 316 KEGDIYPQLSREQLESIQRQLQAIMMGD---YTQVPI-LS---GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSP 388 (945) T ss_pred cccccccccCHHHHHHHHHHHHHHhCCc---ccccce-ec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 0000 0000000 000000 0000 001111 11 2222222223333444556667778888888889999 Q ss_pred hhcccccCcccchhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHH Q lcl|NC_020883. 412 KAVDFYLDGGASGAQSGVAKFYDLLTTILKSRRLQKEYIDFLKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAEL 491 (589) Q Consensus 412 ~AFg~~~~~g~~~A~Sg~A~r~~~~~~~~Kv~~~R~~~~~aLk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El 491 (589) .-.|...+.+. +.+.. ....+.+ ..+..|...+...|.+.+ +... ......+.|+..... +.. T Consensus 389 ~lLG~~e~st~--SNiEq-q~~~Fv~--~tL~Pil~~IEqeLNrkL-----l~~~-----eg~~i~fdFd~ldl~--D~k 451 (945) T protein:vir:10 389 QDVGILEGSNK--ATAEV-MASLTKA--KGLEPLMATISKGFDEVV-----SEFR-----NEKDIKLWFKEDDLE--KER 451 (945) T ss_pred HHcccCCCCCc--chHHH-HHHHHHH--HHHHHHHHHHHHHHHHhc-----cccc-----cCceeEEEecchhcc--CHH Confidence 99986433221 11111 1112211 123334333444443322 1111 111234566544333 345 Q ss_pred HHHHHHHHHhccchhhHHHHHHHhC-CCCC--HHHHHHHHHHHHhhccc-cccc---cccccccccccccCcccCCCCCC Q lcl|NC_020883. 492 VAENMAAYAASKQGQSLETTVRRMN-PDAS--EDWIQEEIARIEEEQAG-SDTS---SLMGINQTFEQMNDNRDEDGNII 564 (589) Q Consensus 492 ~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~--dE~v~eEv~RI~~E~a~-~~p~---~~g~~~~~l~~~~~~~~~~~~p~ 564 (589) +.++..+.+.++|+++.-.+-++++ |-.+ |+- -+.....+ .+.. ..|+..+.+.+. +...|. T Consensus 452 sraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~l------li~~nn~~P~d~~~ka~~ga~p~q~aq~-----~~dqp~ 520 (945) T protein:vir:10 452 DWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVP------FSGLRNWKPEDEQAKAQQGAMPPQLAQA-----MADQPS 520 (945) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee------eeccccccccccccccccCCCCcccccC-----CCCCCC Confidence 6788888889999999876655552 1111 110 00000000 0000 000000000000 000111 Q ss_pred CCCCCCCCCCcchhhhhhcccccCC Q lcl|NC_020883. 565 EEGDTEEEPSAEENEEIEKEGEPIA 589 (589) Q Consensus 565 deg~~~eep~~~~~e~~~~~~~~~~ 589 (589) .+|.+.+|.+ .+..+....++. T Consensus 521 ~kGGe~dEns---~~psE~kda~~e 542 (945) T protein:vir:10 521 QQGGGVDENS---SVPSEQKNAGLE 542 (945) T ss_pred CCCCCCCCCC---CCCCcccchHHH Confidence 1111111111 111111111111 No 165 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=20.38 E-value=2.8 Score=18.15 Aligned_cols=441 Identities=10% Similarity=0.051 Sum_probs=133.9 Q ss_pred CccceeccchhHHHHhhcchhhhhhhhhcCCccccCHHHHHHHhhccccceeccCcceeeecCcceEEEEcchhhhccch Q lcl|NC_020883. 1 MIDWTVRGWTDKTTKNVHGDYERYRQLYEGKHELLFPRAKRLIEEGDAVGRFLDSSQTARETQTPYVIFNLPKVIAEIPA 80 (589) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~r~l~~g~~~~~f~ra~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~n~~~~i~~~pa 80 (589) ..+=.++||..+..-.---.+.-++++|.|.. +-.++.+.+. |..-..+-+ T Consensus 52 ~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~--~~~~~i~t~~-------------------------~~va~~~~i-- 102 (535) T protein:vir:10 52 IADGNVAGQYSVASISDVLSTKKLLKAYADND--IVQAIIRTRT-------------------------NQVLTYSNP-- 102 (535) T ss_pred cccCCcccccccCccccccCHHHHHHHhccCh--hHHHHHHHHH-------------------------HHHHHHHHH-- Confidence 22223345544432211112233344444432 1122222221 000001111 Q ss_pred hhhccccccccccccCCcccchhhccchhhcccccccchhhhhhhhhhhhhhhhHHHHHHhh--cc------ccccchhh Q lcl|NC_020883. 81 TMVSGSIGQIKSSITTGEIDPDIEEDTDEMIEGPQDEEEAGKNENNTVIDLQNEIIEQITKN--SK------LERRHWSN 152 (589) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~e~i~~v~kn--~~------~~~~~~~~ 152 (589) .++++.+...+=-+- +.+.. .+...+ .++. -++.++.+ +. |+..|... T Consensus 103 ~~~s~~~~~~~i~l~----~~~~~-~~~~~~-----------~~~~--------~l~~lL~~~PN~~~~~~~~~~~~~~~ 158 (535) T protein:vir:10 103 SRYNRNGVGFKVELK----DATKV-MSKAQI-----------KRAH--------EIEDFIYNTGSEYYEWRDTFPRLLTK 158 (535) T ss_pred HHHhcccCcceeEEE----eccCC-Ccchhh-----------hhhh--------HHHHHHHhCCCCCCChhHHHHHHHHH Confidence 112222211110000 00000 000000 0000 11122211 11 11223333 Q ss_pred HHH-HHHcCce-eEEEEEecCc--eeEEEecCceecccccCcceeEEEeecCCCc-cceEEEEEeeeccccceeehhhhc Q lcl|NC_020883. 153 IVQ-HQVDGGI-VAAPVIDELG--PRIVFKARDVYFPHDDEKGADLAYYIDHGQY-GQFLHIYRERVEKDGLRTTNMLYP 227 (589) Q Consensus 153 l~~-~~v~Gg~-~~~~~~~~~~--~~i~f~~~d~~~P~~d~~~~div~~~e~~~~-~~~l~~~~~~~~~~~~~~~~~~y~ 227 (589) |+. ++..||. +..+..+..+ ..+....+++..+..++ +++. .++...+ T Consensus 159 lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~----------~~~~~~~~~~~~----------------- 211 (535) T protein:vir:10 159 IINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSP----------RSKDQPRKFEQF----------------- 211 (535) T ss_pred HHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcC----------ccccCceEEEEE----------------- Confidence 332 2344444 4444444433 34555566665542111 1111 1111111 Q ss_pred cccccchhheeecccccccccccccccchhhhhhcccCCccccccccccCCCCcceEEEecCCC---CCCCcccCcchhh Q lcl|NC_020883. 228 VVKAKGDVKKEIKKGELVTNVEGAEDLEGEELIREVLNIPDDRPLENFYPGRNRPFISYWANNE---TFMNPYGISALDN 304 (589) Q Consensus 228 ~~~~~~~~~~~~~~gd~~~~~~e~~d~e~e~~i~~~i~ip~~~e~~~i~TGv~~plvvyvPN~~---~~~~~lG~SD~~~ 304 (589) ...+.... + + ..+ |+|++.++ ....++|.|-+.- T Consensus 212 -----------~~~~~~~~---~----------------~-~~e------------iih~~~~~~~~~~~~~~G~Spi~~ 248 (535) T protein:vir:10 212 -----------VSETKSVK---F----------------S-ERN------------LTFINYWNLSDTDRRGYGYSPVEA 248 (535) T ss_pred -----------ecCceeEE---E----------------C-ccc------------EEEEeccCCCCcccccccccHHHH Confidence 01111000 0 0 001 66765433 3446789998877 Q ss_pred hhHHHHHHHHHHhHHHHHHHHhCCCcEEe--chhhhhcccccccccccccccccccccccccccc-ccccccccccCccc Q lcl|NC_020883. 305 LESKQDEINWTITRSAVIYEQNGKPRISI--TKEMMDTLLNIAYERDGHSAKEASMMTPRIDHRD-MEITTFDENGRSME 381 (589) Q Consensus 305 ie~l~DeLd~t~S~~srildk~gkpRI~V--P~~~L~t~~g~~~d~dge~~~~~~~~~~~~d~~d-lev~~~de~g~~~~ 381 (589) +...+.....+-..-.+.|...+.|+.++ |..+=..++.-..+.-.+... ..+.+.+..- .-+. ...| +. T Consensus 249 ~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~---~~~~G~~nag~~~vl--~~~g--~~ 321 (535) T protein:vir:10 249 SIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWT---SQGSGLGGAWKIPIL--AAKD--AK 321 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHH---HHhcCcccccccccc--cCCC--ce Confidence 66666555444334455554445665433 321100000000000000000 0000000000 0011 0112 23 Q ss_pred eeeec--ccHHHHHHHHHHHHHHHHHHhcCCchhcccccCcccch------hHHHHHHHHHhhhHH-HHHHHHHHHHHHH Q lcl|NC_020883. 382 IHQID--ISKIGDMDHVKNLIKLMLIETQTSEKAVDFYLDGGASG------AQSGVAKFYDLLTTI-LKSRRLQKEYIDF 452 (589) Q Consensus 382 ~iq~D--irveeh~~~ie~L~~~Il~~a~ts~~AFg~~~~~g~~~------A~Sg~A~r~~~~~~~-~Kv~~~R~~~~~a 452 (589) +.+.. ..-.+..+..+...+.|..+=+.|+.-.|...+.+-+. +..+.-....+..-+ ..+.-|...+... T Consensus 322 ~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ 401 (535) T protein:vir:10 322 FVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQV 401 (535) T ss_pred EEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 22334455556677888777799999888744322110 000111111110000 1122233233333 Q ss_pred HHHHHHHHHHHHhhcCcccCcccceeeeCCcCCCCCCHHHHHHHHHHHhccchhhHHHHHHHhC-CCCC--HHHHH-HHH Q lcl|NC_020883. 453 LKELYESCLWLLNDQDSSIRIEEPNIETQDMILKPRAELVAENMAAYAASKQGQSLETTVRRMN-PDAS--EDWIQ-EEI 528 (589) Q Consensus 453 Lk~li~~~l~L~~~~~~~~~~e~p~I~f~D~lPvde~El~~A~t~~~l~~a~~~S~etaVr~Lh-pdw~--dE~v~-eEv 528 (589) |.+.+ |. ..+. ...+.|...+..| .+..+++.++..+ +.++.-.+-++++ |-.. |.-.. --. T Consensus 402 ln~~L-----l~-----~~~~-~~~f~f~~l~~~d--~~~r~~~~~~~~~-g~lT~NE~R~~~gl~piegGD~~~~~~~~ 467 (535) T protein:vir:10 402 INDKI-----MR-----YVDT-DYRFSFTLGDAQD--KLQEEQVWKLKLA-NGYFINEYRKDHGLKTVDGLDVPGFIGSA 467 (535) T ss_pred Hhhhc-----cc-----ccCC-eEEEEeccccccC--HHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCccccccccch Confidence 32211 11 1111 2246676655444 4455667666554 4466665544442 1111 11000 000 Q ss_pred HHHHhhccc-cccccccccccccccccCcccC-----CCCCCCCCCCCCCCC---cchhhhh--hccccc Q lcl|NC_020883. 529 ARIEEEQAG-SDTSSLMGINQTFEQMNDNRDE-----DGNIIEEGDTEEEPS---AEENEEI--EKEGEP 587 (589) Q Consensus 529 ~RI~~E~a~-~~p~~~g~~~~~l~~~~~~~~~-----~~~p~deg~~~eep~---~~~~e~~--~~~~~~ 587 (589) ......++. ....+-... ++-.+. +++++ +....+.|.+.++++ +.++++. .++++. T Consensus 468 ~~~~~~~~~~~~~~p~~~~-~~~~~~-~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 468 ENFINATGFGQPNVPDSSD-DSGSTL-GERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred hhcccccccccccCCCCCC-CccccC-CccccCcccccccccccCCCCCCCCCCcCCCCCccccccccCC Confidence 000000000 000000000 011110 01100 000111122221111 1111111 111222 Done!