Query lcl|NC_012418.1_cdsid_YP_002727853.1 [gene=PPphikF77_gp34] [protein=putative head-tail connector protein] [protein_id=YP_002727853.1] [location=22142..23674] Match_columns 510 No_of_seqs 114 out of 153 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 12:58:52 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6322 Length: 510 # 100.0 2E-187 1E-190 1044.8 58.0 510 1-510 1-510 (510) 2 protein:vir:78942 Length: 510 100.0 3E-187 2E-190 1042.9 58.5 510 1-510 1-510 (510) 3 protein:vir:80211 Length: 514 100.0 2E-171 1E-174 956.1 57.3 505 1-510 1-511 (514) 4 protein:vir:99672 Length: 532 100.0 4E-166 2E-169 927.6 55.7 506 1-510 10-529 (532) 5 protein:vir:78696 Length: 542 100.0 2E-165 1E-168 923.5 55.1 507 1-510 1-536 (542) 6 protein:vir:94572 Length: 535 100.0 2E-164 1E-167 918.1 56.6 505 1-510 11-532 (535) 7 protein:vir:2198 Length: 536 # 100.0 4E-164 3E-167 916.2 56.1 506 1-510 9-532 (536) 8 protein:vir:10447 Length: 536 100.0 6E-164 4E-167 915.4 56.2 506 1-510 9-532 (536) 9 protein:vir:96988 Length: 516 100.0 2E-164 1E-167 917.8 52.7 496 1-507 12-516 (516) 10 protein:vir:103330 Length: 517 100.0 2E-163 1E-166 913.2 56.6 500 1-510 8-516 (517) 11 protein:vir:100039 Length: 522 100.0 7E-164 4E-167 915.1 54.2 502 1-510 1-513 (522) 12 protein:vir:7017 Length: 515 # 100.0 1E-163 8E-167 913.7 54.9 498 1-509 11-515 (515) 13 protein:vir:105641 Length: 516 100.0 1E-163 7E-167 913.8 54.4 497 1-508 12-516 (516) 14 protein:vir:1538 Length: 535 # 100.0 3E-162 2E-165 906.4 55.2 506 1-510 10-532 (535) 15 protein:vir:8883 Length: 543 # 100.0 4E-162 2E-165 905.6 54.4 504 1-510 10-534 (543) 16 protein:vir:103765 Length: 549 100.0 4E-162 2E-165 905.4 52.0 500 1-510 8-542 (549) 17 protein:vir:1785 Length: 555 # 100.0 3E-161 2E-164 900.3 54.7 503 1-510 1-552 (555) 18 protein:vir:3361 Length: 535 # 100.0 9E-161 5E-164 898.1 56.3 506 1-510 10-532 (535) 19 protein:vir:94709 Length: 522 100.0 1E-159 7E-163 891.9 55.1 502 1-510 8-521 (522) 20 protein:vir:107822 Length: 555 100.0 2E-159 1E-162 891.1 53.2 500 1-510 6-543 (555) 21 protein:vir:98506 Length: 555 100.0 2E-159 1E-162 891.1 53.2 500 1-510 6-543 (555) 22 protein:vir:107404 Length: 555 100.0 2E-159 1E-162 891.1 53.2 500 1-510 6-543 (555) 23 protein:vir:102668 Length: 547 100.0 1E-157 8E-161 880.6 53.0 496 1-510 2-541 (547) 24 protein:vir:7321 Length: 556 # 100.0 3E-156 2E-159 873.5 53.0 497 1-510 5-538 (556) 25 protein:vir:95315 Length: 559 100.0 3E-156 2E-159 873.0 53.2 499 1-510 5-538 (559) 26 protein:vir:94599 Length: 641 100.0 1.5E-88 9.5E-92 502.0 42.5 493 1-510 25-619 (641) 27 protein:vir:80165 Length: 651 100.0 6.3E-68 3.9E-71 389.0 45.7 496 1-510 21-629 (651) 28 protein:vir:95449 Length: 584 100.0 1.4E-34 8.7E-38 206.2 36.7 475 1-501 17-584 (584) 29 protein:vir:8846 Length: 705 # 100.0 2.5E-27 1.6E-30 166.4 44.1 487 1-510 15-632 (705) 30 protein:vir:3139 Length: 599 # 100.0 2.8E-27 1.7E-30 166.2 36.7 485 1-510 21-599 (599) 31 protein:vir:95821 Length: 763 99.9 3.8E-22 2.4E-25 138.0 42.0 489 1-510 28-682 (763) 32 protein:vir:93630 Length: 776 99.6 9.5E-14 5.9E-17 92.0 36.2 481 1-510 43-678 (776) 33 protein:vir:108295 Length: 711 99.5 5.9E-12 3.6E-15 82.2 40.6 494 1-510 28-664 (711) 34 protein:vir:105429 Length: 708 99.0 4.3E-09 2.7E-12 66.4 35.3 487 1-510 1-649 (708) 35 protein:vir:77597 Length: 725 98.9 1.9E-08 1.2E-11 62.9 33.5 487 1-510 4-645 (725) 36 protein:vir:9263 Length: 725 # 98.9 2.3E-08 1.4E-11 62.4 33.3 487 1-510 4-645 (725) 37 protein:vir:3296 Length: 714 # 98.8 3.8E-08 2.4E-11 61.3 39.3 483 1-510 17-666 (714) 38 protein:vir:2764 Length: 714 # 98.8 3.8E-08 2.4E-11 61.3 39.3 483 1-510 17-666 (714) 39 protein:vir:9950 Length: 714 # 98.8 3.8E-08 2.4E-11 61.3 39.3 483 1-510 17-666 (714) 40 protein:vir:817 Length: 714 # 98.8 3.8E-08 2.4E-11 61.3 39.3 483 1-510 17-666 (714) 41 protein:vir:10117 Length: 714 98.8 3.8E-08 2.4E-11 61.3 39.3 483 1-510 17-666 (714) 42 protein:vir:100920 Length: 725 98.8 3.8E-08 2.4E-11 61.2 32.8 490 1-510 4-641 (725) 43 protein:vir:172 Length: 708 # 98.7 8.3E-08 5.1E-11 59.4 30.6 481 1-510 1-649 (708) 44 protein:vir:104437 Length: 714 98.6 1.7E-07 1.1E-10 57.7 37.1 483 1-510 17-666 (714) 45 protein:vir:105520 Length: 706 98.6 2.2E-07 1.4E-10 57.1 31.7 487 1-510 1-656 (706) 46 protein:vir:3520 Length: 720 # 98.5 3.6E-07 2.3E-10 55.9 27.9 483 1-510 1-644 (720) 47 protein:vir:105619 Length: 772 98.4 7.1E-07 4.4E-10 54.3 35.0 475 1-510 20-660 (772) 48 protein:vir:3028 Length: 500 # 98.4 1.1E-06 6.6E-10 53.3 31.5 424 1-510 11-500 (500) 49 protein:vir:9815 Length: 500 # 98.4 1.1E-06 6.6E-10 53.3 31.5 424 1-510 11-500 (500) 50 protein:vir:2341 Length: 488 # 98.3 1.7E-06 1.1E-09 52.2 35.4 419 1-510 12-479 (488) 51 protein:vir:99916 Length: 504 98.3 2E-06 1.3E-09 51.8 33.0 417 1-510 18-496 (504) 52 protein:vir:80959 Length: 499 98.2 3.2E-06 2E-09 50.7 30.7 431 1-510 5-494 (499) 53 protein:vir:104082 Length: 485 98.1 5.3E-06 3.3E-09 49.5 36.8 416 1-510 17-479 (485) 54 protein:vir:1587 Length: 508 # 98.0 8E-06 4.9E-09 48.5 34.6 427 1-510 3-506 (508) 55 protein:vir:97447 Length: 474 98.0 8.1E-06 5E-09 48.5 34.1 414 1-510 28-468 (474) 56 protein:vir:94498 Length: 474 98.0 8.1E-06 5E-09 48.5 34.1 414 1-510 28-468 (474) 57 protein:vir:38 Length: 496 # N 98.0 8.8E-06 5.5E-09 48.3 35.2 426 1-506 5-496 (496) 58 protein:vir:80680 Length: 441 97.9 1.1E-05 6.7E-09 47.8 37.0 411 1-507 1-441 (441) 59 protein:vir:95113 Length: 474 97.9 1.1E-05 6.7E-09 47.8 34.4 416 1-510 28-471 (474) 60 protein:vir:79043 Length: 479 97.9 1.3E-05 8.2E-09 47.3 38.5 425 1-503 24-479 (479) 61 protein:vir:96179 Length: 468 97.8 1.8E-05 1.1E-08 46.6 34.4 407 1-506 27-468 (468) 62 protein:vir:98444 Length: 434 97.8 1.9E-05 1.2E-08 46.5 31.5 386 28-509 1-434 (434) 63 protein:vir:96266 Length: 474 97.7 2.3E-05 1.4E-08 46.0 33.6 413 1-510 28-468 (474) 64 protein:vir:95899 Length: 474 97.7 2.3E-05 1.4E-08 46.0 33.6 413 1-510 28-468 (474) 65 protein:vir:98883 Length: 517 97.7 2.3E-05 1.5E-08 46.0 37.2 452 1-510 3-515 (517) 66 protein:vir:101494 Length: 527 97.7 2.4E-05 1.5E-08 45.9 28.0 442 1-510 28-525 (527) 67 protein:vir:102239 Length: 527 97.7 2.5E-05 1.6E-08 45.8 28.0 442 1-510 28-525 (527) 68 protein:vir:7768 Length: 484 # 97.7 3E-05 1.8E-08 45.4 33.2 416 1-509 16-484 (484) 69 protein:vir:4782 Length: 522 # 97.7 3E-05 1.9E-08 45.4 36.9 447 1-510 7-520 (522) 70 protein:vir:78227 Length: 480 97.6 3.4E-05 2.1E-08 45.1 36.1 418 1-510 4-474 (480) 71 protein:vir:79703 Length: 505 97.6 3.9E-05 2.4E-08 44.8 36.5 418 1-501 3-505 (505) 72 protein:vir:96240 Length: 511 97.5 5E-05 3.1E-08 44.2 41.0 428 1-510 13-506 (511) 73 protein:vir:5961 Length: 503 # 97.5 5.9E-05 3.7E-08 43.8 35.9 437 1-510 26-501 (503) 74 protein:vir:4223 Length: 486 # 97.5 6.4E-05 3.9E-08 43.6 36.0 417 1-510 17-476 (486) 75 protein:vir:106571 Length: 499 97.4 6.7E-05 4.2E-08 43.5 37.8 421 1-510 17-485 (499) 76 protein:vir:2427 Length: 485 # 97.4 7.7E-05 4.8E-08 43.1 36.3 416 1-510 13-476 (485) 77 protein:vir:97171 Length: 512 97.4 7.7E-05 4.8E-08 43.1 40.4 425 1-510 38-505 (512) 78 protein:vir:3964 Length: 453 # 97.3 9.7E-05 6E-08 42.6 38.4 400 1-510 18-443 (453) 79 protein:vir:78537 Length: 480 97.2 0.00012 7.5E-08 42.1 36.0 417 1-510 4-474 (480) 80 protein:vir:9306 Length: 511 # 97.2 0.00013 7.8E-08 42.0 41.1 431 1-510 13-504 (511) 81 protein:vir:94805 Length: 492 97.2 0.00013 8E-08 41.9 35.2 412 1-506 45-492 (492) 82 protein:vir:99781 Length: 511 97.2 0.00014 8.6E-08 41.7 39.9 432 1-510 13-506 (511) 83 protein:vir:1236 Length: 483 # 97.2 0.00014 8.9E-08 41.6 35.2 412 1-510 36-479 (483) 84 protein:vir:9922 Length: 489 # 97.2 0.00015 9.1E-08 41.6 34.3 420 1-510 13-489 (489) 85 protein:vir:78805 Length: 511 97.2 0.00015 9.2E-08 41.6 40.3 428 1-510 13-504 (511) 86 protein:vir:96366 Length: 511 97.2 0.00015 9.2E-08 41.6 40.3 428 1-510 13-504 (511) 87 protein:vir:8184 Length: 474 # 97.2 0.00015 9.3E-08 41.5 30.5 408 1-503 12-474 (474) 88 protein:vir:733 Length: 453 # 97.1 0.00017 1.1E-07 41.3 39.2 404 1-501 17-453 (453) 89 protein:vir:95806 Length: 440 97.1 0.00017 1.1E-07 41.2 36.3 400 1-504 5-440 (440) 90 protein:vir:99522 Length: 470 97.1 0.00018 1.1E-07 41.1 38.8 409 1-505 25-470 (470) 91 protein:vir:106639 Length: 481 97.1 0.00019 1.2E-07 41.0 41.1 415 1-510 30-476 (481) 92 protein:vir:80453 Length: 535 97.0 0.00023 1.4E-07 40.6 28.9 435 1-510 32-530 (535) 93 protein:vir:93747 Length: 472 96.9 0.0003 1.8E-07 39.9 36.7 408 1-510 25-464 (472) 94 protein:vir:99072 Length: 479 96.8 0.00033 2.1E-07 39.7 36.4 410 1-510 14-474 (479) 95 protein:vir:94101 Length: 474 96.8 0.00034 2.1E-07 39.6 37.0 424 1-504 16-474 (474) 96 protein:vir:105889 Length: 474 96.8 0.00034 2.1E-07 39.6 37.0 424 1-504 16-474 (474) 97 protein:vir:2500 Length: 501 # 96.7 0.00036 2.3E-07 39.4 35.8 418 1-510 28-495 (501) 98 protein:vir:97336 Length: 492 96.7 0.00042 2.6E-07 39.1 35.8 407 1-510 45-486 (492) 99 protein:vir:107112 Length: 478 96.6 0.00048 3E-07 38.8 32.3 412 1-508 27-478 (478) 100 protein:vir:78907 Length: 518 96.5 0.00052 3.2E-07 38.6 31.5 427 1-509 7-518 (518) 101 protein:vir:3609 Length: 452 # 96.5 0.0006 3.7E-07 38.2 39.0 402 1-510 18-445 (452) 102 protein:vir:96839 Length: 474 96.4 0.00061 3.8E-07 38.2 35.1 412 1-506 27-474 (474) 103 protein:vir:105461 Length: 470 96.3 0.00074 4.6E-07 37.8 36.4 413 1-510 2-466 (470) 104 protein:vir:103951 Length: 511 96.3 0.00076 4.7E-07 37.7 41.0 421 1-510 13-506 (511) 105 protein:vir:9871 Length: 429 # 96.3 0.00082 5.1E-07 37.5 39.4 403 1-504 1-429 (429) 106 protein:vir:105292 Length: 478 96.1 0.001 6.4E-07 36.9 35.8 416 1-509 26-478 (478) 107 protein:vir:96494 Length: 501 96.0 0.0011 7E-07 36.8 37.6 417 1-510 43-495 (501) 108 protein:vir:4898 Length: 502 # 95.9 0.0013 8.3E-07 36.3 38.7 428 1-510 34-493 (502) 109 protein:vir:94546 Length: 506 95.6 0.0017 1.1E-06 35.8 37.5 422 1-510 23-500 (506) 110 protein:vir:102950 Length: 471 95.2 0.0025 1.5E-06 34.9 33.5 412 1-508 6-471 (471) 111 protein:vir:345 Length: 663 # 95.0 0.0029 1.8E-06 34.5 33.4 469 1-510 12-645 (663) 112 protein:vir:94599 Length: 641 94.4 0.0044 2.8E-06 33.5 21.8 444 1-510 39-632 (641) 113 protein:vir:9751 Length: 422 # 93.4 0.0075 4.7E-06 32.2 29.6 386 1-485 1-422 (422) 114 protein:vir:78393 Length: 489 93.1 0.0086 5.3E-06 31.9 26.2 427 1-505 17-489 (489) 115 protein:vir:2732 Length: 501 # 93.1 0.0087 5.4E-06 31.9 39.9 427 1-508 40-501 (501) 116 protein:vir:94742 Length: 409 92.9 0.0097 6E-06 31.6 34.1 375 1-467 1-409 (409) 117 protein:vir:7430 Length: 563 # 92.6 0.011 6.7E-06 31.4 29.4 462 1-510 1-533 (563) 118 protein:vir:78083 Length: 537 92.2 0.012 7.7E-06 31.0 38.7 437 1-510 13-523 (537) 119 protein:vir:94956 Length: 452 92.2 0.013 7.8E-06 31.0 23.6 419 1-503 1-452 (452) 120 protein:vir:95149 Length: 501 91.9 0.014 8.6E-06 30.8 29.0 430 1-506 1-501 (501) 121 protein:vir:80040 Length: 461 91.8 0.014 8.7E-06 30.7 21.9 420 1-499 1-461 (461) 122 protein:vir:95014 Length: 491 90.9 0.018 1.1E-05 30.1 27.7 421 1-506 17-491 (491) 123 protein:vir:3989 Length: 392 # 90.1 0.023 1.4E-05 29.6 25.8 327 27-446 1-392 (392) 124 protein:vir:1023 Length: 392 # 90.1 0.023 1.4E-05 29.6 25.8 327 27-446 1-392 (392) 125 protein:vir:1634 Length: 409 # 89.2 0.028 1.7E-05 29.1 31.9 376 1-467 1-409 (409) 126 protein:vir:9568 Length: 410 # 88.6 0.031 1.9E-05 28.8 34.8 377 14-487 1-410 (410) 127 protein:vir:102602 Length: 456 88.6 0.031 1.9E-05 28.8 33.8 413 1-502 9-456 (456) 128 protein:vir:105819 Length: 456 88.6 0.031 1.9E-05 28.8 33.8 413 1-502 9-456 (456) 129 protein:vir:7407 Length: 392 # 88.5 0.032 2E-05 28.8 25.1 311 63-446 1-392 (392) 130 protein:vir:7987 Length: 456 # 83.7 0.066 4.1E-05 27.1 34.9 412 1-507 9-456 (456) 131 protein:vir:1785 Length: 555 # 82.7 0.074 4.6E-05 26.8 22.6 455 5-510 1-523 (555) 132 protein:vir:96738 Length: 505 82.2 0.079 4.9E-05 26.6 20.8 435 1-510 19-504 (505) 133 protein:vir:100150 Length: 437 80.0 0.099 6.2E-05 26.1 15.6 390 1-505 1-437 (437) 134 protein:vir:3153 Length: 467 # 79.0 0.11 6.8E-05 25.9 20.2 409 44-504 1-467 (467) 135 protein:vir:96783 Length: 488 77.8 0.12 7.5E-05 25.6 32.8 403 1-465 22-488 (488) 136 protein:vir:4995 Length: 384 # 76.0 0.14 8.7E-05 25.3 23.4 346 1-448 1-384 (384) 137 protein:vir:97265 Length: 513 70.0 0.21 0.00013 24.2 30.4 433 1-510 6-507 (513) 138 protein:vir:78161 Length: 355 68.5 0.24 0.00015 24.0 17.9 292 163-509 1-355 (355) 139 protein:vir:1326 Length: 457 # 67.4 0.25 0.00016 23.9 15.9 415 1-510 1-452 (457) 140 protein:vir:4698 Length: 251 # 66.0 0.27 0.00017 23.7 10.8 239 1-360 1-251 (251) 141 protein:vir:10321 Length: 495 63.2 0.32 0.0002 23.3 18.0 422 1-510 16-492 (495) 142 protein:vir:79538 Length: 502 61.1 0.36 0.00022 23.0 28.8 432 1-510 11-489 (502) 143 protein:vir:4854 Length: 386 # 58.4 0.41 0.00026 22.7 21.4 345 45-506 1-386 (386) 144 protein:vir:1266 Length: 416 # 54.1 0.51 0.00032 22.2 23.3 385 7-509 1-416 (416) 145 protein:vir:3843 Length: 397 # 49.6 0.63 0.00039 21.7 18.8 359 29-506 1-397 (397) 146 protein:vir:4952 Length: 386 # 44.7 0.8 0.00049 21.1 23.4 347 1-452 1-386 (386) 147 protein:vir:102330 Length: 451 44.3 0.81 0.0005 21.1 35.9 409 1-502 1-451 (451) 148 protein:vir:81152 Length: 411 41.6 0.92 0.00057 20.8 23.9 363 1-465 3-411 (411) 149 protein:vir:4828 Length: 382 # 39.4 1 0.00063 20.5 25.7 330 29-449 1-382 (382) 150 protein:vir:5249 Length: 437 # 36.4 1.2 0.00073 20.2 18.9 405 18-502 1-437 (437) 151 protein:vir:95542 Length: 548 30.2 1.6 0.00099 19.5 24.7 442 1-510 12-523 (548) 152 protein:vir:107851 Length: 175 27.8 0.74 0.00046 21.3 3.2 106 1-115 39-175 (175) 153 protein:vir:101647 Length: 460 27.3 1.9 0.0012 19.1 25.1 395 1-489 1-460 (460) 154 protein:vir:107742 Length: 537 24.8 2.1 0.0013 18.8 20.0 425 1-510 70-533 (537) 155 protein:vir:6240 Length: 457 # 22.9 2.4 0.0015 18.5 18.7 394 6-510 1-442 (457) 156 protein:vir:98396 Length: 441 21.5 2.6 0.0016 18.3 16.6 366 1-466 15-441 (441) 157 protein:vir:9408 Length: 441 # 21.3 2.6 0.0016 18.3 17.2 366 1-466 15-441 (441) 158 protein:vir:79984 Length: 441 21.3 2.6 0.0016 18.3 17.2 366 1-466 15-441 (441) 159 protein:vir:93610 Length: 454 21.0 2.7 0.0017 18.2 18.9 397 8-503 1-454 (454) No 1 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=1.5e-187 Score=1044.75 Aligned_cols=510 Identities=98% Similarity=1.376 Sum_probs=501.1 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) ||+||++||++|||++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+||||++|||||+++ T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccCCC Confidence 99999999999999999999999999999999999998888888999999999999999999999999999999999999 Q ss_pred hHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeCCC Q lcl|NC_012418. 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d~~ 160 (510) |..+++.++.+.+.+++++||++||++++.+|++||||.++|++|+||++|||+++|++++..+|++|||++|||++|++ T Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~ 160 (510) T protein:vir:63 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSDAATVVAWSLRSYAVRRDAT 160 (510) T ss_pred hHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCCCcEEEEEEcceeEEeeCCC Confidence 99999998888899999999999999999999999999999999999999999999999998889999999999999999 Q ss_pred CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceE Q lcl|NC_012418. 161 GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYI 240 (510) Q Consensus 161 G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |+||+||||++||+++|+++|+++..++..+++|+++|+|||+|+|+++++|||+|||+|++|++++.+|+|+|++|||+ T Consensus 161 G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~ 240 (510) T protein:vir:63 161 GRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYI 240 (510) T ss_pred cCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceeccccccccccCcee Confidence 99999999999999999999999998888889999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccccccc Q lcl|NC_012418. 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) Q Consensus 241 ~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~ 320 (510) ++||++.+||+||||||+++|||+|+||+|+++.+++++++++|||+|+|+|+++|+++..+++|.++||++++++++++ T Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~ 320 (510) T protein:vir:63 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeecCCcccceeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_012418. 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) Q Consensus 321 ~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l 400 (510) ++++||+++++.|++++++|+++||+++.++|+++||||||++|++||+++||||||||++|||.|||+|+|+||+++++ T Consensus 321 ~~~~d~~~~~~~i~~~~~rI~~af~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl 400 (510) T protein:vir:63 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHhhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHH Q lcl|NC_012418. 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAE 480 (510) Q Consensus 401 ~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~ 480 (510) +|+|++.+++.+|+|+|+|+|+|+++++.++.|+++++++++|++|+||+|++++++|+++||||..|+||+|||+++++ T Consensus 401 ~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~ 480 (510) T protein:vir:63 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) T ss_pred CCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999988889999999999999 Q ss_pred HHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 481 QRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 481 q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) |++|++++|+++++++..|+++..+|+||| T Consensus 481 ~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 481 QQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 999999999999999999999999999999 No 2 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=3.3e-187 Score=1042.91 Aligned_cols=510 Identities=98% Similarity=1.375 Sum_probs=500.8 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) ||+|+++||++|||++|+++|+||++||+|++|++++++++++..++|||||++|+++|||||||+||||++|||||+++ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccCCC Confidence 99999999999999999999999999999999999998888888899999999999999999999999999999999999 Q ss_pred hHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeCCC Q lcl|NC_012418. 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d~~ 160 (510) |..+++.++.+.+.++|++||++||++++.+|++||||.++|++|+||++|||+++|++++..+|++|||++|||++|++ T Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~ 160 (510) T protein:vir:78 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) T ss_pred hHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCCeEEEEEcceeEEeeCCC Confidence 99999988888899999999999999999999999999999999999999999999999998899999999999999999 Q ss_pred CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceE Q lcl|NC_012418. 161 GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYI 240 (510) Q Consensus 161 G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |+||+|||||+||+++|+++|+++..++..+++|+++|+|||+|+|+++++|||+|+|+|+||++++.+|+|+|++|||+ T Consensus 161 G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~ 240 (510) T protein:vir:78 161 GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYI 240 (510) T ss_pred cCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeeccccccccccCCee Confidence 99999999999999999999999999888889999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccccccc Q lcl|NC_012418. 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) Q Consensus 241 ~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~ 320 (510) ++||++.+||+||||||+++|||+|+||+|+++.+++++++++|+|+|+|+|+++|+++..+++|.++||++++++++++ T Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~~~~v~~~~~ 320 (510) T protein:vir:78 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCCccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_012418. 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) Q Consensus 321 ~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l 400 (510) ++++||+++++.|++++++|+++||+++.++|+++||||||++|++||+++||||||||++|||.|||+|+|+||+++++ T Consensus 321 ~~~~d~~~~~~~i~~~~~rI~~aF~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl 400 (510) T protein:vir:78 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) T ss_pred CcccchHHHHHHHHHHHHHHHHHHhhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHH Q lcl|NC_012418. 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAE 480 (510) Q Consensus 401 ~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~ 480 (510) +|+||+.+++.+|+|+|+|+|+|+++++.++.|+++++++++|++|+||+|++++++++++||||..|+||+|||+++|+ T Consensus 401 ~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~ 480 (510) T protein:vir:78 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) T ss_pred CCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999987789999999999999 Q ss_pred HHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 481 QRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 481 q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) |++|+++++++++++...++++..++++|| T Consensus 481 ~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 481 EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 999999999999999999999999999999 No 3 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=2.3e-171 Score=956.07 Aligned_cols=505 Identities=46% Similarity=0.734 Sum_probs=470.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccccc--CCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLM--VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~--~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) ||+++.+.|++.+|++|+++|+||++||+|+++ +.++++++.+..++|||||++|+++|||||||+||||++|||||+ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 999999999999999999999999999999976 445666677778999999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|..++.....+.+..++++||++||++++++|++||||.++|++|+||++|||+++|++++..+|++|||++|||++| T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d 160 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTGKMLVWTMQSYTVRRT 160 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCCcEEEEEcCeEEEeeC Confidence 99888777777888899999999999999999999999999999999999999999999999888999999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCc Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCP 238 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P 238 (510) ++|+|||||||++||+++|+++|+++..+...+++++++|+|||||+|++++++||+|||+|++|++++++|+|++++|| T Consensus 161 ~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~y~~~e~P 240 (514) T protein:vir:80 161 SHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVGPESSYPAHLCP 240 (514) T ss_pred CCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeecccCccccccCC Confidence 99999999999999999999999999888777888999999999999999999999999999999999999999999999 Q ss_pred eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccccc Q lcl|NC_012418. 239 YIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAY 318 (510) Q Consensus 239 ~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~ 318 (510) |+++||++.+||+||||||+++|||+|+||+|+++.+++++++++|+|+++|+|+++|+++..+++|+++||++++++++ T Consensus 241 ~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~~~~v~~~ 320 (514) T protein:vir:80 241 YVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQVGSVASY 320 (514) T ss_pred eeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCCCccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh- Q lcl|NC_012418. 319 ERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD- 397 (510) Q Consensus 319 ~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~- 397 (510) +.++.+||+++++.|++++++|+++||++...+|+++||||||++|++||+++||||||||++|||.|||+|+|+||++ T Consensus 321 ~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~ 400 (514) T protein:vir:80 321 ERGDYNKIAQASASVESIVMRLNRAFMYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRG 400 (514) T ss_pred ecCcccchHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999999999999999999999999999877799999999999999999999999999999999999999999999976 Q ss_pred --cCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcCh-hhHhhccCHHHHHHHHHHHcCCCHhHccCCHHH Q lcl|NC_012418. 398 --ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEE 474 (510) Q Consensus 398 --~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~e 474 (510) +.+|++|++.+++.+++++++|+|+++++++.++.++++.+++. +++.++||+|++++++|+++|||++.|++++|+ T Consensus 401 ~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~ 480 (514) T protein:vir:80 401 NGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDV 480 (514) T ss_pred ccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHH Confidence 34566777788999999999999999999999999999999985 779999999999999999999999889999999 Q ss_pred HHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 475 LQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 475 v~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +++++++++|++++++++++. .+++...+|| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 511 (514) T protein:vir:80 481 VAAEAEQEAALAQQQLDVASG-----ALAAETSAGV 511 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----HHHHhhhccc Confidence 999887776655555544322 2223334455 No 4 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=3.6e-166 Score=927.63 Aligned_cols=506 Identities=31% Similarity=0.460 Sum_probs=459.6 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =+.+|++||++|| |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+||||++|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~ 89 (532) T protein:vir:99 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2678999999996 89999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC------CcEEEEEece Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE------ATVVAWSLRS 152 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~------~~~~~~pl~~ 152 (510) ++|+.+.+....+.+.++|++||++||++|+++|++||||.++|++|+||++|||++||++++. .+|++|||++ T Consensus 90 ~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl~~ 169 (532) T protein:vir:99 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) T ss_pred CCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEcCe Confidence 9999999988888899999999999999999999999999999999999999999999997643 2699999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe-eeccccc Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV-RVGEEGR 231 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~-~~~~~~~ 231 (510) |||++|++|+|++||||+++++++|++++++++.+...+++|+++|+|||+|+|+++ +|+|++ |++++|+ .++.+|+ T Consensus 170 y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~-~~~~~~-~~~~~g~~~~~~~~~ 247 (532) T protein:vir:99 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPE-AMVFRS-YQEIDGEIVAGTEGE 247 (532) T ss_pred EEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCC-CCeeEE-EEeecCceecccccc Confidence 999999999999999999999999999999998887778899999999999999876 577765 5667775 5678999 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCC Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~ 311 (510) |++++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+|+|+|+++|+++..+++|+++||. T Consensus 248 ~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~ 327 (532) T protein:vir:99 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCCcceecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHH Q lcl|NC_012418. 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) ++++++++.++++||+++++.|++++++|+++||++.. ++|+++||||||++|++||+++||||||||++|||.|||+| T Consensus 328 ~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) T protein:vir:99 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) T ss_pred cccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999855 69999999999999999999999999999999999999999 Q ss_pred HHHHHhhcCCCCCCcc-cccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHcc Q lcl|NC_012418. 391 CLSEVDDALLQGLITK-QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) Q Consensus 391 ~~~il~~~~l~~~~~~-~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~ 469 (510) +|+||+++|++|++|+ .+++.+++|+|+|+|+|+++++.+|.+.++.+.+ ++.++||+|+++++||+++|||+..|+ T Consensus 408 ~~~il~r~g~lP~~p~~~~~~~iv~~is~Laraq~~~~l~~~~~~laq~~p--~~~d~id~d~~~~~~a~~~GV~~~~i~ 485 (532) T protein:vir:99 408 LLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG--LQDDDINLLDVKMRLANSLGMDTTGLI 485 (532) T ss_pred HHHHHHhcCCCCCCChhhcccceeecchHHHHHHHHHHHHHHHHHHHhhcc--hhhhhCCHHHHHHHHHHHhCCChhhcc Confidence 9999998876665555 5688999999999999999999999999887765 356789999999999999999888899 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhHH---HhhhhhhhhhcccCC Q lcl|NC_012418. 470 KSEEELQAEAEQRRQQAAQAQAAQET---LLEGASDMTNALAGV 510 (510) Q Consensus 470 rs~~ev~~~r~q~~q~~~~~~~~~~~---~~~ga~~~~~~~ag~ 510 (510) ||+||++++++|++++++++++++++ ++++++......+|- T Consensus 486 r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 529 (532) T protein:vir:99 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGM 529 (532) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCC Confidence 99999999997766555444433322 223333344444555 No 5 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=2e-165 Score=923.47 Aligned_cols=507 Identities=24% Similarity=0.341 Sum_probs=457.1 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) ||+++++||++|| |++|+++|+||++||+|++++.+++.++++..++|||||++|+++|||||||+||||++|||||. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 9999999999996 89999999999999999999999988888889999999999999999999999999999999999 Q ss_pred CChHHHhhhcc-cchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEee Q lcl|NC_012418. 79 LTDAIRREADS-RDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRR 157 (510) Q Consensus 79 ~~d~~~~~~~~-~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~ 157 (510) ++|+.+.+... .+...++++.||++||++++++|++||||.++|++|+||++|||+++|++++ +|++|||++|+|++ T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~--~~~~~pl~~y~v~~ 158 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK--TLKVYPLDRYVIER 158 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC--CceEEecceeEEee Confidence 99999888655 4444688999999999999999999999999999999999999999999886 69999999999999 Q ss_pred CCCCCeEEEEEEEeecHHHhhHHhhHhhh----hhhhccCCCceEEEEEEEEeecC--------CCceEEEEEEEecCee Q lcl|NC_012418. 158 DATGRWMDIVLKQRYKSKDLDEAYKQDLM----RAGRNLSGSGSVDLYTHVQRKKG--------TAMEYAELYHEIDGVR 225 (510) Q Consensus 158 d~~G~vd~i~r~~~~t~~~l~~~~~~~~~----~~~~~~~~~~~v~i~~~v~~~~~--------~~~p~~sv~~e~~~~~ 225 (510) |++|+||+|||||+||+++|+++|+.+.. +...+++++.+++|+|+|+|+++ ++++|||+|++++|++ T Consensus 159 d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~ 238 (542) T protein:vir:78 159 DGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKE 238 (542) T ss_pred CCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEecccc Confidence 99999999999999999999999986543 34456788999999999999864 4789999999999998 Q ss_pred e-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCC Q lcl|NC_012418. 226 V-GEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM 304 (510) Q Consensus 226 ~-~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~ 304 (510) + +.++.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++..+++ T Consensus 239 v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~~ 318 (542) T protein:vir:78 239 IKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLARAGT 318 (542) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCC Confidence 7 456777789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHH Q lcl|NC_012418. 305 GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQ 384 (510) Q Consensus 305 g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l 384 (510) |.+++|.+++++++++++++||+++++.|++++++|+++||++. .+|+++||||||++|++||+++||||||||++||| T Consensus 319 g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~-~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L 397 (542) T protein:vir:78 319 GAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN-VRQSERTTATEVREVQMELDRQLSGIYGSLTVELL 397 (542) T ss_pred ceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc-cCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999864 68999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCC Q lcl|NC_012418. 385 SPLAYVCLSEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV 463 (510) Q Consensus 385 ~Pli~r~~~il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv 463 (510) .|||+|+|+||+++|++| +|++.+++.++++|..++|+++++++.+|.+.++.+.+++++.+.||+|++++++|+++|| T Consensus 398 ~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gv 477 (542) T protein:vir:78 398 TPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGI 477 (542) T ss_pred HHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCC Confidence 999999999999877655 5555678889999999999999999999999998877778888999999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhh-------hhhhhhhc-----ccCC Q lcl|NC_012418. 464 DTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLE-------GASDMTNA-----LAGV 510 (510) Q Consensus 464 p~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~-------ga~~~~~~-----~ag~ 510 (510) |++.|+||+||++++++|++++++++..+.++... +..++.++ ++|- T Consensus 478 p~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~~~~~~ 536 (542) T protein:vir:78 478 DTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQQINAPGQEAPAGP 536 (542) T ss_pred CHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhhcCCCCcCCCCCC Confidence 98889999999999998877766655544333221 11112222 1121 No 6 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=2e-164 Score=918.06 Aligned_cols=505 Identities=31% Similarity=0.458 Sum_probs=459.5 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =+.++++||++|| |++|+++|+||++||+|++++.++++++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 11 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 89 (535) T protein:vir:94 11 AENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALFPM-QTWMKLT 89 (535) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhcCC-CCccccc Confidence 3667999999996 899999999999999999999999988888899999999999999999999999976 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEeceEEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~~~i 155 (510) ++|..++++...+.+.+++++||++||++++.+|++||||.++|++|+||++|||+++|++++.+ +|++|||++||| T Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~y~v 169 (535) T protein:vir:94 90 ISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRLSSYVV 169 (535) T ss_pred cChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEcCeEEE Confidence 99999999888888999999999999999999999999999999999999999999999987654 699999999999 Q ss_pred eeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-cccccccc Q lcl|NC_012418. 156 RRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGRWPI 234 (510) Q Consensus 156 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~y~~ 234 (510) ++|++|+||+|||||++++++|+++|++.+.++. +++++++|+|||||+|++ ++|||.+ |++++|+.+ +.++.|+| T Consensus 170 ~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~-~~~~~~~v~v~~~v~~~~-~~~~~~~-~~e~~g~~~~~~~~~~g~ 246 (535) T protein:vir:94 170 QRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQ-EHKGDEMIDVYTHIYLDE-ESGEYLK-YEEIDGVEVEGTDASYPV 246 (535) T ss_pred eeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhcc-ccCCCceeEEEEEEEeeC-CCCcEEE-EEEecCeeeccccccCcc Confidence 9999999999999999999999999999886554 568999999999999875 4688765 668899876 57888889 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccc Q lcl|NC_012418. 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~ 314 (510) ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+++|+|+++|+++..+++|+++||.+++ T Consensus 247 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~ 326 (535) T protein:vir:94 247 DACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCceeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHH Q lcl|NC_012418. 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) ++++++++++||+++++.|++++++|+++||++.. ++|+++||||||++|++||+++||||||||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~ 406 (535) T protein:vir:94 327 ISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999855 69999999999999999999999999999999999999999999 Q ss_pred HHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCH Q lcl|NC_012418. 394 EVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSE 472 (510) Q Consensus 394 il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~ 472 (510) ||+++|++| +|++.+++.+|++|++|+|+++++++.+|.+.++.++| ..+++.||+|++++++++++|||++.|+||+ T Consensus 407 il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P-~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~ 485 (535) T protein:vir:94 407 QLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAP-MQGDPDINIATIKLRIANAIGIDTSGILKTP 485 (535) T ss_pred HHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhCh-HHhhhcCCHHHHHHHHHHHhCCChhhhcCCH Confidence 998876554 56667899999999999999999999999998888654 5667789999999999999999988899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHh-hhh--------hhhhhcccCC Q lcl|NC_012418. 473 EELQAEAEQRRQQAAQAQAAQETLL-EGA--------SDMTNALAGV 510 (510) Q Consensus 473 ~ev~~~r~q~~q~~~~~~~~~~~~~-~ga--------~~~~~~~ag~ 510 (510) ||++|+++|++++++++++++++.. +++ ++.....+|. T Consensus 486 eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 532 (535) T protein:vir:94 486 EEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGM 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhcc Confidence 9999999888777666666544322 221 2233445555 No 7 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=4.3e-164 Score=916.21 Aligned_cols=506 Identities=30% Similarity=0.429 Sum_probs=460.6 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) -.++|++||++|| |++|+++|+||++||+|++|+.+++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 9 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WFrl~ 87 (536) T protein:vir:21 9 AEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 5669999999996 899999999999999999999999988888899999999999999999999999976 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC----cEEEEEeceEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) ++|+++.+....+...+++++||+.||++++.+|++||||.++|++|+||++|||+++|++++.. +|++|||++|| T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~ 167 (536) T protein:vir:21 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEE Confidence 99999998887788899999999999999999999999999999999999999999999987754 38999999999 Q ss_pred EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeec-cccccc Q lcl|NC_012418. 155 VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVG-EEGRWP 233 (510) Q Consensus 155 i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~-~~~~y~ 233 (510) |++|++|+||+|||||+||+++|+++|+.++.+...+++|+++|+|||+|+|+++ +++ |++|++++|+++. ++|.|+ T Consensus 168 v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~-~~~~~e~~g~~v~~~~g~~~ 245 (536) T protein:vir:21 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED-SGE-YLRYEEVEGMEVQGSDGTYP 245 (536) T ss_pred EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecC-CCc-EEEEeccCCeeeccccCccc Confidence 9999999999999999999999999999998888888899999999999999865 455 5789999999885 555667 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcc Q lcl|NC_012418. 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~ 313 (510) |++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|+|+++|+|+++|+++..+++|.++||.++ T Consensus 246 f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~ 325 (536) T protein:vir:21 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) T ss_pred cccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHH Q lcl|NC_012418. 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++.. ++|+++||||||++|++||+++||||||||++|||.|||+|+| T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~ 405 (536) T protein:vir:21 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) T ss_pred cceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999854 6999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCC Q lcl|NC_012418. 393 SEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS 471 (510) Q Consensus 393 ~il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs 471 (510) +||+++|++| +|++.+++.++++|++|+|+++++++.+|.+.++.++| ..+++.||+|++++++|+++||+|..|+|| T Consensus 406 ~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P-e~ld~~id~d~~~~~~a~~~Gv~p~~~irt 484 (536) T protein:vir:21 406 KQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAP-MRDDPDINLAMIKLRIANAIGIDTSGILLT 484 (536) T ss_pred HHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhch-hhhcccCCHHHHHHHHHHHcCCChhhhcCC Confidence 9998877655 55556799999999999999999999999999887664 446678999999999999999966679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHhhhh---------hhhhhcccCC Q lcl|NC_012418. 472 EEELQAEAEQRRQQAAQAQAAQETLLEGA---------SDMTNALAGV 510 (510) Q Consensus 472 ~~ev~~~r~q~~q~~~~~~~~~~~~~~ga---------~~~~~~~ag~ 510 (510) +|||+++|+|+++++++++++++....-+ +++....+|+ T Consensus 485 ~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~ 532 (536) T protein:vir:21 485 EEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhcccc Confidence 99999999988777776666654332211 2233345666 No 8 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=6.2e-164 Score=915.35 Aligned_cols=506 Identities=30% Similarity=0.434 Sum_probs=460.6 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) -.++|++||++|| |++|+++|+||++||+|++|+.++++++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WFrl~ 87 (536) T protein:vir:10 9 AEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC-Ccccccc Confidence 5669999999996 899999999999999999999999988888899999999999999999999999976 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC----cEEEEEeceEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) ++|+++.+....+...+++++||+.||++++.+|++||||.++|++|+||++|||+++|++++.. +|++|||++|| T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~ 167 (536) T protein:vir:10 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEE Confidence 99999998887788899999999999999999999999999999999999999999999987754 38999999999 Q ss_pred EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccccccc Q lcl|NC_012418. 155 VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGRWP 233 (510) Q Consensus 155 i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~y~ 233 (510) |++|++|+||+|||||+||+++|+++|+.+..+...+++|+++|+|||+|+|+++ +++ |++|++++|+++ ..+|.|+ T Consensus 168 v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~-~~~-~~~~~e~~g~~v~~~~g~~~ 245 (536) T protein:vir:10 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEA-SGE-YLRYEEVEGMEVQGSDGTYP 245 (536) T ss_pred EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecC-CCc-EEEEEeecCccccccccccc Confidence 9999999999999999999999999999998888888899999999999999865 344 578899999988 4566678 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcc Q lcl|NC_012418. 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~ 313 (510) |++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|+|+++|+|+++|+++..+++|.++||.++ T Consensus 246 f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~ 325 (536) T protein:vir:10 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHH Q lcl|NC_012418. 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++.. ++|+++||||||++|++||+++||||||||++|||.|||+|+| T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~ 405 (536) T protein:vir:10 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) T ss_pred cceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999854 6999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCC Q lcl|NC_012418. 393 SEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS 471 (510) Q Consensus 393 ~il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs 471 (510) +||+++|++| +|++.+++.++++|++|+|+++++++.+|.+.++.++| ..+++.||+|++++++|+++||+|..|+|| T Consensus 406 ~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P-~~ld~~id~d~~~~~~a~~~Gv~p~~~irt 484 (536) T protein:vir:10 406 KQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAP-MRDDPDINLAMIKLRIANAIGIDTSGILLT 484 (536) T ss_pred HHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhch-hhhcccCCHHHHHHHHHHHcCCCchhhcCC Confidence 9998877655 55556799999999999999999999999999888664 445667999999999999999966679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHhhhh---------hhhhhcccCC Q lcl|NC_012418. 472 EEELQAEAEQRRQQAAQAQAAQETLLEGA---------SDMTNALAGV 510 (510) Q Consensus 472 ~~ev~~~r~q~~q~~~~~~~~~~~~~~ga---------~~~~~~~ag~ 510 (510) +|||+++|+|++++++++++++++...-+ +++....+|+ T Consensus 485 ~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~ 532 (536) T protein:vir:10 485 EEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhcccc Confidence 99999999988777776666654332211 2233445666 No 9 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=2.3e-164 Score=917.75 Aligned_cols=496 Identities=30% Similarity=0.447 Sum_probs=455.4 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|++++++ .+.+++|||||++|+++|||||||+||||++|||||+ T Consensus 12 ~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~ 89 (516) T protein:vir:96 12 KRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDN--ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 89 (516) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCc--cccCCcccchHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 6789999999996 89999999999999999999887644 3456899999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|..++.++..+.+..++++||++||++++.+|++||||.++|++|+||++|||++||++++. +|++|||++|||++| T Consensus 90 ~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-~~~~~pl~~y~v~~d 168 (516) T protein:vir:96 90 LTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG-AISAIPMHHYVVNRD 168 (516) T ss_pred cChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC-CEEEEEcCeEEEeeC Confidence 9999998888888899999999999999999999999999999999999999999999998764 799999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhh--hhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccccccc Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLM--RAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHL 236 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~--~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+|+++|||+++++++|+++|++... +...+++|+++|+|||+|+|++++ |+++|+++|+++++.+|+|+|++ T Consensus 169 ~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~---~~~~~~~~d~~~~~~es~~~~~e 245 (516) T protein:vir:96 169 TNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDG---FWELKQSADDIPVGKVSKIKSEK 245 (516) T ss_pred CCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCc---eeEEEEEeCceeecccccccccc Confidence 9999999999999999999999976542 334567899999999999998876 79999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccc Q lcl|NC_012418. 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+++|+|+++|+++..+++|.++||++++++ T Consensus 246 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g~~~~v~ 325 (516) T protein:vir:96 246 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIH 325 (516) T ss_pred CCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcccchHHHHHHHHHHHHHHHHHhhcc-ccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+++++.|++++++|+++||++ +.++|+++||||||++|++||+.+||||||||++|||.|||+|++.++ T Consensus 326 ~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~ 405 (516) T protein:vir:96 326 IVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA 405 (516) T ss_pred eeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999998 567899999999999999999999999999999999999999998876 Q ss_pred hhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcCh-hhHhhccCHHHHHHHHHHHcCCCHhHccCCHHH Q lcl|NC_012418. 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEE 474 (510) Q Consensus 396 ~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~e 474 (510) . |++|+.++++.+|+|+++|+|+++++++.++.++++.+++. +++.++||+|++++++++++|||++ ++||+|| T Consensus 406 ~----p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~-~irs~ee 480 (516) T protein:vir:96 406 G----ESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELP-FLKSAEE 480 (516) T ss_pred C----CCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCcc-ccCCHHH Confidence 4 77888889999999999999999999999999999998874 6788999999999999999999985 9999999 Q ss_pred HHHHHHHHHHHHHHHHHhhHHHhh--h-hhhhhhcc Q lcl|NC_012418. 475 LQAEAEQRRQQAAQAQAAQETLLE--G-ASDMTNAL 507 (510) Q Consensus 475 v~~~r~q~~q~~~~~~~~~~~~~~--g-a~~~~~~~ 507 (510) |+++++|++++++++++++++.++ | +.++...+ T Consensus 481 v~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 481 MAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 999998887766666655444322 1 11122222 No 10 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=1.5e-163 Score=913.19 Aligned_cols=500 Identities=29% Similarity=0.416 Sum_probs=456.5 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =|+||++||++|| |++|+++|+||++||+|+++++++++ .+.+++|||||++|+++|||||+|+||||++|||||+ T Consensus 8 e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 85 (517) T protein:vir:10 8 NKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD--LSSQNAWQDDGASATNFLSNKLSQVLFPAQRSFFRID 85 (517) T ss_pred cHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC--ccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2599999999995 99999999999999999999877643 3446899999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|+.+++.+......++|++||++||++++.+|++||||.++|++|+||++|||+++|+++...+|++|||++|||++| T Consensus 86 ~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~pl~~y~v~~d 165 (517) T protein:vir:10 86 LTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDKTSPIQAVPLHHYCVRRD 165 (517) T ss_pred CCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCCCCcEEEEEcCeEEEeeC Confidence 99999999988889999999999999999999999999999999999999999999999998888999999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhhh--hhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccccccc Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLMR--AGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHL 236 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+||+||||+++|+++|+++|+++... ...+++|+++|+|||+|+|++++ ++++|+++||++++.+|+|+|++ T Consensus 166 ~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~---~~~~~~~~d~~~~~~~s~y~~~e 242 (517) T protein:vir:10 166 NNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDG---KYLIRQSADDVPVGKESTVTEDK 242 (517) T ss_pred CCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCC---ceEEEEEeCceeecccccccccc Confidence 99999999999999999999999987643 23467899999999999998765 57889999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccc Q lcl|NC_012418. 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||+|+++.+++++++++|||+++|+|+++|+++..+++|+++||+++++. T Consensus 243 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g~~~~g~~~~v~ 322 (517) T protein:vir:10 243 SPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIH 322 (517) T ss_pred CCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCccccccCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcccchHHHHHHHHHHHHHHHHHhhccc-cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+++++.|++++++|+++||+++ .++|+++||||||++|++||+++||||||||++|||+|||+|+|++| T Consensus 323 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l 402 (517) T protein:vir:10 323 IVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGI 402 (517) T ss_pred eeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999986 56899999999999999999999999999999999999999999999 Q ss_pred hhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcCh-hhHhhccCHHHHHHHHHHHcCCCHhHccCCHHH Q lcl|NC_012418. 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEE 474 (510) Q Consensus 396 ~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~e 474 (510) .+. +|+..++|.+++++++|+|+++++++.++.++++.+++. +++++.||+|++++++|+++|||++ ++||++| T Consensus 403 ~~~----l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~-~irs~~e 477 (517) T protein:vir:10 403 SSI----LTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFP-FFKTQDE 477 (517) T ss_pred hhh----cCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChh-hcCCHHH Confidence 753 444568999999999999999999999999999988764 5677889999999999999999984 9999999 Q ss_pred HHHHHHHHHHHHHHHHHhhHH-Hhhhhhhhhhc--ccCC Q lcl|NC_012418. 475 LQAEAEQRRQQAAQAQAAQET-LLEGASDMTNA--LAGV 510 (510) Q Consensus 475 v~~~r~q~~q~~~~~~~~~~~-~~~ga~~~~~~--~ag~ 510 (510) |++++++++++++++++++++ ..+++.++.+. ++|= T Consensus 478 v~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~ 516 (517) T protein:vir:10 478 LNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGG 516 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCC Confidence 999998777766665555333 22333333333 4444 No 11 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=7e-164 Score=915.06 Aligned_cols=502 Identities=27% Similarity=0.343 Sum_probs=453.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) || +++||++|| |++|+++|+||++||+|++++.++ +.++++..++|||||++|+++||||||++||||++|||| T Consensus 1 m~--~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MK--ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred Cc--hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 87 889999996 899999999999999999988764 456677789999999999999999999999999999999 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEe Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVR 156 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~ 156 (510) |+++|+.+.+. ..+...+++++||++||++++++|++||||.++|++|+||++|||+++|++++ +|++|||++|||+ T Consensus 79 l~~~d~~l~~~-~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--~~~~~pl~~y~v~ 155 (522) T protein:vir:10 79 LQVRDDKLGEE-LDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKD--GLKTFPLTRYVIN 155 (522) T ss_pred ccCChHHHhhh-cChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCC--CceEEEcceEEEe Confidence 99999987764 34456688999999999999999999999999999999999999999999987 5899999999999 Q ss_pred eCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhh--hccCCCceEEEEEEEEeecCCCceEEEEEEEecCee-eccccccc Q lcl|NC_012418. 157 RDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAG--RNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR-VGEEGRWP 233 (510) Q Consensus 157 ~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~--~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~-~~~~~~y~ 233 (510) +|++|+||+|||||+||+++|+++|+.+..+.. ..++++++|+|||||+|+++.+ ++++|++++|+. ++.+|.|+ T Consensus 156 ~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~--~~~~~~~~~~~~~~~~~s~~g 233 (522) T protein:vir:10 156 RDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSG--RWVWHQEAFDKIIPDSRSTAP 233 (522) T ss_pred eCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCC--ceEEEEccCCccccccccccc Confidence 999999999999999999999999998765433 2468999999999999986643 477888888865 56788888 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcc Q lcl|NC_012418. 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~ 313 (510) +++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+|+|+|+++|.++..+++|.+++|.++ T Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~~~ 313 (522) T protein:vir:10 234 KNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGRPE 313 (522) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHH Q lcl|NC_012418. 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) ++.+++.++++||+.+++.|++++++|+++||+. .++|+++||||||++|++||+++||||||||++|||.|||+|+|+ T Consensus 314 ~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~-~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 392 (522) T protein:vir:10 314 DVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM-NVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLL 392 (522) T ss_pred cceeecccccccchHHHHHHHHHHHHHHHHHhhc-cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999874 689999999999999999999999999999999999999999999 Q ss_pred HHhhcCCCCCCcc-cccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCH Q lcl|NC_012418. 394 EVDDALLQGLITK-QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSE 472 (510) Q Consensus 394 il~~~~l~~~~~~-~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~ 472 (510) ||++.|++|++|+ .+++.+|+|+|+|+|+|+++++.+|.+.++.+.+++++.+.||+|++++++|+++|||++.|+||+ T Consensus 393 il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~ 472 (522) T protein:vir:10 393 VLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTE 472 (522) T ss_pred HHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCH Confidence 9999886665555 458999999999999999999999999999888778888999999999999999999988899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHh-hhhhhhhhc--ccCC Q lcl|NC_012418. 473 EELQAEAEQRRQQAAQAQAAQETLL-EGASDMTNA--LAGV 510 (510) Q Consensus 473 ~ev~~~r~q~~q~~~~~~~~~~~~~-~ga~~~~~~--~ag~ 510 (510) |||+++||+++|+++++++++++.+ +|...++++ +-|- T Consensus 473 eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 513 (522) T protein:vir:10 473 QQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLM 513 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHH Confidence 9999999999888888877766533 333333222 2222 No 12 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=1.2e-163 Score=913.70 Aligned_cols=498 Identities=31% Similarity=0.451 Sum_probs=453.3 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|++++++. +.+++|||||++|+++|||||||+||||++|||||+ T Consensus 11 ~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 88 (515) T protein:vir:70 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) T ss_pred CHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 5789999999995 999999999999999999998776543 446899999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|+.++.++..+.+..++++||+.||+.++.+|++||||.++|++|+||++|||+++|++++. +|++|||++|||++| T Consensus 89 ~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-~~~~~pl~~y~v~~d 167 (515) T protein:vir:70 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-AMSAVPMHHYVVNRD 167 (515) T ss_pred cChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCC-CeEEEEcCeEEEeeC Confidence 9999988888888889999999999999999999999999999999999999999999998764 699999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhhh--hhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccccccc Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLMR--AGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHL 236 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+||||||||+||+++|+++|++.... ...+++|+++|+|||+|+|+++ +|+++|++++|++++.+|+|+|++ T Consensus 168 ~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~---~~~~~~~e~d~~~~~~es~y~~~e 244 (515) T protein:vir:70 168 TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE---GFWKINQSADDIPVGKESRIKSEK 244 (515) T ss_pred CCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCC---CceEEEEecCceeecccccccccc Confidence 99999999999999999999999977532 2345679999999999999864 589999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccc Q lcl|NC_012418. 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+++|+|+++|+++..+++|.++||.+++++ T Consensus 245 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~iv~g~~~~v~ 324 (515) T protein:vir:70 245 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVAEDIH 324 (515) T ss_pred CCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcccchHHHHHHHHHHHHHHHHHhhcc-ccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+.+++.|++++++|+++||+| +.++|+++||||||++|++||+++||||||||++|||.|||.|++. T Consensus 325 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~-- 402 (515) T protein:vir:70 325 IVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-- 402 (515) T ss_pred eeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH-- Confidence 9999999999999999999999999999997 5678999999999999999999999999999999999999998753 Q ss_pred hhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChh-hHhhccCHHHHHHHHHHHcCCCHhHccCCHHH Q lcl|NC_012418. 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIA-QLDPRISLPKMMDTIWAAFSVDTSQFYKSEEE 474 (510) Q Consensus 396 ~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~-q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~e 474 (510) +.+|++|++++++.+|+|+++|+|+|+++++.++.|+++.+++.+ ++.+.||+|++++++++.+|+|++ ++||+|| T Consensus 403 --~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~-~~rs~ee 479 (515) T protein:vir:70 403 --EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-FLKSEEE 479 (515) T ss_pred --hhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcc-ccCCHHH Confidence 335778888899999999999999999999999999999887754 577889999999999999999985 9999999 Q ss_pred HHHHHHHHHHHHHHHHHhhHHHhh-hhhhhhhcccC Q lcl|NC_012418. 475 LQAEAEQRRQQAAQAQAAQETLLE-GASDMTNALAG 509 (510) Q Consensus 475 v~~~r~q~~q~~~~~~~~~~~~~~-ga~~~~~~~ag 509 (510) |+++|+|++|+++++++++.+.++ +.+..+..--| T Consensus 480 v~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 480 MQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 999999988877776666544322 12222222222 No 13 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=1.2e-163 Score=913.85 Aligned_cols=497 Identities=30% Similarity=0.445 Sum_probs=454.2 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|++++++. +.+++|||||++|+++|||||||+||||++|||||+ T Consensus 12 ~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~ 89 (516) T protein:vir:10 12 KRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNE--TSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 89 (516) T ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 4578999999995 999999999999999999998876543 446899999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|..+++++..+.+.+++++||++||++++.+|++||||.++|++|+||++|||+++|+|++. +|++|||++|||++| T Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-~~~~~pl~~y~v~~d 168 (516) T protein:vir:10 90 LTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG-AISAIPMHHYVVNRD 168 (516) T ss_pred CChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC-CeEEEEcCeEEEeeC Confidence 9999998888888899999999999999999999999999999999999999999999998764 699999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhh--hhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccccccc Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLM--RAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHL 236 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~--~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+|+++|||++||+++|+++|++... +...+++|+++++|||+|++++++ ||++|+++|+++++.+|+|+|++ T Consensus 169 ~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~---~~~~~~~~d~~~~~~~s~~~~~e 245 (516) T protein:vir:10 169 TNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEG---FWELKQSADDIPVGKVSKIKSEK 245 (516) T ss_pred CCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCC---ceEEEEeeCceeecccccccccc Confidence 9999999999999999999999976542 334566899999999999998765 79999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccccc Q lcl|NC_012418. 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|+|+|+|+|+++|+++..+++|.++||.+++++ T Consensus 246 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~~~~g~~~~v~ 325 (516) T protein:vir:10 246 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIH 325 (516) T ss_pred CCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccCCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcccchHHHHHHHHHHHHHHHHHhhcc-ccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+.+++.|++++++|+++||++ +.++|+++||||||++|++||+++||||||||++|||.|||+|++..+ T Consensus 326 ~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~ 405 (516) T protein:vir:10 326 IVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA 405 (516) T ss_pred eeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh Confidence 9999999999999999999999999999998 567899999999999999999999999999999999999999998655 Q ss_pred hhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcCh-hhHhhccCHHHHHHHHHHHcCCCHhHccCCHHH Q lcl|NC_012418. 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEE 474 (510) Q Consensus 396 ~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~e 474 (510) +|++|++.+++.+|+||++|+|+|+++++.++.|+++.+++. +++.++||+|+++|++++++|||++ ++||+|| T Consensus 406 ----~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~-~irs~ee 480 (516) T protein:vir:10 406 ----GDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELP-FLKSAEE 480 (516) T ss_pred ----CCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChh-ccCCHHH Confidence 478888899999999999999999999999999999988874 5688999999999999999999985 9999999 Q ss_pred HHHHHHHHHHHHHHHHHhhHH--Hhhhhhhhhhccc Q lcl|NC_012418. 475 LQAEAEQRRQQAAQAQAAQET--LLEGASDMTNALA 508 (510) Q Consensus 475 v~~~r~q~~q~~~~~~~~~~~--~~~ga~~~~~~~a 508 (510) |+++|+|++++++.+.+++.+ +..|++.....-+ T Consensus 481 v~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 481 MEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccchhhhhhhcC Confidence 999998887655544433322 2234444333333 No 14 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=2.7e-162 Score=906.36 Aligned_cols=506 Identities=31% Similarity=0.416 Sum_probs=453.9 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||++|| |++|+++|+||++||+|++|++++++++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (535) T protein:vir:15 10 GEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLT 88 (535) T ss_pred chHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4677889999996 899999999999999999999999888888899999999999999999999999976 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEeceEEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~~~i 155 (510) ++|..+++....+...+++++||++||++|+.+|++||||.++|++|+||++|||+++|++++.+ +|++|||++||| T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v 168 (535) T protein:vir:15 89 ISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVV 168 (535) T ss_pred cChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEE Confidence 99999999888888999999999999999999999999999999999999999999999987654 599999999999 Q ss_pred eeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-cccccccc Q lcl|NC_012418. 156 RRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGRWPI 234 (510) Q Consensus 156 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~y~~ 234 (510) ++|++|+||+|||||+||+++|+++|++++.+...+++++++|+|||+|+++++ +++ +++|++++|..+ +.+++|+| T Consensus 169 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~-~~~~~e~~g~~~~~~~~~~~~ 246 (535) T protein:vir:15 169 QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEE-SGD-YLKYEEVEDVEIDGSDATYPT 246 (535) T ss_pred eeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecC-CCc-EEEEEEeeCcccccccccccc Confidence 999999999999999999999999999999888888899999999999998755 344 567888988776 68899999 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccc Q lcl|NC_012418. 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~ 314 (510) ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|+++|.++..+++|.+++|.+++ T Consensus 247 ~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~v~g~~~~ 326 (535) T protein:vir:15 247 DAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHH Q lcl|NC_012418. 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) +++++.++++||+++++.|++++++|+++||++.+ ++|+++||||||++|++|++++||||||||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (535) T protein:vir:15 327 IDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999855 69999999999999999999999999999999999999999999 Q ss_pred HHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCH Q lcl|NC_012418. 394 EVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSE 472 (510) Q Consensus 394 il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~ 472 (510) +|+++|++| +|++++++.+++++..++|+++++++.+|.+.++.+.| .++++.||+|++++++++++|||++.|+||+ T Consensus 407 il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P-~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~ 485 (535) T protein:vir:15 407 QLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAP-MQGDPDINLAVIKLRIANAIGIDTSGILLTD 485 (535) T ss_pred HHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcCh-hhhhccCCHHHHHHHHHHHcCCChhhhcCCH Confidence 998876555 66667788888777777777777777777777766544 5677789999999999999999998899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHh-hhhh--------hhhhcccCC Q lcl|NC_012418. 473 EELQAEAEQRRQQAAQAQAAQETLL-EGAS--------DMTNALAGV 510 (510) Q Consensus 473 ~ev~~~r~q~~q~~~~~~~~~~~~~-~ga~--------~~~~~~ag~ 510 (510) ||++++++|+++++++++++.++.+ .++. ++....+|+ T Consensus 486 eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~ 532 (535) T protein:vir:15 486 EQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGL 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCC Confidence 9999999877666655555543221 1111 122223444 No 15 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=3.6e-162 Score=905.65 Aligned_cols=504 Identities=28% Similarity=0.432 Sum_probs=454.4 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (543) T protein:vir:88 10 AEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFPL-QSWMKLK 88 (543) T ss_pred hHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4778999999996 899999999999999999999998888888889999999999999999999999987 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCc------EEEEEece Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEAT------VVAWSLRS 152 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~------~~~~pl~~ 152 (510) ++|..+.+....+.+.++|+.||++||++++.+|++||||.++|++|+||++|||+++|++++.++ |+.|||++ T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~~ 168 (543) T protein:vir:88 89 VSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLHN 168 (543) T ss_pred cChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcce Confidence 999999888777888999999999999999999999999999999999999999999999987542 67799999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccccc Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGR 231 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~ 231 (510) |+|++|++|+||+||||+++|+++|+++|++++.+. .+++|+++|+|||+|+|+++++ + +++|++++|+.+ +.+|+ T Consensus 169 y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~-~~~~p~~~~~v~~~V~pr~~~~-~-~~~~~~~~~~~v~~~~~~ 245 (543) T protein:vir:88 169 HVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGG-QEYKPEQELEVYTHIYIDDESG-D-FLSYQEIEGVEVDGSDGQ 245 (543) T ss_pred EEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHH-hhcCCccceEEEEEEEeecCCC-c-ccccccccCeeeecCCCc Confidence 999999999999999999999999999999887554 4678999999999999987653 3 557889999987 67888 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCC Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~ 311 (510) |++++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|.++..+++|.++||. T Consensus 246 ~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g~~v~g~ 325 (543) T protein:vir:88 246 YPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVAGR 325 (543) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCceeecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHH Q lcl|NC_012418. 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) ++++.+++.++++||+++++.|++++++|+++||++.+ ++|+++||||||++|++||+++||||||||++|||.|||+| T Consensus 326 ~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 405 (543) T protein:vir:88 326 KADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRV 405 (543) T ss_pred CCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999855 59999999999999999999999999999999999999999 Q ss_pred HHHHHhhcCCC-CCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHcc Q lcl|NC_012418. 391 CLSEVDDALLQ-GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) Q Consensus 391 ~~~il~~~~l~-~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~ 469 (510) +|+||+++|++ ++|++++++.++++|++|+|+++++++.++.++++.+++ +++.++||+|++++++++++|||+..|+ T Consensus 406 ~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~-p~vld~id~d~~~~~~a~~~Gv~~~~i~ 484 (543) T protein:vir:88 406 LLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQ-LNGDPDLNVNNIKLRLANAIGIDTAGLL 484 (543) T ss_pred HHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccc-hhhhccCCHHHHHHHHHHHhCCChhhhc Confidence 99999887654 566667899999999999999999999999999999987 5677899999999999999999888899 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh----------hhhhcccCC Q lcl|NC_012418. 470 KSEEELQAEAEQRRQQAAQAQAAQETLLEGAS----------DMTNALAGV 510 (510) Q Consensus 470 rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~----------~~~~~~ag~ 510 (510) |++||++++|+|+++++++++++++ ++.|.. ++....||. T Consensus 485 r~~~e~~~~~~q~~~q~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (543) T protein:vir:88 485 LTEAEKAQAQSQEMLKQGGLNAAAG-IGSGVAAQATASPEAMESAMDTAGV 534 (543) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH-HhhchhhhhccChHHHHHHhhhcCC Confidence 9999999999877654444333321 111111 011112222 No 16 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=4e-162 Score=905.41 Aligned_cols=500 Identities=14% Similarity=0.085 Sum_probs=441.0 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccc------cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL------MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~------~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) +.++|++||+.|+ |++|+++|+||++||+|++ .+.++..+..+.+++|||||++|+++||||||++||||++ T Consensus 8 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~~ltpp~~ 87 (549) T protein:vir:10 8 ILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDSMITPATQ 87 (549) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHhhccCCCC Confidence 7788999999996 9999999999999999986 2335556777788999999999999999999999999999 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL--FQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l--~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~ 147 (510) |||||+++|+.+.+ ..++++||++||++++..+ ++||||.++|++|+||++|||+++|++++.+ +|++ T Consensus 88 ~wF~l~~~~~~~~e-------~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~ 160 (549) T protein:vir:10 88 LWHRLKTGNDALNE-------IASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRN 160 (549) T ss_pred ccccccCCccchhh-------hhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEE Confidence 99999999987654 3579999999999999955 5899999999999999999999999997754 5899 Q ss_pred EEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhh----hhhhhccCCCceEEEEEEEEeec--------CCCceEE Q lcl|NC_012418. 148 WSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDL----MRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYA 215 (510) Q Consensus 148 ~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~----~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~ 215 (510) |||++|||++|++|+||+|||||+||+++|.++|+.+. .++..+++|+++|+|||+|+|++ .++|||. T Consensus 161 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~ 240 (549) T protein:vir:10 161 VPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQFA 240 (549) T ss_pred EEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCceE Confidence 99999999999999999999999999999999887643 23445678999999999999874 4689999 Q ss_pred EEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc Q lcl|NC_012418. 216 ELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV 295 (510) Q Consensus 216 sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~ 295 (510) |||++.++.+++++||| ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++++|+++ T Consensus 241 sv~~e~~~~~il~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~ 318 (549) T protein:vir:10 241 SYWLDEGRDRIVQNSGF--RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD 318 (549) T ss_pred EEEEEecCCEeeccCCc--ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc Confidence 99999999999999998 7899999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcccc--CCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 296 VDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN--QRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 296 p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~--~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) |.++..++.+.+..|..++....+++.+++|+++++.|++++++|+++||+|++ ++++++||||||++|++|++++|| T Consensus 319 ~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LG 398 (549) T protein:vir:10 319 GFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLA 398 (549) T ss_pred cceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhh Confidence 998887776665555444444444556789999999999999999999999964 589999999999999999999999 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccc----cceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh--- Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH----KPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP--- 446 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~----~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~--- 446 (510) ||||||++|||.|||+|+|+||+++|++|++|+++ ....|+|+|+|+|+|+.+++.++.|+++++++++|++| T Consensus 399 pv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~l 478 (549) T protein:vir:10 399 PTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAA 478 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHH Confidence 99999999999999999999999988877766654 24569999999999999999999999988888766665 Q ss_pred -ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 447 -RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 447 -~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +||+|++++++++++|||++ ++||++||+++|++++|+++++++++++..++..+++.+.+-- T Consensus 479 d~id~d~~~~~~a~~~Gvp~~-~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~t 542 (549) T protein:vir:10 479 KVPNGARIARLLADYGGVPVE-AMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQT 542 (549) T ss_pred hcCCHHHHHHHHHHhcCCCcc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcC Confidence 69999999999999999985 9999999999999988888888887776665555433333222 No 17 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=3.4e-161 Score=900.35 Aligned_cols=503 Identities=26% Similarity=0.360 Sum_probs=451.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) ||++|++||++|| |++|+++|+||++||+|++++++++.++.+..++|||||++|+++|||||||+||||++|||||. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 9999999999996 89999999999999999999999988888889999999999999999999999999999999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeC Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d 158 (510) ++|+++++....+....+++.||++||++++.+|++||||.++|++|+||++|||+++|++++ +|++|||++|||++| T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--~~~~~pl~~y~v~~d 158 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKK--NLKLYPLDRFVVSRD 158 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCC--ceeEEEcCeEEEeeC Confidence 999999888777778889999999999999999999999999999999999999999999876 588999999999999 Q ss_pred CCCCeEEEEEEEeecHHHhhHHhhHhhh----hhhhcc-----------------CCCceEEEEEEEEeecCCCceEEEE Q lcl|NC_012418. 159 ATGRWMDIVLKQRYKSKDLDEAYKQDLM----RAGRNL-----------------SGSGSVDLYTHVQRKKGTAMEYAEL 217 (510) Q Consensus 159 ~~G~vd~i~r~~~~t~~~l~~~~~~~~~----~~~~~~-----------------~~~~~v~i~~~v~~~~~~~~p~~sv 217 (510) ++|+||+|||||+||+++|+++|+++.. +...++ +++.++++|+++.++++ ++++ T Consensus 159 ~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~----~~~~ 234 (555) T protein:vir:17 159 GEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDG----QVKW 234 (555) T ss_pred CCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCC----eeEE Confidence 9999999999999999999999987532 122222 33445566666555443 5889 Q ss_pred EEEecCeee-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch Q lcl|NC_012418. 218 YHEIDGVRV-GEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 218 ~~e~~~~~~-~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p 296 (510) |++++|+.+ +.++.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++| T Consensus 235 ~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~ 314 (555) T protein:vir:17 235 HQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKP 314 (555) T ss_pred EEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCc Confidence 999999887 5567777799999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhH Q lcl|NC_012418. 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 297 ~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~ 376 (510) .++..+++|.+++|.+++++|++.++++||+.+++.|++++++|+++||++ ..+|+++||||||++|++||+++||||| T Consensus 315 ~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~-~~~d~~r~TAtEV~~r~~E~~~~LGpv~ 393 (555) T protein:vir:17 315 QNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML-QVRQSERTTATEVQATVQELNEQIGGIY 393 (555) T ss_pred ceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc-CCCCcccchHHHHHHHHHHHHHHHhHHH Confidence 999999999999999999999999999999999999999999999999985 5689999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHH Q lcl|NC_012418. 377 SLLAENLQSPLAYVCLSEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMD 455 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~ 455 (510) +||++|||.|||+|+|+||++.|++| +|++.+++.+++++..|+|+++++++.+|.+.++.+.+++++.++||+|++++ T Consensus 394 ~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~ 473 (555) T protein:vir:17 394 SNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIK 473 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHH Confidence 99999999999999999999988655 55556789999999999999999999999999998888888999999999999 Q ss_pred HHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHh-hhhhhhhhc-----------------------ccCC Q lcl|NC_012418. 456 TIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLL-EGASDMTNA-----------------------LAGV 510 (510) Q Consensus 456 ~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~-~ga~~~~~~-----------------------~ag~ 510 (510) .|++++|||+..|+||+||++++||++++++++++.+++++. +|+..++.. +.|. T Consensus 474 ~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~ 552 (555) T protein:vir:17 474 RLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGAAESETSSAEAQ 552 (555) T ss_pred HHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHHHHhhcCCcccc Confidence 999999998888999999999999988877777776655433 333222221 1111 No 18 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=8.7e-161 Score=898.10 Aligned_cols=506 Identities=31% Similarity=0.422 Sum_probs=455.3 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||++|| |++|+++|+||++||+|++|++++++++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (535) T protein:vir:33 10 GEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLT 88 (535) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4677889999996 899999999999999999999999988888899999999999999999999999976 7999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEeceEEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~~~i 155 (510) ++|..+.+....+...+++++||++||++++.+|++||||.++|++|+||++|||+++|++++.+ +|++|||++||| T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v 168 (535) T protein:vir:33 89 ISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVV 168 (535) T ss_pred cChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEE Confidence 99999999988889999999999999999999999999999999999999999999999987754 599999999999 Q ss_pred eeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-cccccccc Q lcl|NC_012418. 156 RRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGRWPI 234 (510) Q Consensus 156 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~y~~ 234 (510) ++|++|+||+|||||+||+++|+++|+.+..+...++++++++++||||++++ ++++|. +|++++|..+ +.+++|+| T Consensus 169 ~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~ 246 (535) T protein:vir:33 169 QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDE-ESGDYL-KYEEVEDVEIDGSDATYPT 246 (535) T ss_pred eeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeC-CCCcEE-EEEEEeCcccccccccccc Confidence 99999999999999999999999999999888888889999999999999854 456654 5668888776 78899999 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCccc Q lcl|NC_012418. 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~ 314 (510) ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|+++|.++..+++|.+++|.+++ T Consensus 247 ~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~v~g~~~~ 326 (535) T protein:vir:33 247 DAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHH Q lcl|NC_012418. 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) +++++.++++||+++++.|++++++|+++||++.+ ++|+++||||||++|++|++++||||||||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (535) T protein:vir:33 327 IDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999855 69999999999999999999999999999999999999999999 Q ss_pred HHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCH Q lcl|NC_012418. 394 EVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSE 472 (510) Q Consensus 394 il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~ 472 (510) +|+++|++| +|++++++.+++++..++|+++++++.+|.+.++.+.| .++++.||+|++++++++++|||++.|+||+ T Consensus 407 il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P-~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ 485 (535) T protein:vir:33 407 QLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAP-MQGDPDINLAVIKLRIANAIGIDTSGILLTD 485 (535) T ss_pred HHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhCh-hhhhccCCHHHHHHHHHHHcCCCHhHhcCCH Confidence 998876555 66667788888777777777777777777777766544 5667789999999999999999998899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHH-hhh--------hhhhhhcccCC Q lcl|NC_012418. 473 EELQAEAEQRRQQAAQAQAAQETL-LEG--------ASDMTNALAGV 510 (510) Q Consensus 473 ~ev~~~r~q~~q~~~~~~~~~~~~-~~g--------a~~~~~~~ag~ 510 (510) ||++++++|+++++++++++.++. ..+ ++++....+|+ T Consensus 486 ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 532 (535) T protein:vir:33 486 EQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQGAAAKAGL 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHHHHHhccC Confidence 999999988776666666554321 111 12233444555 No 19 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=1.2e-159 Score=891.89 Aligned_cols=502 Identities=29% Similarity=0.445 Sum_probs=448.8 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||++|| |++|+++|+||++||+|++|+.+++.++.+..++|||||++|+++|||||||+||| ++|||||. T Consensus 8 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP-~~~WFrl~ 86 (522) T protein:vir:94 8 AAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFP-QSPWMRLT 86 (522) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCC-CCcccccc Confidence 4788999999996 89999999999999999999999888888888999999999999999999999996 67999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC----cEEEEEeceEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) +.|..+++.........++++||++||++|+++|++||||.++|++|+||++|||+++|++++.. +|++|||++|| T Consensus 87 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~ 166 (522) T protein:vir:94 87 VSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYV 166 (522) T ss_pred cchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEE Confidence 99988887777777888999999999999999999999999999999999999999999976532 48999999999 Q ss_pred EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccccccc Q lcl|NC_012418. 155 VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGRWP 233 (510) Q Consensus 155 i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~y~ 233 (510) |++|++|+||+|||||++++++|++++++.+.+ .+++|+++|+|||+|+|++++ +++|++++|+.+ +.+|+|+ T Consensus 167 v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~--~~~~p~~~v~v~~~v~~~~~~----~~~~~~~~g~~~~~~~~~~~ 240 (522) T protein:vir:94 167 VQRDAFGNILQIVTIDKVAFSALPEDVKSQLNA--DDYEPDTELEVYTHIYRQDDE----YLRYEEVEGIEVTGTDGSYP 240 (522) T ss_pred EeeCCCcCeEEEeeeeeccHHhcchHHHHHHhc--ccCCccceEEEEEEEEeeCCc----eeEEeeccCceecccCCCCc Confidence 999999999999999999999999999988743 346789999999999999887 567888999876 7889999 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcc Q lcl|NC_012418. 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~ 313 (510) |++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++..+++|.++||.++ T Consensus 241 ~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v~g~~~ 320 (522) T protein:vir:94 241 LTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVE 320 (522) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCceeecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHH Q lcl|NC_012418. 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++.+ ++|+++||||||++|++|++++||||||||++|||.|||+|+| T Consensus 321 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 400 (522) T protein:vir:94 321 DINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLM 400 (522) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999855 6999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCC-CCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCC Q lcl|NC_012418. 393 SEVDDALLQ-GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS 471 (510) Q Consensus 393 ~il~~~~l~-~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs 471 (510) ++|+++|++ ++|++++++.++++|..++|+++++++.+|.+.++.++| ..++++||+|++++++++++|||+..|+|| T Consensus 401 ~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P-~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~ 479 (522) T protein:vir:94 401 NQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQP-LSQDPDINLPTLKLRLLNALGIDTAGLLLT 479 (522) T ss_pred HHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccc-hhhhhcCCHHHHHHHHHHHcCCChhhccCC Confidence 999887755 556667888888888888888888888888888877665 344678999999999999999988889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHH-Hhhhh--hhhhhcccCC Q lcl|NC_012418. 472 EEELQAEAEQRRQQAAQAQAAQET-LLEGA--SDMTNALAGV 510 (510) Q Consensus 472 ~~ev~~~r~q~~q~~~~~~~~~~~-~~~ga--~~~~~~~ag~ 510 (510) ++|++++++|+++++++++++.+. +..++ .....+-+++ T Consensus 480 ~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 521 (522) T protein:vir:94 480 QDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQ 521 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhc Confidence 999999988766555544444322 22222 2222233333 No 20 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=1.7e-159 Score=891.05 Aligned_cols=500 Identities=15% Similarity=0.137 Sum_probs=435.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) -+++|++||++|+ |++||++|+||++||+|++ +.+++++++.+.+++|||||++|+++|||||||+||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7899999999996 8999999999999999984 5677778888889999999999999999999999999999999 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+++.+ ..++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999887653 46799999999999999999999999999999999999999999987644 588899999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhh-----hhhhhccCCCceEEEEEEEEeec--------CCCceEEEEEE Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDL-----MRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~sv~~ 219 (510) |||++|+.|+||+|||||+||+++|.++|+.+. ++...+++++++|+|+|+|+|+. .++|||.|||| T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999887543 33333444567899999999874 35799999999 Q ss_pred E--ecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh Q lcl|NC_012418. 220 E--IDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~ 297 (510) + ++|++++++||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++.+++- T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:10 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 7 467889999998 689999999999999999999999999999999999999999999999999999999988876 Q ss_pred hhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) ++..++.+.+.+|..++....+.++.+||+.+++.|++++++|+++||.| +.++|+++||||||++|++|++++|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:10 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 66666666677887776544456677899999999999999999999988 44689999999999999999999999 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc-c-eeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh----c Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK-P-AIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP----R 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~-~-~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~----~ 447 (510) |||+||++|||.|||+|+|+||+++|++|++|+.+. + ..|+|+|+|+|+|+..++.++.|+++++++++|++| + T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:10 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999999988777777654 3 448999999999999998888888877776655544 6 Q ss_pred cCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhh-----hhhhhhcccCC Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEG-----ASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~g-----a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||++||+++|+||++++++++++++..++. .+.++.+.+|+ T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 999999999999999998 599999999999999888777666654433221 11133344444 No 21 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=1.7e-159 Score=891.05 Aligned_cols=500 Identities=15% Similarity=0.137 Sum_probs=435.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) -+++|++||++|+ |++||++|+||++||+|++ +.+++++++.+.+++|||||++|+++|||||||+||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:98 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7899999999996 8999999999999999984 5677778888889999999999999999999999999999999 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+++.+ ..++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:98 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999887653 46799999999999999999999999999999999999999999987644 588899999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhh-----hhhhhccCCCceEEEEEEEEeec--------CCCceEEEEEE Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDL-----MRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~sv~~ 219 (510) |||++|+.|+||+|||||+||+++|.++|+.+. ++...+++++++|+|+|+|+|+. .++|||.|||| T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:98 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999887543 33333444567899999999874 35799999999 Q ss_pred E--ecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh Q lcl|NC_012418. 220 E--IDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~ 297 (510) + ++|++++++||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++.+++- T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:98 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 7 467889999998 689999999999999999999999999999999999999999999999999999999988876 Q ss_pred hhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) ++..++.+.+.+|..++....+.++.+||+.+++.|++++++|+++||.| +.++|+++||||||++|++|++++|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:98 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 66666666677887776544456677899999999999999999999988 44689999999999999999999999 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc-c-eeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh----c Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK-P-AIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP----R 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~-~-~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~----~ 447 (510) |||+||++|||.|||+|+|+||+++|++|++|+.+. + ..|+|+|+|+|+|+..++.++.|+++++++++|++| + T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:98 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999999988777777654 3 448999999999999998888888877776655544 6 Q ss_pred cCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhh-----hhhhhhcccCC Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEG-----ASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~g-----a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||++||+++|+||++++++++++++..++. .+.++.+.+|+ T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:98 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 999999999999999998 599999999999999888777666654433221 11133344444 No 22 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=1.7e-159 Score=891.05 Aligned_cols=500 Identities=15% Similarity=0.137 Sum_probs=435.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) -+++|++||++|+ |++||++|+||++||+|++ +.+++++++.+.+++|||||++|+++|||||||+||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7899999999996 8999999999999999984 5677778888889999999999999999999999999999999 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+++.+ ..++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999887653 46799999999999999999999999999999999999999999987644 588899999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhh-----hhhhhccCCCceEEEEEEEEeec--------CCCceEEEEEE Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDL-----MRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~sv~~ 219 (510) |||++|+.|+||+|||||+||+++|.++|+.+. ++...+++++++|+|+|+|+|+. .++|||.|||| T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999887543 33333444567899999999874 35799999999 Q ss_pred E--ecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh Q lcl|NC_012418. 220 E--IDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~ 297 (510) + ++|++++++||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++.+++- T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:10 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 7 467889999998 689999999999999999999999999999999999999999999999999999999988876 Q ss_pred hhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) ++..++.+.+.+|..++....+.++.+||+.+++.|++++++|+++||.| +.++|+++||||||++|++|++++|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:10 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 66666666677887776544456677899999999999999999999988 44689999999999999999999999 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc-c-eeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh----c Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK-P-AIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP----R 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~-~-~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~----~ 447 (510) |||+||++|||.|||+|+|+||+++|++|++|+.+. + ..|+|+|+|+|+|+..++.++.|+++++++++|++| + T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:10 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999999988777777654 3 448999999999999998888888877776655544 6 Q ss_pred cCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhh-----hhhhhhcccCC Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEG-----ASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~g-----a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||++||+++|+||++++++++++++..++. .+.++.+.+|+ T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 999999999999999998 599999999999999888777666654433221 11133344444 No 23 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=1.3e-157 Score=880.60 Aligned_cols=496 Identities=16% Similarity=0.142 Sum_probs=432.0 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCC------CCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPM------SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~------~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) =+++|++||++|+ |++|+++|+||++||+|+++...+ .....+..++|||||++|+++|||||||+||||++ T Consensus 2 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~~ 81 (547) T protein:vir:10 2 ENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPAT 81 (547) T ss_pred CHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 4678999999996 899999999999999999754221 22235677899999999999999999999999999 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC---C--cEEE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE---A--TVVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~---~--~~~~ 147 (510) |||||++.|.++.+ ..++++||++||+.|+++|++||||.++|++|+||++|||+++|++++. + +|++ T Consensus 82 ~WF~l~~~d~~~~~-------~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~ 154 (547) T protein:vir:10 82 KWFELAFRDKELNS-------DDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQS 154 (547) T ss_pred cccccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEE Confidence 99999999886644 4579999999999999999999999999999999999999999997653 1 5899 Q ss_pred EEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhh----hhhhccCCCc---eEEEEEEEEeecC----------- Q lcl|NC_012418. 148 WSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLM----RAGRNLSGSG---SVDLYTHVQRKKG----------- 209 (510) Q Consensus 148 ~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~----~~~~~~~~~~---~v~i~~~v~~~~~----------- 209 (510) |||++|||++|++|+||+|||||+||++||.++|+.+.. ++..++++++ ++++||+|+|+.+ T Consensus 155 ~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~ 234 (547) T protein:vir:10 155 SPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVL 234 (547) T ss_pred eecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcccccee Confidence 999999999999999999999999999999998875432 2223445544 7999999998743 Q ss_pred --CCceEEEEEEEecC-eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCce Q lcl|NC_012418. 210 --TAMEYAELYHEIDG-VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) Q Consensus 210 --~~~p~~sv~~e~~~-~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~ 286 (510) ++|||.|+|++++| ++++++|+| ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++||| T Consensus 235 ~~~~~p~~s~~~e~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 312 (547) T protein:vir:10 235 APTERPFGKKWILKEGAVQLGEEGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAI 312 (547) T ss_pred eccccceeEEEEEecCceeeeecCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 48999999999886 688999998 6899999999999999999999999999999999999999999999999999 Q ss_pred eeCCCcccchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-cCCCCCCCCHHHHHHHH Q lcl|NC_012418. 287 LVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITA 365 (510) Q Consensus 287 l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~~TAtEi~~r~ 365 (510) +++|+|++++- ...++|.++.|..++++|++.+ ++|+++++.|++++++|+++||.|+ .++++++||||||++|+ T Consensus 313 ~v~~~g~~~~~--~~~pgg~~~~~~~~~v~pl~~~--~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~ 388 (547) T protein:vir:10 313 MVTERGLISDI--DLGASGLTVVRDMESMKPFESR--ARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRY 388 (547) T ss_pred ecccccccccc--eecCCeeeecCCcccceeeecc--cchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHH Confidence 99999999875 4568888899999999998654 6999999999999999999999996 56899999999999999 Q ss_pred HHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccc-----cceeeecHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH-----KPAIETGLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~-----~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~ 440 (510) +|++++|||||+||++|||.|||+|+|++|++.|++|++|+++ ....|+|+|+|+|+|++.++.++.|+++++++ T Consensus 389 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~ 468 (547) T protein:vir:10 389 ELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQ 468 (547) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999988877666553 35679999999999999999888888888777 Q ss_pred hhhHhh----ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 441 IAQLDP----RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 441 ~~q~~~----~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ++|++| +||+|++++++++++|||++ ++||++||+++|+||+++++++++++.+..+|-+......++- T Consensus 469 laq~~P~vld~id~d~~~~~~a~~~Gvp~~-~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a 541 (547) T protein:vir:10 469 LAEINPEVLDIPDWDEMVRMLGSLLGAPQT-LMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQA 541 (547) T ss_pred hhccChhhhhcCCHHHHHHHHHHHhCCChh-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 655554 69999999999999999985 9999999999999988877777766555544444433332222 No 24 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=2.6e-156 Score=873.54 Aligned_cols=497 Identities=14% Similarity=0.138 Sum_probs=431.0 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccC---CCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMV---DPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~---~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) -+++|++||++|+ |++||++|+||++||+|++++ ++...++.+..++|||||++|+++|||||||+||||++||| T Consensus 5 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~~WF 84 (556) T protein:vir:73 5 EKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPARPWF 84 (556) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 6778999999996 999999999999999998754 33445566778999999999999999999999999999999 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) +|+++|+.+.+ ..+|++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 85 ~l~~~d~~~~~-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~ 157 (556) T protein:vir:73 85 KLATPDPDMMD-------YGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGS 157 (556) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecce Confidence 99999887654 45799999999999999999999999999999999999999999987644 588999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHh-----hhhhhhccCCCceEEEEEEEEeec--------CCCceEEEEEE Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQD-----LMRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~-----~~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~sv~~ 219 (510) |||++|+.|+||+|||||+||+++|.++|+.+ +++...+++++++|+|+|+|+|+. .++|||.|+|| T Consensus 158 ~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 237 (556) T protein:vir:73 158 YYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYF 237 (556) T ss_pred eEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEEEE Confidence 99999999999999999999999999888754 334444545577899999999864 45899999999 Q ss_pred Ee--cCeeeccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch Q lcl|NC_012418. 220 EI--DGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 220 e~--~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrg-p~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p 296 (510) +. ++++++++||| ++|||+++||++.+|++|||| |++++|||+|+||.++++.+++++++++|||++++++.+.+ T Consensus 238 ~~~~~~~~vl~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~ 315 (556) T protein:vir:73 238 ESGGDSDKLLRESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQR 315 (556) T ss_pred EecCCCceecccCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc Confidence 85 56788999997 789999999999999999999 89999999999999999999999999999999999986654 Q ss_pred hhhccCCCc---eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc----cCCCCCCCCHHHHHHHHHHHH Q lcl|NC_012418. 297 DDYQDAEMG---DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA----NQRDAERVTAEEVRITAEEAE 369 (510) Q Consensus 297 ~~~~~~~~g---~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~----~~~~~~~~TAtEi~~r~~E~~ 369 (510) - ...++| ...+|+.+++.|++.++ +|++.+.+.|++++++|+++||.|+ .++++++||||||++|++|++ T Consensus 316 ~--~~~pgg~~~~~~~~~~~~i~p~~~~~-~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~ 392 (556) T protein:vir:73 316 V--SLLPGDVTYLDVISGQDGFKPAYLVN-PNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL 392 (556) T ss_pred e--eeccCccccccCCCCccceeeecccc-ccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHH Confidence 3 333433 33567778889988776 6799999999999999999999885 458999999999999999999 Q ss_pred HHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc--ceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh- Q lcl|NC_012418. 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP- 446 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~--~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~- 446 (510) .+|||||+||++|||.|||+|+|+||+++|++|++|+.+. ...|+|+|+|+++|+..++..+.++++++++++|++| T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe 472 (556) T protein:vir:73 393 LMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPE 472 (556) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChh Confidence 9999999999999999999999999999998877777653 3458999999999999998888888777777665554 Q ss_pred ---ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 447 ---RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 447 ---~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +||+|++++.+++++|||+ .|+||++||+++|+||+++++++|++++.+.+...+++.+.++. T Consensus 473 ~~d~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~ 538 (556) T protein:vir:73 473 ALDKLDVDQAIDAFSEMSGVSP-TVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQT 538 (556) T ss_pred hHhcCCHHHHHHHHHHHcCCCh-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 6999999999999999998 59999999999999988888877777666554334444444444 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=3.2e-156 Score=873.05 Aligned_cols=499 Identities=14% Similarity=0.142 Sum_probs=427.9 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccC---CCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMV---DPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~---~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) .+++|++||+.|+ |++||++|+||++||+|++++ ++.+.++.+..++|||||++|+++|||||||+||||++||| T Consensus 5 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~~WF 84 (559) T protein:vir:95 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 7888999999996 999999999999999999855 23455667778999999999999999999999999999999 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||+++|+.+.+ ..++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 85 ~l~~~d~~~~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~ 157 (559) T protein:vir:95 85 RLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGS 157 (559) T ss_pred ccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCe Confidence 99999886654 45799999999999999999999999999999999999999999987543 588999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHh-----hhhhhhccCCCceEEEEEEEEeec--------CCCceEEEEEE Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQD-----LMRAGRNLSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~-----~~~~~~~~~~~~~v~i~~~v~~~~--------~~~~p~~sv~~ 219 (510) |||++|+.|+||+|||||+||+++|.++|+.+ +++...++.++++|+|+|+|+|+. .++|||.|+|| T Consensus 158 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~ 237 (559) T protein:vir:95 158 YYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) T ss_pred EEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEEEEEE Confidence 99999999999999999999999999988754 334444444466799999999874 35899999999 Q ss_pred Ee--cCeeeccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch Q lcl|NC_012418. 220 EI--DGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 220 e~--~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrg-p~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p 296 (510) +. ++++++++||| ++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++|||++++++.+++ T Consensus 238 e~~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~ 315 (559) T protein:vir:95 238 EVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR 315 (559) T ss_pred EecCCCceeeecCCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc Confidence 97 45688999998 789999999999999999999 89999999999999999999999999999999999998877 Q ss_pred hhhccCCCceeecCC-cccccccccCcccchHHHHHHHHHHHHHHHHHhhccc----cCCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_012418. 297 DDYQDAEMGDYVPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA----NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 297 ~~~~~~~~g~~~pg~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~----~~~~~~~~TAtEi~~r~~E~~~~ 371 (510) .++..++.+.+..+. .+.+.|.+..+ .+++.+...|++++++|+++||.|+ .++++++||||||++|++||+.+ T Consensus 316 ~~l~pgg~~~~~~~~~~~~i~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~ 394 (559) T protein:vir:95 316 ASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) T ss_pred eeeeccceeeeCCCCCcccceeecccc-cchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHH Confidence 665533322222222 23466665544 5788889999999999999999884 56899999999999999999999 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccc--cceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh--- Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH--KPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP--- 446 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~--~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~--- 446 (510) |||||+||++|||.|||+|+|+||+++|++|++|+.+ ....|+|+|+|+|+|+..++..+.+++++++.++|++| T Consensus 395 LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevl 474 (559) T protein:vir:95 395 LGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEAL 474 (559) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhh Confidence 9999999999999999999999999999888777765 45668999999999999988888887777777666555 Q ss_pred -ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 447 -RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 447 -~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +||+|++++.+|+++|||++ ++||++||+++|+|++|+++++|++++...++..+++.+.|+- T Consensus 475 d~id~d~~~~~~a~~~Gvp~~-~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~ 538 (559) T protein:vir:95 475 DKLNVDQAIDAFADMSGVSPT-VIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKT 538 (559) T ss_pred hcCCHHHHHHHHHHHhCCchh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccC Confidence 59999999999999999985 9999999999999988888877777665544333333333333 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=1.5e-88 Score=502.01 Aligned_cols=493 Identities=13% Similarity=0.081 Sum_probs=365.5 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcc----------cccCCCCCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLP----------YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP----------~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) +=+++.+||+.+| |++||.+|+||++|..+ ..+...++.......+++|++..+++++|+++||+++| T Consensus 25 ~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~ 104 (641) T protein:vir:94 25 IGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATF 104 (641) T ss_pred HHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHHhhhhc Confidence 6677999999996 99999999999987655 22222333333334589999999999999999999999 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC-------- Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-------- 140 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~-------- 140 (510) | +++||++.+.+++..+ ..++ ++..+...+.+++|+..+++.+.+.+.+||+++-++= T Consensus 105 p-~~~wf~~~p~~~ed~~----------~A~~---~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~ 170 (641) T protein:vir:94 105 P-SDDWFDLKGMVPELAD----------AARV---VKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQF 170 (641) T ss_pred C-CCceEEEecCCCChHH----------HHHH---HHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhh Confidence 7 8999999987665432 1222 2234556678999999999999999999998764320 Q ss_pred ---------------------CCCcEEEEEeceEEEeeCCCCCeE----EEEEEEeecHHHhhHH--hhHhh-----hhh Q lcl|NC_012418. 141 ---------------------DEATVVAWSLRSYAVRRDATGRWM----DIVLKQRYKSKDLDEA--YKQDL-----MRA 188 (510) Q Consensus 141 ---------------------~~~~~~~~pl~~~~i~~d~~G~vd----~i~r~~~~t~~~l~~~--~~~~~-----~~~ 188 (510) ....+++.||..|-|-.|+.++++ ++||++++|+++|..+ +..+. ... T Consensus 171 ~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~ 250 (641) T protein:vir:94 171 KRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVD 250 (641) T ss_pred hhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhccc Confidence 011256677766655566666665 5678888888888543 22111 111 Q ss_pred hhccCCCce----------EEEEEEEEeecCCCceEEEEEEEecCeeecccccccc-ccCceEEEeeeecCCCccccchH Q lcl|NC_012418. 189 GRNLSGSGS----------VDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPI-HLCPYIVPTWNLAPGEHYGRGHV 257 (510) Q Consensus 189 ~~~~~~~~~----------v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~-~~~P~~~~Rw~~~~g~~YGrgp~ 257 (510) ....+++.. .++|++....++++++|+|+|++++|++++++++|++ +++||+++||.+.++++||+||+ T Consensus 251 ~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~ 330 (641) T protein:vir:94 251 YKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVL 330 (641) T ss_pred ccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChH Confidence 111112211 2344444456678899999999999999998888763 68999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHH Q lcl|NC_012418. 258 EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVV 337 (510) Q Consensus 258 ~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~ 337 (510) +++|||+|+||.+++..++++.++++|+|+++++|+++|+++...|+|.+..+..+++.|+..+. .+|+..+..++.++ T Consensus 331 ~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~v~pl~~~~-~~~~~~~~~~~~~~ 409 (641) T protein:vir:94 331 HPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGSLQPIDMGR-QDFVVTYQEAQVQE 409 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCcceeecCCc-cccchhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999988889989999987655 58999999999999 Q ss_pred HHHHHHhhccc----cC-CCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh--------------- Q lcl|NC_012418. 338 VRLNQAFMYGA----NQ-RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD--------------- 397 (510) Q Consensus 338 ~~I~~af~~~~----~~-~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~--------------- 397 (510) .+|+++|+.+. .+ +++++||||||++|.+|+...||+++++++.||+.||+.+++.++++ T Consensus 410 ~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~ 489 (641) T protein:vir:94 410 SSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEE 489 (641) T ss_pred HHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhh Confidence 99999998663 23 67788999999999999999999999999999999999999998866 Q ss_pred --cCCCCCCccccccee-eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC--CCHhHccCCH Q lcl|NC_012418. 398 --ALLQGLITKQHKPAI-ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS--VDTSQFYKSE 472 (510) Q Consensus 398 --~~l~~~~~~~~~~~~-v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G--vp~~~i~rs~ 472 (510) ++++|+||++++..+ +.+++..+++.+++++..+.++++.+++.+++.+.+|+|.+++.+++..| +|. .++|++ T Consensus 490 ~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~-~~ir~~ 568 (641) T protein:vir:94 490 QMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPM-RYIKKA 568 (641) T ss_pred hcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCch-hhccCc Confidence 367888888886532 33444444444555556666666666677888899999999999998755 676 478888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhc--------------ccCC Q lcl|NC_012418. 473 EELQAEAEQRRQQAAQAQAAQETLLEGASDMTNA--------------LAGV 510 (510) Q Consensus 473 ~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~--------------~ag~ 510 (510) |..++.+.+++++ ++++++..++..|+...+.+ -+|+ T Consensus 569 ~~~~~~~~~~~~~-~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 619 (641) T protein:vir:94 569 EAPPAAPPIAPAE-PGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGI 619 (641) T ss_pred cCchhHHHHHHHH-HHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcC Confidence 7543322221111 11222222222222222222 2333 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=6.3e-68 Score=389.02 Aligned_cols=496 Identities=12% Similarity=0.086 Sum_probs=355.8 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccc-------cCCC---CCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL-------MVDP---MSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~-------~~~~---~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) +=+.+.++|++.+ |+.|+.+|++|+++..+.- .... .........++++++-..+++.+.+.|+..+| T Consensus 21 ~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~ 100 (651) T protein:vir:80 21 VSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHAYLMSATF 100 (651) T ss_pred HHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHHHHHHhhc Confidence 3445778888774 8999999999999877731 1111 11222233568999999999999999999999 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCC----- Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSD----- 141 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~----- 141 (510) | +..||++.+..+.. +.+++-.-|+..+...++.++|+...+.+++|.+..||+++-+ +.. T Consensus 101 ~-~~~~~~~~p~~~~d-----------~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~ 168 (651) T protein:vir:80 101 P-NKNWFDVVPAKPGQ-----------DNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVK 168 (651) T ss_pred C-CCceeEeccCCchh-----------HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeee Confidence 7 58899998864432 1244445566667777899999999999999999999987632 110 Q ss_pred ---------------------------CCcEEEEEeceEEEeeCCCCCeEEEE-EEEeecHHHhhHHh------------ Q lcl|NC_012418. 142 ---------------------------EATVVAWSLRSYAVRRDATGRWMDIV-LKQRYKSKDLDEAY------------ 181 (510) Q Consensus 142 ---------------------------~~~~~~~pl~~~~i~~d~~G~vd~i~-r~~~~t~~~l~~~~------------ 181 (510) .-+++.+|+.+|++..++.+--|+-| .+..+|.+++-+.. T Consensus 169 ~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~ 248 (651) T protein:vir:80 169 KKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLD 248 (651) T ss_pred hheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHH Confidence 01367789999999999877666533 34456666642211 Q ss_pred --hHhhh----------hhhh-----ccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecc--ccccccccCceEEE Q lcl|NC_012418. 182 --KQDLM----------RAGR-----NLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGE--EGRWPIHLCPYIVP 242 (510) Q Consensus 182 --~~~~~----------~~~~-----~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~--~~~y~~~~~P~~~~ 242 (510) .+... .... ..++.++|+||+|+.+.+.+++.++++|+..+|+.+++ +..|+ ++|||+++ T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~-~~~Pf~~~ 327 (651) T protein:vir:80 249 VVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNPYW-CGRPFVIG 327 (651) T ss_pred HHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccCCC-CCCCeeee Confidence 00000 0000 12456789999999888888999999999888888865 44444 68999999 Q ss_pred eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcccccccccCc Q lcl|NC_012418. 243 TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGD 322 (510) Q Consensus 243 Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~ 322 (510) ||.+.+|+.||+||++.++||.+.||.++++.++++.++++|+|++++||+++|+++...++|.++.|..+++.+++.+. T Consensus 328 ~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~~~~~l~~~~ 407 (651) T protein:vir:80 328 TYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHGDLQPLANQS 407 (651) T ss_pred cceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999887664 Q ss_pred ccchHHHHHHHHHHHHHHHHHhhccc-cC----CCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 323 YNKMAAIQQSLQAVVVRLNQAFMYGA-NQ----RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD 397 (510) Q Consensus 323 ~~~~~~~~~~i~~~~~~I~~af~~~~-~~----~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~ 397 (510) .+++.++..|+.++++|++.|+.+- .+ ++.+++|||||+.+++|+..+||++|++++.||+.||++|++.++++ T Consensus 408 -~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~ 486 (651) T protein:vir:80 408 -SNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQ 486 (651) T ss_pred -ccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5799999999999999999997752 22 45578999999999999999999999999999999999999999987 Q ss_pred cCCCCCCc-----------------ccccce-eeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHH Q lcl|NC_012418. 398 ALLQGLIT-----------------KQHKPA-IETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) Q Consensus 398 ~~l~~~~~-----------------~~~~~~-~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~ 459 (510) .+-.|..+ +++... .+.++++.+..++.+.+....++++.+++.+++.+.+|+..++..+++ T Consensus 487 ~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~ 566 (651) T protein:vir:80 487 FTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQ 566 (651) T ss_pred hcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHH Confidence 65433211 122211 134566655555555555555566666666777777899999999999 Q ss_pred HcCCC-HhHccCCHHHHHHHH-HH----HHHH---HHHHHHhhHHHhh---hhhhhhhcccCC Q lcl|NC_012418. 460 AFSVD-TSQFYKSEEELQAEA-EQ----RRQQ---AAQAQAAQETLLE---GASDMTNALAGV 510 (510) Q Consensus 460 ~~Gvp-~~~i~rs~~ev~~~r-~q----~~q~---~~~~~~~~~~~~~---ga~~~~~~~ag~ 510 (510) .+|++ +..++..+++.+... ++ +.++ +.+++++++.+.+ ...+++...+-. T Consensus 567 ~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 629 (651) T protein:vir:80 567 HWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTPN 629 (651) T ss_pred HcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99984 344666655443211 11 0100 0011111111100 111111111111 No 28 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=1.4e-34 Score=206.17 Aligned_cols=475 Identities=12% Similarity=0.095 Sum_probs=311.5 Q ss_pred ChhHHHHHHHHH--hhccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~l--kr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) ..+.+.++|+.. +|++|+..|.|+.+|..-+.-...+....+...++|=+-....+.++.+.||+.+||. ..||++. T Consensus 17 ~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~l~~~~Fp~-~~w~~~v 95 (584) T protein:vir:95 17 SAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSNYFSSLFPN-DDWLRWV 95 (584) T ss_pred hHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHHHHHhhcCc-cceeeee Confidence 678889999877 4999999999999998876543333333333457787888889999999999999995 8899999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC-------------- Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-------------- 142 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~-------------- 142 (510) ...+.... .++ =+.+++-+..-|..+||+.++...+++++.+|++.+=+ .... T Consensus 96 ~~~~~~~~---------~~~--~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~ 164 (584) T protein:vir:95 96 GYGKGDST---------KTK--AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIG 164 (584) T ss_pred cCCCchhh---------HHH--HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecccccccccc Confidence 88775432 111 23345556667799999999999999999999987522 2221 Q ss_pred CcEEEEEeceEEEeeCCCCCeEEEE--EEEeecHHHhhHHhhHh--------hhh---------hhhccC----C----- Q lcl|NC_012418. 143 ATVVAWSLRSYAVRRDATGRWMDIV--LKQRYKSKDLDEAYKQD--------LMR---------AGRNLS----G----- 194 (510) Q Consensus 143 ~~~~~~pl~~~~i~~d~~G~vd~i~--r~~~~t~~~l~~~~~~~--------~~~---------~~~~~~----~----- 194 (510) -++.-++.-++++..++ +.+++.. +|..+|..+|.....+. ..+ .....+ + T Consensus 165 prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (584) T protein:vir:95 165 PRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDV 243 (584) T ss_pred ceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccccccccccc Confidence 23544566788899998 6666633 46668998885544221 000 000000 0 Q ss_pred -----------CceEEEEEEEE---eec-CCCceEEEEEEEecCeeecc--ccccccccCceEEEeeeecCCCccccchH Q lcl|NC_012418. 195 -----------SGSVDLYTHVQ---RKK-GTAMEYAELYHEIDGVRVGE--EGRWPIHLCPYIVPTWNLAPGEHYGRGHV 257 (510) Q Consensus 195 -----------~~~v~i~~~v~---~~~-~~~~p~~sv~~e~~~~~~~~--~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~ 257 (510) ...|++++.+- -+. +....+..|.+ .+|.++++ +.-|+++.+||++..|.....+.||.|+. T Consensus 244 d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v-~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~ 322 (584) T protein:vir:95 244 DGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITV-VDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPL 322 (584) T ss_pred ccccccccccCCceeEEEeecccccccccCCCcccceEEE-EeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCch Confidence 11244444321 111 22222233333 35555544 66688899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHH Q lcl|NC_012418. 258 EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVV 337 (510) Q Consensus 258 ~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~ 337 (510) +-++|-.+.+|.+.+.++.+...+++|++. .+++++++..++++.+.+|...++.++.... .++-.+...|+-+. T Consensus 323 ~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k----~~~~~~~~~~~pg~~~~~~~~~~~q~~~p~a-~~~~s~~~~lq~~e 397 (584) T protein:vir:95 323 DNLVGMQYRIDHLENAKADAVDLIIQPPLK----IIGEVEEFVWGPGAEIHLDQGGDVQEIAKNV-NYIINADNQIQMLE 397 (584) T ss_pred hhhhhHHHHHhHHHHHHHHHHHHhcCccee----eccccchhcccCCceeecCCCCCcceecCch-hhhhHHHHHHHHHH Confidence 999999999999999999999999999543 4667788888888888898888887776432 34444555555555 Q ss_pred HHHHHHhhcccc--CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcC---------------- Q lcl|NC_012418. 338 VRLNQAFMYGAN--QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL---------------- 399 (510) Q Consensus 338 ~~I~~af~~~~~--~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~---------------- 399 (510) +...+ +.+. ..+|.+.++++.+--.+..++-.+.++-+....|-.|+++|++.+|..-+ T Consensus 398 ~~me~---~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e 474 (584) T protein:vir:95 398 DRMEL---YAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTD 474 (584) T ss_pred HHHHh---hhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccc Confidence 55444 2222 12333344444444446677788899999999999999999999886432 Q ss_pred -----CCCCCcccccce-ee--ecHH-HHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccC Q lcl|NC_012418. 400 -----LQGLITKQHKPA-IE--TGLP-ALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 400 -----l~~~~~~~~~~~-~v--~~is-~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~r 470 (510) ...+.+++++.. .+ .+-+ -+.|+|..+++.+|.|. .+++ +++|..+--++.+.+++..+.|.-.|.+ T Consensus 475 ~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~--~~~~--~i~p~~~~~~l~~~ladl~~~p~~~~~~ 550 (584) T protein:vir:95 475 LGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNS--QIGQ--MILPHTSGKALATFVDDVTGLQGYEIFR 550 (584) T ss_pred cccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHh--hhhh--hccccchHHHHHHHHHHHhCCCcccccC Confidence 123344555533 12 2222 25667888888877664 2222 5778888888999999999999877777 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhHHHh---hhhh Q lcl|NC_012418. 471 SEEELQAEAEQRRQQAAQAQAAQETLL---EGAS 501 (510) Q Consensus 471 s~~ev~~~r~q~~q~~~~~~~~~~~~~---~ga~ 501 (510) .+-.+++-.+.++.+.++|+.+.+-++ .||. T Consensus 551 ~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 551 PNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred CCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 665554321111111111122211111 1222 No 29 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.96 E-value=2.5e-27 Score=166.39 Aligned_cols=487 Identities=12% Similarity=0.070 Sum_probs=276.8 Q ss_pred ChhHHHHHHHHHh--hc-cchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~-~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) +.+.+..++...+ .. .+...+.+-.+|.+=.......+ + ..+++.+.-...++.+.+.|+..+|+ +.+||++ T Consensus 15 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~~-~---~s~~~~~~v~~~v~~~~~~l~~~~~~-~~~~~~~ 89 (705) T protein:vir:88 15 VLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERP-G---KSGIVSRDVQETVDWIMPSLMKVFTS-GGQVVKY 89 (705) T ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcccC-C---CCccccHHHHHHHHHHHHHHHHhhcC-CCceEEE Confidence 2223333332221 11 12223333444443221111111 1 23566777777889999999988775 8999999 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHH-HHHhcCCHHHHHHHHHHHHhhCeEEE---EEe-------------- Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQ-RLFQNASLAVLTQVIKLLIVTGNALL---YRN-------------- 139 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~-~l~~snfy~~~~~~~~dl~~~G~~~l---~~~-------------- 139 (510) .|..+.+.+. . +.++..+.- ....++.+..++.++++.+..|++++ |.. T Consensus 90 ~p~~~~D~~~----------a---~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~ 156 (705) T protein:vir:88 90 EPDTAEDVEQ----------A---EQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSE 156 (705) T ss_pred eeCChhHHHH----------H---HHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCCh Confidence 9876654321 1 111222222 24566678889999999998888765 311 Q ss_pred ---------CC-------------------------CCcEEEEEeceEEEeeCCCCCeEE--EEEEEeecHHHhhH---- Q lcl|NC_012418. 140 ---------SD-------------------------EATVVAWSLRSYAVRRDATGRWMD--IVLKQRYKSKDLDE---- 179 (510) Q Consensus 140 ---------~~-------------------------~~~~~~~pl~~~~i~~d~~G~vd~--i~r~~~~t~~~l~~---- 179 (510) ++ .-+++.+|..+|++..++.+--|. +++++.+|.++|.. T Consensus 157 ~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~ 236 (705) T protein:vir:88 157 DMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVP 236 (705) T ss_pred hhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCC Confidence 10 113566788899999998775543 67788999888732 Q ss_pred -----HhhHhh-----------hhhh------------hccCCCceEEEEEEEEeecCCCceEEEEEEE-ecCeeecccc Q lcl|NC_012418. 180 -----AYKQDL-----------MRAG------------RNLSGSGSVDLYTHVQRKKGTAMEYAELYHE-IDGVRVGEEG 230 (510) Q Consensus 180 -----~~~~~~-----------~~~~------------~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e-~~~~~~~~~~ 230 (510) ++..+- .+.. ........|.+|.|+.+.+..+-++..+|.- ..|.++.+.. T Consensus 237 ~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~ 316 (705) T protein:vir:88 237 EDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNE 316 (705) T ss_pred hhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccccc Confidence 111100 0000 0011223578888877655333233333322 2355555544 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeec- Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVP- 309 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~p- 309 (510) .+ +.+||++..+.+.++..||.|++....+-.+.+|.+.+..+.++-.+++|.+++++ |..+++++.+..+|.++. T Consensus 317 ~~--~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~-g~v~~~d~~~~~pg~vv~~ 393 (705) T protein:vir:88 317 PW--DCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLD-GQVNLEDLLTNEAAGIVRV 393 (705) T ss_pred cC--CCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccc-cccCcccccccCCCeeEEe Confidence 44 46999999999999999999999999999999999999999999999999999965 555666655544444442 Q ss_pred CCcccccccccCcccchHHHHHHHHHHHHHHHHHh-hccccC---CC--CCCCCHHHHHHHHHHHHHHhchhHhHHHHHH Q lcl|NC_012418. 310 GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQ---RD--AERVTAEEVRITAEEAENTLGGTYSLLAENL 383 (510) Q Consensus 310 g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~---~~--~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 383 (510) ...+.+.+++... -.+.+...++.+.+.|++.. +.+..+ .+ ..+.||+.|....+.....+.-..-.+...+ T Consensus 394 ~~~~~i~~~~~~~--~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~ 471 (705) T protein:vir:88 394 KSMNSITPLETPQ--LSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETG 471 (705) T ss_pred cCCCccccccCCc--CcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2234455554432 23556677788888887765 222222 11 2357999999988888888888777777789 Q ss_pred HHHHHHHHHHHHhhcCC-----------CCCCcccc----cceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhcc Q lcl|NC_012418. 384 QSPLAYVCLSEVDDALL-----------QGLITKQH----KPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) Q Consensus 384 l~Pli~r~~~il~~~~l-----------~~~~~~~~----~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~i 448 (510) +.+++++++.++....- .+..+.+. .+.+...++...+.++...+..+.+..+.+.+.+++.+.+ T Consensus 472 ~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~ 551 (705) T protein:vir:88 472 VKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLV 551 (705) T ss_pred HHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhc Confidence 99999999998754321 12222222 2333456677777777777777777666665555555544 Q ss_pred C---HHHHHHHHHHHcCCC-HhHccCCHHHHHHHHHH--HHH--------HHH---HHHHhhHHH--hhhhhhhhhcccC Q lcl|NC_012418. 449 S---LPKMMDTIWAAFSVD-TSQFYKSEEELQAEAEQ--RRQ--------QAA---QAQAAQETL--LEGASDMTNALAG 509 (510) Q Consensus 449 d---~d~~~~~~a~~~Gvp-~~~i~rs~~ev~~~r~q--~~q--------~~~---~~~~~~~~~--~~ga~~~~~~~ag 509 (510) + ..++...++...|+- +..+...+...++++.+ +.+ +++ .++.+++.+ ...-.+....-+- T Consensus 552 ~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q 631 (705) T protein:vir:88 552 SEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQ 631 (705) T ss_pred ChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 335555566665532 11222222222111100 000 000 000000000 0000000000000 Q ss_pred C Q lcl|NC_012418. 510 V 510 (510) Q Consensus 510 ~ 510 (510) . T Consensus 632 ~ 632 (705) T protein:vir:88 632 I 632 (705) T ss_pred H Confidence 0 No 30 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.95 E-value=2.8e-27 Score=166.15 Aligned_cols=485 Identities=11% Similarity=0.041 Sum_probs=290.2 Q ss_pred ChhHHHHHHHHH--hhccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~l--kr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) .-.-++.+|++. +|+.-+..|.|+.+|+.-+.-+..++..-+=..+++-+.....+.+|-+.+++++|| +..||++. T Consensus 21 ~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~l~a~~~~~~fp-~~~w~d~~ 99 (599) T protein:vir:31 21 FIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLMITTSYMEHLLP-NRNWVDFV 99 (599) T ss_pred HHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHHHHHHHHhhhcC-CccceEee Confidence 222366888876 488899999999999876543333333333334567777889999999999999999 79999999 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--------CCCC------- Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--------SDEA------- 143 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--------~~~~------- 143 (510) ..+++.. .+..=+.++.-+..-|+.|+|..+....+.|++..|++.-=++ ++.. T Consensus 100 ~~~~~~~-----------~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~ 168 (599) T protein:vir:31 100 GFDNDSV-----------NAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSG 168 (599) T ss_pred ecCCchh-----------HHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeeccccccccccc Confidence 8876532 1222234556677889999999999999999999999864333 1111 Q ss_pred -cEEEEEeceEEEeeCCCCCeEEEE--EEEeecHHHhhHHhhHh---------hhh---hhh-----cc----------- Q lcl|NC_012418. 144 -TVVAWSLRSYAVRRDATGRWMDIV--LKQRYKSKDLDEAYKQD---------LMR---AGR-----NL----------- 192 (510) Q Consensus 144 -~~~~~pl~~~~i~~d~~G~vd~i~--r~~~~t~~~l~~~~~~~---------~~~---~~~-----~~----------- 192 (510) +++-+..-++++..++ +.++..+ +|...|..+|-..+.+. +.. ... +. T Consensus 169 P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~ 247 (599) T protein:vir:31 169 TVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDS 247 (599) T ss_pred ceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccc Confidence 2445566788999888 6666644 57777877775433210 000 000 00 Q ss_pred ---CCCce---------EEEEEEEE-eecCCCceEEE--EEEEecCeeecc--ccccccccCceEEEeeeecCCCccccc Q lcl|NC_012418. 193 ---SGSGS---------VDLYTHVQ-RKKGTAMEYAE--LYHEIDGVRVGE--EGRWPIHLCPYIVPTWNLAPGEHYGRG 255 (510) Q Consensus 193 ---~~~~~---------v~i~~~v~-~~~~~~~p~~s--v~~e~~~~~~~~--~~~y~~~~~P~~~~Rw~~~~g~~YGrg 255 (510) ++..+ |++++.+- --+..+-+.+. |-..++++.+.+ ..-|++++.||++..|....++.||.| T Consensus 248 ~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G 327 (599) T protein:vir:31 248 LHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIG 327 (599) T ss_pred cccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCC Confidence 11111 22222220 00111111221 112246555543 344888889999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHH Q lcl|NC_012418. 256 HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQA 335 (510) Q Consensus 256 p~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~ 335 (510) |-..++|-.-.||.+.+..+.+...++.|+ ++-.|.+.|.++...|+..+..+...++.+..-. .+..-+...|+. T Consensus 328 ~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~--l~~~~dl~~eD~~~~P~~v~~~~d~~~vq~~~p~--s~~~~a~~~is~ 403 (599) T protein:vir:31 328 PLHRLTGMQYKLDKRENFREDLHDRFLHPS--LKKVGDVREKGMRGGPNHVFEVEETGDVQYMTPP--AEVLQPDNQLSI 403 (599) T ss_pred CchhcchHHHHHHHHHHHhhhhhhhhhccc--ccccccccccCccCCCCcceeecCCCccccccCc--hhhhhHHHHHHH Confidence 999999999999999999999999999883 3345668888888777766666666666544322 233344445555 Q ss_pred HHHHHHHHh----hccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcC------------ Q lcl|NC_012418. 336 VVVRLNQAF----MYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------ 399 (510) Q Consensus 336 ~~~~I~~af----~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~------------ 399 (510) .+.+..+.= +..+.+..++ -||+||....++...........+..+++.||+++++...++.. T Consensus 404 ~e~~mee~sGvp~~~~G~~~ag~-~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e 482 (599) T protein:vir:31 404 TLQLMEDLSGAPKESIGQRTAGE-KTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSE 482 (599) T ss_pred HHHHHHHhhccchhhcCCcccch-hhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeccc Confidence 555443311 1112222333 49999999999999999999999999999999999998865421 Q ss_pred -----CCCCCcccccc-eeeec---HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccC Q lcl|NC_012418. 400 -----LQGLITKQHKP-AIETG---LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 400 -----l~~~~~~~~~~-~~v~~---is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~r 470 (510) +..+-.++++. ..+.. -.-+.|++-.+++.+|.+ +.+++ .++|+..-.++...++.....-.-.|.+ T Consensus 483 ~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~--~~~~q--~~~P~~~~k~l~~~l~~~~~l~~~~~~~ 558 (599) T protein:vir:31 483 LGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILG--GPLGA--ALAPHMSRTKLFNAVEYLGDLDAYGIFT 558 (599) T ss_pred ccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhc--ccCCC--ccchhhHHHHHHHHHHHHHhccccccCC Confidence 11122223321 11211 222567777788877765 22222 1344444434444444432222223444 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh----hhhcccCC Q lcl|NC_012418. 471 SEEELQAEAEQRRQQAAQAQAAQETLLEGASD----MTNALAGV 510 (510) Q Consensus 471 s~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~----~~~~~ag~ 510 (510) ..--| +||+.+.+++|.++++...--.++ ..++--|- T Consensus 559 ~~va~---~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 559 FGIGV---QEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred CchhH---HHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 33322 222222222333332222211111 00000011 No 31 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.91 E-value=3.8e-22 Score=138.00 Aligned_cols=489 Identities=10% Similarity=0.003 Sum_probs=243.5 Q ss_pred ChhHHHHHHHHHh------hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcc Q lcl|NC_012418. 1 MKSTAAMLWEKLR------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~~~~~~~r~~~lk------r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) .=++|++-++..+ ++.- ..|-+++-|.- ...++. ..+ ..++.+..-.+.++.+=+.|+-.+++ +..| T Consensus 28 ~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~-~~g---rs~vv~~~v~~~ve~~~~~l~~~f~~-~~~~ 100 (763) T protein:vir:95 28 SLQALKADLDAAKPSHTAMMIKV-KEWNDLMRIEG-KAKPPK-VKG---RSQVQPKLVRRQAEWRYSALTEPFLG-SNKL 100 (763) T ss_pred HHHHHHHHHHhhhcchhHHHHHH-HHHHHhhhccc-cCcccc-cCC---CccccCHHHHHHHHHHHHHHHHhhcC-CCcE Confidence 1122222222221 1111 23555433331 111111 111 23567777888888888888888777 5679 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEe----C-------- Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRN----S-------- 140 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~----~-------- 140 (510) |.+.|-.+.+.+... ..+.+-.| -....++-+..++..+++.+..|++++ |-+ . T Consensus 101 ~~~~P~~~~D~~~A~---q~t~~~n~---------~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~ 168 (763) T protein:vir:95 101 FKVTPVTWEDVQGAR---QNELVLNY---------QFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVF 168 (763) T ss_pred EEEecCCcchHHHHH---HHHHHHHH---------HHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhh Confidence 999998776543211 11111111 234566677888899999888888753 211 0 Q ss_pred ----------------------------------C---------------------------------CCcEEEEEeceE Q lcl|NC_012418. 141 ----------------------------------D---------------------------------EATVVAWSLRSY 153 (510) Q Consensus 141 ----------------------------------~---------------------------------~~~~~~~pl~~~ 153 (510) . .-+++.+|..+| T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~ 248 (763) T protein:vir:95 169 SLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENI 248 (763) T ss_pred hhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHh Confidence 0 002445777889 Q ss_pred EEeeCCCCCeE---EEEEEEeecHHHhhHH---------hhHh----hhhhh---------hccCC-CceEEEEEEEEee Q lcl|NC_012418. 154 AVRRDATGRWM---DIVLKQRYKSKDLDEA---------YKQD----LMRAG---------RNLSG-SGSVDLYTHVQRK 207 (510) Q Consensus 154 ~i~~d~~G~vd---~i~r~~~~t~~~l~~~---------~~~~----~~~~~---------~~~~~-~~~v~i~~~v~~~ 207 (510) +|..++.+.++ -+++++.+|..+|... +..+ ..... ...++ .+.|.||.|+.+- T Consensus 249 ~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~ 328 (763) T protein:vir:95 249 IIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFW 328 (763) T ss_pred eecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeee Confidence 99988876444 3578899999988321 1111 00000 00011 3568888887764 Q ss_pred cCCCceEEEEEE--EecCeee-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCC Q lcl|NC_012418. 208 KGTAMEYAELYH--EIDGVRV-GEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 (510) Q Consensus 208 ~~~~~p~~sv~~--e~~~~~~-~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p 284 (510) +-.+..++.+|. ..++..+ ..++-|+++++||++..+.+.++..||.|.++.+.+..+.+|++.+..+..+..+++| T Consensus 329 d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~ 408 (763) T protein:vir:95 329 DIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANG 408 (763) T ss_pred ccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCC Confidence 422222223322 2344433 2456677788999999999999999999999999999999999999999999999999 Q ss_pred ceeeCCCcccchhhhccCCCceee--cCCccc--ccccccCc-ccchHHHHHHHHHHHHHHHHHh-hccccCCCCCCCCH Q lcl|NC_012418. 285 LNLVDEAKGAVVDDYQDAEMGDYV--PGGAEA--VRAYERGD-YNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTA 358 (510) Q Consensus 285 ~~l~~~~g~~~p~~~~~~~~g~~~--pg~~~~--v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~~TA 358 (510) .|+++.+.+...+.+...+.+.+. ||.... +.+...+. .+.+..+.+.++...+.|.-.- +..+...++...|| T Consensus 409 ~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~ta 488 (763) T protein:vir:95 409 QRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVA 488 (763) T ss_pred cEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchh Confidence 999977555444434443433332 443322 12222111 1222333333333222221111 12222333345799 Q ss_pred HHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcC------------CCCCCcccccce--eeecHHH-HHHHH Q lcl|NC_012418. 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------LQGLITKQHKPA--IETGLPA-LSRSA 423 (510) Q Consensus 359 tEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~------------l~~~~~~~~~~~--~v~~is~-L~raq 423 (510) ++|..+.+.....+..++.++. +.+.+++++++.++.... ..+..+++.... ++..+++ -.+.+ T Consensus 489 t~v~~l~qa~~~~~~~~~r~~~-~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~~as~~~q 567 (763) T protein:vir:95 489 AGIRGVLDAASKREMAILRRLA-KGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDISTAEVDNQ 567 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecccchHHHH Confidence 9998888888888877776665 578999999999886521 223333333221 1112222 22333 Q ss_pred HHHHHHHHHHHHHhhcCh-------hhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHH--HHHHHhhH Q lcl|NC_012418. 424 AVQSMLNASQVIAGLAPI-------AQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQA--AQAQAAQE 494 (510) Q Consensus 424 ~~~~~~~~~q~l~~~~~~-------~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~--~~~~~~~~ 494 (510) ..+.+..+.+.++...+. ..+....+...+++.+.....-| ..+-.-..+.++.+.+.+++. .+++..++ T Consensus 568 ~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~-d~~~q~qaqle~~~~q~e~~~~~akaq~~qa 646 (763) T protein:vir:95 568 KSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQP-DPVQEQLKQLAVEKAQLENEELRSKIRLNDA 646 (763) T ss_pred HHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCc-cchhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 344444333333211111 01112233333333333322211 111111111111110000000 00000000 Q ss_pred HHhh----------------hhhh----hhhcccCC Q lcl|NC_012418. 495 TLLE----------------GASD----MTNALAGV 510 (510) Q Consensus 495 ~~~~----------------ga~~----~~~~~ag~ 510 (510) -+.. ...+ .....+.. T Consensus 647 qa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~ 682 (763) T protein:vir:95 647 QAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQS 682 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000 00000011 No 32 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.62 E-value=9.5e-14 Score=91.98 Aligned_cols=481 Identities=12% Similarity=-0.023 Sum_probs=229.5 Q ss_pred ChhHHHHHHHHHh-----hccchHHHHHHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLR-----DGSVEQRAIEFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lk-----r~~~~~~w~e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) +++.+....++++ ...|...+.+-.+|..=. -.++..... .....+ |+=+ ...++.+.+..-. + T Consensus 43 ~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i-~~~i~~v~g~~~~-----n 115 (776) T protein:vir:93 43 AVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNI-QWSQDEIDELKERGQAPTVYNVI-SQSVNWIIGSEKR-----G 115 (776) T ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC-CCCHHHHHHHHhcCCceEEecch-HHHHHHHHHHHHh-----C Confidence 4544444444333 223444444445554211 111111000 001112 2322 3333333322222 5 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCCC--c--E Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA--T--V 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~~--~--~ 145 (510) ++=+++.+.++... ++.+.| +..+......+++..+...++.+.+..|++++ +.+.+.. . . T Consensus 116 r~~~~~~p~~~~d~----------~~Ae~l---~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~ 182 (776) T protein:vir:93 116 RSDFKVLPRRKDGG----------KAAERK---TALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYA 182 (776) T ss_pred CcceEEecCChhHH----------HHHHHH---HHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEe Confidence 55566666544321 123333 33444456788899999999999888887764 4454322 2 3 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhhh---hhh----------------------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDLM---RAG----------------------------- 189 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~~---~~~----------------------------- 189 (510) ++++..++++..++.- ...-+|++.++|.+++-..|+.... +.. T Consensus 183 ~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (776) T protein:vir:93 183 GAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNS 262 (776) T ss_pred eccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccc Confidence 4456677887665532 2334788889999988655543110 000 Q ss_pred ----hccCCCceEEEEEEEEeecCC-----------------------------------CceEEEEE--EEecCeee-c Q lcl|NC_012418. 190 ----RNLSGSGSVDLYTHVQRKKGT-----------------------------------AMEYAELY--HEIDGVRV-G 227 (510) Q Consensus 190 ----~~~~~~~~v~i~~~v~~~~~~-----------------------------------~~p~~sv~--~e~~~~~~-~ 227 (510) ......+.|.|+.++++++.. .....++| +.+++..+ . T Consensus 263 ~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~ 342 (776) T protein:vir:93 263 VTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWA 342 (776) T ss_pred ccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhc Confidence 000122467778887654210 01122232 22344332 2 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc---CCC Q lcl|NC_012418. 228 EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEM 304 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~---~~~ 304 (510) ..+-|+++.|||++......+.+.||.|.+....+-.+.+|++...++..+ .+..+++..+.+-+.+.+.. .++ T Consensus 343 ~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~~~~gav~~~d~~~~~~~rp~ 419 (776) T protein:vir:93 343 GPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKVLMEEGAVDDIDEFRREAARPD 419 (776) T ss_pred cCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCceeeccccccchHHHHHhcccCC Confidence 346677788999999999999999999999999999999999887776543 35568888877767766553 333 Q ss_pred ceee--cCCcccccccccCcccchHHHHHHHHHHHHHHHHHh-hcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHH Q lcl|NC_012418. 305 GDYV--PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLA 380 (510) Q Consensus 305 g~~~--pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 380 (510) +.+. +|....+..... .+-.+...+.++...+.|+..- ..+.+ ...+...+..-|..|.+.-...+..++.++. T Consensus 420 ~vi~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~ 497 (776) T protein:vir:93 420 AVMTVKNGKLGAVKMDVD--RDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLR 497 (776) T ss_pred ceeeeCCccccccccccC--cCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 4443 333333222111 1112344555566666665542 22221 1233346777888888888888888888876 Q ss_pred HHHHHHHHHHHHHHHhhc----CCCCCCc----------------ccc-----cceeeec-HHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 381 ENLQSPLAYVCLSEVDDA----LLQGLIT----------------KQH-----KPAIETG-LPALSRSAAVQSMLNASQV 434 (510) Q Consensus 381 ~E~l~Pli~r~~~il~~~----~l~~~~~----------------~~~-----~~~~v~~-is~L~raq~~~~~~~~~q~ 434 (510) . .+.=+.+.++.++... .+.-+.. .++ .+.+..+ -++..|.+....++ ++ T Consensus 498 ~-~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~---ql 573 (776) T protein:vir:93 498 L-AFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELM---EV 573 (776) T ss_pred H-HHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHH---HH Confidence 5 3333555555554331 1111111 111 1111122 22333444444443 44 Q ss_pred HHhhcChhh---------HhhccCHHHHHHHHHHHcCCCH-hHccCCHHHHHHHHHHHHHHHHHHHHhhHH--------- Q lcl|NC_012418. 435 IAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDT-SQFYKSEEELQAEAEQRRQQAAQAQAAQET--------- 495 (510) Q Consensus 435 l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~-~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~--------- 495 (510) ++.+.+-.+ ..+--+.+++...+-...+-+. ..-...+++.++.+.++++++.+++.+.+. T Consensus 574 ~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~ 653 (776) T protein:vir:93 574 IGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKAR 653 (776) T ss_pred HhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHH Confidence 443322111 1111255666666666555221 111222222221111111111110000000 Q ss_pred ---Hhh-------hhhhhhhcccCC Q lcl|NC_012418. 496 ---LLE-------GASDMTNALAGV 510 (510) Q Consensus 496 ---~~~-------ga~~~~~~~ag~ 510 (510) +.+ -..+......++ T Consensus 654 ~~~aea~~~~aqa~~~~~~a~~~~~ 678 (776) T protein:vir:93 654 KAAAEAQVAEAKAKHISRMAIREGV 678 (776) T ss_pred HHHHHHHHHhhhhhhhhhcchhhhh Confidence 000 000000000111 No 33 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.49 E-value=5.9e-12 Score=82.16 Aligned_cols=494 Identities=10% Similarity=0.011 Sum_probs=225.7 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHH----HHHhcccccCCCCCCccc--c-ccc-cccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEF----AKTTLPYLMVDPMSGSRG--V-VEH-DFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~----~~~~lP~~~~~~~~~~~~--~-~~~-~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++.+....+.++ ...|...|++- .+|..=.-. ++.+.... + .+. .|+=++... +...+..-. + T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~~~~~~~l~~~g~p~~~~N~i~~~v-~~v~g~~~~-----n 100 (711) T protein:vir:10 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW-PSQVRTERELEQRPCLVNNVLPTFV-DQVLGDQRQ-----N 100 (711) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC-CHHHHHHHHhcCCCcEEEcchHHHH-HHHhhhHhh-----C Confidence 2222222222332 33455555432 233321000 11100000 0 011 133333322 222222111 2 Q ss_pred CcccccCCChH------------HHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--E Q lcl|NC_012418. 72 IPFFRSELTDA------------IRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--Y 137 (510) Q Consensus 72 ~~WF~l~~~d~------------~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~ 137 (510) ++=+++.+.++ -.......+..-.++.+.| +..+......++...+...++.+.+..|.+.+ + T Consensus 101 r~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~ 177 (711) T protein:vir:10 101 RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF---TGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) T ss_pred CcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHH---HHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEE Confidence 22233333210 0000111111111233333 33333445677888889999999888887654 2 Q ss_pred Ee---CC--CC--cEEEEE-eceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHhhHhhhh----hhhc-cC---CCce Q lcl|NC_012418. 138 RN---SD--EA--TVVAWS-LRSYAVRRDA---TGR-WMDIVLKQRYKSKDLDEAYKQDLMR----AGRN-LS---GSGS 197 (510) Q Consensus 138 ~~---~~--~~--~~~~~p-l~~~~i~~d~---~G~-vd~i~r~~~~t~~~l~~~~~~~~~~----~~~~-~~---~~~~ 197 (510) .+ ++ .+ .++.++ ..++++..++ ++. ..-+|++.+|+.+++...|+..... .... .+ ..+. T Consensus 178 ~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~ 257 (711) T protein:vir:10 178 SDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKS 257 (711) T ss_pred ecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcccCcce Confidence 22 22 12 244443 4557765433 332 3448999999999998777643211 1110 00 1234 Q ss_pred EEEEEEEEeecC-----------------------------------CCceEEEEEEE--ecCeeeccccccccccCceE Q lcl|NC_012418. 198 VDLYTHVQRKKG-----------------------------------TAMEYAELYHE--IDGVRVGEEGRWPIHLCPYI 240 (510) Q Consensus 198 v~i~~~v~~~~~-----------------------------------~~~p~~sv~~e--~~~~~~~~~~~y~~~~~P~~ 240 (510) |.+..++++++. +......+|+. .++..+...+.|++..|||+ T Consensus 258 vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~v 337 (711) T protein:vir:10 258 VRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVI 337 (711) T ss_pred eeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEE Confidence 555555544210 00122344432 33333434456777789998 Q ss_pred EE--eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc---CCCceee---cCCc Q lcl|NC_012418. 241 VP--TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEMGDYV---PGGA 312 (510) Q Consensus 241 ~~--Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~---~~~g~~~---pg~~ 312 (510) +. .+...++..++.|.+....+-.+.+|++....+..+....++.|++.++.+-+.+.... ..+|.++ ||.. T Consensus 338 p~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~ 417 (711) T protein:vir:10 338 PVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQ 417 (711) T ss_pred EEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEeccccc Confidence 65 35567788888889999999999999999999999999999999998887777665321 2333333 4443 Q ss_pred ccccccccCcccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHH Q lcl|NC_012418. 313 EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 313 ~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) +.-.+......+-.+.....++...+.|.+.- ..+.++ ..+...|..-|..|.+.-...|...+.++.. ...=+.+. T Consensus 418 ~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ 496 (711) T protein:vir:10 418 GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKI 496 (711) T ss_pred CcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 32222222222333556666676777776653 223222 2333478888999999988888888877764 22233333 Q ss_pred HHHHHhhcC----CCCCCcc-------------------------ccc---cee---eecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 391 CLSEVDDAL----LQGLITK-------------------------QHK---PAI---ETGLPALSRSAAVQSMLNASQVI 435 (510) Q Consensus 391 ~~~il~~~~----l~~~~~~-------------------------~~~---~~~---v~~is~L~raq~~~~~~~~~q~l 435 (510) ++.++.... +.-+..+ ++. -.+ +.+-++-.|.+.+..++ +++ T Consensus 497 ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~---ql~ 573 (711) T protein:vir:10 497 LVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMI---QFA 573 (711) T ss_pred HHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHH---HHH Confidence 333332211 1111110 000 011 12233333444344443 333 Q ss_pred HhhcChh--------hHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHH-HHHHHHHHH-HH---HHHHhh-----HHHh Q lcl|NC_012418. 436 AGLAPIA--------QLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQ-AEAEQRRQQ-AA---QAQAAQ-----ETLL 497 (510) Q Consensus 436 ~~~~~~~--------q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~-~~r~q~~q~-~~---~~~~~~-----~~~~ 497 (510) ..+.+.. ...+--+.++++..+....+-+. ......+-+ +..++++++ ++ +++.++ +.+. T Consensus 574 ~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~--~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae 651 (711) T protein:vir:10 574 QAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV--LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEAD 651 (711) T ss_pred hhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCccc--CcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3332211 11222377888888877766331 222222111 111000000 00 000000 0000 Q ss_pred hhhhhhhhcccCC Q lcl|NC_012418. 498 EGASDMTNALAGV 510 (510) Q Consensus 498 ~ga~~~~~~~ag~ 510 (510) .-..+++...+.. T Consensus 652 ~~~Aqae~~qa~~ 664 (711) T protein:vir:10 652 TAQAQADMLKAQL 664 (711) T ss_pred HHHHHHHHHHHHH Confidence 0000011000110 No 34 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.04 E-value=4.3e-09 Score=66.43 Aligned_cols=487 Identities=12% Similarity=0.033 Sum_probs=217.0 Q ss_pred ChhHHHHHHHHHh-----hccchHHHHHHH----HHh-cccccCCCCCCc----ccc---cccc-ccchHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLR-----DGSVEQRAIEFA----KTT-LPYLMVDPMSGS----RGV---VEHD-FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~~~~~~~r~~~lk-----r~~~~~~w~e~~----~~~-lP~~~~~~~~~~----~~~---~~~~-~dstg~~a~~~LAa~ 62 (510) |=+++.++..+++ ...|.+.|++-+ +|. .+..=.++.+.. +.+ .+.+ |+=++...-..+... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 6666666655543 123444444322 222 122111111100 000 0111 343333322222222 Q ss_pred HHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--e- Q lcl|NC_012418. 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--N- 139 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~- 139 (510) .. +++=+++.+.++.- + .++.+.|+ ..+......++...+...+|.+.+..|.+.+-+ + T Consensus 81 ~~------nr~d~~v~P~~~~~------d---~~~Ae~l~---~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~ 142 (708) T protein:vir:10 81 RN------NRITVKFRPGDREA------S---EELANKLN---GLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) T ss_pred Hh------CCcceEEEcCCCCc------h---HHHHHHHH---HHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeecc Confidence 11 45555666553321 0 12333333 333344567888889999999998888776532 1 Q ss_pred --C-----CCC--cEEEE--EeceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHhhHhhhhhhhc---cC------CC Q lcl|NC_012418. 140 --S-----DEA--TVVAW--SLRSYAVRRDA---TGR-WMDIVLKQRYKSKDLDEAYKQDLMRAGRN---LS------GS 195 (510) Q Consensus 140 --~-----~~~--~~~~~--pl~~~~i~~d~---~G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~~~---~~------~~ 195 (510) + +.. +++++ |..++++.-++ ++. -.-+||..+|+.+++-..|+......... .+ .. T Consensus 143 ~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~ 222 (708) T protein:vir:10 143 VNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGA 222 (708) T ss_pred ccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCC Confidence 1 111 23332 44455544332 221 22377888999999988777532211000 00 00 Q ss_pred ceEEEEEEEE-----------eecC-----------------------------CCceEEEEEEE-ecCeeec-cccccc Q lcl|NC_012418. 196 GSVDLYTHVQ-----------RKKG-----------------------------TAMEYAELYHE-IDGVRVG-EEGRWP 233 (510) Q Consensus 196 ~~v~i~~~v~-----------~~~~-----------------------------~~~p~~sv~~e-~~~~~~~-~~~~y~ 233 (510) +.+-|..+++ +.+. +......||+. ..|..++ ..+-|+ T Consensus 223 d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p 302 (708) T protein:vir:10 223 DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIP 302 (708) T ss_pred CceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCC Confidence 1122222211 1110 00112223333 3455444 335577 Q ss_pred cccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhc----------- Q lcl|NC_012418. 234 IHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ----------- 300 (510) Q Consensus 234 ~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~----------- 300 (510) +..|||++.-+.. .+|..++.|.+....+-.+.+|+..-..+..+..+.+.+++++++.+.....-. T Consensus 303 ~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~ 382 (708) T protein:vir:10 303 GEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFL 382 (708) T ss_pred CCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhh Confidence 7789998774433 257787889999999999999999988888888888888888776654332111 Q ss_pred -----cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHh-hccccCCCCCCCCHHHHHHHHHHHHHHhch Q lcl|NC_012418. 301 -----DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTAEEVRITAEEAENTLGG 374 (510) Q Consensus 301 -----~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~~TAtEi~~r~~E~~~~LGp 374 (510) ..+.|.++++... +.......-.....+.++...+.|.+.. ..+.+.....+.+..-|..|.+.-...+.. T Consensus 383 ~~~~~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~ 459 (708) T protein:vir:10 383 PLREVRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFI 459 (708) T ss_pred ccccccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHH Confidence 1122222222111 1011111112234455566666666654 222222222345778899999999999998 Q ss_pred hHhHHH------HHHHHHHHHHHHH------HHhhcCCC----------CCCcc------ccc---ceeeecHHHHHHHH Q lcl|NC_012418. 375 TYSLLA------ENLQSPLAYVCLS------EVDDALLQ----------GLITK------QHK---PAIETGLPALSRSA 423 (510) Q Consensus 375 v~~rl~------~E~l~Pli~r~~~------il~~~~l~----------~~~~~------~~~---~~~v~~is~L~raq 423 (510) .+.+|. -+++.-||...+. |+...|-+ ++-.+ ++. -.++--..|-.-++ T Consensus 460 ~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~ 539 (708) T protein:vir:10 460 YLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTAR 539 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhH Confidence 888775 3444444444431 22211100 01111 111 11222222222233 Q ss_pred HHHHHHHHHHHHHhhcChh----hHhh-------ccCHHHHHHHHHHHcCCCHhHccCCH-HHHHHHHHHHHHHHHHHH- Q lcl|NC_012418. 424 AVQSMLNASQVIAGLAPIA----QLDP-------RISLPKMMDTIWAAFSVDTSQFYKSE-EELQAEAEQRRQQAAQAQ- 490 (510) Q Consensus 424 ~~~~~~~~~q~l~~~~~~~----q~~~-------~id~d~~~~~~a~~~Gvp~~~i~rs~-~ev~~~r~q~~q~~~~~~- 490 (510) +.+.+..+.+++..+++.. .+.+ --+.++++..+-..++.+. ....+ +|.+++.+++++++++++ T Consensus 540 r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~--~~~~~~~ee~q~~~~~q~~~q~q~~ 617 (708) T protein:vir:10 540 RDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG--IAKPRNEKEQQIVQQAQMAAQSQPN 617 (708) T ss_pred HHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccc--cccccchhhHHHHHHHHHHHHHHHH Confidence 3334444445555544321 1211 1256677877777665432 22221 122222111111111000 Q ss_pred ----HhhH------HHhhhhhh-hhhc-ccCC Q lcl|NC_012418. 491 ----AAQE------TLLEGASD-MTNA-LAGV 510 (510) Q Consensus 491 ----~~~~------~~~~ga~~-~~~~-~ag~ 510 (510) .+++ +....+.+ +... .-++ T Consensus 618 ~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~ 649 (708) T protein:vir:10 618 PEMVLAQAQMVAAQAEAQKATNETAQTQIKAF 649 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 00000000 0000 0001 No 35 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.89 E-value=1.9e-08 Score=62.86 Aligned_cols=487 Identities=10% Similarity=-0.002 Sum_probs=208.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHHHH----HHHhcccccCCCCCCccccc-cc-cccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAIEF----AKTTLPYLMVDPMSGSRGVV-EH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~e~----~~~~lP~~~~~~~~~~~~~~-~~-~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) -|+++....+.+++ -.+.+.|+.- .+|..= .-.++......+. .+ .|+=++. .++.+.+.--. +++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G-~Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~~~~-----nr~ 76 (725) T protein:vir:77 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRV-SQWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMRQ-----NPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCC-CCCCHHHHHHHHhcCCCccccHHH-HHHHHHhhHHh-----CCc Confidence 34444444433331 2233444332 233221 1111111100000 11 1332222 22222222111 455 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-----EEeCCCC----c Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----YRNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-----~~~~~~~----~ 144 (510) =+++.+.++... ++.+.|+. .+......|+..-+-..+|.+.+..|.+.+ |.+++.. + T Consensus 77 d~~v~P~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:77 77 DVLYRPKDGARP----------DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCccHH----------HHHHHHHH---HHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCcee Confidence 556666543221 23333333 333345588899999999999888887764 2233221 2 Q ss_pred EEEEE----eceEEEeeCCCCC--eE--EEEEEEeecHHH---hhHHhhHhhhhhhh----c-----cCCCceEEEEEEE Q lcl|NC_012418. 145 VVAWS----LRSYAVRRDATGR--WM--DIVLKQRYKSKD---LDEAYKQDLMRAGR----N-----LSGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~p----l~~~~i~~d~~G~--vd--~i~r~~~~t~~~---l~~~~~~~~~~~~~----~-----~~~~~~v~i~~~v 204 (510) ++.+| ..++++..++.-. -| -+||..+|+.+. +.+.++.+...... . -...+.|.|..++ T Consensus 144 i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~ 223 (725) T protein:vir:77 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEE Confidence 34443 3445555543211 01 166777888764 33444332211100 0 0112345555555 Q ss_pred EeecC----------------------------------------CCceEEEEEEE-ecCeeeccc-cccccccCceEEE Q lcl|NC_012418. 205 QRKKG----------------------------------------TAMEYAELYHE-IDGVRVGEE-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~----------------------------------------~~~p~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) ++++. +......|||. +.|..++.. +-|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:77 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEE Confidence 43310 01123455554 355555443 4466667999854 Q ss_pred e--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCcee------e---cCC Q lcl|NC_012418. 243 T--WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDY------V---PGG 311 (510) Q Consensus 243 R--w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~------~---pg~ 311 (510) - .....|..|+.|.+....+-.+.+|+.....+..+..+.+.++++..+-+-..+.....+++.. + +|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCc Confidence 3 2357899999999999999999999999999988888888888887654432222222222210 1 121 Q ss_pred cccccccccCcccch-HHHHHHHHHHHHHHHHHh-hcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHH------HH Q lcl|NC_012418. 312 AEAVRAYERGDYNKM-AAIQQSLQAVVVRLNQAF-MYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLA------EN 382 (510) Q Consensus 312 ~~~v~~~~~~~~~~~-~~~~~~i~~~~~~I~~af-~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~------~E 382 (510) . ..+++..-....+ +.....++...+.|.+.- +.+.+ ...+..++.--|..|.+.-...+...+.+|. -+ T Consensus 384 ~-~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:77 384 L-PTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred c-cccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1111111111222 345566677777776654 22222 2222234556677787777777776666543 34 Q ss_pred HHHHHHHHHH------HHHhhcCCC-----CCC---cc--------cc--cceeeecHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 383 LQSPLAYVCL------SEVDDALLQ-----GLI---TK--------QH--KPAIETGLPALSRSAAVQSMLNASQVIAGL 438 (510) Q Consensus 383 ~l~Pli~r~~------~il~~~~l~-----~~~---~~--------~~--~~~~v~~is~L~raq~~~~~~~~~q~l~~~ 438 (510) ++.-||...+ .|+...+-. ..+ +. ++ +-.++--..|-.-+++.+.+..+.+++..+ T Consensus 463 ~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~ 542 (725) T protein:vir:77 463 IYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhc Confidence 4444444443 222211111 000 00 01 011111222333333344444444555555 Q ss_pred cChhh-----Hhh---ccCH---HHHHHHHHHHcCCCHhHcc--CCHHHHHHHHHHHHHHHHHHH--Hh---------hH Q lcl|NC_012418. 439 APIAQ-----LDP---RISL---PKMMDTIWAAFSVDTSQFY--KSEEELQAEAEQRRQQAAQAQ--AA---------QE 494 (510) Q Consensus 439 ~~~~q-----~~~---~id~---d~~~~~~a~~~Gvp~~~i~--rs~~ev~~~r~q~~q~~~~~~--~~---------~~ 494 (510) ++... +.. ..|. +++.+.+...... .... .++++-++..+++++++++++ ++ ++ T Consensus 543 ~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~--~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa 620 (725) T protein:vir:77 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ--MGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred cccchhHHHHHHHhhccccchHHHHHHHHHHhhhhh--hhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 44221 111 1233 4444444443321 1121 122221111111111000000 00 00 Q ss_pred H-Hhhhhh----h----hhhcccCC Q lcl|NC_012418. 495 T-LLEGAS----D----MTNALAGV 510 (510) Q Consensus 495 ~-~~~ga~----~----~~~~~ag~ 510 (510) . +.+.+. + +..+-|-+ T Consensus 621 ~~~kaq~e~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:77 621 ELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000000 0 00000011 No 36 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.87 E-value=2.3e-08 Score=62.45 Aligned_cols=487 Identities=9% Similarity=-0.016 Sum_probs=211.4 Q ss_pred ChhHHHHHHHHHhh-ccchHHHHH----HHHHhcccccCCCCCCccccc-cc-cccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAIE----FAKTTLPYLMVDPMSGSRGVV-EH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~e----~~~~~lP~~~~~~~~~~~~~~-~~-~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) =++++....+.+++ -.+.+.|+. =.+|.. ..-.++......+. .+ .|+-++. .++...+.-- .+++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~-G~Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~e~-----~nr~ 76 (725) T protein:vir:92 4 NENRLESILSRFDADWTASDEARREAKNDLFFSR-ISQWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMR-----QNPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc-CCCCCHHHHHHHHhcCCCcccchHH-HHHHHHhhHH-----hCCc Confidence 23333333333321 123334433 233332 11111111100000 01 2333332 2222211111 1455 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-----EEeCCCC----c Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----YRNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-----~~~~~~~----~ 144 (510) =+++.+.++... ++.+.|+.+ +......|+..-+-..+|.+.+..|.+.+ |.+++.. . T Consensus 77 d~~v~P~~~~d~----------~~Ae~l~~~---~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:92 77 DVLYRPKDGASP----------DAADVLMGM---YRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCccHH----------HHHHHHHHH---HHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCcee Confidence 555565443221 233333333 33345588899999999999888887764 2222221 2 Q ss_pred EEEEE----eceEEEeeCCCCC--eE--EEEEEEeecHH---HhhHHhhHhhhhhhh----c-c----CCCceEEEEEEE Q lcl|NC_012418. 145 VVAWS----LRSYAVRRDATGR--WM--DIVLKQRYKSK---DLDEAYKQDLMRAGR----N-L----SGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~p----l~~~~i~~d~~G~--vd--~i~r~~~~t~~---~l~~~~~~~~~~~~~----~-~----~~~~~v~i~~~v 204 (510) ++..| +.+.++..++.-. -| -+||..+|+.. ++.+.++.....-.. . . ...+.|.|+.++ T Consensus 144 i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~ 223 (725) T protein:vir:92 144 IRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEE Confidence 34444 4455555443211 01 15677778865 344444432111000 0 0 012345555555 Q ss_pred EeecC------------C----------------------------CceEEEEEEE-ecCeeeccc-cccccccCceEEE Q lcl|NC_012418. 205 QRKKG------------T----------------------------AMEYAELYHE-IDGVRVGEE-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~------------~----------------------------~~p~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) ++.+. + ......|||. ..|..++.. +-|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:92 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEE Confidence 43210 0 0122345554 345555433 3455566899965 Q ss_pred eee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCcee------e---cCC Q lcl|NC_012418. 243 TWN--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDY------V---PGG 311 (510) Q Consensus 243 Rw~--~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~------~---pg~ 311 (510) -.. ...|..|+.|.+....+-.+.+|+..-..+..+..+.+.++++..+-+-.-......+++.. + +|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeecccccccccc Confidence 323 35899999999999999999999999888888888888888887654422222222222211 1 111 Q ss_pred ccccccccc-CcccchHHHHHHHHHHHHHHHHHhh-ccc-cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHH------HH Q lcl|NC_012418. 312 AEAVRAYER-GDYNKMAAIQQSLQAVVVRLNQAFM-YGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLA------EN 382 (510) Q Consensus 312 ~~~v~~~~~-~~~~~~~~~~~~i~~~~~~I~~af~-~~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~------~E 382 (510) .. ..++.. ....-.+.....++..++.|++.-= .+. ..+.+..++.--|..|.+.-...|...+..|. -+ T Consensus 384 ~~-~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:92 384 MP-TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred cc-ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111111 1122234566777777777777652 222 22222335566788888888888887776554 34 Q ss_pred HHHHHHHHHHH------HHhhcCCCC-----CC-c--c--------ccc--ceeeecHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 383 LQSPLAYVCLS------EVDDALLQG-----LI-T--K--------QHK--PAIETGLPALSRSAAVQSMLNASQVIAGL 438 (510) Q Consensus 383 ~l~Pli~r~~~------il~~~~l~~-----~~-~--~--------~~~--~~~v~~is~L~raq~~~~~~~~~q~l~~~ 438 (510) ++.-||...+. |+...|-.. .+ . . ++. -.++--..|-.-+++.+.+..+.+++..+ T Consensus 463 ~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~ 542 (725) T protein:vir:92 463 IYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhc Confidence 44444444442 221111100 00 0 0 111 11122233333344444445555556555 Q ss_pred cChhh-----Hhhc---cC---HHHHHHHHHHHcCCCHhHcc--CCHHHHHHHHHHHHHH--HHHHHHh---------hH Q lcl|NC_012418. 439 APIAQ-----LDPR---IS---LPKMMDTIWAAFSVDTSQFY--KSEEELQAEAEQRRQQ--AAQAQAA---------QE 494 (510) Q Consensus 439 ~~~~q-----~~~~---id---~d~~~~~~a~~~Gvp~~~i~--rs~~ev~~~r~q~~q~--~~~~~~~---------~~ 494 (510) .+... +... .| .+++.+.+....+.. ... .++++.+++.++++++ +++++++ ++ T Consensus 543 ~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~--~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qa 620 (725) T protein:vir:92 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred ccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchh--ccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 54321 1111 23 244555554443311 111 1233222222111110 0000000 00 Q ss_pred H-Hhhhhhh--------hhhcccCC Q lcl|NC_012418. 495 T-LLEGASD--------MTNALAGV 510 (510) Q Consensus 495 ~-~~~ga~~--------~~~~~ag~ 510 (510) . +.+.+.+ +..+-|-+ T Consensus 621 e~~kaqaE~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:92 621 ELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0000000 00000000 No 37 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=98.81 E-value=3.8e-08 Score=61.26 Aligned_cols=483 Identities=14% Similarity=0.031 Sum_probs=209.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++...+.+..+.+ ....+.|+ +-.+|..= .=.++..... ...+.+ |+=++...-..+.... .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~------~n 89 (714) T protein:vir:32 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG-DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA------KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH------hC Confidence 22222333333332 22333454 44444431 1111111000 000111 3333332222222211 23 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEEeCCCC----cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.++..+ -.++.+.| +..+......+++..+...++.+.+..|.+. +|.+.+.. ++ T Consensus 90 r~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:32 90 RTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 44455555332110 01233333 3344445567888889999999987766554 55554422 36 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh-----------h----------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG-----------R----------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------~----------------- 190 (510) +.+|..++++..++.. .-.-++++.++|.+++...||... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:32 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 6778888888765432 122488999999999866655311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------------C-----------------CceEEEE--EEEecCeeecc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------------T-----------------AMEYAEL--YHEIDGVRVGE 228 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------------~-----------------~~p~~sv--~~e~~~~~~~~ 228 (510) . ......|.|+.|+++... . ..+...+ ++. .|.+++. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~~~L~ 317 (714) T protein:vir:32 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGPHFIV 317 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecCcccc Confidence 0 011245677777764310 0 0011111 222 3334443 Q ss_pred --ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh-hh---c Q lcl|NC_012418. 229 --EGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DY---Q 300 (510) Q Consensus 229 --~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~-~~---~ 300 (510) .+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-.+ +.++++..-+ .+ . T Consensus 318 ~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:32 318 DRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQI 392 (714) T ss_pred cCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHhc Confidence 456776779998654443 456676 58888999999999854443332 3566555 4444543322 22 1 Q ss_pred cCCCceee--cCCcccc---cccccC-cccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 301 DAEMGDYV--PGGAEAV---RAYERG-DYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 301 ~~~~g~~~--pg~~~~v---~~~~~~-~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..++|.+. |+..+.. .+++.. ..+-.+...+.++...+.|++.- ..+.++ +.+...+..-|..|.+.-...| T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:32 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTL 472 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHH Confidence 33444442 3322111 112221 12233445556666666665553 222221 2233355566899999988888 Q ss_pred chhHhHHHHH------HHHHHHHHHHH------HHhhcC------CCCCCcc--------cc---cceee---ecHHHHH Q lcl|NC_012418. 373 GGTYSLLAEN------LQSPLAYVCLS------EVDDAL------LQGLITK--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 373 Gpv~~rl~~E------~l~Pli~r~~~------il~~~~------l~~~~~~--------~~---~~~~v---~~is~L~ 420 (510) ...+.+|..- ++.-||...+. |....+ ..++-++ ++ +-.++ .+-++-. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:32 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 8877666532 23333322221 111000 0000001 11 11111 2334444 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHhHccCCHHHH-HH-HHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL-QA-EAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev-~~-~r~q~~q~~~~~ 489 (510) |.+.+..++.+ ++.+.+..+ ..+.=+.+++++.+-+.+|.+...=-.++++- ++ .+++.++++.+. T Consensus 553 r~~~~~~l~~l---~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:32 553 KAQLAQRMSEV---IQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHH---HhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 55555555544 443333111 11112678899999998886421001122211 11 111100000000 Q ss_pred HH---------hhHH-Hhhhh-hhhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGA-SDMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga-~~~~~~~a-----g~ 510 (510) ++ +++. +.+-+ ..++..-| .- T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:32 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 00000000 00 No 38 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=98.81 E-value=3.8e-08 Score=61.26 Aligned_cols=483 Identities=14% Similarity=0.031 Sum_probs=209.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++...+.+..+.+ ....+.|+ +-.+|..= .=.++..... ...+.+ |+=++...-..+.... .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~------~n 89 (714) T protein:vir:27 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG-DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA------KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH------hC Confidence 22222333333332 22333454 44444431 1111111000 000111 3333332222222211 23 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEEeCCCC----cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.++..+ -.++.+.| +..+......+++..+...++.+.+..|.+. +|.+.+.. ++ T Consensus 90 r~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:27 90 RTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 44455555332110 01233333 3344445567888889999999987766554 55554422 36 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh-----------h----------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG-----------R----------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------~----------------- 190 (510) +.+|..++++..++.. .-.-++++.++|.+++...||... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:27 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 6778888888765432 122488999999999866655311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------------C-----------------CceEEEE--EEEecCeeecc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------------T-----------------AMEYAEL--YHEIDGVRVGE 228 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------------~-----------------~~p~~sv--~~e~~~~~~~~ 228 (510) . ......|.|+.|+++... . ..+...+ ++. .|.+++. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~~~L~ 317 (714) T protein:vir:27 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGPHFIV 317 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecCcccc Confidence 0 011245677777764310 0 0011111 222 3334443 Q ss_pred --ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh-hh---c Q lcl|NC_012418. 229 --EGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DY---Q 300 (510) Q Consensus 229 --~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~-~~---~ 300 (510) .+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-.+ +.++++..-+ .+ . T Consensus 318 ~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:27 318 DRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQI 392 (714) T ss_pred cCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHhc Confidence 456776779998654443 456676 58888999999999854443332 3566555 4444543322 22 1 Q ss_pred cCCCceee--cCCcccc---cccccC-cccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 301 DAEMGDYV--PGGAEAV---RAYERG-DYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 301 ~~~~g~~~--pg~~~~v---~~~~~~-~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..++|.+. |+..+.. .+++.. ..+-.+...+.++...+.|++.- ..+.++ +.+...+..-|..|.+.-...| T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:27 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTL 472 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHH Confidence 33444442 3322111 112221 12233445556666666665553 222221 2233355566899999988888 Q ss_pred chhHhHHHHH------HHHHHHHHHHH------HHhhcC------CCCCCcc--------cc---cceee---ecHHHHH Q lcl|NC_012418. 373 GGTYSLLAEN------LQSPLAYVCLS------EVDDAL------LQGLITK--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 373 Gpv~~rl~~E------~l~Pli~r~~~------il~~~~------l~~~~~~--------~~---~~~~v---~~is~L~ 420 (510) ...+.+|..- ++.-||...+. |....+ ..++-++ ++ +-.++ .+-++-. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:27 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 8877666532 23333322221 111000 0000001 11 11111 2334444 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHhHccCCHHHH-HH-HHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL-QA-EAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev-~~-~r~q~~q~~~~~ 489 (510) |.+.+..++.+ ++.+.+..+ ..+.=+.+++++.+-+.+|.+...=-.++++- ++ .+++.++++.+. T Consensus 553 r~~~~~~l~~l---~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:27 553 KAQLAQRMSEV---IQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHH---HhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 55555555544 443333111 11112678899999998886421001122211 11 111100000000 Q ss_pred HH---------hhHH-Hhhhh-hhhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGA-SDMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga-~~~~~~~a-----g~ 510 (510) ++ +++. +.+-+ ..++..-| .- T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:27 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 00000000 00 No 39 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=98.81 E-value=3.8e-08 Score=61.26 Aligned_cols=483 Identities=14% Similarity=0.031 Sum_probs=209.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++...+.+..+.+ ....+.|+ +-.+|..= .=.++..... ...+.+ |+=++...-..+.... .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~------~n 89 (714) T protein:vir:99 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG-DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA------KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH------hC Confidence 22222333333332 22333454 44444431 1111111000 000111 3333332222222211 23 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEEeCCCC----cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.++..+ -.++.+.| +..+......+++..+...++.+.+..|.+. +|.+.+.. ++ T Consensus 90 r~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:99 90 RTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 44455555332110 01233333 3344445567888889999999987766554 55554422 36 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh-----------h----------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG-----------R----------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------~----------------- 190 (510) +.+|..++++..++.. .-.-++++.++|.+++...||... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:99 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 6778888888765432 122488999999999866655311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------------C-----------------CceEEEE--EEEecCeeecc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------------T-----------------AMEYAEL--YHEIDGVRVGE 228 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------------~-----------------~~p~~sv--~~e~~~~~~~~ 228 (510) . ......|.|+.|+++... . ..+...+ ++. .|.+++. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~~~L~ 317 (714) T protein:vir:99 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGPHFIV 317 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecCcccc Confidence 0 011245677777764310 0 0011111 222 3334443 Q ss_pred --ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh-hh---c Q lcl|NC_012418. 229 --EGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DY---Q 300 (510) Q Consensus 229 --~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~-~~---~ 300 (510) .+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-.+ +.++++..-+ .+ . T Consensus 318 ~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:99 318 DRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQI 392 (714) T ss_pred cCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHhc Confidence 456776779998654443 456676 58888999999999854443332 3566555 4444543322 22 1 Q ss_pred cCCCceee--cCCcccc---cccccC-cccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 301 DAEMGDYV--PGGAEAV---RAYERG-DYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 301 ~~~~g~~~--pg~~~~v---~~~~~~-~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..++|.+. |+..+.. .+++.. ..+-.+...+.++...+.|++.- ..+.++ +.+...+..-|..|.+.-...| T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:99 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTL 472 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHH Confidence 33444442 3322111 112221 12233445556666666665553 222221 2233355566899999988888 Q ss_pred chhHhHHHHH------HHHHHHHHHHH------HHhhcC------CCCCCcc--------cc---cceee---ecHHHHH Q lcl|NC_012418. 373 GGTYSLLAEN------LQSPLAYVCLS------EVDDAL------LQGLITK--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 373 Gpv~~rl~~E------~l~Pli~r~~~------il~~~~------l~~~~~~--------~~---~~~~v---~~is~L~ 420 (510) ...+.+|..- ++.-||...+. |....+ ..++-++ ++ +-.++ .+-++-. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:99 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 8877666532 23333322221 111000 0000001 11 11111 2334444 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHhHccCCHHHH-HH-HHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL-QA-EAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev-~~-~r~q~~q~~~~~ 489 (510) |.+.+..++.+ ++.+.+..+ ..+.=+.+++++.+-+.+|.+...=-.++++- ++ .+++.++++.+. T Consensus 553 r~~~~~~l~~l---~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:99 553 KAQLAQRMSEV---IQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHH---HhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 55555555544 443333111 11112678899999998886421001122211 11 111100000000 Q ss_pred HH---------hhHH-Hhhhh-hhhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGA-SDMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga-~~~~~~~a-----g~ 510 (510) ++ +++. +.+-+ ..++..-| .- T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:99 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 00000000 00 No 40 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=98.81 E-value=3.8e-08 Score=61.26 Aligned_cols=483 Identities=14% Similarity=0.031 Sum_probs=209.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++...+.+..+.+ ....+.|+ +-.+|..= .=.++..... ...+.+ |+=++...-..+.... .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~------~n 89 (714) T protein:vir:81 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG-DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA------KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH------hC Confidence 22222333333332 22333454 44444431 1111111000 000111 3333332222222211 23 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEEeCCCC----cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.++..+ -.++.+.| +..+......+++..+...++.+.+..|.+. +|.+.+.. ++ T Consensus 90 r~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:81 90 RTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 44455555332110 01233333 3344445567888889999999987766554 55554422 36 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh-----------h----------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG-----------R----------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------~----------------- 190 (510) +.+|..++++..++.. .-.-++++.++|.+++...||... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:81 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 6778888888765432 122488999999999866655311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------------C-----------------CceEEEE--EEEecCeeecc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------------T-----------------AMEYAEL--YHEIDGVRVGE 228 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------------~-----------------~~p~~sv--~~e~~~~~~~~ 228 (510) . ......|.|+.|+++... . ..+...+ ++. .|.+++. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~~~L~ 317 (714) T protein:vir:81 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGPHFIV 317 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecCcccc Confidence 0 011245677777764310 0 0011111 222 3334443 Q ss_pred --ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh-hh---c Q lcl|NC_012418. 229 --EGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DY---Q 300 (510) Q Consensus 229 --~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~-~~---~ 300 (510) .+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-.+ +.++++..-+ .+ . T Consensus 318 ~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:81 318 DRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQI 392 (714) T ss_pred cCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHhc Confidence 456776779998654443 456676 58888999999999854443332 3566555 4444543322 22 1 Q ss_pred cCCCceee--cCCcccc---cccccC-cccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 301 DAEMGDYV--PGGAEAV---RAYERG-DYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 301 ~~~~g~~~--pg~~~~v---~~~~~~-~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..++|.+. |+..+.. .+++.. ..+-.+...+.++...+.|++.- ..+.++ +.+...+..-|..|.+.-...| T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:81 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTL 472 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHH Confidence 33444442 3322111 112221 12233445556666666665553 222221 2233355566899999988888 Q ss_pred chhHhHHHHH------HHHHHHHHHHH------HHhhcC------CCCCCcc--------cc---cceee---ecHHHHH Q lcl|NC_012418. 373 GGTYSLLAEN------LQSPLAYVCLS------EVDDAL------LQGLITK--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 373 Gpv~~rl~~E------~l~Pli~r~~~------il~~~~------l~~~~~~--------~~---~~~~v---~~is~L~ 420 (510) ...+.+|..- ++.-||...+. |....+ ..++-++ ++ +-.++ .+-++-. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:81 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 8877666532 23333322221 111000 0000001 11 11111 2334444 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHhHccCCHHHH-HH-HHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL-QA-EAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev-~~-~r~q~~q~~~~~ 489 (510) |.+.+..++.+ ++.+.+..+ ..+.=+.+++++.+-+.+|.+...=-.++++- ++ .+++.++++.+. T Consensus 553 r~~~~~~l~~l---~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:81 553 KAQLAQRMSEV---IQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHH---HhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 55555555544 443333111 11112678899999998886421001122211 11 111100000000 Q ss_pred HH---------hhHH-Hhhhh-hhhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGA-SDMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga-~~~~~~~a-----g~ 510 (510) ++ +++. +.+-+ ..++..-| .- T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:81 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 00000000 00 No 41 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=98.81 E-value=3.8e-08 Score=61.26 Aligned_cols=483 Identities=14% Similarity=0.031 Sum_probs=209.2 Q ss_pred ChhHHHHHHHHHhh-ccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++...+.+..+.+ ....+.|+ +-.+|..= .=.++..... ...+.+ |+=++...-..+.... .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~------~n 89 (714) T protein:vir:10 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG-DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA------KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH------hC Confidence 22222333333332 22333454 44444431 1111111000 000111 3333332222222211 23 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEEeCCCC----cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.++..+ -.++.+.| +..+......+++..+...++.+.+..|.+. +|.+.+.. ++ T Consensus 90 r~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:10 90 RTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 44455555332110 01233333 3344445567888889999999987766554 55554422 36 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh-----------h----------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG-----------R----------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------~----------------- 190 (510) +.+|..++++..++.. .-.-++++.++|.+++...||... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:10 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 6778888888765432 122488999999999866655311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------------C-----------------CceEEEE--EEEecCeeecc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------------T-----------------AMEYAEL--YHEIDGVRVGE 228 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------------~-----------------~~p~~sv--~~e~~~~~~~~ 228 (510) . ......|.|+.|+++... . ..+...+ ++. .|.+++. T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~~~L~ 317 (714) T protein:vir:10 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGPHFIV 317 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecCcccc Confidence 0 011245677777764310 0 0011111 222 3334443 Q ss_pred --ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh-hh---c Q lcl|NC_012418. 229 --EGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DY---Q 300 (510) Q Consensus 229 --~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~-~~---~ 300 (510) .+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-.+ +.++++..-+ .+ . T Consensus 318 ~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:10 318 DRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQI 392 (714) T ss_pred cCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHhc Confidence 456776779998654443 456676 58888999999999854443332 3566555 4444543322 22 1 Q ss_pred cCCCceee--cCCcccc---cccccC-cccchHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 301 DAEMGDYV--PGGAEAV---RAYERG-DYNKMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 301 ~~~~g~~~--pg~~~~v---~~~~~~-~~~~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..++|.+. |+..+.. .+++.. ..+-.+...+.++...+.|++.- ..+.++ +.+...+..-|..|.+.-...| T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:10 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTL 472 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHH Confidence 33444442 3322111 112221 12233445556666666665553 222221 2233355566899999988888 Q ss_pred chhHhHHHHH------HHHHHHHHHHH------HHhhcC------CCCCCcc--------cc---cceee---ecHHHHH Q lcl|NC_012418. 373 GGTYSLLAEN------LQSPLAYVCLS------EVDDAL------LQGLITK--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 373 Gpv~~rl~~E------~l~Pli~r~~~------il~~~~------l~~~~~~--------~~---~~~~v---~~is~L~ 420 (510) ...+.+|..- ++.-||...+. |....+ ..++-++ ++ +-.++ .+-++-. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:10 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 8877666532 23333322221 111000 0000001 11 11111 2334444 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHhHccCCHHHH-HH-HHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL-QA-EAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev-~~-~r~q~~q~~~~~ 489 (510) |.+.+..++.+ ++.+.+..+ ..+.=+.+++++.+-+.+|.+...=-.++++- ++ .+++.++++.+. T Consensus 553 r~~~~~~l~~l---~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:10 553 KAQLAQRMSEV---IQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHH---HhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 55555555544 443333111 11112678899999998886421001122211 11 111100000000 Q ss_pred HH---------hhHH-Hhhhh-hhhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGA-SDMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga-~~~~~~~a-----g~ 510 (510) ++ +++. +.+-+ ..++..-| .- T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:10 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 00000000 00 No 42 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.81 E-value=3.8e-08 Score=61.24 Aligned_cols=490 Identities=9% Similarity=-0.025 Sum_probs=209.4 Q ss_pred ChhHHHHHHHHHhh-ccchHHHHHH----HHHhcccccCCCCCCccccc-cc-cccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLRD-GSVEQRAIEF----AKTTLPYLMVDPMSGSRGVV-EH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lkr-~~~~~~w~e~----~~~~lP~~~~~~~~~~~~~~-~~-~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) =|+++....+.+++ -.+.+.|++- .+|..= .=.++.+....+. .+ .|+-++. .++...+.-- .+++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~QW~~~~~~~l~~q~rp~~N~i~~-~v~~v~g~e~-----~nr~ 76 (725) T protein:vir:10 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRV-SQWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMR-----QNPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcccchHH-HHHHHHhhHH-----hCCc Confidence 23333333333321 1233444332 233221 1111111100000 01 2333332 2222111111 1445 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-----EEeCCCC----c Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----YRNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-----~~~~~~~----~ 144 (510) =+++.+.++... ++.+.|+.+ +......++..-+-..+|.+.+..|.+.+ |.+++.. . T Consensus 77 d~~v~p~~~~d~----------~~Ae~l~~~---~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:10 77 DVLYRPKDGASP----------DAADVLMGM---YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCcchH----------HHHHHHHHH---HHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCcee Confidence 455555443221 233333333 33335578888888999999888887764 3333321 2 Q ss_pred EEEE----EeceEEEeeCC---CCC-eEEEEEEEeecHH---HhhHHhhHhhhhhhh----c-----cCCCceEEEEEEE Q lcl|NC_012418. 145 VVAW----SLRSYAVRRDA---TGR-WMDIVLKQRYKSK---DLDEAYKQDLMRAGR----N-----LSGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~----pl~~~~i~~d~---~G~-vd~i~r~~~~t~~---~l~~~~~~~~~~~~~----~-----~~~~~~v~i~~~v 204 (510) ++.+ |..++++..++ ++. -.-+||..+|+.. ++++.++.+...-.. . -...+.|.|+.++ T Consensus 144 i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~ 223 (725) T protein:vir:10 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEE Confidence 3344 34455555443 221 2236678888854 344444432211000 0 0012345555554 Q ss_pred EeecC------------C----------------------------CceEEEEEEE-ecCeeeccc-cccccccCceEEE Q lcl|NC_012418. 205 QRKKG------------T----------------------------AMEYAELYHE-IDGVRVGEE-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~------------~----------------------------~~p~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) ++.+. + ......|||. ..|..++.. +-|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~ 303 (725) T protein:vir:10 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEE Confidence 43310 0 0122345554 345555433 3455556899965 Q ss_pred eee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCceee---------cCC Q lcl|NC_012418. 243 TWN--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYV---------PGG 311 (510) Q Consensus 243 Rw~--~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~---------pg~ 311 (510) -.. ..+|..|+.|.+....+-.+.+|+.....+..+..+.+.++++..+.+-.-......++++.+ +|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:10 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcc Confidence 323 358899999999999999999999999999998888888888876544322222222222211 111 Q ss_pred cccccccccCcccchHHHHHHHHHHHHHHHHHhh-cccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHH------HH Q lcl|NC_012418. 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAE------NL 383 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~------E~ 383 (510) .....+.......-.+.....++..++.|++.-= .+.+ .+.+..++.--|..|.+.-...|...+.++.. ++ T Consensus 384 ~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:10 384 MPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred cccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111222234566777777888877652 2222 22222345566888888888888777766643 34 Q ss_pred HHHHHHHHHH------HHhhcCCC-----C-CC--cc--------cc--cceeeecHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 384 QSPLAYVCLS------EVDDALLQ-----G-LI--TK--------QH--KPAIETGLPALSRSAAVQSMLNASQVIAGLA 439 (510) Q Consensus 384 l~Pli~r~~~------il~~~~l~-----~-~~--~~--------~~--~~~~v~~is~L~raq~~~~~~~~~q~l~~~~ 439 (510) +.-||...+. |+...|-. . .. +. ++ +-.++--..|-.-+++.+.+..+.+++..+. T Consensus 464 lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~ 543 (725) T protein:vir:10 464 YQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTP 543 (725) T ss_pred HHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhcc Confidence 4444444432 22111110 0 00 00 11 1111222233333334444444445555554 Q ss_pred Chhh-----Hhhcc------CHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHH--HHHHHHhhHHH---------- Q lcl|NC_012418. 440 PIAQ-----LDPRI------SLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQ--AAQAQAAQETL---------- 496 (510) Q Consensus 440 ~~~q-----~~~~i------d~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~--~~~~~~~~~~~---------- 496 (510) +... +...+ ..+++++.+....+.....=-.++++.+++.++++++ ++.+++.++.. T Consensus 544 ~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ 623 (725) T protein:vir:10 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELA 623 (725) T ss_pred ccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 3211 11111 3345555555444321000011222222221111110 00000000000 Q ss_pred hhhhh----hhhhcccCC Q lcl|NC_012418. 497 LEGAS----DMTNALAGV 510 (510) Q Consensus 497 ~~ga~----~~~~~~ag~ 510 (510) .+.+. +.+..-+.+ T Consensus 624 ka~aE~~k~~~~a~~~~~ 641 (725) T protein:vir:10 624 KAQNQTLSLQIDAAKVEA 641 (725) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 00000 000000000 No 43 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.72 E-value=8.3e-08 Score=59.41 Aligned_cols=481 Identities=13% Similarity=0.078 Sum_probs=211.7 Q ss_pred ChhHHHHHHHHH----hh-----ccchHHHHHHHHHh-cccccCCCCCCc----ccc---ccc-cccchHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL----RD-----GSVEQRAIEFAKTT-LPYLMVDPMSGS----RGV---VEH-DFQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~~~~~~~r~~~l----kr-----~~~~~~w~e~~~~~-lP~~~~~~~~~~----~~~---~~~-~~dstg~~a~~~LAa~ 62 (510) |=+++.+..+++ ++ +.|...|++=.+|. .+..=.++.+.. ..+ .+. .|+=++...-..+... T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 655555554444 22 12223333322221 111111111100 000 011 1343333322222221 Q ss_pred HHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-----E Q lcl|NC_012418. 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----Y 137 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-----~ 137 (510) . .+++=+++.+.++.- + .++.+.|+ ..+......++...+-..+|.+.+..|.+.+ | T Consensus 81 ~------~nr~d~~v~p~~~~~------d---~~~Ae~l~---~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~ 142 (708) T protein:vir:17 81 R------NNRITVKFRPGDREA------S---EELANKLN---GLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) T ss_pred h------hCCcceEEecCCCcc------h---HHHHHHHH---HHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecc Confidence 1 234445555543210 0 12333333 3344445678888999999999998887764 3 Q ss_pred EeCC-----CC--cEEEE--EeceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHhhHhhhh-----hhhccC----CC Q lcl|NC_012418. 138 RNSD-----EA--TVVAW--SLRSYAVRRDA---TGR-WMDIVLKQRYKSKDLDEAYKQDLMR-----AGRNLS----GS 195 (510) Q Consensus 138 ~~~~-----~~--~~~~~--pl~~~~i~~d~---~G~-vd~i~r~~~~t~~~l~~~~~~~~~~-----~~~~~~----~~ 195 (510) .+++ .. .++++ |..++++.-++ ++. -.-+||..+|+.+++-..|+..... ...... .. T Consensus 143 ~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~ 222 (708) T protein:vir:17 143 VNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDA 222 (708) T ss_pred cccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCC Confidence 2221 11 23332 55677766554 322 1227889999999987777643211 000000 11 Q ss_pred ceEEEEEEEEeec-------------C--------------------C-------CceEEEEEEE-ecCeeecc-ccccc Q lcl|NC_012418. 196 GSVDLYTHVQRKK-------------G--------------------T-------AMEYAELYHE-IDGVRVGE-EGRWP 233 (510) Q Consensus 196 ~~v~i~~~v~~~~-------------~--------------------~-------~~p~~sv~~e-~~~~~~~~-~~~y~ 233 (510) +.|-|..+++++. + + ....+.||+. ..|..++. .+-|+ T Consensus 223 d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p 302 (708) T protein:vir:17 223 DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIP 302 (708) T ss_pred CeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCC Confidence 2333333222211 0 0 0122233433 35555543 34466 Q ss_pred cccCceEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh----------- Q lcl|NC_012418. 234 IHLCPYIVP---TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY----------- 299 (510) Q Consensus 234 ~~~~P~~~~---Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~----------- 299 (510) +..|||++. ||.. +|...-.|.+..+.+-.+.+|+.....+..+.++.+-+++++.+.+-..... T Consensus 303 ~~~fP~vP~~g~r~~~-d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~ 381 (708) T protein:vir:17 303 GEHIPLIPVYGKRWFI-DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAF 381 (708) T ss_pred CCccceEEEecccccc-cCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhh Confidence 777898865 4544 5666556899999999999999999999988888888888876433111100 Q ss_pred -----ccCCCceeecCCcc--cccccccCcccchHHHHHHHHHHHHHHHHHh-hccccCCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_012418. 300 -----QDAEMGDYVPGGAE--AVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 300 -----~~~~~g~~~pg~~~--~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~~TAtEi~~r~~E~~~~ 371 (510) ...+.|.+++|..- -+.+.++ .+.....++...+.|.++- ..+.++....+++.--|..|.+.-... T Consensus 382 ~~~~~~~~~~g~v~~~a~~~~~~~~~~~-----~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~ 456 (708) T protein:vir:17 382 LPLREVRDKYGNIIAGATPAGYTQPAVM-----NQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMA 456 (708) T ss_pred hhhhccCCcccccccccCCcccCCCccc-----cHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHH Confidence 01122333333221 1122121 1334455555555555553 122222222334556678888888888 Q ss_pred hchhHhHHH------HHHHHHHHHHHHH------HHhhcCC-----------CCCCcc-----cccc---eeee---cHH Q lcl|NC_012418. 372 LGGTYSLLA------ENLQSPLAYVCLS------EVDDALL-----------QGLITK-----QHKP---AIET---GLP 417 (510) Q Consensus 372 LGpv~~rl~------~E~l~Pli~r~~~------il~~~~l-----------~~~~~~-----~~~~---~~v~---~is 417 (510) +.-.+.++. -+++.-||...+. |+...|- .+.+.. ++.. .++- +-+ T Consensus 457 ~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~ 536 (708) T protein:vir:17 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCc Confidence 887777655 5666666666652 2221111 011111 1111 1111 223 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcChh----hHhh-------ccCHHHHHHHHHHHcCCCHhHccC--CHHHHHHHHHHHHH Q lcl|NC_012418. 418 ALSRSAAVQSMLNASQVIAGLAPIA----QLDP-------RISLPKMMDTIWAAFSVDTSQFYK--SEEELQAEAEQRRQ 484 (510) Q Consensus 418 ~L~raq~~~~~~~~~q~l~~~~~~~----q~~~-------~id~d~~~~~~a~~~Gvp~~~i~r--s~~ev~~~r~q~~q 484 (510) +-.|.+..+.++ +++..+.+.. .+.+ .-+.++++..+...++... ... .+++.++..++++. T Consensus 537 ~t~r~~~~~~l~---qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~--~~~~~~~e~~q~~~q~qq~ 611 (708) T protein:vir:17 537 TARRDATVSVLT---NVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG--IAKPRNEKEQQIVQQAQMA 611 (708) T ss_pred hhHHHHHHHHHH---HHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccc--cccCcchhhHHHHHHHHHH Confidence 333444444444 4555444321 1111 1255777777777665432 222 22222221111110 Q ss_pred HHHHHHHhh--HHHhhhhhh----hhh---ccc---CC Q lcl|NC_012418. 485 QAAQAQAAQ--ETLLEGASD----MTN---ALA---GV 510 (510) Q Consensus 485 ~~~~~~~~~--~~~~~ga~~----~~~---~~a---g~ 510 (510) ++++++++. +-++....+ ... ..+ ++ T Consensus 612 ~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~ 649 (708) T protein:vir:17 612 AQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAF 649 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 000000000 000 000 00 No 44 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=98.63 E-value=1.7e-07 Score=57.71 Aligned_cols=483 Identities=14% Similarity=0.044 Sum_probs=208.3 Q ss_pred ChhHHHHHHHHHh-hccchHHHH----HHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~----e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) ........|.++. ...+.+.|+ +-.+|..= .=.++..... ...+.+ |+=++.. ++...+..- .+ T Consensus 17 ~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G-~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v~g~~~-----~n 89 (714) T protein:vir:10 17 TPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDG-DQLAPEVIQVLKDRGQPMTIHNLIAPT-VDGVLGMEA-----KT 89 (714) T ss_pred hhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHHHHHHH-----hC Confidence 2222223333322 222344564 33444321 1011110000 000111 3333322 222222111 13 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC--C--cE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE--A--TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~--~--~~ 145 (510) ++=+++.+.++..+ -.++. +.++..+......++...+...+|.+-+..|.+.+ +.+.+. + ++ T Consensus 90 r~~~~v~pr~~~~~--------~~~~A---e~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i~i 158 (714) T protein:vir:10 90 RTDLIVMSDDPNDE--------TEKLA---EAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEFKV 158 (714) T ss_pred CcceEEecCCCChh--------hHHHH---HHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCeEE Confidence 44445555332211 01122 23344455556788888899999999887776554 565442 2 35 Q ss_pred EEEEeceEEEeeCCCC----CeEEEEEEEeecHHHhhHHhhHhh--hh-hh----------------------------h Q lcl|NC_012418. 146 VAWSLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEAYKQDL--MR-AG----------------------------R 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~--~~-~~----------------------------~ 190 (510) +.+|..++++..++.. .-.-++++.+|+.+++...|+... .. .. . T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:10 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhccc Confidence 6677788888765432 122378889999999876665311 00 00 0 Q ss_pred c-------cCCCceEEEEEEEEeecC----------C------------------C-----ceEEEEEEE-ecCeeeccc Q lcl|NC_012418. 191 N-------LSGSGSVDLYTHVQRKKG----------T------------------A-----MEYAELYHE-IDGVRVGEE 229 (510) Q Consensus 191 ~-------~~~~~~v~i~~~v~~~~~----------~------------------~-----~p~~sv~~e-~~~~~~~~~ 229 (510) . ......|.|+.|+++... . . .....||+. ..|.+++.+ T Consensus 239 ~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~ 318 (714) T protein:vir:10 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVD 318 (714) T ss_pred ccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhc Confidence 0 011245777777654321 0 0 011122221 244444433 Q ss_pred --cccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch-hhhc---c Q lcl|NC_012418. 230 --GRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV-DDYQ---D 301 (510) Q Consensus 230 --~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p-~~~~---~ 301 (510) +-|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+.+ +..+-. ++.++++..- +.+. . T Consensus 319 ~~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~-~~~~gav~~~d~~~~e~~~ 393 (714) T protein:vir:10 319 RPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRV-IMDEDATQLSDNDLMEQLE 393 (714) T ss_pred CCCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHH--HhCCce-eeccccccccHHHHHHhcc Confidence 45777778998653333 445555 67888889999999865443332 345544 4445554332 2222 2 Q ss_pred CCCceeec--CCc---ccccccccCccc-chHHHHHHHHHHHHHHHHHh-hccccC-CCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 302 AEMGDYVP--GGA---EAVRAYERGDYN-KMAAIQQSLQAVVVRLNQAF-MYGANQ-RDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 302 ~~~g~~~p--g~~---~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af-~~~~~~-~~~~~~TAtEi~~r~~E~~~~LG 373 (510) .++|++.. +.. +...+++..... -.+.....++...+.|++.- ..+.++ ..+...+..-|..|.+.-...|. T Consensus 394 rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~ 473 (714) T protein:vir:10 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLA 473 (714) T ss_pred CCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHH Confidence 34454432 211 111122222212 23445566666666666653 222211 23333566678899998888888 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhh----cCCCCCCc---------------c--------cc---cceee---ecHHHHH Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDD----ALLQGLIT---------------K--------QH---KPAIE---TGLPALS 420 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~----~~l~~~~~---------------~--------~~---~~~~v---~~is~L~ 420 (510) ..+.++..- ..=+.+.++.++.. ..+.-+.. + ++ +-.++ .+-++-. T Consensus 474 ~~~dnl~~~-~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~ 552 (714) T protein:vir:10 474 EINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHH Confidence 888877652 22223333333211 11111100 0 01 11111 1233334 Q ss_pred HHHHHHHHHHHHHHHHhhcChhh---------HhhccCHHHHHHHHHHHcCCCHh-HccCCHHH-HHHHHHHHHHHHHHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQ---------LDPRISLPKMMDTIWAAFSVDTS-QFYKSEEE-LQAEAEQRRQQAAQA 489 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q---------~~~~id~d~~~~~~a~~~Gvp~~-~i~rs~~e-v~~~r~q~~q~~~~~ 489 (510) |.+.+..+. +++..+.+..+ ..+.-+.+++++.+-+.+|.+.. .-...+++ .++.+++.++++.+. T Consensus 553 r~~~~~~l~---ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l 629 (714) T protein:vir:10 553 KAQLAQRMS---EVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHH---HHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHHHHHHHHH Confidence 444444444 44444332111 11112677899999999987521 01111111 111111111000000 Q ss_pred HH---------hhHH-Hhhhhh-hhhhccc-----CC Q lcl|NC_012418. 490 QA---------AQET-LLEGAS-DMTNALA-----GV 510 (510) Q Consensus 490 ~~---------~~~~-~~~ga~-~~~~~~a-----g~ 510 (510) ++ +++. +.+-+- .++.+-| .- T Consensus 630 ~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~ 666 (714) T protein:vir:10 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 000000 0000000 00 No 45 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.60 E-value=2.2e-07 Score=57.09 Aligned_cols=487 Identities=12% Similarity=0.014 Sum_probs=202.3 Q ss_pred ChhHHHHHHHHHh-----hccchHHHHHHH----HHhc-ccccCCCCCCc----cc---ccccc-ccchHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLR-----DGSVEQRAIEFA----KTTL-PYLMVDPMSGS----RG---VVEHD-FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~~~~~~~r~~~lk-----r~~~~~~w~e~~----~~~l-P~~~~~~~~~~----~~---~~~~~-~dstg~~a~~~LAa~ 62 (510) |=++...++++++ ...|.+.|+.-+ +|.. +..=.++.+.. .. ..+.+ |+=++. .++...+. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~-~v~~v~g~ 79 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVAT-ELNRIISE 79 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHH-HHHHHhhH Confidence 7766666665543 223445554333 3332 22111111110 00 01122 233332 22222222 Q ss_pred HHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-----E Q lcl|NC_012418. 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----Y 137 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-----~ 137 (510) .-. +++=+++.|.++.. + .++.+.| +..+......++...+...++.+.+..|.+.+ | T Consensus 80 ~~~-----nr~~~~v~P~~~~~------d---~~~Ae~l---~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~ 142 (706) T protein:vir:10 80 YRN-----NRISVKFRPGDNAA------S---EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSF 142 (706) T ss_pred HHh-----CCCceEEecCCCCc------h---HHHHHHH---HHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeecc Confidence 211 33335555432210 0 1122333 33344445688999999999999988887764 2 Q ss_pred EeCC---CC--cEEE----EEeceEEEeeC---CCCC-eEEEEEEEeecHHHhhHHhhHhhhhh---hh--------cc- Q lcl|NC_012418. 138 RNSD---EA--TVVA----WSLRSYAVRRD---ATGR-WMDIVLKQRYKSKDLDEAYKQDLMRA---GR--------NL- 192 (510) Q Consensus 138 ~~~~---~~--~~~~----~pl~~~~i~~d---~~G~-vd~i~r~~~~t~~~l~~~~~~~~~~~---~~--------~~- 192 (510) .+++ .. .+.. .|+.++++.-+ .++. -.-++|..+|+.+++-..|+.....- .. .. T Consensus 143 ~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d 222 (706) T protein:vir:10 143 VNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPD 222 (706) T ss_pred ccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCC Confidence 2211 11 2221 36677776654 3333 22488999999999877776432110 00 00 Q ss_pred -----CCCc--eEEEEEEEEeec-----------------------------CCCceEEEEEEE-ecCeeec-ccccccc Q lcl|NC_012418. 193 -----SGSG--SVDLYTHVQRKK-----------------------------GTAMEYAELYHE-IDGVRVG-EEGRWPI 234 (510) Q Consensus 193 -----~~~~--~v~i~~~v~~~~-----------------------------~~~~p~~sv~~e-~~~~~~~-~~~~y~~ 234 (510) ..++ .+.+..++++++ .+.++...|||. +.|..+. ..+-|++ T Consensus 223 ~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~ 302 (706) T protein:vir:10 223 VVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPG 302 (706) T ss_pred cceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCC Confidence 0000 111112222221 012233445543 3454444 3356777 Q ss_pred ccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCccc-------chhhh------ Q lcl|NC_012418. 235 HLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA-------VVDDY------ 299 (510) Q Consensus 235 ~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~-------~p~~~------ 299 (510) +.|||++.-... .++..+..|.+....+-.+.+|+..-.++.....+.+-.+.+.++.+- ++... T Consensus 303 ~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~ 382 (706) T protein:vir:10 303 EHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLP 382 (706) T ss_pred CccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchh Confidence 889999653322 255666778899999999999988777776655544444344322211 01000 Q ss_pred ---ccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHh-hccccCCCCCCCCHHHHHHHHHHHHHHhchh Q lcl|NC_012418. 300 ---QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTAEEVRITAEEAENTLGGT 375 (510) Q Consensus 300 ---~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv 375 (510) ....+|.+++... ..+.+ ....-.+...+.++...+.|.+.- ..+.+.....+++.--|..|.+.-...+... T Consensus 383 ~~~~~~~~g~i~~~~~-~~~~~--~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~ 459 (706) T protein:vir:10 383 LRTVTDKTGNVVAPAN-VAGYT--QAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIY 459 (706) T ss_pred cccccCCCCccccccc-ccccC--CCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHH Confidence 0011233222110 00111 111112234455555566665553 2222211122356677888888888888877 Q ss_pred HhHH------HHHHHHHHHHHHH------HHHhhcCCCC--------CCcc--------cccc---eeeecHHHHHHHHH Q lcl|NC_012418. 376 YSLL------AENLQSPLAYVCL------SEVDDALLQG--------LITK--------QHKP---AIETGLPALSRSAA 424 (510) Q Consensus 376 ~~rl------~~E~l~Pli~r~~------~il~~~~l~~--------~~~~--------~~~~---~~v~~is~L~raq~ 424 (510) +.++ .-+++.-||...+ .|+...+-.. ..+. +++. .++--..|-.-+++ T Consensus 460 ~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r 539 (706) T protein:vir:10 460 LDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARR 539 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHH Confidence 7544 3444555554333 2332211110 0011 1111 11111122222333 Q ss_pred HHHHHHHHHHHHhhcChhh----Hhhc----c---CHHHHHHHHHHHcCCCHhHccCCHH-HHHHHHHHHHHHHH---HH Q lcl|NC_012418. 425 VQSMLNASQVIAGLAPIAQ----LDPR----I---SLPKMMDTIWAAFSVDTSQFYKSEE-ELQAEAEQRRQQAA---QA 489 (510) Q Consensus 425 ~~~~~~~~q~l~~~~~~~q----~~~~----i---d~d~~~~~~a~~~Gvp~~~i~rs~~-ev~~~r~q~~q~~~---~~ 489 (510) .+.+..+.+++..+.+..+ +.+. . +.+++++.+-..++.. ...+... +.+++.++++|+++ +. T Consensus 540 ~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q--~~~~~~~~~eq~~~~q~qq~q~~q~~~ 617 (706) T protein:vir:10 540 DATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQ--GIVKPRNQQEQAIVQQAQQAQATQPDP 617 (706) T ss_pred HHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhccc--CCccccchhHHHHHHHHHHHHHHHHHH Confidence 4444444455555443221 2121 2 4445666665555422 1222222 11222111111111 00 Q ss_pred HHhhHHHhhhhhh------------------hhhcccCC Q lcl|NC_012418. 490 QAAQETLLEGASD------------------MTNALAGV 510 (510) Q Consensus 490 ~~~~~~~~~ga~~------------------~~~~~ag~ 510 (510) ++..+-++....+ .....|+. T Consensus 618 ~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~ 656 (706) T protein:vir:10 618 NMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAME 656 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000000 00111111 No 46 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.52 E-value=3.6e-07 Score=55.89 Aligned_cols=483 Identities=13% Similarity=0.063 Sum_probs=204.5 Q ss_pred ChhHHHHHHHHH----h-hccchHHHHHHH----HHh-cccccCCCCCCcc-----cccccc---ccchHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL----R-DGSVEQRAIEFA----KTT-LPYLMVDPMSGSR-----GVVEHD---FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~~~~~~~r~~~l----k-r~~~~~~w~e~~----~~~-lP~~~~~~~~~~~-----~~~~~~---~dstg~~a~~~LAa~ 62 (510) |=.++.+++.++ + ...|.+.|+.-+ +|. .+..=.++..... .+-.++ |+-++...-.. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v---- 76 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRI---- 76 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHH---- Confidence 766666666554 2 234666676433 222 1221111111100 001122 44444332222 Q ss_pred HHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--e- Q lcl|NC_012418. 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--N- 139 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~- 139 (510) .+.-- .+++=+++.+.++.- + .++.+.|+. .+......++...+-..+|.+.+..|.++.-+ + T Consensus 77 -~g~~~-~nr~d~~v~P~~~~~------d---~~~Ae~l~~---~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~ 142 (720) T protein:vir:35 77 -ISEYR-HNRITVKFRPGDKTA------S---EALANKLNG---LFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNL 142 (720) T ss_pred -HhHHH-hCCCceEEEcCCCcc------h---HHHHHHHHH---HHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecc Confidence 22221 144555555543210 0 123333333 33334567888888899999998888777533 1 Q ss_pred --C-CC----CcE--EE--EEeceEEEeeCCC---CC-eEEEEEEEeecHHHhhHHhhHhhhhhh---h-----ccCCCc Q lcl|NC_012418. 140 --S-DE----ATV--VA--WSLRSYAVRRDAT---GR-WMDIVLKQRYKSKDLDEAYKQDLMRAG---R-----NLSGSG 196 (510) Q Consensus 140 --~-~~----~~~--~~--~pl~~~~i~~d~~---G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~---~-----~~~~~~ 196 (510) + +. .++ +. .|..++++..++. +. -.-++|..+|+.+++...|+....... . +..... T Consensus 143 ~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~ 222 (720) T protein:vir:35 143 VNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVD 222 (720) T ss_pred cccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCC Confidence 1 11 122 22 2445666654432 21 223677788999998887775432100 0 011122 Q ss_pred eEEEEEEEEeec-----------C-----------------------------CCceEEEEEEE-ecCeeecc-cccccc Q lcl|NC_012418. 197 SVDLYTHVQRKK-----------G-----------------------------TAMEYAELYHE-IDGVRVGE-EGRWPI 234 (510) Q Consensus 197 ~v~i~~~v~~~~-----------~-----------------------------~~~p~~sv~~e-~~~~~~~~-~~~y~~ 234 (510) .|.|..++++++ . +....+-|||. +.|..++. .+-+++ T Consensus 223 ~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~ 302 (720) T protein:vir:35 223 VVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPG 302 (720) T ss_pred ceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCC Confidence 344444432221 0 00112334443 35554443 344555 Q ss_pred ccCceEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccC--------- Q lcl|NC_012418. 235 HLCPYIVP---TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA--------- 302 (510) Q Consensus 235 ~~~P~~~~---Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~--------- 302 (510) ..|||+++ ||.. +|..+..|.+....+-.+.+|+..-..+..+...-..++....+++-..+.-... T Consensus 303 ~~fP~vP~~g~r~~~-d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l 381 (720) T protein:vir:35 303 EHIPLIPVYGKRWFI-DDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFL 381 (720) T ss_pred CccceEEEEeeeecc-CCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhcccccccccc Confidence 66888865 4444 6778788999999999999999877777776544443443322222111111111 Q ss_pred -------CCceeec--CCcccccccccCcccchHHHHHHHHHHHHHHHHHh-hcc-ccCCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_012418. 303 -------EMGDYVP--GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYG-ANQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 303 -------~~g~~~p--g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~~TAtEi~~r~~E~~~~ 371 (510) .+|.+++ +......+.++. +.....++.-...|.++- ..+ ++.+.+ +.+.--|..|.+.-... T Consensus 382 ~~~~~~~~~G~~~~~~~~~~~~~~~~~~-----~~~~~llq~~~~~i~~vsGi~~~~lG~~s-n~SG~Ai~~rq~qg~~~ 455 (720) T protein:vir:35 382 PLNEIVDKQGNIIAPPTPVGYTQPQPLN-----QAMAALLQQTGADIQEVTGSSQAMQPMPS-NIAKETVNHLMHRSDMS 455 (720) T ss_pred ccccccccCcccccCCCcccccCCCCCc-----hHHHHHHHHHHHHHHHHhCCChHHcCccc-chHHHHHHHHHHHHHHH Confidence 1233321 111112222222 222344444444454443 111 111222 24556778888888888 Q ss_pred hchhHhHHH------HHHHHHHHHHHHH------HHhhcCCC-----------CCCcc-----ccc---ceeeecHHHHH Q lcl|NC_012418. 372 LGGTYSLLA------ENLQSPLAYVCLS------EVDDALLQ-----------GLITK-----QHK---PAIETGLPALS 420 (510) Q Consensus 372 LGpv~~rl~------~E~l~Pli~r~~~------il~~~~l~-----------~~~~~-----~~~---~~~v~~is~L~ 420 (510) +...+..+. -+++.-||...+. |+...+-. +.++. ++. -.++.-..|-. T Consensus 456 ~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~ 535 (720) T protein:vir:35 456 SFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSY 535 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCc Confidence 877766654 4555566655553 33222211 11111 111 11122223323 Q ss_pred HHHHHHHHHHHHHHHHhhcChh---h-Hhh----ccC---HHHHHHHHHHHcCCCHhHccC--CHHHHHHHHHH---HHH Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIA---Q-LDP----RIS---LPKMMDTIWAAFSVDTSQFYK--SEEELQAEAEQ---RRQ 484 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~---q-~~~----~id---~d~~~~~~a~~~Gvp~~~i~r--s~~ev~~~r~q---~~q 484 (510) -+++.+.+..+.+++..+.+.. + +.+ ..| .++++..+-..+. +...+. ..++.+++.++ .+| T Consensus 536 ~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~--~~~~~~~~~~e~qq~~a~~qq~~qq 613 (720) T protein:vir:35 536 TARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL--TQGVVKPRNTEEEQMVAQMIQQAQQ 613 (720) T ss_pred ccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc--hhcccCccChhHHHHHHHHHHHHHh Confidence 3333444444445555554321 1 111 123 3455555544432 111111 12222222111 111 Q ss_pred HHHHHHHhhHHH---hhhhhh--hhhcccCC Q lcl|NC_012418. 485 QAAQAQAAQETL---LEGASD--MTNALAGV 510 (510) Q Consensus 485 ~~~~~~~~~~~~---~~ga~~--~~~~~ag~ 510 (510) ++.+.+.+++.+ ++.+.. ....-+.+ T Consensus 614 ~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa 644 (720) T protein:vir:35 614 PNAELVAAQGVLMQGQAEVQKAKNEELAIQV 644 (720) T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111110 000000 00000000 No 47 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=98.42 E-value=7.1e-07 Score=54.29 Aligned_cols=475 Identities=13% Similarity=0.030 Sum_probs=209.8 Q ss_pred ChhH--HHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKST--AAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~--~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~---~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) .+.+ +..+|..-. ...|-...++-.+|..= .-.++.+... ...+.+ |+=++.. ++...+..- .++ T Consensus 20 ~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G-~QW~~~~~~~l~~~g~p~~~~N~i~~~-v~~v~g~~~-----~nr 92 (772) T protein:vir:10 20 TPLTVDEYADINYEIEDQPAWRAVADKEMDYADG-NQLDTELLRRQQALGIPPAVEDLIGPA-LLSLQGYEA-----VTR 92 (772) T ss_pred cccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcEEEcchHHH-HHHHHHHHH-----hcC Confidence 2222 223333221 11233333344444431 1111111000 000111 3333332 222222221 134 Q ss_pred cccccCCChH-HHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCCC----cE Q lcl|NC_012418. 73 PFFRSELTDA-IRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA----TV 145 (510) Q Consensus 73 ~WF~l~~~d~-~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~~----~~ 145 (510) +=+++.+.++ .. .++.+.|+ ..+......+++..+...+|.+.+..|.+.+ +.+++.. ++ T Consensus 93 ~d~~v~Pr~~~~d----------~~~Ae~l~---~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i 159 (772) T protein:vir:10 93 TDWRVTPNGDVGG----------QEVADALN---YRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRC 159 (772) T ss_pred cceEEecCCCchH----------HHHHHHHH---HHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEE Confidence 4455555321 11 12333333 3344445688888999999999887776543 4443322 35 Q ss_pred EEEEeceEEEeeCCCCCeEE---EEEEEeecHHHhhHHhhHhh--hh-hh---------------h-------------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATGRWMD---IVLKQRYKSKDLDEAYKQDL--MR-AG---------------R-------------- 190 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G~vd~---i~r~~~~t~~~l~~~~~~~~--~~-~~---------------~-------------- 190 (510) +.++..++++.-++.....+ +||..+|+.+++...|+... .. .. . T Consensus 160 ~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (772) T protein:vir:10 160 RPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEA 239 (772) T ss_pred EeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchh Confidence 66777888888877554445 88899999998866555321 00 00 0 Q ss_pred ----------ccCCCceEEEEEEEEeecCC--------C-------------------------ceEEEEEEE-ecCeee Q lcl|NC_012418. 191 ----------NLSGSGSVDLYTHVQRKKGT--------A-------------------------MEYAELYHE-IDGVRV 226 (510) Q Consensus 191 ----------~~~~~~~v~i~~~v~~~~~~--------~-------------------------~p~~sv~~e-~~~~~~ 226 (510) .....++|.|+.+++++... + .....||+. +-|.++ T Consensus 240 ~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~ 319 (772) T protein:vir:10 240 RAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHC 319 (772) T ss_pred hccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEeccee Confidence 00112568888887765310 0 011122221 234455 Q ss_pred c--cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh--h- Q lcl|NC_012418. 227 G--EEGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD--Y- 299 (510) Q Consensus 227 ~--~~~~y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~--~- 299 (510) + ..+.|++..|||++.-... ..|..| |.+....+-.+.+|+..-..+... +.+. .+.. .|-++..+ + T Consensus 320 L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l--~~~~-~~~~-~gav~~~d~~~~ 393 (772) T protein:vir:10 320 LHDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGM--SVAR-VERT-KGAVAMTDAQFR 393 (772) T ss_pred eccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHH--hccc-cccc-CCCccchhHHHH Confidence 4 4577888889999653333 455666 788999999999998655444432 2222 2333 34333211 1 Q ss_pred --ccCCCceee--cCCcccc-cccccCcccch-HHHHHHHHHHHHHHHHHh-hcc-ccCCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_012418. 300 --QDAEMGDYV--PGGAEAV-RAYERGDYNKM-AAIQQSLQAVVVRLNQAF-MYG-ANQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 300 --~~~~~g~~~--pg~~~~v-~~~~~~~~~~~-~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~~TAtEi~~r~~E~~~~ 371 (510) ...+++.++ ||..+.. .+++......+ ....+.++...+.|.+.- ..+ ...+.+...+..-|..|.+.-... T Consensus 394 e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 473 (772) T protein:vir:10 394 RQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQS 473 (772) T ss_pred HhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHH Confidence 122334432 3332221 11222111222 344455555556666542 112 111223335677789999999999 Q ss_pred hchhHhHHHHH------HHHHHHHHHHHHHhhcCCCCCCcc---------------------------ccc---ceeeec Q lcl|NC_012418. 372 LGGTYSLLAEN------LQSPLAYVCLSEVDDALLQGLITK---------------------------QHK---PAIETG 415 (510) Q Consensus 372 LGpv~~rl~~E------~l~Pli~r~~~il~~~~l~~~~~~---------------------------~~~---~~~v~~ 415 (510) |...+.+|..- ++.-||...+.- ....-+..+ ++. -.++.- T Consensus 474 l~~~~Dnl~~~~~~~g~~lL~li~~~y~~---er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~ 550 (772) T protein:vir:10 474 IGRIMDNFRAGRTLVGELLLAMIVEDIGQ---ERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALE 550 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCC---CcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEee Confidence 99888776543 333333333311 111111100 111 011222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCh-hh-Hhh-------ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHH Q lcl|NC_012418. 416 LPALSRSAAVQSMLNASQVIAGLAPI-AQ-LDP-------RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQA 486 (510) Q Consensus 416 is~L~raq~~~~~~~~~q~l~~~~~~-~q-~~~-------~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~ 486 (510) ..|..-+++.+.+..+.++++.+.+. .+ +.+ .=+.+++++.+-+..+-+ ++++.++..+++.|++ T Consensus 551 ~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~------~peq~~~~~~q~~qq~ 624 (772) T protein:vir:10 551 DVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQ------TPEQIQQQIDQAVQDA 624 (772) T ss_pred ccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccC------ChHHHHHHHHHHHHHH Confidence 23333344445555555666544331 11 111 114557777776666532 3344333322222211 Q ss_pred HHHHHhhH-----H-------HhhhhhhhhhcccCC Q lcl|NC_012418. 487 AQAQAAQE-----T-------LLEGASDMTNALAGV 510 (510) Q Consensus 487 ~~~~~~~~-----~-------~~~ga~~~~~~~ag~ 510 (510) ++++++.. . +.+...++.....++ T Consensus 625 ~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~ 660 (772) T protein:vir:10 625 LAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGV 660 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111100 0 000000000000111 No 48 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.36 E-value=1.1e-06 Score=53.34 Aligned_cols=424 Identities=10% Similarity=0.055 Sum_probs=171.7 Q ss_pred ChhHHHHHHH-HHh-------------hccchHHHHHHHHHhcccccCCCCCCcccccccccc--chHHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWE-KLR-------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~~~~~~~r~~-~lk-------------r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--stg~~a~~~LAa~l~ 64 (510) +|+...+.+. .++ +..-...|+.+|+=--|.+.-...+. ....+-.. ..+...++.+|+-+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~--~~~~~~~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:30 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDG--ETKKRDLNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCC--CcccCceeecchHHHHHHHHhhhhc Confidence 2222222121 111 11234467777653223221001111 11111122 344555555555333 Q ss_pred HhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC Q lcl|NC_012418. 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE 142 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~ 142 (510) +-. | .++++++. ..++| .+.+..++|+..+.++..+..+.|.+++ |.+.+. T Consensus 89 ~e~--~-----~i~~~d~~-------------~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:30 89 NEQ--A-----EIKVDDDA-------------ANEFI-------SETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCChH-------------HHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 311 1 12333322 33444 3347778999999999999999898764 566554 Q ss_pred CcEEEEEeceEEE-eeCCCCCeEEEEEEEe-ecHHHhhHHhhHhhhhhhhccCCCceEEEE-----------EEEEeec- Q lcl|NC_012418. 143 ATVVAWSLRSYAV-RRDATGRWMDIVLKQR-YKSKDLDEAYKQDLMRAGRNLSGSGSVDLY-----------THVQRKK- 208 (510) Q Consensus 143 ~~~~~~pl~~~~i-~~d~~G~vd~i~r~~~-~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~-----------~~v~~~~- 208 (510) -++.+++...++- .-|..|.+..+|.... .+... +...+..++.| +.++... T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~--------------~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~ 207 (500) T protein:vir:30 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING--------------KEVYYTLIEFHEWQSSDDYVISNELYRSDD 207 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC--------------CceEEEEEEEEEEeCCceeEEEEEEEeccc Confidence 4577788777654 5566665544433221 11100 01111111111 1111111 Q ss_pred ----CCCceEEEEEEEecCeeeccccccccccCce-EEE---e-eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 209 ----GTAMEYAELYHEIDGVRVGEEGRWPIHLCPY-IVP---T-WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYEL 279 (510) Q Consensus 209 ----~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~-~~~---R-w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~ 279 (510) +...|...+|-.+.+. ....++ ..|. ..+ - =+...++.||.|--..+.+-+..|+..--+.....+ T Consensus 208 ~~~lG~~v~l~~~~~~l~~~--~~~~~~---~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~ 282 (500) T protein:vir:30 208 KAKVGSRVPLSEVYKDLKDE--AKVTDV---TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVK 282 (500) T ss_pred ccccCcccccccccCCcCcc--eEeccC---CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHH Confidence 1122333332111111 011222 1232 222 1 233457889999999999999999998777776655 Q ss_pred HhhCCceeeCCCcccchhhhccCCCceeec---------------CCcccccccccCcccch--HHHHHHHHHHHHHHHH Q lcl|NC_012418. 280 ESLEVLNLVDEAKGAVVDDYQDAEMGDYVP---------------GGAEAVRAYERGDYNKM--AAIQQSLQAVVVRLNQ 342 (510) Q Consensus 280 ~a~~p~~l~~~~g~~~p~~~~~~~~g~~~p---------------g~~~~v~~~~~~~~~~~--~~~~~~i~~~~~~I~~ 342 (510) . .+....+++ .++.++. .+.+|...+ +..++-..++.- ..++ ......++.+-+.|.. T Consensus 283 ~-g~~~i~v~~-~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~ 357 (500) T protein:vir:30 283 M-GQRRVAVPE-SLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDL-TTPIRADDYIKAINEGLSLFEM 357 (500) T ss_pred h-Ccceeeech-HHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeEe-ccccChHHHHHHHHHHHHHHHH Confidence 4 444445533 4443221 111121111 111110000000 0111 1122333333333322 Q ss_pred Hh-hcc-ccCC-CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCC---CCCcccccceeeecH Q lcl|NC_012418. 343 AF-MYG-ANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ---GLITKQHKPAIETGL 416 (510) Q Consensus 343 af-~~~-~~~~-~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~---~~~~~~~~~~~v~~i 416 (510) .. |.. .+.- .....|||||..+.+.......- ..+.-..-|.-|++-++.+..-..+. +++..++.+..-.++ T Consensus 358 ~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 436 (500) T protein:vir:30 358 QIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNS-IVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGV 436 (500) T ss_pred HhCCCccccccCcCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCC Confidence 21 111 1111 12335999999888888777654 22222334444555554443221111 111222322221111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH Q lcl|NC_012418. 417 --PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE 494 (510) Q Consensus 417 --s~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~ 494 (510) +.-+.+ +.. .+.+++ |+ +-...+ +.+.+|+ |++|.+++.++.+..+ T Consensus 437 ~~d~~~~~---~~~---~~~v~a--Gi------~s~~~~---i~~~~g~-------~eeea~~~l~~i~~E~-------- 484 (500) T protein:vir:30 437 FTDRDAEL---DYW---IKVVNA--GF------GTREMA---IQKVLNV-------TEEKAQEIAAEINTGI-------- 484 (500) T ss_pred CCCHHHHH---HHH---HHHHHc--CC------CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhc-------- Confidence 121211 111 122221 21 111112 3455564 3556555544332211 Q ss_pred HHhhhhhhhhhcccCC Q lcl|NC_012418. 495 TLLEGASDMTNALAGV 510 (510) Q Consensus 495 ~~~~ga~~~~~~~ag~ 510 (510) +...+..+.+.-++|= T Consensus 485 ~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 485 VDEINQQRTDTHLYGE 500 (500) T ss_pred cccCCCCCccccccCC Confidence 1111222222223333 No 49 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.36 E-value=1.1e-06 Score=53.34 Aligned_cols=424 Identities=10% Similarity=0.055 Sum_probs=171.7 Q ss_pred ChhHHHHHHH-HHh-------------hccchHHHHHHHHHhcccccCCCCCCcccccccccc--chHHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWE-KLR-------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~~~~~~~r~~-~lk-------------r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--stg~~a~~~LAa~l~ 64 (510) +|+...+.+. .++ +..-...|+.+|+=--|.+.-...+. ....+-.. ..+...++.+|+-+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~--~~~~~~~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:98 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDG--ETKKRDLNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCC--CcccCceeecchHHHHHHHHhhhhc Confidence 2222222121 111 11234467777653223221001111 11111122 344555555555333 Q ss_pred HhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC Q lcl|NC_012418. 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE 142 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~ 142 (510) +-. | .++++++. ..++| .+.+..++|+..+.++..+..+.|.+++ |.+.+. T Consensus 89 ~e~--~-----~i~~~d~~-------------~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:98 89 NEQ--A-----EIKVDDDA-------------ANEFI-------SETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCChH-------------HHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 311 1 12333322 33444 3347778999999999999999898764 566554 Q ss_pred CcEEEEEeceEEE-eeCCCCCeEEEEEEEe-ecHHHhhHHhhHhhhhhhhccCCCceEEEE-----------EEEEeec- Q lcl|NC_012418. 143 ATVVAWSLRSYAV-RRDATGRWMDIVLKQR-YKSKDLDEAYKQDLMRAGRNLSGSGSVDLY-----------THVQRKK- 208 (510) Q Consensus 143 ~~~~~~pl~~~~i-~~d~~G~vd~i~r~~~-~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~-----------~~v~~~~- 208 (510) -++.+++...++- .-|..|.+..+|.... .+... +...+..++.| +.++... T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~--------------~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~ 207 (500) T protein:vir:98 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING--------------KEVYYTLIEFHEWQSSDDYVISNELYRSDD 207 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC--------------CceEEEEEEEEEEeCCceeEEEEEEEeccc Confidence 4577788777654 5566665544433221 11100 01111111111 1111111 Q ss_pred ----CCCceEEEEEEEecCeeeccccccccccCce-EEE---e-eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 209 ----GTAMEYAELYHEIDGVRVGEEGRWPIHLCPY-IVP---T-WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYEL 279 (510) Q Consensus 209 ----~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~-~~~---R-w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~ 279 (510) +...|...+|-.+.+. ....++ ..|. ..+ - =+...++.||.|--..+.+-+..|+..--+.....+ T Consensus 208 ~~~lG~~v~l~~~~~~l~~~--~~~~~~---~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~ 282 (500) T protein:vir:98 208 KAKVGSRVPLSEVYKDLKDE--AKVTDV---TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVK 282 (500) T ss_pred ccccCcccccccccCCcCcc--eEeccC---CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHH Confidence 1122333332111111 011222 1232 222 1 233457889999999999999999998777776655 Q ss_pred HhhCCceeeCCCcccchhhhccCCCceeec---------------CCcccccccccCcccch--HHHHHHHHHHHHHHHH Q lcl|NC_012418. 280 ESLEVLNLVDEAKGAVVDDYQDAEMGDYVP---------------GGAEAVRAYERGDYNKM--AAIQQSLQAVVVRLNQ 342 (510) Q Consensus 280 ~a~~p~~l~~~~g~~~p~~~~~~~~g~~~p---------------g~~~~v~~~~~~~~~~~--~~~~~~i~~~~~~I~~ 342 (510) . .+....+++ .++.++. .+.+|...+ +..++-..++.- ..++ ......++.+-+.|.. T Consensus 283 ~-g~~~i~v~~-~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~ 357 (500) T protein:vir:98 283 M-GQRRVAVPE-SLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDL-TTPIRADDYIKAINEGLSLFEM 357 (500) T ss_pred h-Ccceeeech-HHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeEe-ccccChHHHHHHHHHHHHHHHH Confidence 4 444445533 4443221 111121111 111110000000 0111 1122333333333322 Q ss_pred Hh-hcc-ccCC-CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCC---CCCcccccceeeecH Q lcl|NC_012418. 343 AF-MYG-ANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ---GLITKQHKPAIETGL 416 (510) Q Consensus 343 af-~~~-~~~~-~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~---~~~~~~~~~~~v~~i 416 (510) .. |.. .+.- .....|||||..+.+.......- ..+.-..-|.-|++-++.+..-..+. +++..++.+..-.++ T Consensus 358 ~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 436 (500) T protein:vir:98 358 QIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNS-IVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGV 436 (500) T ss_pred HhCCCccccccCcCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCC Confidence 21 111 1111 12335999999888888777654 22222334444555554443221111 111222322221111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH Q lcl|NC_012418. 417 --PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE 494 (510) Q Consensus 417 --s~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~ 494 (510) +.-+.+ +.. .+.+++ |+ +-...+ +.+.+|+ |++|.+++.++.+..+ T Consensus 437 ~~d~~~~~---~~~---~~~v~a--Gi------~s~~~~---i~~~~g~-------~eeea~~~l~~i~~E~-------- 484 (500) T protein:vir:98 437 FTDRDAEL---DYW---IKVVNA--GF------GTREMA---IQKVLNV-------TEEKAQEIAAEINTGI-------- 484 (500) T ss_pred CCCHHHHH---HHH---HHHHHc--CC------CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhc-------- Confidence 121211 111 122221 21 111112 3455564 3556555544332211 Q ss_pred HHhhhhhhhhhcccCC Q lcl|NC_012418. 495 TLLEGASDMTNALAGV 510 (510) Q Consensus 495 ~~~~ga~~~~~~~ag~ 510 (510) +...+..+.+.-++|= T Consensus 485 ~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 485 VDEINQQRTDTHLYGE 500 (500) T ss_pred cccCCCCCccccccCC Confidence 1111222222223333 No 50 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.28 E-value=1.7e-06 Score=52.18 Aligned_cols=419 Identities=11% Similarity=0.011 Sum_probs=167.3 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhc-----ccccCCCCCCccccccccccchHHHHHHHHHHHHH--HhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTL-----PYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLA--RSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~l-----P~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~--~~ltpp~~~ 73 (510) +.+.|...|.. + .++++.+.+|.. |..-. .. ....+..++..+-+..+++++|..|. +..+|.... T Consensus 12 ~i~~L~~~~~~--~---~~r~~~~~~Yy~g~~~i~~~~~-~~-~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~ 84 (488) T protein:vir:23 12 LRDQLLDAFEN--K---QNELKSSKAYYDAERRPDAIGL-AV-PLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANG 84 (488) T ss_pred HHHHHHHHHHH--H---HHHHHHHHHHHhcccchhhcCc-cc-chhhhhhhhhcchHHHHHHHHHHhhhccceeccCCcc Confidence 44444444431 1 123333444432 21100 00 01111223456677777777777664 222222222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC----------- Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE----------- 142 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~----------- 142 (510) +=-....+. ++.+ .+.+.+..+||.....++.++..++|.+.+++.... T Consensus 85 ~~~~~~~d~-------------~~~~-------~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~ 144 (488) T protein:vir:23 85 EEPESGGEN-------------DPAS-------ELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEV 144 (488) T ss_pred cccccccch-------------hHHH-------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCc Confidence 211111111 1222 234456788999999999999999999876654321 Q ss_pred CcEEEEEece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEe Q lcl|NC_012418. 143 ATVVAWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEI 221 (510) Q Consensus 143 ~~~~~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~ 221 (510) .++++++..+ |++--+..+++...++.+. . ..+..+..++.+.+ +. ...|... T Consensus 145 ~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~-------------------~~~~~~~~~~~y~~--~~----~~~~~~~ 198 (488) T protein:vir:23 145 PLIRVEPPTALYAEVDPRTRKVLYAIRAIY-G-------------------ADGNEIVSATLYLP--DT----TMTWLRA 198 (488) T ss_pred ceEEEeccceeEEEEecCCCceEEEEEEEE-e-------------------cCCCcEEEEEEEec--Cc----EEEEEec Confidence 1366666655 5555456677776666543 0 00111111111111 11 0011122 Q ss_pred cCee-eccccccccccCceEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCcc-cc Q lcl|NC_012418. 222 DGVR-VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDY-IGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-AV 295 (510) Q Consensus 222 ~~~~-~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-L~d~r~L~~l~~~~l~~~~~a~~p~~l~~---~~g~-~~ 295 (510) +|.. +.....+++..+|++.++.+...++.+|+|=..+. ++=+..++...-.....++..+.|...+. ++.. .. T Consensus 199 ~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~ 278 (488) T protein:vir:23 199 EGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGIN 278 (488) T ss_pred CCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccc Confidence 2322 22233345678999999999888999999855432 34456667666666666776666654442 1110 00 Q ss_pred h---hhhccCCCcee--ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHH--- Q lcl|NC_012418. 296 V---DDYQDAEMGDY--VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV--- 361 (510) Q Consensus 296 p---~~~~~~~~g~~--~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi--- 361 (510) . ..+.....|.+ .+++ ..++..+.+. ++ +...++.++.-|...+.... +..+... -++.-+ T Consensus 279 ~~~~~~~~~~~~~~v~~~~~g-~~~~~~q~~~-~~---~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~ 353 (488) T protein:vir:23 279 AETGQRMFDAYMARILAFEGG-EGAHAEQFSA-AE---LRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAA 353 (488) T ss_pred ccccchhhhhhhhhhccCCCC-CCceeEecCC-CC---hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHH Confidence 0 00111111111 1111 1122222221 22 34555666666655443221 1111111 133333 Q ss_pred ----HHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHh--hcCCCCCCcccccceeeecHHHH--HHHHHHHHHHHHHH Q lcl|NC_012418. 362 ----RITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVD--DALLQGLITKQHKPAIETGLPAL--SRSAAVQSMLNASQ 433 (510) Q Consensus 362 ----~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~--~~~l~~~~~~~~~~~~v~~is~L--~raq~~~~~~~~~q 433 (510) ..+++++...+|.. +.+++.++. .++ ...+. +..-..+++-.++ ..++.++.+..+ T Consensus 354 ~~~l~~k~~~~~~~f~~~------------l~~~~~l~~~~~~~-~~~~~-~~~~i~v~f~~~~~~s~~~~ada~~kl-- 417 (488) T protein:vir:23 354 ESRLVKKVERKNKIFGGA------------WEQAMRLAYKMVKG-GDIPT-EYYRMETVWRDPSTPTYAAKADAAAKL-- 417 (488) T ss_pred HHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHhcC-CCcch-hhccceEEecCCCCCCHHHHHHHHHHH-- Confidence 22333344444433 344444332 122 11122 2222222332221 222222222221 Q ss_pred HHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHH--HHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 434 VIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAA--QAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 434 ~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~--~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ++...+ .+..+. +...+|.... ..++++++.++++.+.. ..++.......+. .. .+.+|- T Consensus 418 -~~~g~~------~~s~et----~~~~l~~~~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~ 479 (488) T protein:vir:23 418 -FANGAG------LIPRER----GWVDMGYTIV----EREQMRQWLEQDQKQGLGLIGSLYGASTPEGK-PG-EAPVGE 479 (488) T ss_pred -Hhcccc------cCCHHH----HHHhCCCCch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCccc-CC-CCCCCC Confidence 221111 011111 3334442211 12344443333222111 1111111111110 01 111222 No 51 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.25 E-value=2e-06 Score=51.80 Aligned_cols=417 Identities=10% Similarity=0.005 Sum_probs=166.9 Q ss_pred ChhH----HHHHHHHHhhccchHHHHHHHHHhcccccCCC-CCCcccccc--ccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKST----AAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-MSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~----~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~-~~~~~~~~~--~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) |-.. +...+.++.+ ..++.+.+.+|..-..-... +..-...+. +..-+-+..++++||..|. .-+ T Consensus 18 l~~~e~~~i~~L~~~~~~--~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~----~~G-- 89 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVD--RTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCN----LES-- 89 (504) T ss_pred CCHHHHHHHHHHHHHHHH--HhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhc----cce-- Confidence 3322 3333333321 22445555555432211000 000011111 1233455666676665442 222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC---cEEEE Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA---TVVAW 148 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~---~~~~~ 148 (510) |+ .++... . . ..+++....++|.....++.++..++|.+.+++. ++.. .++++ T Consensus 90 -f~--~~d~~~--------~----~-------~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~ 147 (504) T protein:vir:99 90 -FV--WPDGDY--------G----S-------IGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVK 147 (504) T ss_pred -ee--CCCCCh--------h----h-------HHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEe Confidence 22 221110 0 0 1233445778999999999999999999877664 3322 36666 Q ss_pred Eece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEec--Cee Q lcl|NC_012418. 149 SLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEID--GVR 225 (510) Q Consensus 149 pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~--~~~ 225 (510) +-.+ |++--+..+++...++.... + +......+++|. ++ ..+|+..+ +.. T Consensus 148 sP~~~~~iyD~~~~~~~~a~~~~~~---d--------------~~g~~~~~~~y~-----~~-----~~~~~~~~~~~~~ 200 (504) T protein:vir:99 148 SAMQATGEWNSRRNAMDSLLSITSR---D--------------AEGHPTGIALYE-----DG-----VTVTADMDDDGDW 200 (504) T ss_pred ccceeEEEEeCCCCceeEEEEEEEe---c--------------CCCeEEEEEEEc-----CC-----cEEEEEEcCCcee Confidence 5544 44443445555443332210 0 001111233331 11 12222222 122 Q ss_pred eccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch-------- Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV-------- 296 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p-------- 296 (510) ......+++. +|++.+..+...++.||+|-- ...++=+..+|+..-..+..+++.+.|...+- |+... T Consensus 201 ~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~--G~~~~~~~~~d~~ 277 (504) T protein:vir:99 201 HADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL--GADAKNFRNKDGS 277 (504) T ss_pred eeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--cCCcccccccccc Confidence 2222233333 899998888888899999954 35567788888888888888888777754441 11110 Q ss_pred -hhhccCCCcee--ecCCccc-------ccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-------CCCCCCCCHH Q lcl|NC_012418. 297 -DDYQDAEMGDY--VPGGAEA-------VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-------QRDAERVTAE 359 (510) Q Consensus 297 -~~~~~~~~g~~--~pg~~~~-------v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-------~~~~~~~TAt 359 (510) ........+.+ +|...+. ++.-++. .++++. -++.++.-|....+.... ..+..+-+|. T Consensus 278 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~-~~~l~~---~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~ 353 (504) T protein:vir:99 278 MKPAWQIALARVFALPDDEDEPDAARARADVKQFP-ASSPQP---HIEMLEQIAMMFSGETSIPVESLGFSNRANPTSAD 353 (504) T ss_pred ccchhhhhhhhhhcCCCccccccccCccceeeecC-CCChHH---HHHHHHHHHHHHHhhhCCCHHHhcccccccccHHH Confidence 00000000111 1221111 1111111 123433 334444444443322211 1111122443 Q ss_pred HH-------HHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee-ecHHHHHHHHHHHHHHHH Q lcl|NC_012418. 360 EV-------RITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE-TGLPALSRSAAVQSMLNA 431 (510) Q Consensus 360 Ei-------~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v-~~is~L~raq~~~~~~~~ 431 (510) -| ..++++|...+|..+.++- +..+.++. +....+.+.....++ ....+-..++.++.+..+ T Consensus 354 Ai~~~~~~L~~ka~~k~~~f~~~l~~~~--------rla~~~~~--~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl 423 (504) T protein:vir:99 354 AYIASREDLIAEAEGATDDWSPAFRRSM--------IRALAIKN--GLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKM 423 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHhc--CCCccccccccceeEecCCCccCHHHHHHHHHHH Confidence 33 4456666777776665441 22233333 222333333332211 112222233333333222 Q ss_pred HHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHH---HHhhHHH-hhhhhh----h Q lcl|NC_012418. 432 SQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQA---QAAQETL-LEGASD----M 503 (510) Q Consensus 432 ~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~---~~~~~~~-~~ga~~----~ 503 (510) .+ ....+ . ... +.+...+|+++ +|++.+.+++++++.++ +++.... ..++.. . T Consensus 424 ~~---ag~~l--~---~~~----~~l~~~lg~~~-------~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 484 (504) T protein:vir:99 424 LG---AGPEW--L---KET----EVGLELLGLTP-------QQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQG 484 (504) T ss_pred Hh---hcccc--c---cch----HHHHhhcCCCH-------HHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcC Confidence 12 11110 0 011 12344557654 44443332222211111 1111100 011111 0 Q ss_pred hhcccC-----C Q lcl|NC_012418. 504 TNALAG-----V 510 (510) Q Consensus 504 ~~~~ag-----~ 510 (510) ...+|+ . T Consensus 485 ~~e~a~~~~~~~ 496 (504) T protein:vir:99 485 AGEPPANEPPAA 496 (504) T ss_pred CCCCCCCCCCcc Confidence 011111 1 No 52 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.17 E-value=3.2e-06 Score=50.70 Aligned_cols=431 Identities=10% Similarity=0.001 Sum_probs=173.1 Q ss_pred ChhHHHHHHHHH------h-------------hccchHHHHHHHHHhcccccCCCCC-Cccc-cccccccchHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL------R-------------DGSVEQRAIEFAKTTLPYLMVDPMS-GSRG-VVEHDFQSAGALLVNNL 59 (510) Q Consensus 1 ~~~~~~~r~~~l------k-------------r~~~~~~w~e~~~~~lP~~~~~~~~-~~~~-~~~~~~dstg~~a~~~L 59 (510) ++..++.-+.++ + .......|+++|+=--|-....... .+.. ...++--..+...++.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~ 84 (499) T protein:vir:80 5 IIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYM 84 (499) T ss_pred HHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHHHHHH Confidence 333333333321 1 0123456777764211211111111 1111 11223345666667777 Q ss_pred HHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--E Q lcl|NC_012418. 60 AAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--Y 137 (510) Q Consensus 60 Aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~ 137 (510) |+-|.+- |+ .+++++. +..++|.+ .+..++|...+.++..+....|.+++ | T Consensus 85 a~~l~~e--p~-----~i~~~d~-------------~~~e~l~~-------~~~~n~f~~~~~~~~~~a~~~G~~~~~~~ 137 (499) T protein:vir:80 85 SKLLFNE--KV-----KINIDDE-------------TAEEFVLN-------VLKTNGFTKNMERYIEYGEAMGGFVIKVY 137 (499) T ss_pred HHhhhCC--cc-----eEeeCCH-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcCcEEEEEE Confidence 6655432 22 1333332 23334333 35667899999999999999998776 4 Q ss_pred EeCCC-CcEEEEEeceEE-EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhh-hhc--cCCCceEEEEEEEEeecC--- Q lcl|NC_012418. 138 RNSDE-ATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRA-GRN--LSGSGSVDLYTHVQRKKG--- 209 (510) Q Consensus 138 ~~~~~-~~~~~~pl~~~~-i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~-~~~--~~~~~~v~i~~~v~~~~~--- 209 (510) .|.+. -++..++-.+++ +..| .|++..+.+....+... . ...+. ... ...+....|-+.++...+ T Consensus 138 ~D~~~~~~i~~v~a~~~~Pi~~d-~~~~~~~~f~~~~~~~~---~---~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~ 210 (499) T protein:vir:80 138 HDGNKNVKVSFATADCMYPLSND-SENVDECLIANSFHKNN---K---YYKLLEWNEWKGEKEEVYTVTTELYQSDDPNE 210 (499) T ss_pred ECCCCcEEEEEEcCCceEEEEec-CCCeEEEEEEEEEeecC---e---EEEEEEEEEecccceeeEEEEEEEEeccCccc Confidence 55442 246677777766 4555 58787766555443211 0 00000 000 000001111111111111 Q ss_pred --CCceEEEEEEEecCeeeccccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhC Q lcl|NC_012418. 210 --TAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLE 283 (510) Q Consensus 210 --~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R----w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~ 283 (510) ...|...+|= +........++ ...||+.++ .++..++++|+|-...+.+-+..|+..--......+. .+ T Consensus 211 lG~~v~l~~~~~--~~~~~~~~~~~--~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~ 285 (499) T protein:vir:80 211 LGGKVSLKLLFN--DIEPVVPLPSL--TRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GK 285 (499) T ss_pred cCcccchhhhcc--CcCCceeecCC--CccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-cc Confidence 1112222210 00000011122 234455443 3445688999999999999999999887777665554 34 Q ss_pred CceeeCCCcccchhhhc--------cCCCce--eecCCccccc-ccccCcccchHHHHHHHHHHHHHHHHHhhccc---- Q lcl|NC_012418. 284 VLNLVDEAKGAVVDDYQ--------DAEMGD--YVPGGAEAVR-AYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA---- 348 (510) Q Consensus 284 p~~l~~~~g~~~p~~~~--------~~~~g~--~~pg~~~~v~-~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~---- 348 (510) ..+.|++ .++.+..-. ...... .+++...+-+ .++.-. .++. ..+-++.+..-++...+.-. T Consensus 286 ~~i~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~ir-~e~~~~~l~~~l~~i~~~~g~s~~ 362 (499) T protein:vir:80 286 KKVLVPS-SFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDIS-VEIR-STEFIESINAMLRIYAMQVGLSAG 362 (499) T ss_pred cceecch-hhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEec-CcCC-hHHHHHHHHHHHHHHHHhcCCChh Confidence 4444432 333221000 000000 1111111100 011000 1111 12223333333333333211 Q ss_pred -cC-CCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCC---CCCcccccceeee--cHHHHHH Q lcl|NC_012418. 349 -NQ-RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ---GLITKQHKPAIET--GLPALSR 421 (510) Q Consensus 349 -~~-~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~---~~~~~~~~~~~v~--~is~L~r 421 (510) +. ......|||||..+.+.......- ..+.-..-|..|++-++.+..-.+.. +.++..+....=. ..+..+. T Consensus 363 ~fg~~~~g~~TAtei~s~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 441 (499) T protein:vir:80 363 TFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTT 441 (499) T ss_pred hcCCCcccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHH Confidence 11 122346999999888877776543 22222233444444444433221211 1222233322211 1222222 Q ss_pred HHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh Q lcl|NC_012418. 422 SAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGAS 501 (510) Q Consensus 422 aq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~ 501 (510) ++.. .+.++ +|+-. .. ..++...|++ ++|.+++.++.++ +++..+ + T Consensus 442 ~~~~------~~~~~--~Gi~S------~e---t~l~~~~~~~-------d~ea~~el~~i~~--E~~~~~--~------ 487 (499) T protein:vir:80 442 INRY------TTAKN--QGMIP------LK---IALQRAWNIT-------EAEADEWAEMLAK--EKQAEI--P------ 487 (499) T ss_pred HHHH------HHHHH--cCCCC------HH---HHHhhcCCCC-------hHHHHHHHHHHHH--HhhcCC--C------ Confidence 2111 12121 12100 01 1244555543 3444333332221 111111 1 Q ss_pred hhhhcccCC Q lcl|NC_012418. 502 DMTNALAGV 510 (510) Q Consensus 502 ~~~~~~ag~ 510 (510) .+...|. T Consensus 488 --~~d~~g~ 494 (499) T protein:vir:80 488 --NNDMTGI 494 (499) T ss_pred --CCCcccc Confidence 1111222 No 53 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.07 E-value=5.3e-06 Score=49.49 Aligned_cols=416 Identities=10% Similarity=0.004 Sum_probs=163.5 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) +-+.|.+.|++ +. ++.+.+.+|..=.. ..+.......+..+..-+-+..++++++..| ++-+ |+. T Consensus 17 ~~~~l~~~~~~--~~---~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 84 (485) T protein:vir:10 17 ARDEMVSAFED--ST---QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ----AVEG---FRF 84 (485) T ss_pred HHHHHHHHHHH--HH---HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh----cccc---eec Confidence 22333333321 11 23333444433110 0000000011111233456666677666655 3332 222 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-----------CcEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-----------ATVV 146 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-----------~~~~ 146 (510) . .++. .. ..+.+.+.+++|.....++.++..++|.+.+++..+. .+++ T Consensus 85 ~-~~~~-------------~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~ 143 (485) T protein:vir:10 85 G-DADE-------------AD-------EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIR 143 (485) T ss_pred C-CCch-------------hH-------HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEE Confidence 1 1111 01 1223345678999999999999999999876654332 1367 Q ss_pred EEEeceEEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe- Q lcl|NC_012418. 147 AWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV- 224 (510) Q Consensus 147 ~~pl~~~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~- 224 (510) +++..+.++..| ..+++...++.+. .. +.+.-..+++|+ ++. .+. |...++. T Consensus 144 ~~~p~~~~~~~D~~~~~~~~~~~~~~-~~----------------~~~~~~~~~~y~-----~~~---~~~-~~~~~~~~ 197 (485) T protein:vir:10 144 VEPPTRMYAEIDPRIGRVSKAIRVAY-DA----------------EGNEIQAATLYT-----PND---IFG-WYRVENEW 197 (485) T ss_pred EEccceeEEEEcCCCCceeEEEEEEE-ee----------------CCCeEEEEEEEe-----CCe---EEE-EEEcCCce Confidence 776656444444 4566655555432 10 001111222322 110 011 1111121 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCcc-cch--- Q lcl|NC_012418. 225 RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-AVV--- 296 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~---~~g~-~~p--- 296 (510) ........++..+|++.+..+...++.||+|=... ..+-+..++...-.+...++..+.|...+. ++.+ ... T Consensus 198 ~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~ 277 (485) T protein:vir:10 198 QEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETG 277 (485) T ss_pred EEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCccccccccccc Confidence 11122223456799999999999999999996554 344566777776677777777777754442 1110 000 Q ss_pred hhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHH-------HH Q lcl|NC_012418. 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV-------RI 363 (510) Q Consensus 297 ~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi-------~~ 363 (510) ..+.....|.+..-...+.+..+... +++ ...++.++.-|++...... +...... -++.-+ .. T Consensus 278 ~~~~~~~~~~i~~~~~~d~k~~q~~~-~~~---~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ 353 (485) T protein:vir:10 278 QTLFDAYLARILAFEDAEGKIQQFSA-AEL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIK 353 (485) T ss_pred chhhhhcccceeccCCCCceEEeecc-cch---HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHH Confidence 00011111222110111223333321 223 3445555555655443221 1111111 233333 33 Q ss_pred HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccc--cccee--eecHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAI--ETGLPALSRSAAVQSMLNASQVIAGLA 439 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~--~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~ 439 (510) +++++...+|+.+.++ ++.++.+ ... ...+.+. +++.. ..+-+.++.++-+.++ ++... T Consensus 354 k~~~k~~~f~~~l~~~--------~~l~~~~-~~~--~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl------~~ag~ 416 (485) T protein:vir:10 354 KVERKNSIFGGAWEEA--------MRLAYRM-MKG--GDVPPDMLRMETVWRDPSTPTYAAKADAASKL------YNGGT 416 (485) T ss_pred HHHHHHHHHHHHHHHH--------HHHHHHH-hCC--CCCcccceeeeEEecCCCCCCHHHHHHHHHHH------Hhccc Confidence 3445555555544433 1222222 221 1222222 22221 1122222222222222 22111 Q ss_pred ChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhh---hhhhhh--hcc-cCC Q lcl|NC_012418. 440 PIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLE---GASDMT--NAL-AGV 510 (510) Q Consensus 440 ~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~---ga~~~~--~~~-ag~ 510 (510) ++ +.-+.+ ...+|+.+.. .++++++.++++.+...+..+...... ++.+.. +.+ ++- T Consensus 417 ~~------~s~et~----~~~lg~~~~~----~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (485) T protein:vir:10 417 GV------IPRERA----RKDMGYSIAE----REEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALE 479 (485) T ss_pred cC------CCHHHH----HHhCCCCHhH----HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCC Confidence 11 111122 2346655431 134444433332221111111011111 111111 111 111 No 54 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.99 E-value=8e-06 Score=48.54 Aligned_cols=427 Identities=12% Similarity=0.086 Sum_probs=177.9 Q ss_pred ChhHHHHHHHH----------Hh-------------hccchHHHHHHHHHhcccccCCCCCCcccccccccc--chHHHH Q lcl|NC_012418. 1 MKSTAAMLWEK----------LR-------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALL 55 (510) Q Consensus 1 ~~~~~~~r~~~----------lk-------------r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--stg~~a 55 (510) |-+.++..+.+ |+ +..-...|+.+|+=..|.+--... .+.+..+... ..+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~--~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 3 LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS--DGIKKKRLKNTINMAKTA 80 (508) T ss_pred hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC--CCCccccceeecchHHHH Confidence 22222222211 11 112245677776654443211111 1122223333 345566 Q ss_pred HHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE Q lcl|NC_012418. 56 VNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL 135 (510) Q Consensus 56 ~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~ 135 (510) ++.+|+-+.+-... | ++++. +...+||+ +.+..++|+..+.++..+..+.|.++ T Consensus 81 ~~~~A~lv~~e~~~-----i--~v~~~------------~~~~e~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~ 134 (508) T protein:vir:15 81 ARRIASVVFNEKAE-----I--HVKDN------------NEADKFLN-------DVLEDNDFKNKFEEALEKGVALGGFA 134 (508) T ss_pred HHHHHhhhhCCCce-----E--EeCCc------------hHHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCceE Confidence 66666655433111 1 22111 11234443 34778899999999999999999876 Q ss_pred E--EEeCCCCcEEEEEeceEE-EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeec-CCC Q lcl|NC_012418. 136 L--YRNSDEATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKK-GTA 211 (510) Q Consensus 136 l--~~~~~~~~~~~~pl~~~~-i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~-~~~ 211 (510) + |.+.+..++..++...++ +..|. |++.++.........+ +.+-.+|+.++... ... T Consensus 135 ~k~~~d~~~~~i~~v~ad~~~P~~~d~-~~~~~~af~~~~~~~~------------------~~~~~~yt~lE~h~~~~~ 195 (508) T protein:vir:15 135 MRPYIDGNHIKIAWVRADQFYPLQSNT-NDISEAAIASRTQRTE------------------SNQTKYYTLLEFHQWQDN 195 (508) T ss_pred EEEEEeCCeeEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEeec------------------CCCceEEEEEEEEEEecC Confidence 4 667665567778887876 45564 5555443322221110 01112333332211 000 Q ss_pred ceEE---EEEEEec----Ceeec--------------cccccccccCceEEEeee----ecCCCccccchHHHHHHHHHH Q lcl|NC_012418. 212 MEYA---ELYHEID----GVRVG--------------EEGRWPIHLCPYIVPTWN----LAPGEHYGRGHVEDYIGDFAK 266 (510) Q Consensus 212 ~p~~---sv~~e~~----~~~~~--------------~~~~y~~~~~P~~~~Rw~----~~~g~~YGrgp~~~~L~d~r~ 266 (510) .++. .+|..-+ |..+. ...+. ..-||+.++.. ...++.||+|--..+.+-++. T Consensus 196 ~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~--~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~ 273 (508) T protein:vir:15 196 GSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGL--QRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDD 273 (508) T ss_pred cceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCC--CcceeEEecCCccccccCCCCcCCchHhhhHHHHHH Confidence 1111 1121111 11110 00111 12334444432 233678999999999999999 Q ss_pred HHHHHHHHHHHHHHhhCCceeeCCCcccchhhh--c--cCCCceee--cCCcc---cccccccCcccchHHHHHHHHHHH Q lcl|NC_012418. 267 LSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY--Q--DAEMGDYV--PGGAE---AVRAYERGDYNKMAAIQQSLQAVV 337 (510) Q Consensus 267 L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~--~--~~~~g~~~--pg~~~---~v~~~~~~~~~~~~~~~~~i~~~~ 337 (510) ||..--+...-. ...++...+++ .+++++.- . ....-.++ ++..+ .+..++.. -....-...++.+. T Consensus 274 lD~~~s~~~~e~-~~~~~~i~v~~-~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~~~~~l 349 (508) T protein:vir:15 274 INDTHDQFIWEI-RLGQKHIAVQP-GMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTP--IRTVQYKDAIDHFI 349 (508) T ss_pred HHHHHHHHHHHH-Hhcccceeech-HHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecc--cChHHHHHHHHHHH Confidence 998776666555 45666655543 33332210 0 00000011 11111 01111100 01112234444444 Q ss_pred HHHHHHhhcc--ccCCC-CCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhc-----CCC----CCCc Q lcl|NC_012418. 338 VRLNQAFMYG--ANQRD-AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-----LLQ----GLIT 405 (510) Q Consensus 338 ~~I~~af~~~--~~~~~-~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~-----~l~----~~~~ 405 (510) +.|....-+. .+.-+ ...-|||||..+.+...+...- ..+.-..-|..|++-++.++.-. |.+ ..+. T Consensus 350 ~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~ 428 (508) T protein:vir:15 350 KEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSS-YLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSAS 428 (508) T ss_pred HHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc Confidence 4444433121 11112 1225999999888887777664 44444455556666666554322 211 1222 Q ss_pred ccccce--eeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHH Q lcl|NC_012418. 406 KQHKPA--IETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRR 483 (510) Q Consensus 406 ~~~~~~--~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~ 483 (510) .+..+. .=.++.+= +.++++.. .+.++ +++ .+ .. ..+....|++ ++|++++.++.+ T Consensus 429 ~~~~v~v~f~D~i~~d-~~~~~~~~---~~~v~--aGi---~s---~e---~~i~~~~g~~-------deea~~el~ri~ 486 (508) T protein:vir:15 429 QPLDIECHFDDGVFVN-KDKQLEED---AKVLA--IGA---LS---KQ---TFLQRNYGMT-------DEQAAEELAKIQ 486 (508) T ss_pred CCcceEEEeCCCCCCC-HHHHHHHH---HHHHh--cCC---CC---HH---HHHHhcCCCC-------hHHHHHHHHHHH Confidence 222222 11222221 11111111 12221 121 11 11 2233445543 455555544333 Q ss_pred HHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 484 QQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 484 q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +.+. ......++..+.--|. T Consensus 487 ~E~~-------~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 487 SEAP-------TDTFEGGRSAILNGGD 506 (508) T ss_pred Hhcc-------ccCccccccccCCCCC Confidence 2211 0111111111111111 No 55 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.98 E-value=8.1e-06 Score=48.49 Aligned_cols=414 Identities=10% Similarity=0.019 Sum_probs=181.7 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHh--cccc-cC--CCCC-CccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MV--DPMS-GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~--lP~~-~~--~~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) ..+.+.+.+.+.+ |.....+++++|+=- +..+ .. .... .......++..+-+...++..++-|++ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--~p---- 101 (474) T protein:vir:97 28 QEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KP---- 101 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--CC---- Confidence 3333444444332 334455556555421 1111 11 1111 111222356677777778877776654 22 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC-CcEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~-~~~~~~pl 150 (510) +.++..|+. ..+.| ..+..+||.....++.++...+|.+.+++ +++. -++.+++. T Consensus 102 -~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:97 102 -VTYSCEDEN-------------VLKVI--------HDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred -ceeccCcHH-------------HHHHH--------HHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 123333322 12222 12235789999999999999999876554 4432 24666766 Q ss_pred ceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 151 RSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 151 ~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~~~ 225 (510) .+.++..|. .+++.-++|.+... ....+++|+- .+.+..++..........++.. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:97 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQ 219 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEcCCccccccccCcCccc Confidence 664444443 57887777776521 1112333321 1111111111111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh-hccC-C Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQDA-E 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~-~~~~-~ 303 (510) . .....++..+|++.++. +.+|.|=.....+-+..+|.+.-......+....|.+++.....-.... ..+. . T Consensus 220 ~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:97 220 S-HFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKY 293 (474) T ss_pred c-cccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc Confidence 1 11112345688887654 4689998899999999999988888888888899887764211111111 1111 1 Q ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHH-------HHHHHHHHHHhch Q lcl|NC_012418. 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEV-------RITAEEAENTLGG 374 (510) Q Consensus 304 ~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi-------~~r~~E~~~~LGp 374 (510) .+.+.....++++.+.. ..+.......++.++..|...-+. +. ....+...|+..+ ..++.++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:97 294 YKAINVDGDGGVETIQV--EVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred cceeeccCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222233444332 235667777788887777554332 11 1112234566543 2344445555554 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHH Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMM 454 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~ 454 (510) .+.++ +..++.++ +.. .....+.+.. +...|..-++.++.+ ...+. +....++ T Consensus 372 ~l~~~--------~~li~~~~---~~~-~d~~~i~v~f-~~~~p~~~~e~a~~~-------~~~g~-------iS~et~l 424 (474) T protein:vir:97 372 AIQEL--------ISFIIDFN---NLK-TDVKDIEISF-NFNRMMNDAEQSQII-------AQSQY-------LSRETLV 424 (474) T ss_pred HHHHH--------HHHHHHHh---CCC-cccceeeEEe-ccCcccCHHHHHHHH-------HHcCC-------CCHHHHH Confidence 44432 22222222 111 1112233222 122221111111111 11111 1222233 Q ss_pred HHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 455 DTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 455 ~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) . .++ ++ -.++|++++.++++..++... ....+....+....+- T Consensus 425 ~----~l~~v~-----D~~~E~eri~~E~~~~~~~~~----~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:97 425 K----SSPLVD-----DYKAELERIEQEQMEYNKQLP----NLDDGGADGAQQQEGS 468 (474) T ss_pred H----hCCCCC-----CHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCcccCCCC Confidence 2 222 22 123566665554432222111 1111111111111111 No 56 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.98 E-value=8.1e-06 Score=48.49 Aligned_cols=414 Identities=10% Similarity=0.019 Sum_probs=181.7 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHh--cccc-cC--CCCC-CccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MV--DPMS-GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~--lP~~-~~--~~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) ..+.+.+.+.+.+ |.....+++++|+=- +..+ .. .... .......++..+-+...++..++-|++ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--~p---- 101 (474) T protein:vir:94 28 QEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KP---- 101 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--CC---- Confidence 3333444444332 334455556555421 1111 11 1111 111222356677777778877776654 22 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC-CcEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~-~~~~~~pl 150 (510) +.++..|+. ..+.| ..+..+||.....++.++...+|.+.+++ +++. -++.+++. T Consensus 102 -~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:94 102 -VTYSCEDEN-------------VLKVI--------HDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred -ceeccCcHH-------------HHHHH--------HHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 123333322 12222 12235789999999999999999876554 4432 24666766 Q ss_pred ceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 151 RSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 151 ~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~~~ 225 (510) .+.++..|. .+++.-++|.+... ....+++|+- .+.+..++..........++.. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:94 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQ 219 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEcCCccccccccCcCccc Confidence 664444443 57887777776521 1112333321 1111111111111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh-hccC-C Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQDA-E 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~-~~~~-~ 303 (510) . .....++..+|++.++. +.+|.|=.....+-+..+|.+.-......+....|.+++.....-.... ..+. . T Consensus 220 ~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:94 220 S-HFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKY 293 (474) T ss_pred c-cccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc Confidence 1 11112345688887654 4689998899999999999988888888888899887764211111111 1111 1 Q ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHH-------HHHHHHHHHHhch Q lcl|NC_012418. 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEV-------RITAEEAENTLGG 374 (510) Q Consensus 304 ~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi-------~~r~~E~~~~LGp 374 (510) .+.+.....++++.+.. ..+.......++.++..|...-+. +. ....+...|+..+ ..++.++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:94 294 YKAINVDGDGGVETIQV--EVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred cceeeccCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222233444332 235667777788887777554332 11 1112234566543 2344445555554 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHH Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMM 454 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~ 454 (510) .+.++ +..++.++ +.. .....+.+.. +...|..-++.++.+ ...+. +....++ T Consensus 372 ~l~~~--------~~li~~~~---~~~-~d~~~i~v~f-~~~~p~~~~e~a~~~-------~~~g~-------iS~et~l 424 (474) T protein:vir:94 372 AIQEL--------ISFIIDFN---NLK-TDVKDIEISF-NFNRMMNDAEQSQII-------AQSQY-------LSRETLV 424 (474) T ss_pred HHHHH--------HHHHHHHh---CCC-cccceeeEEe-ccCcccCHHHHHHHH-------HHcCC-------CCHHHHH Confidence 44432 22222222 111 1112233222 122221111111111 11111 1222233 Q ss_pred HHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 455 DTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 455 ~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) . .++ ++ -.++|++++.++++..++... ....+....+....+- T Consensus 425 ~----~l~~v~-----D~~~E~eri~~E~~~~~~~~~----~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:94 425 K----SSPLVD-----DYKAELERIEQEQMEYNKQLP----NLDDGGADGAQQQEGS 468 (474) T ss_pred H----hCCCCC-----CHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCcccCCCC Confidence 2 222 22 123566665554432222111 1111111111111111 No 57 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=97.97 E-value=8.8e-06 Score=48.30 Aligned_cols=426 Identities=10% Similarity=-0.004 Sum_probs=171.7 Q ss_pred ChhHHHHHHHHH------h-------------hccchHHHHHHHHHhcccccCCCCC-Cccccc-cccccchHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL------R-------------DGSVEQRAIEFAKTTLPYLMVDPMS-GSRGVV-EHDFQSAGALLVNNL 59 (510) Q Consensus 1 ~~~~~~~r~~~l------k-------------r~~~~~~w~e~~~~~lP~~~~~~~~-~~~~~~-~~~~dstg~~a~~~L 59 (510) +++.++.-+.++ + +......|+++|+=--+-....... .+.... .++--..+...++.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~ 84 (496) T protein:vir:38 5 IIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYM 84 (496) T ss_pred HHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHH Confidence 222222222221 0 1123445666654211211111111 111111 122235556666666 Q ss_pred HHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE- Q lcl|NC_012418. 60 AAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR- 138 (510) Q Consensus 60 Aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~- 138 (510) |+-|.+- || .++.++. +..++|. +.+..++|...+.++..+...+|.+.+++ T Consensus 85 a~~l~~~--p~-----~i~~~d~-------------~~~e~l~-------~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~ 137 (496) T protein:vir:38 85 SKLLFNE--KV-----KINIDDK-------------AAEEFVL-------NVLKTNGFTKNMERYIEYGEAMGGFVIKVY 137 (496) T ss_pred hhhhhCC--cc-----eEeeCCh-------------HHHHHHH-------HHHhccCHHHHHHHHHHHHhhhCcEEEEEE Confidence 6544321 11 1233332 2333433 34567789999999999999999887654 Q ss_pred -eCCCC-cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEE----------EEEEe Q lcl|NC_012418. 139 -NSDEA-TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLY----------THVQR 206 (510) Q Consensus 139 -~~~~~-~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~----------~~v~~ 206 (510) +.+.. ++..+|-.+++--.+..|++..+.+...++.+ .+....++.+ +.++. T Consensus 138 ~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~----------------~~~y~~le~h~~~~~~~~I~~~~y~ 201 (496) T protein:vir:38 138 HDGNKNVKVSFATADCMYPLSNDSENVDECVIANSFHKN----------------NKYYTLLEWNEWQGDVYTVTTELYQ 201 (496) T ss_pred EcCCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeC----------------CeEEEEEEEEEEeCceEEEEEEEEe Confidence 44322 47778888877444556777665554443221 1111112211 11111 Q ss_pred ecCC-----CceEEEEEEEecCeeecccccc-ccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 207 KKGT-----AMEYAELYHEIDGVRVGEEGRW-PIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGL 276 (510) Q Consensus 207 ~~~~-----~~p~~sv~~e~~~~~~~~~~~y-~~~~~P~~~~R----w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~ 276 (510) ..+. ..|...+|-.. .....+ +....||+..+ .+...++.||+|-..++++-+..|+..--.... T Consensus 202 ~~~~~~~g~~v~~~~~~~~~-----~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~ 276 (496) T protein:vir:38 202 SDDPNELGTKVSLTLLFDDI-----EPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQ 276 (496) T ss_pred cCCccccCcccccccccccc-----ccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHH Confidence 1110 11111111111 011111 11223443332 344667899999999999999999887666655 Q ss_pred HHHHhhCCceeeCCCcccchhhh--------ccCCCceeec--CCcc-cccccccCcccch--HHHHHHHHHHHHHHHHH Q lcl|NC_012418. 277 YELESLEVLNLVDEAKGAVVDDY--------QDAEMGDYVP--GGAE-AVRAYERGDYNKM--AAIQQSLQAVVVRLNQA 343 (510) Q Consensus 277 ~~~~a~~p~~l~~~~g~~~p~~~--------~~~~~g~~~p--g~~~-~v~~~~~~~~~~~--~~~~~~i~~~~~~I~~a 343 (510) ..+. .++.+.+++ .++....- .......+.. +... ....++.-. .++ ..-...++.+.+.|... T Consensus 277 ~~~~-~~~~i~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~i~~e~~~~~l~~~l~~i~~~ 353 (496) T protein:vir:38 277 EFKL-GKKKVLVPS-SFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDIS-VEIRSTEFIESINAMLRIYAMQ 353 (496) T ss_pred HHhh-cccceecch-HHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeec-cccCHHHHHHHHHHHHHHHHHh Confidence 5543 455555532 33322110 0000011111 1110 000111000 111 11223334443433322 Q ss_pred h-hc-cccCC-CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh----cCCCCCCcccccceeee-- Q lcl|NC_012418. 344 F-MY-GANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD----ALLQGLITKQHKPAIET-- 414 (510) Q Consensus 344 f-~~-~~~~~-~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~----~~l~~~~~~~~~~~~v~-- 414 (510) . +. ..+.. .+...|||||..+.+.......- ..+.-...|..+++-.+.+... .+.. ..+..+....-. T Consensus 354 ~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~-~~~~~i~v~f~d~i 431 (496) T protein:vir:38 354 VGLSAGTFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEV-VELDTITVDFDDSI 431 (496) T ss_pred hCCChhhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-CCccceEEEeCCCC Confidence 1 11 11221 23346999999888777766543 5555556666666656554322 2222 122223322211 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH Q lcl|NC_012418. 415 GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE 494 (510) Q Consensus 415 ~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~ 494 (510) ..+..+.++.... .++ +++ +-...+ +....|++ ++|++++.++.++ ++++.+.. T Consensus 432 ~~d~~~~~~~~~~------~~~--~Gi------iS~et~---l~~~~~~~-------d~ea~~el~ri~~--E~~~~~~~ 485 (496) T protein:vir:38 432 AQDEDTTINRYTN------AKN--QGM------IPLKIA---LQRAWNIT-------EAEADEWAEMLAK--EKQAEMPN 485 (496) T ss_pred CCCHHHHHHHHHH------HHh--cCC------CCHHHH---HHhcCCCC-------hHHHHHHHHHHHH--hhhccCcc Confidence 2222222222221 111 121 111111 33444543 3444333322221 11111111 Q ss_pred HHhhhhhhhhhc Q lcl|NC_012418. 495 TLLEGASDMTNA 506 (510) Q Consensus 495 ~~~~ga~~~~~~ 506 (510) . ..+....+.. T Consensus 486 ~-d~~~~~~~~e 496 (496) T protein:vir:38 486 N-DMNGIFGEEE 496 (496) T ss_pred c-cccCCCCCCC Confidence 1 1111111111 No 58 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.92 E-value=1.1e-05 Score=47.83 Aligned_cols=411 Identities=9% Similarity=-0.077 Sum_probs=160.3 Q ss_pred ChhHHHHHHHH----Hh-hccchHHHHHHHHH--hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEK----LR-DGSVEQRAIEFAKT--TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~----lk-r~~~~~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) |...-...... .. +.++....+++|+- .+|... .......+..++..+-+...++.++..| +|.+ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~--~~~~~~~~~~k~~~n~~~~ivd~~~~~l----~~~g-- 72 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLG--VAIPPELQRVQTVVSWPGIAVDALEERL----DWLG-- 72 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcC--cccchhhhhhhhhcchHHHHHHHHHhhh----cccc-- Confidence 44332222222 21 22223333344422 122211 0000111122445566666666666654 3332 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC-cEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~-~~~~~pl 150 (510) | ..++. ++++ +....+||.....++.++...+|.+.+++- ++.. +++.++. T Consensus 73 -~--~~~d~------------~~l~-----------~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 126 (441) T protein:vir:80 73 -W--TNGDG------------YGLD-----------GVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSP 126 (441) T ss_pred -c--cCCCh------------HHHH-----------HHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEcc Confidence 1 22221 1122 234568999999999999999998765544 3322 4666766 Q ss_pred ceEEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC-ee-ec Q lcl|NC_012418. 151 RSYAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG-VR-VG 227 (510) Q Consensus 151 ~~~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~-~~-~~ 227 (510) .+.++..| ..+++...++++.... +....+++|. ++. ...|++.++ .. .. T Consensus 127 ~~~~~i~d~~~~~~~~~~~~~~~~~------------------~~~~~~~vy~-----~~~----~~~~~~~~~~~~~~~ 179 (441) T protein:vir:80 127 KNCTGKFSADGSRLDAGLVVQQTCD------------------PEVVEAELLL-----PDV----IVQVERRGSREWVEV 179 (441) T ss_pred ceEEEEEeCCCCceeEEEEEEEEec------------------CceEEEEEEe-----cCe----EEEEEEcCCcceeec Confidence 66444444 4567776666654210 1111223331 111 000111111 11 12 Q ss_pred cccccccccCceEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch---hhhccCC Q lcl|NC_012418. 228 EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVE-DYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV---DDYQDAE 303 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~-~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p---~~~~~~~ 303 (510) ......++.+|++.+.-+...++.||+|-.. +..+-+..++...-......+....|...+. |.... ....... T Consensus 180 ~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G~~~~~~~~~~~~~~ 257 (441) T protein:vir:80 180 DRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--GVSADEFSQPGWVLS 257 (441) T ss_pred cccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--cCCccccccchhhhc Confidence 2223345679999888788888899999543 4556667777777777777788888865552 21100 1000111 Q ss_pred Ccee--ecCCccc--ccccccCcccchHHHHHHHHHHHHHHHHHhhcc-c----cCCCCCCC-CHHHHHHHHHHHHHHhc Q lcl|NC_012418. 304 MGDY--VPGGAEA--VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A----NQRDAERV-TAEEVRITAEEAENTLG 373 (510) Q Consensus 304 ~g~~--~pg~~~~--v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~----~~~~~~~~-TAtEi~~r~~E~~~~LG 373 (510) .|.+ +++..+. +...+.. .++++. .+..++.-|...+... . +...+... ++.-+..+...... T Consensus 258 ~~~i~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~--- 330 (441) T protein:vir:80 258 MASVWAVDKDDDGDTPNVGSFP-VNSPTP---YSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVK--- 330 (441) T ss_pred ccccccCCCCCCCCcceeEecC-ccchHH---HHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHH--- Confidence 1222 2222111 2222222 123333 3344444444433221 1 11111111 33333222211111 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceeeecHHHH--HHHHHHHHHHHHHHHHHhhcChhhHhhccCH Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPAL--SRSAAVQSMLNASQVIAGLAPIAQLDPRISL 450 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v~~is~L--~raq~~~~~~~~~q~l~~~~~~~q~~~~id~ 450 (510) -.++.+.. +.+-+.+.+.++.. .+...-.+....-..+.+-.++ ..++.++.+. +..+. ++.. +.. T Consensus 331 -k~~~~~~~-f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~---kl~~~--g~~~----~s~ 399 (441) T protein:vir:80 331 -RAERRQTS-FGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVT---KLVGA--GILP----ADS 399 (441) T ss_pred -HHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHH---HHHhc--Cccc----ccH Confidence 12222222 22233444443321 1111111122222222222121 1112222211 11221 1111 111 Q ss_pred HHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcc Q lcl|NC_012418. 451 PKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNAL 507 (510) Q Consensus 451 d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ 507 (510) +.+...+|. +++|++++.+++++++.+...+ .+..++ +...+ T Consensus 400 ----~~~~~~l~~-------~~~e~~~~~~e~~e~~~~~~~~-~~~~~~---~~~~~ 441 (441) T protein:vir:80 400 ----RTVLEMLGL-------DDVQVEAVMRHRAESSDPLAVL-AGAISR---QTNEV 441 (441) T ss_pred ----HHHHHhCCC-------CHHHHHHHHHHHHHHHHHHHHH-hhhhhc---ccccC Confidence 112344453 3455554433322221111111 111111 22222 No 59 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.92 E-value=1.1e-05 Score=47.81 Aligned_cols=416 Identities=9% Similarity=0.010 Sum_probs=178.1 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHH--hccccc-C--CCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKT--TLPYLM-V--DPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~--~lP~~~-~--~~~~~-~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) -++.+.+...+.+ |-.....+++++.- -++.+- . ..... ......++..+-+...++..++-|.+ .|+ T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g--~p~--- 102 (474) T protein:vir:95 28 QEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KPV--- 102 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhcc--CCc--- Confidence 2222222222221 22333444554431 111211 1 11110 11122356667777777777766653 222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~~pl 150 (510) .++..|+. +.+. +.. +..+||...+.++.++...+|.+.+++..+. + ++.+++. T Consensus 103 --~~~~~d~~-------------~~~~-------l~~-~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 159 (474) T protein:vir:95 103 --TYSCEDES-------------VLKI-------IHD-VLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred --eeccCchH-------------HHHH-------HHH-HHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcc Confidence 23333322 1111 111 2236899999999999999998776554322 3 3555554 Q ss_pred ce-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 151 RS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 151 ~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~~~ 225 (510) .+ |.+--| ..|.+.-++|.+... ....+++|+. ++.+..++...........+.. T Consensus 160 ~~~~~v~d~~~~~~~~~~i~~~~~~--------------------~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:95 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQ 219 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEEc--------------------CeeEEEEEeCCeEEEEEEcCCccccccccCccccc Confidence 44 444433 357777666665421 1123444421 1111111110000000111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh-hccC-C Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQDA-E 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~-~~~~-~ 303 (510) .......+..+|++.++. +.+|.|=.+...+-+..+|.+.-......+....|.+++.....-.... .... . T Consensus 220 -~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:95 220 -SHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKY 293 (474) T ss_pred -ccccccCCCccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc Confidence 111222345789887754 4679998899999999999988888888888889877764211111111 1111 1 Q ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHH-------HHHHHHHHHHhch Q lcl|NC_012418. 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEV-------RITAEEAENTLGG 374 (510) Q Consensus 304 ~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi-------~~r~~E~~~~LGp 374 (510) .+.+.....++++.+.. ..+.......++.+.+.|...-.. +. ....+...|+..+ ..+++++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:95 294 YKAINVDGDGGVETIQV--EVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred cceeeccCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222233444332 245677778888888877654322 21 1112234566554 3344444444444 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHH Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM 453 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~ 453 (510) .+. +++.++.. -|. ......+.+.. ++-.|-..++.++.+ ...+- +-...+ T Consensus 372 ~l~------------~~~~li~~~~g~-~~d~~~i~v~f-~~~~p~d~~e~a~~~-------~~~g~-------iS~et~ 423 (474) T protein:vir:95 372 AIQ------------ELIGFIIDFNNL-KMDVKDIEISF-NFNRMMNDAEQSQII-------AQSQY-------LSRETL 423 (474) T ss_pred HHH------------HHHHHHHHHhCC-CcccceeeEEe-ccCCCcCHHHHHHHH-------HhcCC-------CchHHH Confidence 433 33333221 111 11122222222 111121111111111 11111 111222 Q ss_pred HHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 454 MDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 454 ~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) + ..++ ++ -.++|++++.++++.++++++..... .......+....-- T Consensus 424 i----~~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~-~~d~~~~~~~~~~~ 471 (474) T protein:vir:95 424 V----KSSPLVD-----DYKAELERIEQEQMEYNKQLPNLDDG-GADGAQQQERSNDK 471 (474) T ss_pred H----HhCCCCC-----CHHHHHHHHHHHHHHHHhcccccccc-cCCCCcCCCCCccC Confidence 2 2232 22 12466776665554433332221110 00000000000000 No 60 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.88 E-value=1.3e-05 Score=47.34 Aligned_cols=425 Identities=11% Similarity=-0.011 Sum_probs=186.3 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHH--hcccc---cCCCCC---CccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKT--TLPYL---MVDPMS---GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~--~lP~~---~~~~~~---~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) +.+.+.....+- |.+...+++++++- -++.+ ...+.. .......++-.+.+...++..++-|++- |+ T Consensus 24 ~~~~i~~~~~~~-~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~--p~-- 98 (479) T protein:vir:79 24 LVKVIEHYILKH-RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGN--PI-- 98 (479) T ss_pred HHHHHHHHHhhh-hHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcC--Cc-- Confidence 222222222211 22223334444431 12221 001100 0111122455666777777777666542 22 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCC-CcEEEEE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE-ATVVAWS 149 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~-~~~~~~p 149 (510) +++.+++. +.+.+ ..+..++|.....++.++..++|.+.+++. ++. -+++.++ T Consensus 99 ---~~~~~~~~-------------~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 154 (479) T protein:vir:79 99 ---VFNADDDN-------------LTKLL--------NDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIP 154 (479) T ss_pred ---eeccCCHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEc Confidence 22333322 22222 234457899999999999999998765554 332 1355565 Q ss_pred eceEEEeeC--CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE---EEEeecCCCceEEEE------- Q lcl|NC_012418. 150 LRSYAVRRD--ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT---HVQRKKGTAMEYAEL------- 217 (510) Q Consensus 150 l~~~~i~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~---~v~~~~~~~~p~~sv------- 217 (510) ..+.+...| ..+++...+|.+...-.+ .+.-..+++|+ .++.+..++...... T Consensus 155 p~~~~~v~d~~~~~~~~~~ir~y~~~~~~---------------~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~ 219 (479) T protein:vir:79 155 AEEAIPIWDSKRQRELVAFIRFYYIEDID---------------GNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGK 219 (479) T ss_pred cceeEEEEeCCCCCceEEEEEEEEEeecC---------------CceEEEEEEEeCCcEEEEEecCCccccccccccccc Confidence 555433333 356677666665543111 11111233321 111111111100000 Q ss_pred EEEe-cCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccc Q lcl|NC_012418. 218 YHEI-DGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAV 295 (510) Q Consensus 218 ~~e~-~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~ 295 (510) .... ++.........++..+|++..+- +.+|+|=.+...+-+..++.+.-......+...+|.+++.-.+ ... T Consensus 220 ~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~ 294 (479) T protein:vir:79 220 MTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSL 294 (479) T ss_pred ccccccccccccccccCCCcccEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc Confidence 0000 11111222233456789987654 4679998888999999999888788888888888877764211 111 Q ss_pred hhhhccC-CCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 296 VDDYQDA-EMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 296 p~~~~~~-~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~TAtEi~~r~~E~~~~L 372 (510) .+..... ..+.+ .+++ ++++.+.. ..+.......++.+++.|...-.. +.........|++.+..+-.-... . T Consensus 295 ~~~~~~~~~~~~i~~~~~-~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~-k 370 (479) T protein:vir:79 295 QEFIDNIRYYKSIKVDGG-GGVDKLEI--NIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDL-K 370 (479) T ss_pred ccchhhhhhccceecCCC-CcceEEec--cCCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHH-H Confidence 1111111 11222 2222 33444432 235677788888888888655432 221112234566665442111111 1 Q ss_pred chhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeec--HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCH Q lcl|NC_012418. 373 GGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISL 450 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~--is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~ 450 (510) ---..+.-.+.+.-+++.+..++...+.......++++..... .+....+ +.+. +..+. +.. T Consensus 371 ~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a---~~~~---kl~g~----------iS~ 434 (479) T protein:vir:79 371 CSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKI---DMAA---KSTGI----------VSD 434 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHH---HHHH---HHhcc----------CcH Confidence 1222333333333344444444433332333333444433222 2222222 2111 11111 122 Q ss_pred HHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhh Q lcl|NC_012418. 451 PKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 451 d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~ 503 (510) ..++ ..++ ++ -.++|++++.++++.+.+..+.....-..+..++ T Consensus 435 et~l----~~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 435 ETIV----SNHPWVE-----DVNDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 2233 2333 21 1356777776665554444444432222233333 No 61 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.80 E-value=1.8e-05 Score=46.58 Aligned_cols=407 Identities=9% Similarity=0.016 Sum_probs=178.0 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHhcc--cccCCCCC---C-ccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTTLP--YLMVDPMS---G-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~lP--~~~~~~~~---~-~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) -++.+.+..+..+ |......|+++|+=.-+ .+-..... . ......++..+.+...++..++-|.+ -|+. T Consensus 27 ~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g--~p~~-- 102 (468) T protein:vir:96 27 QEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVA--NPVT-- 102 (468) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhcc--CCce-- Confidence 2333333333332 43445556666543211 11000000 0 01112355566666666666655543 2222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCCC-cEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA-TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~~-~~~~~pl 150 (510) ++.+++. ..+.|. ..+ ..||...+.++.++...+|.+.+ |.+++.. ++.+++. T Consensus 103 ---~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 158 (468) T protein:vir:96 103 ---YGTEDEK-------------SLKTIQ-------EVL-NHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPA 158 (468) T ss_pred ---eccCChH-------------HHHHHH-------HHH-hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcc Confidence 2333322 222222 222 35888889999999999998775 4444321 3555554 Q ss_pred ce-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---E-EeecCCCceEEEEEEEecC- Q lcl|NC_012418. 151 RS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---V-QRKKGTAMEYAELYHEIDG- 223 (510) Q Consensus 151 ~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v-~~~~~~~~p~~sv~~e~~~- 223 (510) .+ |.+-.| ..|++.-++|.+...- ...+++|+. . +...+.. ..........+ T Consensus 159 ~~~~~v~~~~~~~~~~~~ir~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 217 (468) T protein:vir:96 159 EQAIPIWTNKERDELKAFIRLYELDG--------------------GERVEYWTANDVTFYELKDGQ-LIPDYYQGEEHV 217 (468) T ss_pred cceEEEEcCCCCCceEEEEEEEEecC--------------------ceEEEEEeCCeEEEEEEcCCc-eeeccccccccc Confidence 44 444333 3577766666654321 111222211 0 0111111 00001111111 Q ss_pred --eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh-c Q lcl|NC_012418. 224 --VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-Q 300 (510) Q Consensus 224 --~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~-~ 300 (510) ........+.+..+|++.++ ++.+|.|=-....+-+..++.+.-......+....|.+++.-...-+.... . T Consensus 218 ~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~ 292 (468) T protein:vir:96 218 QAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMY 292 (468) T ss_pred ccceeeccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhh Confidence 11111222344568888664 356799988899999999998888888888888888777642111111111 1 Q ss_pred c-CCCcee-ecCCc-ccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHH-------HHHHH Q lcl|NC_012418. 301 D-AEMGDY-VPGGA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRI-------TAEEA 368 (510) Q Consensus 301 ~-~~~g~~-~pg~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~-------r~~E~ 368 (510) . ...+.+ +++.. ++++.+.. ..+.+.....++.++..|...-.. +.. ...+...|+..+.. .+.++ T Consensus 293 ~~~~~~~i~~~~d~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k 370 (468) T protein:vir:96 293 NLKYYKAINVDGDGSGGVDTIQI--DVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKL 370 (468) T ss_pred hhhcCceEEecCCCCCcceEEee--cCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 1 112222 33322 22333332 234566677788887777655332 211 12233456665532 23344 Q ss_pred HHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHh Q lcl|NC_012418. 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~ 445 (510) ...++..+ ++.+.++.+ .|. ......+.+..- .+.+.+.. ++. +...+. T Consensus 371 ~~~~~~~l------------~~~~~li~~~~g~-~~d~~~i~i~f~~~~p~d~~e~---a~~-------~~~~g~----- 422 (468) T protein:vir:96 371 KNKTLTAL------------QELLQYIIDFYKL-SIKVQDVEITFNFNVMVNELEQ---SQI-------GVNSQY----- 422 (468) T ss_pred HHHHHHHH------------HHHHHHHHHHhCC-CcccceeeEEecCCCCcCHHHH---HHH-------HHhcCC----- Confidence 44444433 333333322 121 111222222221 11222111 111 111111 Q ss_pred hccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhc Q lcl|NC_012418. 446 PRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~ 506 (510) +.-..++..+ -+++ -.++|++++.+++++..+++. ..-|+...++| T Consensus 423 --iS~et~i~~l---~~v~-----D~~~E~~ri~~E~~~~~~~~~-----~~~~~~~~~~~ 468 (468) T protein:vir:96 423 --LSKETVVTNH---PWVD-----DPVAEMERIDQEELALPSIEE-----GLNGKENNEPT 468 (468) T ss_pred --CchHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHhh-----ccCCCCCCCCC Confidence 1122222221 1221 124666666655443332221 23456666666 No 62 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=97.80 E-value=1.9e-05 Score=46.52 Aligned_cols=386 Identities=10% Similarity=-0.006 Sum_probs=161.4 Q ss_pred hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHH Q lcl|NC_012418. 28 TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRK 107 (510) Q Consensus 28 ~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~ 107 (510) .+|.-..++... ..++..-+-+..++++++..|. +.+ |+ .+|... .. . T Consensus 1 ~l~~~~~~~~~~---~~~~~v~n~~~~ivd~~~~~l~----~~g---f~--~~d~~~---------~~-----------~ 48 (434) T protein:vir:98 1 MLPKNAEQAFLD---FQRKARTNFCGLIANASVHRLL----ALG---VT--GPDGEP---------DT-----------R 48 (434) T ss_pred CCCCCccHHHHH---hhhhhhccchHHHHHHHHhhhc----cCc---ee--cCCCch---------HH-----------H Confidence 344322221111 1112234566677777776553 333 33 222211 11 1 Q ss_pred HHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC----------cEEEEEece-EEEeeCCCCCeEEEEEEEeecHHH Q lcl|NC_012418. 108 ATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----------TVVAWSLRS-YAVRRDATGRWMDIVLKQRYKSKD 176 (510) Q Consensus 108 ~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~----------~~~~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~ 176 (510) +.+.+.+++|.....+++++..++|.+.+++..+.. .+++++-.+ +++--+..+++...++.+....+ T Consensus 49 ~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~- 127 (434) T protein:vir:98 49 ASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDID- 127 (434) T ss_pred HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccC- Confidence 223456789999999999999999998776543211 266676655 44444444666655554432211 Q ss_pred hhHHhhHhhhhhhhccCCCceEEEEEEEEe---e-cCC-CceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCc Q lcl|NC_012418. 177 LDEAYKQDLMRAGRNLSGSGSVDLYTHVQR---K-KGT-AMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEH 251 (510) Q Consensus 177 l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~---~-~~~-~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~ 251 (510) ......+.+++.++. + ... .....+.-+...+. .......++..+|++.+.=+...++ T Consensus 128 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~h~~g~vPvv~f~N~~~~~~- 190 (434) T protein:vir:98 128 ---------------GFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGT-ADSGDVHDLGGMQLVEFARMPDLGE- 190 (434) T ss_pred ---------------CceEEEEEEeCcEEEEEEeeccccccccccccceeccc-ccccccCCCCccceEEeccCCCcCc- Confidence 111112222221111 1 111 00000000111111 1111112356799998876665544 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCce--------eecCC-----ccccccc Q lcl|NC_012418. 252 YGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGD--------YVPGG-----AEAVRAY 318 (510) Q Consensus 252 YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~--------~~pg~-----~~~v~~~ 318 (510) +|+|=.+..++-+..++...-..+..++..+.|...+. |.. +........+. ..+|. ..+.+.. T Consensus 191 ~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 267 (434) T protein:vir:98 191 DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK--GHK-FAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFG 267 (434) T ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCC-cccccccccccchhhhhhhccccccccCCCCCceEE Confidence 79998899999999999998888888888888865542 111 11000000000 01111 0122222 Q ss_pred ccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCCCHHHHHH-------HHHHHHHHhchhHhHHHHHHHHH Q lcl|NC_012418. 319 ERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEVRI-------TAEEAENTLGGTYSLLAENLQSP 386 (510) Q Consensus 319 ~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~P 386 (510) ++.. ++++. .+..++.-|........ +..+..+.++.-++. +.+.|...+|..+. T Consensus 268 q~~~-~~~~~---~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~--------- 334 (434) T protein:vir:98 268 QLDA-TDLSG---FLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLE--------- 334 (434) T ss_pred EecC-cchHH---HHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------- Confidence 2221 23333 33444444444332211 112223345554433 33344444444333 Q ss_pred HHHHHHHHHh-hcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCC Q lcl|NC_012418. 387 LAYVCLSEVD-DALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV 463 (510) Q Consensus 387 li~r~~~il~-~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv 463 (510) +.+.++. -.|..+ ....+++..- .+-+.++.++-+.++ .+. +++ . + .+...+|. T Consensus 335 ---~~~rl~~~~~g~~~-~~~~~~v~w~~~~~~s~~~~ada~~kl------~~~--g~~-------~-e---~~~~~lg~ 391 (434) T protein:vir:98 335 ---SVLALAAAQAGVPE-DYTEAEVRWANPAHVTMAVKADAATKL------KSI--GYP-------L-D---VIAEELDE 391 (434) T ss_pred ---HHHHHHHHhcCCCh-hheeeeEEecCCCCCCHHHHHHHHHHH------Hhc--CCc-------H-H---HHHHhCCC Confidence 3333322 233321 1112222221 122223333322222 111 111 1 1 23345665 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhh--hhhh--hhhcccC Q lcl|NC_012418. 464 DTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLE--GASD--MTNALAG 509 (510) Q Consensus 464 p~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~--ga~~--~~~~~ag 509 (510) + ++|++++.+++++++..++........ +... ...++=| T Consensus 392 ~-------~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 392 S-------PARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred C-------HHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCCCC Confidence 4 456665554444333332222111111 0000 1111112 No 63 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.74 E-value=2.3e-05 Score=45.99 Aligned_cols=413 Identities=10% Similarity=0.009 Sum_probs=175.1 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHh--cccc-cCCCCCC---ccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MVDPMSG---SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~--lP~~-~~~~~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) -++.+.+..++.+ |-+......++|.-. ++.+ ...+... ......++..+-+..-++..++-|++ -|+ T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g--~p~--- 102 (474) T protein:vir:96 28 QEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG--KPV--- 102 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc--cCc--- Confidence 2222222222221 222222333333321 1111 0011100 01112345566666666666666554 222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl 150 (510) .++..+... ...++.| ..++|.....++.++...+|.+.+++ +++.. ++.+++. T Consensus 103 --~~~~~~~~~---------~~~l~~~------------~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:96 103 --TYAHDDDKV---------LDVIHQV------------LDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPA 159 (474) T ss_pred --eeccCChHH---------HHHHHHH------------HhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Confidence 233333211 1112222 23689999999999999999887654 44321 3555666 Q ss_pred ceEEEeeC--CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE---EeecCCCceEEEEEEEecCee Q lcl|NC_012418. 151 RSYAVRRD--ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV---QRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 151 ~~~~i~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~~~~~~~p~~sv~~e~~~~~ 225 (510) .+.++-.| ..+++.-.+|.++.. ....+++|+.- +.....+. +.......+... T Consensus 160 ~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:96 160 EQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVYENGG-LIPDFYYGDEHI 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEEcCCc-eeeccccccccc Confidence 55443333 357777666665421 11234444211 11111111 111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc--CC Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--AE 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~--~~ 303 (510) .......++..+|++.++. +.+|.|=.+..++-+-.++.+.-......+....|.+++.-.+..+...... .. T Consensus 219 ~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:96 219 QTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKY 293 (474) T ss_pred cCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhc Confidence 1222223456788887653 4679998899999999999888888888888888876653211111111111 11 Q ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHH-------HHHHHHHHHhch Q lcl|NC_012418. 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVR-------ITAEEAENTLGG 374 (510) Q Consensus 304 ~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~-------~r~~E~~~~LGp 374 (510) .+.+..+..++++.+.. ..+.......++.++..|...-.. +.. ...+...|+..+. .++.+++..++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~ 371 (474) T protein:vir:96 294 YKAINVSSDGGVETIQV--EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANV 371 (474) T ss_pred cceeeccCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22332233334444332 235566777888888777554422 111 1122334554443 233344444444 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHH Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~ 452 (510) .+.++ ++.++.++ |.. .....+.+.. ..+.+.+..++-+ ...+. +--.. T Consensus 372 ~l~~~--------~~~i~~~~---g~~-~d~~~i~i~f~~~~p~~~~e~a~~~----------~~~gi-------iS~et 422 (474) T protein:vir:96 372 ALQEL--------MQFILDFN---KIK-LDAKEIEITFNFNVMVNDLEQSQIG----------AQSQY-------LSKET 422 (474) T ss_pred HHHHH--------HHHHHHHh---CCC-cccceeeEEecCCCccCHHHHHHHH----------HHcCC-------CChHH Confidence 33332 22222222 221 1222333322 1223333332211 11111 11112 Q ss_pred HHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 453 MMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ++ ..++. +--.++|++++.+++++.+++++.. ...+... +....+- T Consensus 423 ~~----~~lp~----v~D~~~E~eri~~E~~~~~~~~~~~---~~~~~~~-~~~~~~~ 468 (474) T protein:vir:96 423 LV----RHHPW----VDDPKAELERLDEEQLELNKQLPNL---DDGGADG-AQQQQQS 468 (474) T ss_pred HH----HhCCC----CCCHHHHHHHHHHHHHHHHhhcccc---ccccCCC-CCCcCCC Confidence 22 22321 1113466666665554433322211 1111111 1111111 No 64 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.74 E-value=2.3e-05 Score=45.99 Aligned_cols=413 Identities=10% Similarity=0.009 Sum_probs=175.1 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHh--cccc-cCCCCCC---ccccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MVDPMSG---SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~--lP~~-~~~~~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) -++.+.+..++.+ |-+......++|.-. ++.+ ...+... ......++..+-+..-++..++-|++ -|+ T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g--~p~--- 102 (474) T protein:vir:95 28 QEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG--KPV--- 102 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc--cCc--- Confidence 2222222222221 222222333333321 1111 0011100 01112345566666666666666554 222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl 150 (510) .++..+... ...++.| ..++|.....++.++...+|.+.+++ +++.. ++.+++. T Consensus 103 --~~~~~~~~~---------~~~l~~~------------~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:95 103 --TYAHDDDKV---------LDVIHQV------------LDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPA 159 (474) T ss_pred --eeccCChHH---------HHHHHHH------------HhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Confidence 233333211 1112222 23689999999999999999887654 44321 3555666 Q ss_pred ceEEEeeC--CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE---EeecCCCceEEEEEEEecCee Q lcl|NC_012418. 151 RSYAVRRD--ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV---QRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 151 ~~~~i~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~~~~~~~p~~sv~~e~~~~~ 225 (510) .+.++-.| ..+++.-.+|.++.. ....+++|+.- +.....+. +.......+... T Consensus 160 ~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:95 160 EQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVYENGG-LIPDFYYGDEHI 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEEcCCc-eeeccccccccc Confidence 55443333 357777666665421 11234444211 11111111 111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc--CC Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--AE 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~--~~ 303 (510) .......++..+|++.++. +.+|.|=.+..++-+-.++.+.-......+....|.+++.-.+..+...... .. T Consensus 219 ~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:95 219 QTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKY 293 (474) T ss_pred cCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhc Confidence 1222223456788887653 4679998899999999999888888888888888876653211111111111 11 Q ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHH-------HHHHHHHHHhch Q lcl|NC_012418. 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVR-------ITAEEAENTLGG 374 (510) Q Consensus 304 ~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~-------~r~~E~~~~LGp 374 (510) .+.+..+..++++.+.. ..+.......++.++..|...-.. +.. ...+...|+..+. .++.+++..++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~ 371 (474) T protein:vir:95 294 YKAINVSSDGGVETIQV--EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANV 371 (474) T ss_pred cceeeccCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22332233334444332 235566777888888777554422 111 1122334554443 233344444444 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHH Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~ 452 (510) .+.++ ++.++.++ |.. .....+.+.. ..+.+.+..++-+ ...+. +--.. T Consensus 372 ~l~~~--------~~~i~~~~---g~~-~d~~~i~i~f~~~~p~~~~e~a~~~----------~~~gi-------iS~et 422 (474) T protein:vir:95 372 ALQEL--------MQFILDFN---KIK-LDAKEIEITFNFNVMVNDLEQSQIG----------AQSQY-------LSKET 422 (474) T ss_pred HHHHH--------HHHHHHHh---CCC-cccceeeEEecCCCccCHHHHHHHH----------HHcCC-------CChHH Confidence 33332 22222222 221 1222333322 1223333332211 11111 11112 Q ss_pred HHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 453 MMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ++ ..++. +--.++|++++.+++++.+++++.. ...+... +....+- T Consensus 423 ~~----~~lp~----v~D~~~E~eri~~E~~~~~~~~~~~---~~~~~~~-~~~~~~~ 468 (474) T protein:vir:95 423 LV----RHHPW----VDDPKAELERLDEEQLELNKQLPNL---DDGGADG-AQQQQQS 468 (474) T ss_pred HH----HhCCC----CCCHHHHHHHHHHHHHHHHhhcccc---ccccCCC-CCCcCCC Confidence 22 22321 1113466666665554433322211 1111111 1111111 No 65 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=97.74 E-value=2.3e-05 Score=45.97 Aligned_cols=452 Identities=13% Similarity=0.092 Sum_probs=177.3 Q ss_pred ChhHHHHHHHHH---------h---h----------ccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL---------R---D----------GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNN 58 (510) Q Consensus 1 ~~~~~~~r~~~l---------k---r----------~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~ 58 (510) |-++++.-|.+. + + .....+|+.+|+=-.|.+....++....+..+.-=..+...++. T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~ 82 (517) T protein:vir:98 3 VIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADV 82 (517) T ss_pred hHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHH Confidence 444444444321 1 0 11344677776533343321111111111111111234445555 Q ss_pred HHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE-- Q lcl|NC_012418. 59 LAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-- 136 (510) Q Consensus 59 LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l-- 136 (510) +|+-|.+-+.. +++++....+.. ........++|++ .+..++|+..+.++..+..+.|.+++ T Consensus 83 ~A~Ll~~e~~~-------i~v~d~~~~~~~--~~~~~~~~e~l~~-------i~~~n~f~~~~~~~~e~a~a~G~~a~k~ 146 (517) T protein:vir:98 83 LSGLVFNEQCE-------VYVSDAKDEEKK--DNSFKTAHEFIQH-------VFQHNKFIKNLSDYLEPTFALGGLTVRP 146 (517) T ss_pred hhhhhcCCcce-------EEeccccccccc--ccchhHHHHHHHH-------HHHhccHHHHHHHHHHHHhhhCCEEEEE Confidence 55544332222 122222111100 0111223455544 47788999999999999999998764 Q ss_pred EEeCCCCcEEEEEeceEEE-eeCCCCCeEEEE-EEEeecHHHhhHHhhHhhhh--hhhccCCCceEEEEEEEEeec---- Q lcl|NC_012418. 137 YRNSDEATVVAWSLRSYAV-RRDATGRWMDIV-LKQRYKSKDLDEAYKQDLMR--AGRNLSGSGSVDLYTHVQRKK---- 208 (510) Q Consensus 137 ~~~~~~~~~~~~pl~~~~i-~~d~~G~vd~i~-r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~---- 208 (510) |.+.+..++.+++-..|+- .-|..|.+..+| .++..+.+.=..-+. .+.. .......+.+..|-+.++... T Consensus 147 ~~d~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt-~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~ 225 (517) T protein:vir:98 147 YVDNGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYT-LLEFHEWEKTEEGESLYVITNELYKSDNEGE 225 (517) T ss_pred EEeCCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEE-EEEEEecCceeccCCcEEEEEEEEecCCCcc Confidence 6776655677777777654 666666554432 333332211000000 0000 000000111222333333221 Q ss_pred -CCCceEEEEEEEecCeeeccccccccccCceEE----Eeee-ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 209 -GTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIV----PTWN-LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESL 282 (510) Q Consensus 209 -~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~----~Rw~-~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~ 282 (510) +...|...+|-++.. .....+. ..|.++ +-.+ +..++.||+|--..+++-++.||..--+...-... . T Consensus 226 lG~~v~L~~~~e~l~~--~~~~~g~---~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g 299 (517) T protein:vir:98 226 IGKRIPLEELYEGMQE--KTYIQGL---SRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-G 299 (517) T ss_pred ccccccccccccCCCc--ceeECCC---CcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-C Confidence 112233333211110 0111221 234222 1222 33378899999999999999999877666665444 4 Q ss_pred CCceeeCCCcccchhhhccCCCceeecCCccc----c-cccccC--c--c----cch--HHHHHHHHHHHHHHHHHh-hc Q lcl|NC_012418. 283 EVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA----V-RAYERG--D--Y----NKM--AAIQQSLQAVVVRLNQAF-MY 346 (510) Q Consensus 283 ~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~----v-~~~~~~--~--~----~~~--~~~~~~i~~~~~~I~~af-~~ 346 (510) +....|++ .++.++. . .+| ..++..-+ + ..+..+ . . .++ ....+.++.+-+.|.... +. T Consensus 300 ~~~i~vp~-~~l~~~~-~--~~g-~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls 374 (517) T protein:vir:98 300 QRTVFVSD-VMLRTVP-D--ESG-MPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLS 374 (517) T ss_pred CcceecCh-hhhcccc-C--CCC-cccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCC Confidence 55555543 4443221 0 001 11111000 0 000000 0 0 111 122333444444343322 11 Q ss_pred -cccCCC-CCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCC-C--CcccccceeeecH--HHH Q lcl|NC_012418. 347 -GANQRD-AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQG-L--ITKQHKPAIETGL--PAL 419 (510) Q Consensus 347 -~~~~~~-~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~-~--~~~~~~~~~v~~i--s~L 419 (510) ..+.-+ ...-|||||..+.+...+...- +.+.-...|.-|++-+..+..-.++.+ . +...+.+..=.++ +.- T Consensus 375 ~~t~~~~~~~~kTATEi~s~~~~~~~t~~~-~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~ 453 (517) T protein:vir:98 375 VGTFSFDGRSMKTATEIVSENDLTYRTRND-HVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRS 453 (517) T ss_pred cccccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHH Confidence 112211 1235999999999988877654 233323333334333333322222221 1 1122222211222 222 Q ss_pred HHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhh Q lcl|NC_012418. 420 SRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEG 499 (510) Q Consensus 420 ~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~g 499 (510) +.+.... +.+++ |+ +-.. ..+.+.+|+ |++|.+++.++.+. +. +.+.-... T Consensus 454 ~~~~~~~------~~v~a--G~------ms~~---~~i~~~~g~-------~eeeA~~e~~~i~~--E~---~~~~~~~~ 504 (517) T protein:vir:98 454 ALLRFYG------QAKTF--GF------IPTV---EAIQRIFKV-------PKKTAEQWLEEIRK--DQ---IELDPVTI 504 (517) T ss_pred HHHHHHH------HHHhc--CC------CCHH---HHHHHhCCC-------ChHHHHHHHHHHHH--hc---cccCCCCc Confidence 2221111 11211 21 1111 224455664 34554443322211 11 11111111 Q ss_pred hhhhhhcccCC Q lcl|NC_012418. 500 ASDMTNALAGV 510 (510) Q Consensus 500 a~~~~~~~ag~ 510 (510) ....+..++|= T Consensus 505 ~~~~~~~~~gd 515 (517) T protein:vir:98 505 SQRAQKRMFGD 515 (517) T ss_pred cccccCCCCCC Confidence 12222222322 No 66 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=97.73 E-value=2.4e-05 Score=45.88 Aligned_cols=442 Identities=12% Similarity=0.051 Sum_probs=201.1 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc--cCCCCCCcccc-ccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL--MVDPMSGSRGV-VEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~--~~~~~~~~~~~-~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) |.+.=.++|. .+.+|..=.- ...--..+..+ ...++++.|...+.. .+-.+. |+..|+- T Consensus 28 ~d~~Rl~aY~------------l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~----~~~~~~-~g~~~~~- 89 (527) T protein:vir:10 28 FDKARLASYR------------LYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEA----KMRFLG-QGLKWEF- 89 (527) T ss_pred HHHHHHHHHH------------HHHHHhcCchhheeeecCCccccccceeeehhhHHhhCC----cceeec-cCccccc- Confidence 4443333443 3333332210 00000011111 124678888444332 232333 3344421 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--C-C--CCcEEE--EEe Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S-D--EATVVA--WSL 150 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~-~--~~~~~~--~pl 150 (510) +.. - ++|+..+...+++.|++....++-.+.++.|-+++.+- + + .+|.++ +-. T Consensus 90 ---~~~----------~-------e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP 149 (527) T protein:vir:10 90 ---SKK----------D-------AKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDP 149 (527) T ss_pred ---cch----------h-------HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCc Confidence 111 0 12344555567889999999999999999998886653 2 1 235544 445 Q ss_pred ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHh-----hhhhhhccCCCceE-----EEEEEEEee-----cCCCce-- Q lcl|NC_012418. 151 RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQD-----LMRAGRNLSGSGSV-----DLYTHVQRK-----KGTAME-- 213 (510) Q Consensus 151 ~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~-----~~~~~~~~~~~~~v-----~i~~~v~~~-----~~~~~p-- 213 (510) +.|+..+|++| ++.+-+.+....=.+|++-.+. +.+-.++.++.... ..|+...+. +...-| T Consensus 150 ~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~ 228 (527) T protein:vir:10 150 STYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLE 228 (527) T ss_pred ceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccc Confidence 78888888876 4433333221111223222211 11111122222221 122222111 111111 Q ss_pred EEEEEEEecCeeeccc-cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc Q lcl|NC_012418. 214 YAELYHEIDGVRVGEE-GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK 292 (510) Q Consensus 214 ~~sv~~e~~~~~~~~~-~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g 292 (510) -.++-..+++..+..+ -.| .-.|++.++=...++++||+|=-.+.+.=+..||+..-.....+...-.|+...+ | T Consensus 229 ~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t--g 304 (527) T protein:vir:10 229 PDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD--S 304 (527) T ss_pred hhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec--c Confidence 1222223455554322 223 2368887787888999999999999999999999888777777777777766553 4 Q ss_pred ccchhhhccCCC-ceeecCCcccc----cccccCcccchHHHHHHHHHHHHHHHHHhhcc---ccCCCCCCCCHHHHHHH Q lcl|NC_012418. 293 GAVVDDYQDAEM-GDYVPGGAEAV----RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG---ANQRDAERVTAEEVRIT 364 (510) Q Consensus 293 ~~~p~~~~~~~~-g~~~pg~~~~v----~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---~~~~~~~~~TAtEi~~r 364 (510) +...+. ..-.+ -.+-||.+... +...+...+++...+.-+..+..+|...==.- ....|..+ --+.++ T Consensus 305 ~~~vd~-~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~A-- 380 (527) T protein:vir:10 305 APPRDS-RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAV-AESGIA-- 380 (527) T ss_pred cccccc-cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHH-- Confidence 443221 11100 11224433321 22223333455555555666665554432111 11123222 112221 Q ss_pred HHHHHHHhchhHhHHHHHH-HHHHHHHHHH--H----H--hhcCCCCCCccc-ccceee--ecHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 365 AEEAENTLGGTYSLLAENL-QSPLAYVCLS--E----V--DDALLQGLITKQ-HKPAIE--TGLPALSRSAAVQSMLNAS 432 (510) Q Consensus 365 ~~E~~~~LGpv~~rl~~E~-l~Pli~r~~~--i----l--~~~~l~~~~~~~-~~~~~v--~~is~L~raq~~~~~~~~~ 432 (510) +...|+|++.|.+..= +.-.+.|-|. + | +++ ...-+.+ .-+..+ .+.-|.-+++-++++... T Consensus 381 ---LeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~--v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL- 454 (527) T protein:vir:10 381 ---LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEG--VGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQL- 454 (527) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhh--cccCCCccccceEEEecccCCCCHHHHHHHHHHH- Confidence 2334555555555442 2222332221 0 1 111 1111111 111122 344455555555554322 Q ss_pred HHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh------h--hh Q lcl|NC_012418. 433 QVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGAS------D--MT 504 (510) Q Consensus 433 q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~------~--~~ 504 (510) .+ +++ +-...+++.+.++-|. --.++|++++.+.+.+|+....-|....++.|+ . .| T Consensus 455 --~~--aGi------~S~~tAv~~L~~~~g~-----eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d 519 (527) T protein:vir:10 455 --WE--AGL------IPAKKLTEELSKIMGF-----ELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDD 519 (527) T ss_pred --HH--cCc------hhHHHHHHHHHhccCC-----CChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcc Confidence 11 121 2334577888887773 245678888888777766655444333221111 1 12 Q ss_pred hcccCC Q lcl|NC_012418. 505 NALAGV 510 (510) Q Consensus 505 ~~~ag~ 510 (510) ...-|+ T Consensus 520 ~~~~~~ 525 (527) T protein:vir:10 520 QALNGQ 525 (527) T ss_pred cccCCC Confidence 222222 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=97.72 E-value=2.5e-05 Score=45.80 Aligned_cols=442 Identities=12% Similarity=0.052 Sum_probs=201.3 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc--cCCCCCCcccc-ccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL--MVDPMSGSRGV-VEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~--~~~~~~~~~~~-~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) |.+.=.++|. .+.+|..=.- ...--..+..+ ...++++.|...+.. .+-.+. |+..|+- T Consensus 28 ~d~~Rl~aY~------------l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~----~~~~~~-~g~~~~~- 89 (527) T protein:vir:10 28 FDKARLASYR------------LYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEA----KMRFLG-QGLKWEF- 89 (527) T ss_pred HHHHHHHHHH------------HHHHHhcCchhheeeecCCccccccceeeehhhHHhhCC----cceeec-cCccccc- Confidence 4443333443 3333332210 00000011111 124678888444332 232333 3334421 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--C-C--CCcEEE--EEe Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S-D--EATVVA--WSL 150 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~-~--~~~~~~--~pl 150 (510) +.. - ++|+..+...+++.|++....++-.+.++.|-+++.+- + + .+|.++ +-. T Consensus 90 ---~~~----------~-------e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP 149 (527) T protein:vir:10 90 ---SKK----------D-------AKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDP 149 (527) T ss_pred ---cch----------h-------HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCc Confidence 110 0 12344555678889999999999999999998886653 2 1 235544 445 Q ss_pred ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHh-----hhhhhhccCCCceE-----EEEEEEEee-----cCCCce-- Q lcl|NC_012418. 151 RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQD-----LMRAGRNLSGSGSV-----DLYTHVQRK-----KGTAME-- 213 (510) Q Consensus 151 ~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~-----~~~~~~~~~~~~~v-----~i~~~v~~~-----~~~~~p-- 213 (510) +.|+..+|++| ++.+-+.+....=.+|++-.+. +.+-.++.++.... ..|+...+. +...-| T Consensus 150 ~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~ 228 (527) T protein:vir:10 150 STYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLE 228 (527) T ss_pred ceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccc Confidence 78888888876 4433333221111223222211 11111122222221 122222111 111111 Q ss_pred EEEEEEEecCeeeccc-cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc Q lcl|NC_012418. 214 YAELYHEIDGVRVGEE-GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK 292 (510) Q Consensus 214 ~~sv~~e~~~~~~~~~-~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g 292 (510) -.++-..+++..+..+ -.| .-.|++.++=...++++||+|=-.+.+.=+..||+..-.....+...-.|+...+ | T Consensus 229 ~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t--g 304 (527) T protein:vir:10 229 PDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD--S 304 (527) T ss_pred hhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec--c Confidence 1222223455554322 223 2368887787888999999999999999999999888777777777777766553 4 Q ss_pred ccchhhhccCCC-ceeecCCcccc----cccccCcccchHHHHHHHHHHHHHHHHHhhcc---ccCCCCCCCCHHHHHHH Q lcl|NC_012418. 293 GAVVDDYQDAEM-GDYVPGGAEAV----RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG---ANQRDAERVTAEEVRIT 364 (510) Q Consensus 293 ~~~p~~~~~~~~-g~~~pg~~~~v----~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---~~~~~~~~~TAtEi~~r 364 (510) +...+. ..-.+ -.+-||.+... +...+...+++...+.-+..+..+|...==.- ....|..+ --+.++ T Consensus 305 ~~~vd~-~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~A-- 380 (527) T protein:vir:10 305 APPRDS-RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAV-AESGIA-- 380 (527) T ss_pred cccccc-cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHH-- Confidence 443221 11100 11224433321 22223333455555555666665554432111 11123222 112221 Q ss_pred HHHHHHHhchhHhHHHHHH-HHHHHHHHHH--H----H--hhcCCCCCCccc-ccceee--ecHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 365 AEEAENTLGGTYSLLAENL-QSPLAYVCLS--E----V--DDALLQGLITKQ-HKPAIE--TGLPALSRSAAVQSMLNAS 432 (510) Q Consensus 365 ~~E~~~~LGpv~~rl~~E~-l~Pli~r~~~--i----l--~~~~l~~~~~~~-~~~~~v--~~is~L~raq~~~~~~~~~ 432 (510) +...|+|++.|.+..= +.-.+.|-|. + | +++ ...-+.+ .-+..+ .+.-|.-+++-++++.... T Consensus 381 ---LeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~--v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~ 455 (527) T protein:vir:10 381 ---LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEG--VGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELW 455 (527) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhh--cccCCCccccceEEEecccCCCCHHHHHHHHHHHH Confidence 2334555555555442 2222332221 0 1 111 1111111 111122 3444555555555543221 Q ss_pred HHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh------h--hh Q lcl|NC_012418. 433 QVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGAS------D--MT 504 (510) Q Consensus 433 q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~------~--~~ 504 (510) + +++ +-...+++.+.++-|. --.++|++++.+.+.+|+....-|....++.|+ . .| T Consensus 456 ~-----aGi------iS~etAv~~L~~~~g~-----eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d 519 (527) T protein:vir:10 456 E-----AGL------IPAKKLTEELSKIMGF-----ELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDD 519 (527) T ss_pred H-----cCc------hhHHHHHHHHHhccCC-----CchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcc Confidence 1 121 2334577888887773 245678888888877766655444333221111 1 12 Q ss_pred hcccCC Q lcl|NC_012418. 505 NALAGV 510 (510) Q Consensus 505 ~~~ag~ 510 (510) ...-|+ T Consensus 520 ~~~~~~ 525 (527) T protein:vir:10 520 QALNGQ 525 (527) T ss_pred cccCCC Confidence 222222 No 68 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=97.68 E-value=3e-05 Score=45.40 Aligned_cols=416 Identities=12% Similarity=0.014 Sum_probs=159.9 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc-cCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~-~~~~~~~~~~~~~--~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) +++.+.+.+.+. ..+.+.+.+|..=.. ...-+........ +..-+-+..++++++..| ++-+ |+ T Consensus 16 ~~~~l~~~~~~~-----~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~- 82 (484) T protein:vir:77 16 AREEMLNLFTER-----TQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQ----ELEG---FR- 82 (484) T ss_pred HHHHHHHHHHHH-----HHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhh----ccCc---ee- Confidence 555555555422 122333334432110 0000000011111 122344455555555544 2222 22 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC-----------cEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-----------TVV 146 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~-----------~~~ 146 (510) .++.. +.. +.+.+....++|.....++.++..++|.+.+++..+.. +++ T Consensus 83 -~~~~~------------~~~-------~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~ 142 (484) T protein:vir:77 83 -LGGAD------------KAD-------EQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIR 142 (484) T ss_pred -cCCcc------------hhH-------HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEE Confidence 12111 011 12334466789999999999999999998765543321 356 Q ss_pred EEEece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEE-ecCe Q lcl|NC_012418. 147 AWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHE-IDGV 224 (510) Q Consensus 147 ~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e-~~~~ 224 (510) +++-.+ |++--+..+++...++.+.-. ..+.-..+++|+ ++. .+++. .+|. T Consensus 143 ~~~p~~~~~~~D~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~y~-----~~~-----~~~~~~~~~~ 195 (484) T protein:vir:77 143 VEPPTNLYAQIDPRTRQVMRAIRAIEDE-----------------EGNEVIGATLYL-----PNN-----TVIWNREDGQ 195 (484) T ss_pred EeccceeEEEecCCCCceEEEEEEEEee-----------------cCCcEEEEEEEe-----cCe-----EEEEEecCCc Confidence 665555 444433456666555544321 001111222221 110 11111 1221 Q ss_pred e-eccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCcc-cchhh Q lcl|NC_012418. 225 R-VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-AVVDD 298 (510) Q Consensus 225 ~-~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~---~~g~-~~p~~ 298 (510) . .......++..+|++.++.+...++.+|+|-..+ ..+=+..++...-.+...++..+.|...+. ++-. ..+.+ T Consensus 196 ~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~ 275 (484) T protein:vir:77 196 WVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPET 275 (484) T ss_pred eEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccc Confidence 1 1122223356799999998888888999996654 334456666666666666676666654431 1100 00000 Q ss_pred ---hccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHH-------H Q lcl|NC_012418. 299 ---YQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV-------R 362 (510) Q Consensus 299 ---~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi-------~ 362 (510) +.....|.+......+.+..++.. +++ ...++.++.-|........ +.....+ -++.-+ . T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~q~~~-~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~ 351 (484) T protein:vir:77 276 GQTLFDAYLARILAFEDHESKAQQFSA-AEL---RNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLV 351 (484) T ss_pred cchhhhhhhhhhcccCCCCceeEeecC-CCh---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHH Confidence 000011111111111222223221 223 3445556665655432211 1111111 233333 2 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHH--HHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPA--LSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~--L~raq~~~~~~~~~q~l~~~~~ 440 (510) .++++|...+|..+.++- ..++.+. ++ ...+.+... ..+.+-.+ -..++.++.+..+ .+.-.+ T Consensus 352 ~ka~~k~~~f~~~l~~~~--------~l~~~~~--~~-~~~~~~~~~-i~v~w~~~~~~s~~~~ad~~~kl---~~~g~g 416 (484) T protein:vir:77 352 KTVERKNKIFGGAWEQAM--------RVAYKVM--NG-GDIPPEYYR-MESIWRDPSTPTYAAKADAATKL---YNNGQG 416 (484) T ss_pred HHHHHHHHHHHHHHHHHH--------HHHHHHh--CC-CCccccccc-ceEEecCCCCCCHHHHHHHHHHH---HhccCC Confidence 344556666665554431 2222222 11 122222222 12222111 1222222222221 111111 Q ss_pred hhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHH-HHHHhhH--HHhhhhhh----------hhhcc Q lcl|NC_012418. 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAA-QAQAAQE--TLLEGASD----------MTNAL 507 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~-~~~~~~~--~~~~ga~~----------~~~~~ 507 (510) + +.- +.+...+|.... ..+|++++++++..+.+ ....+.. ....|... .+..+ T Consensus 417 i------~s~----et~~~~l~~~~~----~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (484) T protein:vir:77 417 V------IPK----ERARIDMGYSIT----EREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPNPAEEA 482 (484) T ss_pred C------CCH----HHHHhcCCCChh----HHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCCCcccc Confidence 1 111 123344554322 12344444333222111 1111100 11111111 11222 Q ss_pred cC Q lcl|NC_012418. 508 AG 509 (510) Q Consensus 508 ag 509 (510) || T Consensus 483 ~~ 484 (484) T protein:vir:77 483 AA 484 (484) T ss_pred CC Confidence 22 No 69 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.67 E-value=3e-05 Score=45.35 Aligned_cols=447 Identities=10% Similarity=0.012 Sum_probs=172.6 Q ss_pred ChhHHHHHHHHH-----h-------------hccchHHHHHHHHHhcccccCCCCCCcccccccccc--chHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL-----R-------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALLVNNLA 60 (510) Q Consensus 1 ~~~~~~~r~~~l-----k-------------r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--stg~~a~~~LA 60 (510) ||..+++...++ . +......|+.+|+=--+.+ . .....+....+-+. ..+...++.+| T Consensus 7 ~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~-~-~~~~~~~~~~~~~~slnl~~~i~~~~A 84 (522) T protein:vir:47 7 VKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDV-Q-YKNTDGDIKSRPMNHLPIARTASKKIA 84 (522) T ss_pred HHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccc-c-ccccCcchhcccceecchHHHHHHHHh Confidence 333333333221 1 0011224544443111111 0 00111111222233 34455555555 Q ss_pred HHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE--EEE Q lcl|NC_012418. 61 AKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYR 138 (510) Q Consensus 61 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~--l~~ 138 (510) +-+.+-.. .++++|+ ...++| .+.+..++|+..+.+++....+.|+++ .|. T Consensus 85 ~lv~~e~~-------~i~v~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~ 137 (522) T protein:vir:47 85 SLVYNEQA-------TITTKNE-------------ILQKFL-------DDMLTNDRFNKNFERYLESCLALGGLAMRPYI 137 (522) T ss_pred hhhcCCcc-------eeecCCh-------------HHHHHH-------HHHHhhcchHHHHHHHHHHhhccCCEEEEEEE Confidence 54443221 1222222 233444 444778899999999999999888866 466 Q ss_pred eCCCCcEEEEEeceEE-EeeCCCCCeEEEEE-EEeecHHHhh--------HHhhHhhhhhhhccCCCceEEEEEEEEeec Q lcl|NC_012418. 139 NSDEATVVAWSLRSYA-VRRDATGRWMDIVL-KQRYKSKDLD--------EAYKQDLMRAGRNLSGSGSVDLYTHVQRKK 208 (510) Q Consensus 139 ~~~~~~~~~~pl~~~~-i~~d~~G~vd~i~r-~~~~t~~~l~--------~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~ 208 (510) +.+.-++..++-..|+ +..|..|.+..++. +...+-.+-. .+|...-.........+....|-+..+... T Consensus 138 d~~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~ 217 (522) T protein:vir:47 138 DGDKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSD 217 (522) T ss_pred cCCceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecC Confidence 6555567778887766 56777776554333 2221111100 000000000000000111112222222211 Q ss_pred -----CCCceEEEEEEEecCeeeccccccccccCceEE----Eeeee-cCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 209 -----GTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIV----PTWNL-APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 209 -----~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~----~Rw~~-~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~ 278 (510) ++..|...+....+-.......+. .-|.++ +.++. ..++.||+|--..+.+-++.||..--+...-. T Consensus 218 ~~~~lG~~v~l~~~~e~~~l~~~~~~~~~---~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~ 294 (522) T protein:vir:47 218 VNDVLGQRVNLSELDKYKNLEPVTVFENL---SRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEV 294 (522) T ss_pred CCcccCccccccccccccCCCCceEeCCC---CcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHH Confidence 111222222110110110011111 224322 23333 34788999999999999999998666655544 Q ss_pred HHhhCCceeeCCCcccchhhhc-----------cCCCceeecCC-----cccccccccCcccchHHHHHHHHHHHHHHHH Q lcl|NC_012418. 279 LESLEVLNLVDEAKGAVVDDYQ-----------DAEMGDYVPGG-----AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQ 342 (510) Q Consensus 279 ~~a~~p~~l~~~~g~~~p~~~~-----------~~~~g~~~pg~-----~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 342 (510) ...-. ...|+ ..+++...-. +.....+++.+ .+.+..++.. -........++.+-+.|.. T Consensus 295 ~~g~~-~i~v~-~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~--ir~e~~~~~~~~~l~~i~~ 370 (522) T protein:vir:47 295 RMGQR-RVIVP-EHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSP--IRANDYILAISEGLKLFEM 370 (522) T ss_pred Hhccc-eeecc-hHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccc--cChHHHHHHHHHHHHHHHH Confidence 43222 22332 2222211100 00001122211 1112211111 1112223344444444433 Q ss_pred Hh-hc-cccCC-CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCC-CCccc--ccceeeec- Q lcl|NC_012418. 343 AF-MY-GANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQG-LITKQ--HKPAIETG- 415 (510) Q Consensus 343 af-~~-~~~~~-~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~-~~~~~--~~~~~v~~- 415 (510) .. |. ..+.- .....|||||..+.+...+...-.- +.-..-|.-|+.-++.++.-.++.. .+++. +.+..=.+ T Consensus 371 ~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~-~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i 449 (522) T protein:vir:47 371 QIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIV-ALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGV 449 (522) T ss_pred HhCCCccccCccccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCC Confidence 32 11 11211 1233699999999999888866633 3333444556666665554333321 22222 22222122 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH Q lcl|NC_012418. 416 -LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE 494 (510) Q Consensus 416 -is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~ 494 (510) .+..++.+... +.++ +|+ +-.. ..+.+..|++ ++|.+++.++.++ ++..+... T Consensus 450 ~~D~~~~~~~~~------~~v~--aG~------~s~e---~~i~~~~g~~-------eeea~~el~ri~~--E~~~~~~~ 503 (522) T protein:vir:47 450 FTDRHAELDYWA------KMVA--AGF------STKK---RAIGKTLNIS-------GVEAEKELNAINS--ELLPMNDA 503 (522) T ss_pred CCCHHHHHHHHH------HHHh--cCC------CCHH---HHHHhcCCCC-------hHHHHHHHHHHHH--hhccCCCC Confidence 22222221111 1111 121 1111 2234455643 4554444333222 11111111 Q ss_pred HHhhhhh-hhhhcccCC Q lcl|NC_012418. 495 TLLEGAS-DMTNALAGV 510 (510) Q Consensus 495 ~~~~ga~-~~~~~~ag~ 510 (510) ....+.+ .......|= T Consensus 504 ~~~~~~~~~~~~~~~d~ 520 (522) T protein:vir:47 504 ELAIYGMHDQNEEKADD 520 (522) T ss_pred CCCCCCCCCcccccCCC Confidence 1111111 111111111 No 70 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.64 E-value=3.4e-05 Score=45.11 Aligned_cols=418 Identities=13% Similarity=0.013 Sum_probs=166.6 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc-cCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~-~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) =+..+.....++.+ ...+.+.+.+|..-.. ....+ .....+..++..+-+..+++.+++.| ++.+ |.. T Consensus 4 ~~~~i~~L~~~~~~--~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 74 (480) T protein:vir:78 4 YHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEG---FRI 74 (480) T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh----ccCc---eec Confidence 23334444443321 1223333444432211 00000 00001111334455666666666654 3332 222 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC------C-CC--cEEEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS------D-EA--TVVAW 148 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~------~-~~--~~~~~ 148 (510) .- |.. ..+ .+...+..++|.....+++++...+|.+.+++.. + .+ +++.+ T Consensus 75 ~~-d~~-------------~~~-------~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~ 133 (480) T protein:vir:78 75 SE-DSE-------------GLE-------ELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred CC-Cch-------------hHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEE Confidence 21 111 111 2234467789999999999999999998766542 1 12 46667 Q ss_pred EeceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe-- Q lcl|NC_012418. 149 SLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV-- 224 (510) Q Consensus 149 pl~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~-- 224 (510) +..+.++..|. .+++...+|.+.-. . +.+....+++|+ ++. ...|...++. T Consensus 134 ~p~~~~~~~D~~~~~~~~~~i~~~~~~-~---------------~~~~~~~~~~y~-----~~~----~~~~~~~~~~~~ 188 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYL-----PDE----TVPLRRNGGLND 188 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee-c---------------CCCceEEEEEEe-----CCe----EEEEEecCCCcc Confidence 76676666664 57777666555310 0 011112233331 110 0001111110 Q ss_pred ---eeccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhc Q lcl|NC_012418. 225 ---RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ 300 (510) Q Consensus 225 ---~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~ 300 (510) ........++..+|++.++.+...+..||+|=..+ ..+-+-.++...-.....++..+.|...+. |........ T Consensus 189 ~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~ 266 (480) T protein:vir:78 189 QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTN 266 (480) T ss_pred ccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--cCCcccccc Confidence 01111123346799999998888898999997665 456677777777677777777777754442 221100000 Q ss_pred -------cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHHH----- Q lcl|NC_012418. 301 -------DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVR----- 362 (510) Q Consensus 301 -------~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi~----- 362 (510) ....|.+..-...+.+..++.. ++++. .++.++.-|...+.... +...... -++.-++ T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~ 342 (480) T protein:vir:78 267 DGENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSR 342 (480) T ss_pred ccccchhhhhhhhhccCCCCCceEEecCc-cCHHH---HHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHH Confidence 0011111110111233333332 23433 34445555544433221 1111111 1333222 Q ss_pred --HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceeeecHHHH--HHHHHHHHHHHHHHHHHh Q lcl|NC_012418. 363 --ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPAL--SRSAAVQSMLNASQVIAG 437 (510) Q Consensus 363 --~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v~~is~L--~raq~~~~~~~~~q~l~~ 437 (510) ..++++...+++ -+.+.+.++.. .|- ..+ .+.....+++-.+. ..++.++.+. +++++ T Consensus 343 l~~ka~~~~~~f~~------------~l~~~~~l~~~~~g~-~~~-~~~~~i~v~f~~~~~~s~~~~ad~~~---kl~~~ 405 (480) T protein:vir:78 343 IVKMAERKGRIFGG------------AWERAMRIAMQIMGR-EVT-EEYTRLETVWRDPSTPTVAAKADAVS---KLYAN 405 (480) T ss_pred HHHHHHHHHHHHHH------------HHHHHHHHHHHHcCC-Ccc-ccceeeeEEecCCCCCCHHHHHHHHH---HHHHh Confidence 222333333333 33444443321 111 111 12222223332221 2222233332 22222 Q ss_pred hcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH----------hhhhhhhhhcc Q lcl|NC_012418. 438 LAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL----------LEGASDMTNAL 507 (510) Q Consensus 438 ~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~----------~~ga~~~~~~~ 507 (510) ..++ +..+. +...+|..+.. .++++++++++.+....+....... +..+.+..+++ T Consensus 406 g~~~------~s~et----~~~~lg~~~d~----~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (480) T protein:vir:78 406 GQGP------IPKEQ----ARIDLGYTATQ----REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSP 471 (480) T ss_pred cccc------CCHHH----HHhcCCCCHhH----HHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCcccccc Confidence 2221 11111 33346644321 1333333322222111111110000 00112233344 Q ss_pred cCC Q lcl|NC_012418. 508 AGV 510 (510) Q Consensus 508 ag~ 510 (510) .|- T Consensus 472 ~~~ 474 (480) T protein:vir:78 472 SGF 474 (480) T ss_pred CCC Confidence 444 No 71 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.60 E-value=3.9e-05 Score=44.77 Aligned_cols=418 Identities=13% Similarity=0.089 Sum_probs=174.7 Q ss_pred ChhHHHHHHHHH----------h------hc-------cchHHHHHHHHHhcccccCCCCCCcccccccccc--chHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL----------R------DG-------SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALL 55 (510) Q Consensus 1 ~~~~~~~r~~~l----------k------r~-------~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--stg~~a 55 (510) |...++..+.++ + |- .-...|+.+|+=--|.+..... .+....+.+. ..+... T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~--~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS--YGDTQKHELQSVNVTKLA 80 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc--CCCccccceeecchHHHH Confidence 444444333331 1 11 1123466665422222111111 1111122233 345555 Q ss_pred HHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEE Q lcl|NC_012418. 56 VNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL 135 (510) Q Consensus 56 ~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~ 135 (510) ++.+|+-|.+- || .+++++. +..++|+ +.+..++|+..+.++..+..+.|.++ T Consensus 81 ~~~~A~ll~~e--~~-----~i~~~d~-------------~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a~G~~~ 133 (505) T protein:vir:79 81 SAKLASLIFNE--QC-----QVTVSDE-------------TANDFLD-------DVFQQNDFYTTFEEKLEEWIALGSGC 133 (505) T ss_pred HHHHHhhhcCC--Cc-----eeecCCh-------------HHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCCeE Confidence 66655544333 11 1333332 1334443 34677889999999999999999876 Q ss_pred E--EEeCCCCcEEEEEeceEE-EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCc Q lcl|NC_012418. 136 L--YRNSDEATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAM 212 (510) Q Consensus 136 l--~~~~~~~~~~~~pl~~~~-i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~ 212 (510) + |.|.+..++..++-..++ +..|..+ +..+.....++..+ ++ +-.+|+.++.....+. T Consensus 134 ~k~~~D~~~~~i~~v~ad~~~P~~~d~~~-~~~~a~~~~~~~~~---------------~~---~~~~yt~lE~h~~~~~ 194 (505) T protein:vir:79 134 VRPYVDSGKIKLAWATADQVYPLQADTNQ-VNELAIASRTTEVE---------------NH---RTIYYTLLEFHQWDHG 194 (505) T ss_pred EEEEEeCCceEEEEEcCCeeEEEEEcCCC-eEEEEEEEEEEEec---------------CC---cceEEEEEEEEEecCc Confidence 5 666555567778887865 5566544 54443333221100 01 1123333333211111 Q ss_pred eEEE---EEEEec----Ceee-----------ccc---cccccccCceEEEe---ee-ecCCCccccchHHHHHHHHHHH Q lcl|NC_012418. 213 EYAE---LYHEID----GVRV-----------GEE---GRWPIHLCPYIVPT---WN-LAPGEHYGRGHVEDYIGDFAKL 267 (510) Q Consensus 213 p~~s---v~~e~~----~~~~-----------~~~---~~y~~~~~P~~~~R---w~-~~~g~~YGrgp~~~~L~d~r~L 267 (510) ++.. +|..-+ |..+ ..+ .++ ..-+|..++ ++ ...++.+|+|--..+.+-++.| T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~--~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~l 272 (505) T protein:vir:79 195 DYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGL--KHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAI 272 (505) T ss_pred eEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCC--CcceEEEecCCcccccccCCccCCchhhhhHHHHHHH Confidence 1221 111111 1111 011 112 112232222 22 3346789999999999999999 Q ss_pred HHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC------------Cceeec--C--CcccccccccCcccch--HHH Q lcl|NC_012418. 268 SLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE------------MGDYVP--G--GAEAVRAYERGDYNKM--AAI 329 (510) Q Consensus 268 ~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~------------~g~~~p--g--~~~~v~~~~~~~~~~~--~~~ 329 (510) +..--+.....+. .+....|++ .++.+.....+. .-.+.+ + +...+..++ .++ ... T Consensus 273 D~~~s~~~~e~~~-g~~~i~v~~-~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~----~~ir~e~~ 346 (505) T protein:vir:79 273 NRTHDQFVDEVKK-GQRRLIVPA-EWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDAT----SPIRVADY 346 (505) T ss_pred HHHHHHHHHHHHh-cccceeech-HHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEec----ccCCHHHH Confidence 9876666665543 333333322 333222110000 000111 0 001111111 112 122 Q ss_pred HHHHHHHHHHHHHHhhcc--ccCC-CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCC----- Q lcl|NC_012418. 330 QQSLQAVVVRLNQAFMYG--ANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ----- 401 (510) Q Consensus 330 ~~~i~~~~~~I~~af~~~--~~~~-~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~----- 401 (510) ...++.+-++|....=+. .+.- .....|||||..+.+.......-. .+.-..-|..|++.++.+..-.++. T Consensus 347 ~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~-~~~~~~al~~li~~i~~~~~~~~~~~~g~~ 425 (505) T protein:vir:79 347 QATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSY-ITQVEKTIKALTYAILELASVPSFYADGQA 425 (505) T ss_pred HHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 333444444433222111 1111 223359999999999888887753 3333555677777777665432221 Q ss_pred ----CCCcccccceeeec--HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHH Q lcl|NC_012418. 402 ----GLITKQHKPAIETG--LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEEL 475 (510) Q Consensus 402 ----~~~~~~~~~~~v~~--is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev 475 (510) ++++.++.+..=.+ .+.-+.. +. ..+.++. ++ .+ .. ..+....|++ ++|+ T Consensus 426 ~~~~~~~~~~i~v~f~d~i~~d~~~~~---~~---~~~~v~~--Gi---~s---~e---~~l~~~~~~~-------eeea 481 (505) T protein:vir:79 426 RWTGDVDSLDITINFNDGVFVDQESKR---AA---DLQAVQA--QV---MP---KK---QFLMRNYGLD-------EEEA 481 (505) T ss_pred cccCCCCceeEEEEeCCCCCCCHHHHH---HH---HHHHHHc--CC---CC---HH---HHHHhcCCCC-------hHHH Confidence 12222222222122 2221111 11 1122211 21 10 11 2234455544 3555 Q ss_pred HHHHHHHHHHHHHHHHhhHHHhhhhh Q lcl|NC_012418. 476 QAEAEQRRQQAAQAQAAQETLLEGAS 501 (510) Q Consensus 476 ~~~r~q~~q~~~~~~~~~~~~~~ga~ 501 (510) +++.++-+. ++...+...-..|+. T Consensus 482 ~~el~ri~~--E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 482 DEWLAQIDA--ENSTAEPEFNQFGGD 505 (505) T ss_pred HHHHHHHHH--hccccCCCchhccCC Confidence 444433222 111111222222222 No 72 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.53 E-value=5e-05 Score=44.18 Aligned_cols=428 Identities=10% Similarity=-0.024 Sum_probs=181.1 Q ss_pred ChhHHHHHHHHHh---------------------------hccchHHHHHHHHHhcc-----cccCCCCCCccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLP-----YLMVDPMSGSRGVVEHDF 48 (510) Q Consensus 1 ~~~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~lP-----~~~~~~~~~~~~~~~~~~ 48 (510) ++.++..+|.... ...-.++++++.+|..- .+.... ........++. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~ 91 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVA 91 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-cccccCcceee Confidence 3333333332211 01112244444554432 111111 11111123555 Q ss_pred cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_012418. 49 QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLL 128 (510) Q Consensus 49 dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl 128 (510) .+.+...++..++-|.+ -|+. ++.+++. +. ..+...+..++|.....++.++. T Consensus 92 ~n~~k~Iv~~~~~yl~g--~p~~-----~~~~~~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:96 92 HDYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cchHHHHHHHHHhhhcc--CCce-----eecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHH Confidence 66667777776655543 1211 2333221 11 23445567789999999999999 Q ss_pred HhhCeEEEEE--eCCCC-cEEEEEece-EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE Q lcl|NC_012418. 129 IVTGNALLYR--NSDEA-TVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH 203 (510) Q Consensus 129 ~~~G~~~l~~--~~~~~-~~~~~pl~~-~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 203 (510) .++|.+.+++ +++.. +++.++..+ |++--|. .+++...+|.+.....+ ....-+++++ T Consensus 145 ~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-----------------~~~~~~~~~~ 207 (511) T protein:vir:96 145 SIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----------------KTDEDEVFTV 207 (511) T ss_pred HhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc-----------------ccccceEEEE Confidence 9999876554 44322 355555544 4444333 46676666665442111 0011112222 Q ss_pred EEeecCCCceEEEEEEEecCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 204 VQRKKGTAMEYAELYHEIDGV------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLY 277 (510) Q Consensus 204 v~~~~~~~~p~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~ 277 (510) -...+++- + .|...++. ........++..+|++.++- ..+|+|-.+..++-+..++.+.-..... T Consensus 208 ~iyt~~~i---~-~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~ 278 (511) T protein:vir:96 208 DLFTSHGV---Y-RYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred EEEeCCcE---E-EEEecCCCcccccccccccccccCCceeeEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 11112210 0 01111111 11122233456788887653 4579999999999999999988888888 Q ss_pred HHHhhCCceeeCCCcccchhhhccCCCceee--------------cCCcccccccccCcccchHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 278 ELESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------------PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA 343 (510) Q Consensus 278 ~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~--------------pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 343 (510) .....+|.+++.-.+......+..-..+... .+...+++.+ ....+.+.....++.+.+.|... T Consensus 279 ~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:96 279 MSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred HHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHH Confidence 8888888776542222222222211111111 0111122222 22234566677778887777554 Q ss_pred hhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--cccceeee--cHH Q lcl|NC_012418. 344 FMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIET--GLP 417 (510) Q Consensus 344 f~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~--~~~~~~v~--~is 417 (510) -+. +.. ..-+...|+..+...-.- +.+......++-.+.+.-+++.++.++...+-.....+ ++++..-. +.+ T Consensus 357 s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n 435 (511) T protein:vir:96 357 TNTPNMKDDNFSGTQSGEAMKYKLFG-LEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKS 435 (511) T ss_pred hCCcccccccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCC Confidence 322 211 111234566665333221 11112222233223333333333344433222211222 23333321 223 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_012418. 418 ALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL 496 (510) Q Consensus 418 ~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~ 496 (510) ....++.+.+ ..+.+ -...++ ..++ |+ -.++|++++.++++.++.+++...... T Consensus 436 ~~e~~~~~~k------l~G~i----------S~et~l----~~l~~v~-----D~~~E~~ri~~E~~~~~~~~~~~~~~~ 490 (511) T protein:vir:96 436 LIEELKAYID------SGGKI----------SQTTLM----SLFSFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKD 490 (511) T ss_pred HHHHHHHHHH------HhccC----------ChHHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHHHHhhccccC Confidence 2223222111 11111 111222 2332 22 135677777766554333332221111 Q ss_pred hhhhhh--hhhcccCC Q lcl|NC_012418. 497 LEGASD--MTNALAGV 510 (510) Q Consensus 497 ~~ga~~--~~~~~ag~ 510 (510) ..+... .+.....+ T Consensus 491 ~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:96 491 PRDINDDEQDDDTKDT 506 (511) T ss_pred CCCCCCCCCCCccccc Confidence 111111 11111111 No 73 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=97.48 E-value=5.9e-05 Score=43.75 Aligned_cols=437 Identities=12% Similarity=0.015 Sum_probs=183.4 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc-----cc---CCCCCC----ccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY-----LM---VDPMSG----SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~-----~~---~~~~~~----~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) ..........++-++.-..+++.+.+|..-. +. .+.... ......++..+-+...++..++-|.+ T Consensus 26 ~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g--- 102 (503) T protein:vir:59 26 IAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVG--- 102 (503) T ss_pred ccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhc--- Confidence 1111111222221111124455555555421 11 111000 01111244455566666666665532 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cE Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TV 145 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~ 145 (510) - | +.++..++. +.+.+ ..+..++|-....++.++...+|.+.+++..+. + ++ T Consensus 103 -~--~-~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i 157 (503) T protein:vir:59 103 -E--P-VTFTSDNKT-------------LLEYV--------NELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDY 157 (503) T ss_pred -C--C-eeeccCcHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEE Confidence 1 1 123333322 22222 223346899999999999999999876554322 2 46 Q ss_pred EEEEeceEEEeeC-C-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE---EeecCCCceEE-EEEE Q lcl|NC_012418. 146 VAWSLRSYAVRRD-A-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV---QRKKGTAMEYA-ELYH 219 (510) Q Consensus 146 ~~~pl~~~~i~~d-~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~~~~~~~p~~-sv~~ 219 (510) ++++..+.+.-.| . .+++..++|.++..-. +.+....+++|+.- +....++.+.. ..+. T Consensus 158 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~---------------~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~ 222 (503) T protein:vir:59 158 VIFPAEEMIVVYKDNTRRDILFALRYYSYKGI---------------MGEETQKAELYTDTHVYYYEKIDGVYQMDYSYG 222 (503) T ss_pred EEEccceeEEEEeCCCCCceEEEEEEEEEecC---------------CCceEEEEEEEeCCcEEEEEEcCCccccccccc Confidence 6666655444333 3 4777766666553200 11111223333211 00100000000 0000 Q ss_pred Eec--CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch- Q lcl|NC_012418. 220 EID--GVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV- 296 (510) Q Consensus 220 e~~--~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p- 296 (510) +.. .........+++..+|++.++- ..+|.|=...+.+-+..+|.+.-......+....|.+++.-...-+. T Consensus 223 ~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~ 297 (503) T protein:vir:59 223 ENNPRPHMTKGGQAIGWGRVPIIPFKN-----NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK 297 (503) T ss_pred ccccccceeecceeccCCccceEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc Confidence 000 0000111123345688887753 45799988899999999998888888888888999877642111111 Q ss_pred hhhc-cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc-c-cCCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 297 DDYQ-DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A-NQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 297 ~~~~-~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~-~~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) +... ....+.+..+..++++.+... .+.+.....++.++..|...-..- + ....+...|+..+..+..-.... - T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~ 374 (503) T protein:vir:59 298 EFTANLRYHSVIKVSGDGGVDTLRAE--IPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLK-A 374 (503) T ss_pred hhhhhhhcccceeccCCCcceeEecc--CCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHH-H Confidence 1111 112223332333345544432 345667778888888776654322 1 12223456777765443322222 1 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCC-CCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHH Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~-~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~ 452 (510) --..+.-.+.|.-+++.++.++...+... .....+.+..-..+ |-..++.++.+. +.++. +-+++ .. T Consensus 375 ~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~-p~d~~~~~~~~~---kl~~~-GiiS~-------et 442 (503) T protein:vir:59 375 NMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTR-IQNDSEIVQSLV---QGVTG-GIMSK-------ET 442 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCC-CCCHHHHHHHHH---HHHhC-CCCch-------HH Confidence 22333334444444455555554322211 11223333322111 111222222221 11111 11111 11 Q ss_pred HHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh----------hhhhcccCC Q lcl|NC_012418. 453 MMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGAS----------DMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~----------~~~~~~ag~ 510 (510) ++. .++ ++ -.++|++++.+++++.+++.+..... ..|.. +.++..+|= T Consensus 443 ~l~----~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~ 501 (503) T protein:vir:59 443 AVA----RNPFVQ-----DPEEELARIEEEMNQYAEMQGNLLDD-EGGDDDLEEDDPNAGAAESGGAGQ 501 (503) T ss_pred HHH----hCCCCC-----CHHHHHHHHHHHHHHHHhhhccccCc-cCCCCCCCcCCCCCCcccCCCCCC Confidence 222 222 21 12466666655444333322222111 01111 111111111 No 74 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=97.46 E-value=6.4e-05 Score=43.59 Aligned_cols=417 Identities=10% Similarity=-0.019 Sum_probs=162.9 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc-cCCCCCCccc--cccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPMSGSRG--VVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~-~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) +-+.|.+.|.. + ..+.+.+.+|..=.. ...-+..-.. +..+...+-+..++++++..| +|.+ |.+ T Consensus 17 ~~~~l~~~~~~--~---~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l----~~~g---~~~ 84 (486) T protein:vir:42 17 VREEMISAFED--A---SKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ----AVEG---FRL 84 (486) T ss_pred HHHHHHHHHHH--H---HHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhh----cccc---eec Confidence 22233333321 1 123333444432110 0000000000 111223445566666655544 3433 222 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-----------CcEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-----------ATVV 146 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-----------~~~~ 146 (510) ++... ... .+.+.+..++|.....++.++..++|.+.+++..+. -+++ T Consensus 85 --~~~~~--------~~~-----------~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~ 143 (486) T protein:vir:42 85 --GDADE--------ADE-----------ELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIR 143 (486) T ss_pred --CCCch--------hHH-----------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEE Confidence 21110 001 123345668899999999999999999877664321 1355 Q ss_pred EEEece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 147 AWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 147 ~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~ 225 (510) +++-.+ |++--+..+++...+|.+.- . +.+.-..+++|+ ++. ...|...+|.. T Consensus 144 ~~~p~~~~~i~d~~~~~~~~~~~~~~~---~--------------~~~~~~~~~~y~-----~~~----~~~~~~~~~~~ 197 (486) T protein:vir:42 144 VEPPTRMHAEIDPRINRVSKAIRVAYD---K--------------EGNEIQAATLYT-----PME----TIGWFRADGEW 197 (486) T ss_pred EecccceEEEEeCCCCCeEEEEEEEEe---c--------------CCCeEEEEEEEc-----CCc----EEEEEecCCcE Confidence 565544 44544467777766665431 0 001111122221 111 00111122222 Q ss_pred ec-cccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCccc-chh-- Q lcl|NC_012418. 226 VG-EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKGA-VVD-- 297 (510) Q Consensus 226 ~~-~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~---~~g~~-~p~-- 297 (510) .. ......++.+|++.++.+...+..+|+|=... ..+-+..++...-.+...++..+.|...+. ++... ... T Consensus 198 ~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~ 277 (486) T protein:vir:42 198 AEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETG 277 (486) T ss_pred EeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccc Confidence 11 12223456799999999888899999996654 335556666665566666676666654442 11110 000 Q ss_pred -hhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHHH-------H Q lcl|NC_012418. 298 -DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVR-------I 363 (510) Q Consensus 298 -~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi~-------~ 363 (510) .+.....|.+......+++..+... ++ +...++.++.-|++...... +...... -++.-+. . T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~q~~~-~~---~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ 353 (486) T protein:vir:42 278 QTLFDAYLARILAFEDAEGKIQQFSA-AE---LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIK 353 (486) T ss_pred cchhhhhhchhcccCCCCceEEeecc-cC---HHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHH Confidence 0000111111110011222223221 22 33455666666655443221 1111111 2333332 2 Q ss_pred HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceeeecHHH--HHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPA--LSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v~~is~--L~raq~~~~~~~~~q~l~~~~~ 440 (510) +++++...+|+.+. +++.++.. .+....+.+... ..+++-.+ -..++.++.+. ++++...+ T Consensus 354 ka~~~~~~f~~~l~------------~~~~l~~~~~~~~~~~~d~~~-i~v~w~~~~~~s~~~~ad~~~---kl~~~~~g 417 (486) T protein:vir:42 354 KVERKNLMFGGAWE------------EAMRIAYRIMKGGDVPPDMLR-METVWRDPSTPTYAAKADAAT---KLYGNGQG 417 (486) T ss_pred HHHHHHHHHHHHHH------------HHHHHHHHHhcCCCcccccee-eeEEecCCCCCCHHHHHHHHH---HHHhcccC Confidence 33444445554443 33333211 111222222222 22222222 12222222222 22222212 Q ss_pred hhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHH-hh-HH-HhhhhhhhhhcccCC Q lcl|NC_012418. 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQA-AQ-ET-LLEGASDMTNALAGV 510 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~-~~-~~-~~~ga~~~~~~~ag~ 510 (510) + +.- +.+ ...+|+... ..+|+++++++++.+.++... +. +. ...|..+.+..+++- T Consensus 418 ~------~s~-et~---~~~lg~~~d----~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (486) T protein:vir:42 418 V------IPR-ERA---RIDMGYSVK----EREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQ 476 (486) T ss_pred C------CCH-HHH---HhcCCCChh----HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCC Confidence 1 111 122 234564332 124555554443332222111 10 00 011111111122221 No 75 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.44 E-value=6.7e-05 Score=43.46 Aligned_cols=421 Identities=10% Similarity=-0.005 Sum_probs=175.5 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) =.+.+.+..++.+ .-..+++.+.+|.... ..............++-.+.+...++..++-|.+- |+ +++ T Consensus 17 ~~~~i~~~i~~~~--~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~--p~-----~~~ 87 (499) T protein:vir:10 17 NIEAINYAIRELQ--NRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNVGFMTGN--PV-----KYV 87 (499) T ss_pred CHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHhhhhccc--Cc-----eee Confidence 0111222223332 1233455555554432 11111111112233555566666677666655431 22 223 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEE--EeCCCC------------- Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RNSDEA------------- 143 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~--~~~~~~------------- 143 (510) ..++.. .. .+...+..++|.....++.++...+|.+.++ .+++.. T Consensus 88 ~~~~~~---------~~-----------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~ 147 (499) T protein:vir:10 88 AEKGKN---------ID-----------DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLT 147 (499) T ss_pred cCChhH---------HH-----------HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccc Confidence 322211 11 2334456678999999999999999987654 444321 Q ss_pred -----cEEEE-EeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceE Q lcl|NC_012418. 144 -----TVVAW-SLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEY 214 (510) Q Consensus 144 -----~~~~~-pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~ 214 (510) ++.++ |..-|.+--|..++....+.++..+... ........+++|+. ......... T Consensus 148 ~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~-------------~~~~~~~~~~iyt~~~i~~~~~~~~~-- 212 (499) T protein:vir:10 148 PNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDL-------------EGNTNGYSITVYMPQRIVEYRTKTTM-- 212 (499) T ss_pred cccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeec-------------CCCceEEEEEEEeCCeEEEEEecCCc-- Confidence 23444 3344666555555443333333211100 00011112333320 000001000 Q ss_pred EEEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCccc Q lcl|NC_012418. 215 AELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA 294 (510) Q Consensus 215 ~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~ 294 (510) ...++.........++..+|++.++- +.+|.|=.....+-+..++.+.-......+....|.+++.-...- T Consensus 213 ----~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~ 283 (499) T protein:vir:10 213 ----EVSANDPIVYDGENLFGAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLG 283 (499) T ss_pred ----cccCcceecccccCCCCccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccc Confidence 01111111222222346789887653 467899889999999999998888888888888998776421111 Q ss_pred -chhhhccCCC-ceeecC-Cc-ccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHH----- Q lcl|NC_012418. 295 -VVDDYQDAEM-GDYVPG-GA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI----- 363 (510) Q Consensus 295 -~p~~~~~~~~-g~~~pg-~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~----- 363 (510) ..+....... +.+..+ .. .+++.+ ....+.......++.+.+.|.+.-+. +. ...-+...|+..+.. T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~d~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l 361 (499) T protein:vir:10 284 DDKDDIQRLKRGAIEAPPREEGADIEWL--TKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGL 361 (499) T ss_pred cccchhhhhhhcceeccCCCCCCcceEE--eccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHH Confidence 1111111111 122221 11 223332 23345677788888888888664322 11 111223456666543 Q ss_pred --HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 364 --TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLA 439 (510) Q Consensus 364 --r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~ 439 (510) +..++...++..+.+ +++-++.++...+. ......+.+... .+.+....++.+.++ + T Consensus 362 ~~k~~~k~~~~~~~l~~--------~~~li~~~~~~~~~-~~d~~~i~i~f~~~~p~n~~e~~~~~~kl----------~ 422 (499) T protein:vir:10 362 ENLLSIKQRYFFDGLRR--------RLKLIQTIVNIKGA-NDDASGCKISLVANIPSNLSDVVNNVKNA----------D 422 (499) T ss_pred HHHHHHHHHHHHHHHHH--------HHHHHHHHHhccCC-ccccccceEEeCCCCCCCHHHHHHHHHHH----------h Confidence 334444444443332 23333344432221 111122332221 122222222222221 1 Q ss_pred ChhhHhhccCHHHHHHHHHHHc-CCCHhHccCCHHHHHHHHHHHHHHHHHHHHh-hHHH-hh----hhhhhhhcccCC Q lcl|NC_012418. 440 PIAQLDPRISLPKMMDTIWAAF-SVDTSQFYKSEEELQAEAEQRRQQAAQAQAA-QETL-LE----GASDMTNALAGV 510 (510) Q Consensus 440 ~~~q~~~~id~d~~~~~~a~~~-Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~-~~~~-~~----ga~~~~~~~ag~ 510 (510) ++ +-...+++ .+ +++ -.++|++++.++++..+...+.. .... .. +..+.+....|- T Consensus 423 g~------iS~et~~~----~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (499) T protein:vir:10 423 GI------IPRKYTYS----WLPDVD-----NPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKE 485 (499) T ss_pred cc------CChHHHHH----hCCCCC-----CHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCC Confidence 10 11122222 22 222 13466666655544322222211 1000 00 000011111111 No 76 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.40 E-value=7.7e-05 Score=43.15 Aligned_cols=416 Identities=11% Similarity=-0.001 Sum_probs=157.7 Q ss_pred ChhH----HHHHHHHHhhccchHHHHHHHHHhc-----ccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKST----AAMLWEKLRDGSVEQRAIEFAKTTL-----PYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~----~~~r~~~lkr~~~~~~w~e~~~~~l-----P~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) |..+ |.+.|.. ++ ++.+.+.+|.. +.+-. . .....+..+..-+-+...+++++..| ++. T Consensus 13 ~~~~~~~~L~~~~~~-~~----~r~~~~~~YY~G~~~i~~~~~-~-~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~- 80 (485) T protein:vir:24 13 DPAIARDEMVSAFED-QN----QNLRSNTSYYEAERRPEAIGV-T-VPVQMQSLLAHVGYPRLYVDSIAERQ----AVE- 80 (485) T ss_pred chHHHHHHHHHHHHH-HH----HHHHHHHHHHhccCchhhcCc-c-cchhhhhhhhccchHHHHHHHHhhhh----ccC- Confidence 4333 2233311 11 12222233322 22100 0 00011112233455566666666554 333 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC-------- Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-------- 143 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~-------- 143 (510) .|+ .++..- ..+ .+.+.+..++|.....+..++..++|.+.+++..+.. T Consensus 81 -g~~---~~~~~~------------~~~-------~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~ 137 (485) T protein:vir:24 81 -GFR---LGDADE------------ADE-------ELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDP 137 (485) T ss_pred -cee---cCCCch------------hHH-------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCC Confidence 222 222110 011 1223356678999999999999999998876643321 Q ss_pred ---cEEEEEeceEEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEE Q lcl|NC_012418. 144 ---TVVAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYH 219 (510) Q Consensus 144 ---~~~~~pl~~~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~ 219 (510) +++.++-.+.++..| ..+++...++.+.-. +.+.-..+++|+ ++. .-.|. T Consensus 138 ~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~-----~~~----~~~~~ 191 (485) T protein:vir:24 138 NVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDA-----------------EGNEIQAATLYT-----PNE----TFGWF 191 (485) T ss_pred CcceEEEeccceeEEEeeCCcCceeEEEEEEEee-----------------cCCeEEEEEEEc-----CCc----EEEEE Confidence 466666556544455 456666555554310 001111222221 111 00111 Q ss_pred EecCeeec-cccccccccCceEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCccc Q lcl|NC_012418. 220 EIDGVRVG-EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDY-IGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKGA 294 (510) Q Consensus 220 e~~~~~~~-~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-L~d~r~L~~l~~~~l~~~~~a~~p~~l~~---~~g~~ 294 (510) ..+|.... .....+++.+|++.++.+...+..||.|-..+. .+=+..++...-.+...++..+.|...+. ++... T Consensus 192 ~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~ 271 (485) T protein:vir:24 192 RAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIG 271 (485) T ss_pred ecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccc Confidence 22332221 111233567999999988888889999976543 34456666666566666777777754442 11110 Q ss_pred -ch---hhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc-c----cCCCCCC-CCHHHHH-- Q lcl|NC_012418. 295 -VV---DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A----NQRDAER-VTAEEVR-- 362 (510) Q Consensus 295 -~p---~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~----~~~~~~~-~TAtEi~-- 362 (510) .. ..+....+|.+..-...+++..+... +++ ...++.++.-|.+..... + +..+... .++.-+. T Consensus 272 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~-~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~ 347 (485) T protein:vir:24 272 VDPETGQTLFDAYLARILAFEDAEGKIQQFSA-AEL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAA 347 (485) T ss_pred cccccccchhhhcccceeccCCCCceEEeecc-cch---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHH Confidence 00 00011111221110011222222221 222 344555555555443221 1 1111111 2333222 Q ss_pred -----HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_012418. 363 -----ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPAL--SRSAAVQSMLNASQVI 435 (510) Q Consensus 363 -----~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L--~raq~~~~~~~~~q~l 435 (510) .+++++...++..+.++ +..++.++...+. +. +..-..+++-.++ ..++.++.+. +++ T Consensus 348 ~~~l~~ka~~~~~~f~~~l~~~--------~~l~~~~~~~~~~---~~-d~~~i~v~f~~~~~~s~~~~ad~~~---kl~ 412 (485) T protein:vir:24 348 ESRLIKKVERKNAIFGGAWEEA--------MRLAYRLMKGGDV---PP-DMLRMETVWRDPSTPTYAAKADAAT---KLY 412 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcCCCC---cc-ccceeeEEecCCCCCCHHHHHHHHH---HHH Confidence 22233333333333322 2222222221111 11 2222223332222 2222222222 112 Q ss_pred HhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHH-HHhhH--HHhhhhhhhhhcccCC Q lcl|NC_012418. 436 AGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQA-QAAQE--TLLEGASDMTNALAGV 510 (510) Q Consensus 436 ~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~-~~~~~--~~~~ga~~~~~~~ag~ 510 (510) +...++ +.-+. +...+|.....+ +|+++++++++.+.++. ..+.. ....+..+......+. T Consensus 413 ~~g~~~------~s~et----~~~~l~~~~d~~----~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 476 (485) T protein:vir:24 413 GNGQGV------IPRER----ARKDMGYSIAER----EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQ 476 (485) T ss_pred hccccc------CCHHH----HHhhCCCCHhHH----HHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCc Confidence 211111 11111 234466543211 34444333322211111 11111 0111111111111111 No 77 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.39 E-value=7.7e-05 Score=43.13 Aligned_cols=425 Identities=10% Similarity=-0.012 Sum_probs=180.8 Q ss_pred Ch---hHHHHHHHHHhhccchHHHHHHHHHhccc-----ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MK---STAAMLWEKLRDGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~---~~~~~r~~~lkr~~~~~~w~e~~~~~lP~-----~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) .. +.+.+.-++. .....++++++.+|..-. +.... ........++..+.+...++.+++-|++ -|+ T Consensus 38 ~~~~~~~i~~~i~~~-~~~~~~r~~~l~~YY~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Ivd~~~~yl~g--~p~-- 111 (512) T protein:vir:97 38 LLQNINEVSKYIEHH-MDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYFLG--NPI-- 111 (512) T ss_pred hhhhHHHHHHHHHHH-HHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHHHHHHHhhhhcc--cCc-- Confidence 00 1111111111 111223455555554421 11111 1111222356667777777777766654 121 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWS 149 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~p 149 (510) +++..++. +.+ .+...+..++|.....++.++..++|.+.+++ +++.. +++.++ T Consensus 112 ---~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~ 168 (512) T protein:vir:97 112 ---QCQDDDKD-------------VLE-------AIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) T ss_pred ---eeccCChH-------------HHH-------HHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEc Confidence 12333322 111 33444667789999999999999999876554 43322 455565 Q ss_pred ece-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecCe Q lcl|NC_012418. 150 LRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDGV 224 (510) Q Consensus 150 l~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~~ 224 (510) ..+ |++--| ..+++...+|.+++...+- ...+.-..+++|+. ++...+...+.. ... T Consensus 169 p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~------------~~~~~~~~~~vyt~~~i~~~~~~~~~~~~-----~~~- 230 (512) T protein:vir:97 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDK------------TDEDEVFTVDLFTSHGVYRYLTSRTNGLK-----LTP- 230 (512) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccc------------cccceEEEEEEEeCCcEEEEEecCCCccc-----ccc- Confidence 555 444333 2367776666664421110 00111112233321 011111100000 000 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCC Q lcl|NC_012418. 225 RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM 304 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~ 304 (510) ........+++.+|++.++ ++..|+|-.+..++-+..++.+.-......+...+|.+++.-....++..+..... T Consensus 231 ~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~ 305 (512) T protein:vir:97 231 RENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKE 305 (512) T ss_pred cccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhh Confidence 1112223345678887654 34678998899999999999887777777888888877653222222332222211 Q ss_pred ceee-c------------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHH------ Q lcl|NC_012418. 305 GDYV-P------------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI------ 363 (510) Q Consensus 305 g~~~-p------------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~------ 363 (510) +..+ . +..+......+....+.......++.++..|...-+. +. ...-+...|+..+.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~ 385 (512) T protein:vir:97 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) T ss_pred cccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHH Confidence 1111 0 0011111111222234566677777777777443221 11 111123456665543 Q ss_pred -HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCc--ccccceeee--cHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 364 -TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIET--GLPALSRSAAVQSMLNASQVIAGL 438 (510) Q Consensus 364 -r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~--~~~~~~~v~--~is~L~raq~~~~~~~~~q~l~~~ 438 (510) ++.+++..++..+. -+++.++.++...+-...+. .++.+.... +.+.+..++.+.++ .+.+ T Consensus 386 ~ka~~k~~~f~~~l~--------~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl------~gii 451 (512) T protein:vir:97 386 QRTKTKEGLFTKGLR--------RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS------GGKI 451 (512) T ss_pred HHHHHHHHHHHHHHH--------HHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH------hccC Confidence 34444444444333 23344444443322222122 233333321 22222222222221 1211 Q ss_pred cChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 439 APIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 439 ~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) +. ..++ ..++ ++ ..++|++++.++++.++++++........+.........+= T Consensus 452 S~----------et~~----~~l~~v~-----d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) T protein:vir:97 452 SQ----------TTLM----SLFSFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) T ss_pred ch----------HHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcc Confidence 11 1222 2232 21 23567777766655444333322111111111111111111 No 78 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.32 E-value=9.7e-05 Score=42.59 Aligned_cols=400 Identities=11% Similarity=0.003 Sum_probs=179.1 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) -++.+.+..++.+. -..+++++.+|..-. ..............++..+.+...++.+++-|++ -| +.++ T Consensus 18 ~~~~l~~~i~~~~~--~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~~ 88 (453) T protein:vir:39 18 TNEVVTKFMEKHRL--EVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNG--IP-----VKKS 88 (453) T ss_pred CHHHHHHHHHHHHH--HHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhhhhcc--cC-----ceec Confidence 23333333333321 123445555554321 1101111111223455667777778877776643 11 2223 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEEEEece-EE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS-YA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~~pl~~-~~ 154 (510) ..++. .. ..+.+.+..++|.....++.++...+|.+.+++..+. + ++++++..+ |+ T Consensus 89 ~~d~~-------------~~-------~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (453) T protein:vir:39 89 HSDKE-------------TL-------SKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFM 148 (453) T ss_pred cCChH-------------HH-------HHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEE Confidence 22221 11 2345557778999999999999999999876654332 2 355665544 45 Q ss_pred EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe--eecccccc Q lcl|NC_012418. 155 VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV--RVGEEGRW 232 (510) Q Consensus 155 i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~--~~~~~~~y 232 (510) +--|..++....+.++... .+....+++|+ ++ ..++++.++. .+...... T Consensus 149 v~d~~~~~~~~~~ir~~~~------------------~~~~~~~~~yt-----~~-----~i~~~~~~~~~~~~~~~~~~ 200 (453) T protein:vir:39 149 VYDDTIKQEPLFAVRYGYD------------------DDYKLYGEVYT-----KE-----TTYALNGTMGFYNMTEQAPN 200 (453) T ss_pred EecCCCCCeEEEEEEEEEe------------------CCeEEEEEEEe-----CC-----eEEEEEecCCceeeeccccc Confidence 5444455444444443311 01111223321 11 1112222221 12222223 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccC-CCcee-ecC Q lcl|NC_012418. 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA-EMGDY-VPG 310 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~-~~g~~-~pg 310 (510) ++..+|++.++. +.+|+|=.+...+-+-.++.+.-......+....|.+++.- ..+..+.+... .++.+ +++ T Consensus 201 ~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~~~~~~~~ 274 (453) T protein:vir:39 201 PFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRSNRVINYYG 274 (453) T ss_pred CCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec-CCCCchhhhhhhhcceeeecC Confidence 345789887653 45799988899999999999988888888899998776631 11222222111 11222 222 Q ss_pred C-----cccccccccCcccchHHHHHHHHHHHHHHHHHhh-ccccCCCCCCCCHHHHHH-------HHHHHHHHhchhHh Q lcl|NC_012418. 311 G-----AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGANQRDAERVTAEEVRI-------TAEEAENTLGGTYS 377 (510) Q Consensus 311 ~-----~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~ 377 (510) . ..+++.+. ...+.+.....++.++..|...-. .+.....-.+.|+.-+.. ++.++...+|..+. T Consensus 275 ~~~~~~~~~~~~lt--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~ 352 (453) T protein:vir:39 275 ESSEAKNVDVKFLE--KPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLN 352 (453) T ss_pred CCCCCCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 12233332 224567777778888877744322 121111112345554433 33344444444333 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHH Q lcl|NC_012418. 378 LLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMD 455 (510) Q Consensus 378 rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~ 455 (510) + +++-+..++...+.. ....++.+..- .+.+.++.++ .+. + ++++ |-...++ T Consensus 353 ~--------~~~li~~~~~~~~~~-~~~~~i~v~f~~~~p~~~~~~a~---~~~---k----l~g~------is~et~l- 406 (453) T protein:vir:39 353 S--------RYKLYCELSTNVSNK-EAWKDIEYTFTRNEPKDIKEQAE---TAN---I----LMGI------TSQETAL- 406 (453) T ss_pred H--------HHHHHHHHHhccCCc-cccccceEEeCCCCCcCHHHHHH---HHH---H----Hhcc------CChHHHH- Confidence 3 333333444333221 11122333221 1222222222 221 1 1121 1112222 Q ss_pred HHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 456 TIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 456 ~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ..+| ++ -.++|++++.++.+...+..+.. .+...|+ T Consensus 407 ---~~l~~v~-----D~~~E~~ri~~E~~~~~~~~~~~-----------~~~~~~~ 443 (453) T protein:vir:39 407 ---SVISVIP-----DVQAEMEKIKKEEASTAIFDKDK-----------QPSEKGT 443 (453) T ss_pred ---HhCCCCC-----CHHHHHHHHHHHHHHHHHHHHhc-----------cCCCCCC Confidence 3333 22 12466666655444322222111 1111222 No 79 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=417 Identities=13% Similarity=0.021 Sum_probs=164.5 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc-cCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~-~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) =...+...+.++.+ ...+...+.+|..-.. ...-+ .....+..++..+-+..+++.+++.| ++.+ |.. T Consensus 4 ~~d~i~~L~~~~~~--~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 74 (480) T protein:vir:78 4 YHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEG---FRI 74 (480) T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh----ccCc---eec Confidence 33444444444421 2234444445533211 00000 00111112334556666677766655 3332 222 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC-------CCC--cEEEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-------DEA--TVVAW 148 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~-------~~~--~~~~~ 148 (510) . .|.. ..+ .+.+.+..++|.....++.++...+|.+.+++.. +.+ +++++ T Consensus 75 ~-~d~~-------------~~~-------~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~ 133 (480) T protein:vir:78 75 S-EDSE-------------GLE-------ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred C-CCch-------------hHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEE Confidence 1 1111 111 2334456789999999999999999998766542 112 36667 Q ss_pred EeceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecC Q lcl|NC_012418. 149 SLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 149 pl~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~ 223 (510) +..+.++..|+ .+++...+|.+.-. . +.+....+++|+. ++.+....+... +..+ T Consensus 134 ~p~~~~~i~D~~~~~~~~~~i~~~~~~-d---------------~~~~~~~~~~y~~~~~~~~~~~~~~~~~---~~~~- 193 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYLPDETVPLRRNGGLNDQ---WVVD- 193 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee-c---------------CCcceEEEEEEeCCeEEEEEecCCCccc---cccc- Confidence 76665555664 45676555554211 0 1111122333321 111100000000 0001 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccC Q lcl|NC_012418. 224 VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA 302 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~ 302 (510) ......++..+|++.+..+...+..||+|=..+ ..+=+..++...-.....++..+.|...+. |.......... T Consensus 194 ---~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~ 268 (480) T protein:vir:78 194 ---GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDG 268 (480) T ss_pred ---ccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--CCCcccccccc Confidence 111123346799999998888888999996654 356677777777666677777777754442 22111000000 Q ss_pred -------CCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCC-CHHHHH------- Q lcl|NC_012418. 303 -------EMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERV-TAEEVR------- 362 (510) Q Consensus 303 -------~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~-TAtEi~------- 362 (510) ..|.+..-...+++..++.. ++++ ..++.++.-|...+.... +...+... ++.-+. T Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~ 344 (480) T protein:vir:78 269 ENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIV 344 (480) T ss_pred ccchhhhhhhhhccCCCCCceEEecCc-cCHH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHH Confidence 01111100011222223221 2333 334445555544443221 11121222 332222 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh--cCCCCCCcccccceeeecHHHH--HHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD--ALLQGLITKQHKPAIETGLPAL--SRSAAVQSMLNASQVIAGL 438 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~--~~l~~~~~~~~~~~~v~~is~L--~raq~~~~~~~~~q~l~~~ 438 (510) .+++++...++.. +.+.+.++.. ++. .+ .+.....+++-.+. ..++.+..+. +++++. T Consensus 345 ~k~~~~~~~f~~~------------l~~~~rl~~~~~~~~--~~-~~~~~i~v~w~~~~~~s~~~~ad~~~---kl~~~g 406 (480) T protein:vir:78 345 KMAERKGRIFGGA------------WERAMRIAMQIMGRE--VT-EEYTRLETVWRDPSTPTVAAKADAVS---KLYANG 406 (480) T ss_pred HHHHHHHHHHHHH------------HHHHHHHHHHHcCCC--cc-ccceeeeEEecCCCCCCHHHHHHHHH---HHHHhc Confidence 2233333333333 3333433321 111 11 12222223332222 2222233332 222222 Q ss_pred cChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh------------hhhc Q lcl|NC_012418. 439 APIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASD------------MTNA 506 (510) Q Consensus 439 ~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~------------~~~~ 506 (510) .++ +.-+ .+...+|..+. ..+|++.+++++.+.... + +.++....+.+ +..+ T Consensus 407 ~~~------~s~e----t~~~~lg~~~d----~~~e~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 407 QGP------IPKE----QARIDLGYTAT----QREQMRDWDKQETEDMID-T-LYSTTKAQADATPKPTVTETKTETQTS 470 (480) T ss_pred ccC------CCHH----HHHhcCCCCHh----HHHHHHHHHHHHHHHHHH-H-hhccccCCCccccCCCCCCCCCccCCC Confidence 221 1111 12334665432 112222222222211111 1 11111111111 1122 Q ss_pred ccCC Q lcl|NC_012418. 507 LAGV 510 (510) Q Consensus 507 ~ag~ 510 (510) +.|- T Consensus 471 ~~~~ 474 (480) T protein:vir:78 471 PSGF 474 (480) T ss_pred cccC Confidence 2222 No 80 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.22 E-value=0.00013 Score=41.98 Aligned_cols=431 Identities=10% Similarity=-0.026 Sum_probs=180.4 Q ss_pred ChhHHHHHHHHHhhc---------------------------cchHHHHHHHHHhcc-----cccCCCCCCccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDG---------------------------SVEQRAIEFAKTTLP-----YLMVDPMSGSRGVVEHDF 48 (510) Q Consensus 1 ~~~~~~~r~~~lkr~---------------------------~~~~~w~e~~~~~lP-----~~~~~~~~~~~~~~~~~~ 48 (510) ++.++..+|....+. .-.++++.+.+|..- .+.... ........++. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-~~~~~~~~ki~ 91 (511) T protein:vir:93 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVA 91 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-cccccCcceee Confidence 333444443322110 011233334444332 111111 11111223566 Q ss_pred cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_012418. 49 QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLL 128 (510) Q Consensus 49 dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl 128 (510) .+.+...++..++-|++ -|+ +++.+++. +.+ .+...+..++|.....++.++. T Consensus 92 ~n~~k~Iv~~~~~yl~g--~p~-----~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:93 92 HDYASYISDFINGYFLG--NPI-----QYQDDDKD-------------VLE-------VIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cchHHHHHHHHhhhhcc--cCe-----eeccCChH-------------HHH-------HHHHHHhhcCHhHHHHHHHHHH Confidence 66777777777765543 222 12333322 122 3334456778999999999999 Q ss_pred HhhCeEEEEE--eCCCC-cEEEEEece-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE Q lcl|NC_012418. 129 IVTGNALLYR--NSDEA-TVVAWSLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH 203 (510) Q Consensus 129 ~~~G~~~l~~--~~~~~-~~~~~pl~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 203 (510) .++|.+.+++ +++.. +++.++..+ |++--| ..+++...+|.+.....+ ....+.-..+++|+. T Consensus 145 ~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~iyt~ 212 (511) T protein:vir:93 145 SIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS 212 (511) T ss_pred HhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEeC Confidence 9999976555 43322 355565555 444333 246776666655432111 000011112233310 Q ss_pred ---EEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 204 ---VQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 204 ---v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~ 280 (510) ++...++..+.. .... .......+++.+|++.++- +..|+|=.+..++-+..++.+.-......+. T Consensus 213 ~~i~~~~~~~~~~~~-----~~~~-~~~~~~~~~g~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~ 281 (511) T protein:vir:93 213 HGVYRYLTSRTNGLK-----LTPR-ENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (511) T ss_pred CcEEEEEecCCCccc-----cccc-cccccccCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Confidence 011111100000 0001 1111222345789887653 4578998899999999999887777777887 Q ss_pred hhCCceeeCCCcccchhhhccCCCce-e-------ec------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 281 SLEVLNLVDEAKGAVVDDYQDAEMGD-Y-------VP------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY 346 (510) Q Consensus 281 a~~p~~l~~~~g~~~p~~~~~~~~g~-~-------~p------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~ 346 (510) ..+|.+++.-......+.+..-..+. + .. +...+++.+ ....+.......++.+++.|...-+. T Consensus 282 ~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:93 282 LNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred hhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCC Confidence 88887665321112222221111111 1 00 111122222 22235566677778887777544322 Q ss_pred -ccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--cccceee--ecHHHHH Q lcl|NC_012418. 347 -GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIE--TGLPALS 420 (510) Q Consensus 347 -~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~--~~~~~~v--~~is~L~ 420 (510) +.. ..-+...|+..+...-. .+........+.-.+.+.-+++.++.++...+-...+.+ .+.+..- .+.+... T Consensus 360 P~~~~~~~~~n~Sg~Al~~~~~-~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e 438 (511) T protein:vir:93 360 PNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIE 438 (511) T ss_pred cccccccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHH Confidence 111 11223456665543222 111122222333233333333444444433222222222 2333331 1233222 Q ss_pred HHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhh Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEG 499 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~g 499 (510) .++.+.++ .+. +-...++. .++ |+ -.++|++++.++++.++.+++........+ T Consensus 439 ~~~~~~kl------~g~----------iS~et~~~----~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:93 439 ELKAYIDS------GGK----------ISQTTLMS----LFSFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKDPRD 493 (511) T ss_pred HHHHHHHH------hcc----------CchHHHHH----hCCCCC-----CHHHHHHHHHHHHHHHHHHHhhhcccCCCC Confidence 22221111 111 11122222 232 22 235677777766554443333221111112 Q ss_pred hhhhhhcccCC Q lcl|NC_012418. 500 ASDMTNALAGV 510 (510) Q Consensus 500 a~~~~~~~ag~ 510 (510) ..+......+- T Consensus 494 ~~~~~~~~~~~ 504 (511) T protein:vir:93 494 INDDEQDDDTK 504 (511) T ss_pred CCCCCCCCccc Confidence 22222222222 No 81 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.21 E-value=0.00013 Score=41.90 Aligned_cols=412 Identities=9% Similarity=0.019 Sum_probs=173.8 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc-----c---cCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY-----L---MVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~-----~---~~~~~~~-~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) ..+.+.+..++.+ .-..+++.+.+|..=. + ....... ......++..+-+...+++.++-|.+ -| T Consensus 45 ~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G--~p-- 118 (492) T protein:vir:94 45 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 118 (492) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcc--cC-- Confidence 3333333333332 1123445555554321 0 0000000 11122356677788888888776643 12 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~ 148 (510) +.++..|+. +.+.|. .+..++|-....++.++...+|.+.+++ +++.. +++++ T Consensus 119 ---~~~~~~d~~-------------~~~~l~--------~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~ 174 (492) T protein:vir:94 119 ---IAFKHTDDE-------------VVKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 174 (492) T ss_pred ---ceeccCchH-------------HHHHHH--------HHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEE Confidence 112333221 122221 1223578888899999999999987655 43322 35556 Q ss_pred Eece-EEEee-CCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE---Ee-ecCCCceEEEEEEEec Q lcl|NC_012418. 149 SLRS-YAVRR-DATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV---QR-KKGTAMEYAELYHEID 222 (510) Q Consensus 149 pl~~-~~i~~-d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~-~~~~~~p~~sv~~e~~ 222 (510) +..+ |++-- +..+++.-.+|.+... ....+++|+-. +. ..+... ...+-.+.+ T Consensus 175 ~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~~-~~~~~~~~~ 233 (492) T protein:vir:94 175 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSL-IPDYSNNLE 233 (492) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEecCee-eeccccccc Confidence 5544 44433 3567787666655421 11123333211 11 111100 000000011 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh-c- Q lcl|NC_012418. 223 GVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-Q- 300 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~-~- 300 (510) +..+ ......+..+|++.++- +.+|.|=.+..++-+..++.+.-.+....+....|.+++.--........ . T Consensus 234 ~~~~-~~~~~~~g~vPvv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~ 307 (492) T protein:vir:94 234 NSKT-HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRL 307 (492) T ss_pred cccc-cccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHH Confidence 1111 11222345788876654 45799988999999999998887778788888888766531011100110 0 Q ss_pred cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHH-------HHHHHHHH Q lcl|NC_012418. 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRI-------TAEEAENT 371 (510) Q Consensus 301 ~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~-------r~~E~~~~ 371 (510) ....+.+.-+..++++.+.. ..+.......++.+++.|...-.. ++. ..-+...|+.-+.. ++.++... T Consensus 308 ~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~ 385 (492) T protein:vir:94 308 LRYYGAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARK 385 (492) T ss_pred HhhccceecCCCCcceeEec--cCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 01112222222333444332 234566677778887777554432 111 12223345543322 23333333 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHH Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLP 451 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d 451 (510) ++..+.+ +++.++.++. .. ....++.+.. ++..|-..+..++.+ .+. .++ +--. T Consensus 386 f~~~l~~--------~~~li~~~~~---~~-~~~~~i~v~f-~~~~p~~~~e~~~~~---~kl----~gi------iS~e 439 (492) T protein:vir:94 386 AKVAIQE--------LLWFVFEHFD---IK-GEHKDVDISF-NYNKVANTELQVQTA---QQS----MGI------VSHE 439 (492) T ss_pred HHHHHHH--------HHHHHHHHhc---CC-cccceeeEEe-cCCCCCCHHHHHHHH---HHH----hcc------CchH Confidence 3333332 2222222222 11 1112233222 111111111111111 111 121 1111 Q ss_pred HHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH------hhhhhhhhhc Q lcl|NC_012418. 452 KMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL------LEGASDMTNA 506 (510) Q Consensus 452 ~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~------~~ga~~~~~~ 506 (510) . +...+| ++ -.++|++++.+++++.+++.+...... ..+..+.++. T Consensus 440 t----~~~~l~~v~-----d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 440 T----VLENHPFVE-----DLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred H----HHHhCCCCC-----CHHHHHHHHHHHHHHHHhhccccccccCCCCccccCCccccCC Confidence 1 223333 22 134677776665544444332221110 0111111111 No 82 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.18 E-value=0.00014 Score=41.73 Aligned_cols=432 Identities=10% Similarity=-0.032 Sum_probs=180.8 Q ss_pred ChhHHHHHHHHHh---------------------------hccchHHHHHHHHHhcc-----cccCCCCCCccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLP-----YLMVDPMSGSRGVVEHDF 48 (510) Q Consensus 1 ~~~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~lP-----~~~~~~~~~~~~~~~~~~ 48 (510) +..++..+|.... .....++++++.+|..- ...... ........++. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~ 91 (511) T protein:vir:99 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVA 91 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceee Confidence 2222222222110 11112344555555432 111111 11111223566 Q ss_pred cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_012418. 49 QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLL 128 (510) Q Consensus 49 dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl 128 (510) .+.+...+++.++-|.+ -|+. ++.+++. +. ..+...+..++|.....++.++. T Consensus 92 ~n~~k~Iv~~~~~yl~g--~p~~-----~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:99 92 HDYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cchHHHHHHHHHhhhcc--cCce-----eecCchH-------------HH-------HHHHHHHhhcCHhHHHHHHHHHH Confidence 77777777777765543 2222 2333221 11 23444566778999999999999 Q ss_pred HhhCeEEEEE--eCCC-CcEEEEEeceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE Q lcl|NC_012418. 129 IVTGNALLYR--NSDE-ATVVAWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH 203 (510) Q Consensus 129 ~~~G~~~l~~--~~~~-~~~~~~pl~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 203 (510) .++|.+.+++ +++. -++++++..+.++..|. .+++...+|.+.....+ ....+.-..+++|+. T Consensus 145 ~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt~ 212 (511) T protein:vir:99 145 SIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS 212 (511) T ss_pred HhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------cCccceEEEEEEEeC Confidence 9999876555 4332 24666666554443443 46776666665442111 000011112233321 Q ss_pred --E-EeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 204 --V-QRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 204 --v-~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~ 280 (510) + ..+.++..+.. .+... ......++..+|++.++- ...|+|-.+..++-+..++.+.-......+. T Consensus 213 ~~i~~~~~~~~~~~~-----~~~~~-~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 281 (511) T protein:vir:99 213 HGVYRYLTSRTNGLK-----LTPRE-NGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (511) T ss_pred CcEEEEEecCCcccc-----ccccc-cccccCCCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHH Confidence 0 01111111100 01111 112222345789887754 3579999999999999999888888777777 Q ss_pred hhCCceeeCCCcccchhhhccCCC-ceee-------------cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 281 SLEVLNLVDEAKGAVVDDYQDAEM-GDYV-------------PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY 346 (510) Q Consensus 281 a~~p~~l~~~~g~~~p~~~~~~~~-g~~~-------------pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~ 346 (510) ..+|.+++.-.+......+..... +.+. .+...+++.+. ...+.......++.+++.|...-+. T Consensus 282 ~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:99 282 LNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred hhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCC Confidence 778766653222222222211111 1110 01111222222 2234556667777777777443221 Q ss_pred -cc-cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--cccceeee--cHHHHH Q lcl|NC_012418. 347 -GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIET--GLPALS 420 (510) Q Consensus 347 -~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~--~~~~~~v~--~is~L~ 420 (510) +. ...-+...|+..+..+-.- +.+......+.-.+.+.-+++.++.++...+-...+.+ .+.+.... +.+.+. T Consensus 360 P~~~~~~~~gn~Sg~Alk~~~~~-l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e 438 (511) T protein:vir:99 360 PNMKDDNFSGTQSGEAMKYKLFG-LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIE 438 (511) T ss_pred cccccccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHH Confidence 11 1111234566655433221 12222223333333333333444444444332222222 33333321 222222 Q ss_pred HHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhh-- Q lcl|NC_012418. 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLE-- 498 (510) Q Consensus 421 raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~-- 498 (510) .++.+.++ .+. |....+++.+ -+|+ -.++|++++.++++.++.+++...-.... T Consensus 439 ~~~~~~kl------~Gi----------iS~et~l~~l---~~v~-----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:99 439 ELKAYIDS------GGK----------ISQTTLMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQKNMYQDPRNI 494 (511) T ss_pred HHHHHHHH------hcc----------CCHHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHhhcccccCCCC Confidence 22221111 111 1112222221 1222 12567777776655433333322111100 Q ss_pred hhhhhhhcccCC Q lcl|NC_012418. 499 GASDMTNALAGV 510 (510) Q Consensus 499 ga~~~~~~~ag~ 510 (510) ...+.+...-.- T Consensus 495 ~~~~~~~~~~~~ 506 (511) T protein:vir:99 495 NDDEQDDSTKDS 506 (511) T ss_pred CCCCCCCCCcCc Confidence 000111111111 No 83 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.17 E-value=0.00014 Score=41.65 Aligned_cols=412 Identities=9% Similarity=0.009 Sum_probs=176.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccC------CCCC-CccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV------DPMS-GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~------~~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) ..+.+.+..++.+ .-..+++.+.+|..-. ... .... .......++..+-+...++..++-|.+ .| T Consensus 36 ~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G--~p-- 109 (483) T protein:vir:12 36 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 109 (483) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcc--cC-- Confidence 3333333333332 1123455556654331 000 0000 011122356677777788887776643 22 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEE--EeCCCC-cEEEE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RNSDEA-TVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~--~~~~~~-~~~~~ 148 (510) +.++..|+. ..+.| . .+...+|.....++.++...+|.+.++ .+++.. +++++ T Consensus 110 ---~~~~~~d~~-------------~~~~l-------~-~~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~ 165 (483) T protein:vir:12 110 ---IAFKHTDDE-------------VVKRI-------D-EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 165 (483) T ss_pred ---ceeccCChH-------------HHHHH-------H-HHHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEE Confidence 122333322 11111 1 123457888899999999999987654 444322 46666 Q ss_pred EeceEEEeeC--CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE--E-E-eecCCCceEEEEEEEec Q lcl|NC_012418. 149 SLRSYAVRRD--ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH--V-Q-RKKGTAMEYAELYHEID 222 (510) Q Consensus 149 pl~~~~i~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~--v-~-~~~~~~~p~~sv~~e~~ 222 (510) +..+.++..| ..+++...+|.+... ....+++|+- + + ...+.. .......+.+ T Consensus 166 ~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~-~~~~~~~~~~ 224 (483) T protein:vir:12 166 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGS-LIPDYSNNLE 224 (483) T ss_pred cccceEEEEcCCCCCceEEEEEEEEee--------------------cceEEEEEecCeEEEEEEeCCe-eeeccccccc Confidence 6655443333 457887666665431 0112333321 1 0 011110 0000111111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhc-- Q lcl|NC_012418. 223 GVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ-- 300 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~-- 300 (510) ...+ .....++..+|++.++- +.+|+|=.+...+-+..+|.+.-......+....|.+++.-.+........ T Consensus 225 ~~~~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~ 298 (483) T protein:vir:12 225 NSKT-HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL 298 (483) T ss_pred cccc-ccccCCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHh Confidence 2222 22222345688877653 457999889999999999988778888888888887766421111111110 Q ss_pred cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHH-------HHHHHHHHH Q lcl|NC_012418. 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAEEAENT 371 (510) Q Consensus 301 ~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~-------~r~~E~~~~ 371 (510) ....+.+.....++++.+.. ..+.......++.+++.|...-.. +. ...-+...|+.-+. .++.++... T Consensus 299 ~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~ 376 (483) T protein:vir:12 299 LRYYGAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARK 376 (483) T ss_pred hhhccccccCCCCcceEEee--cCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHH Confidence 11112332222334444332 234566677777777777554322 11 11222334655432 233444444 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) ++..+.+ +++.++.++. .. ....++.+... .+.+....++-+.+ ++++ +- T Consensus 377 f~~~l~~--------~~~li~~~~~---~~-~~~~~i~v~f~~~~p~~~~~~a~~~~k----------l~Gi------iS 428 (483) T protein:vir:12 377 AKVAIQE--------LLWFVFEHFD---IK-GEHKDVDISFNYNKVANTELQVQTAQQ----------SMGI------VS 428 (483) T ss_pred HHHHHHH--------HHHHHHHHhc---CC-CccceeeEEeCCCCCCCHHHHHHHHHH----------Hhcc------Cc Confidence 4443333 2222223322 11 11122332221 12222222221111 1121 11 Q ss_pred HHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 450 LPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) -..+ ...++ ++ -.++|++++.+++++.+++++.......-+..+.+ -.+- T Consensus 429 ~et~----~~~~~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~--~~~~ 479 (483) T protein:vir:12 429 HETV----LENHPFVE-----DLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQE--RSNN 479 (483) T ss_pred hHHH----HHhCCCCC-----CHHHHHHHHHHHHHHHHhhcccccccccCCcccCC--CCCc Confidence 1122 22332 22 12467776666555433333221100000000000 0011 No 84 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.16 E-value=0.00015 Score=41.59 Aligned_cols=420 Identities=11% Similarity=0.021 Sum_probs=169.1 Q ss_pred Ch---hHHHHHHHHHhhccchHHHHHHHHHhccc---ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcc Q lcl|NC_012418. 1 MK---STAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~~---~~~~~r~~~lkr~~~~~~w~e~~~~~lP~---~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) ++ ..+.+..++.+. .-..+++.+.+|..=. ..............++..+.+...++..++-|.+ -|+. T Consensus 13 ~~~~~~~~~~~i~~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g--~~~~--- 86 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKA-EQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLG--VPVE--- 86 (489) T ss_pred CCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhcc--CCce--- Confidence 22 122222222211 1112344444443311 0001111111122356667777777777766653 2222 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeC---CCC--cEEE Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNS---DEA--TVVA 147 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~---~~~--~~~~ 147 (510) ++..|+. +.++|. ..+...+|.....++.++..++|.+.+ |+.+ ..+ ++.+ T Consensus 87 --~~~~d~~-------------~~~~l~-------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~ 144 (489) T protein:vir:99 87 --YKNENKD-------------LQAAID-------LMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQ 144 (489) T ss_pred --eecCChh-------------HHHHHH-------HHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEE Confidence 2333321 233333 335567888899999999999998764 4422 122 3566 Q ss_pred EEeceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 148 WSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 148 ~pl~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~ 225 (510) ++..++++..|. .+++...+|.+...-.+ ......+++|+ ++.-..|.....+.++.. T Consensus 145 ~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~---------------~~~~~~~~~y~-----~~~i~~~~~~~~~~~~~~ 204 (489) T protein:vir:99 145 LPAEQTFVIYDDTYQRNSLMAVHFYDIDYGS---------------GKRKQIIKAYT-----SDTIYTYEDYNLETKGMR 204 (489) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEecCC---------------CceEEEEEEEe-----CCcEEEEEecCCCcccce Confidence 666665444443 34555555544321000 01111222321 111000000001122222 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch----hh--- Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV----DD--- 298 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p----~~--- 298 (510) +......++..+|++.++. ...|+|-.....+-+-.++.+.-...........|.+++. |...+ .. T Consensus 205 ~~~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~~ 277 (489) T protein:vir:99 205 LKDYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIA--GNAYTGADENDYLD 277 (489) T ss_pred ecccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhc--cCCcccccchhhhh Confidence 3223233456789988764 3568888888888888999888888877777777766652 11110 00 Q ss_pred -hccCCCc-----------eeec--------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccC-CCCCCC Q lcl|NC_012418. 299 -YQDAEMG-----------DYVP--------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQ-RDAERV 356 (510) Q Consensus 299 -~~~~~~g-----------~~~p--------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~-~~~~~~ 356 (510) ....+++ .+.. |...+++ .+....+.......++.+.+.|...-.. +... ..+... T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~ 355 (489) T protein:vir:99 278 DGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAY--FLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQ 355 (489) T ss_pred hcccccccccccccccccceeeeeccccCcccccccee--eeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 0000000 0000 0000111 1112223445556666666666432211 1111 112334 Q ss_pred CHHHHHH-------HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc---cccceee--ecHHHHHHHHH Q lcl|NC_012418. 357 TAEEVRI-------TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK---QHKPAIE--TGLPALSRSAA 424 (510) Q Consensus 357 TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~---~~~~~~v--~~is~L~raq~ 424 (510) |+..+.. +++++...++..+.+ +++.++.++...+...-... ++.+..- .+.+..+.+ T Consensus 356 Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~--------~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~-- 425 (489) T protein:vir:99 356 SGESMKYKLMASDNYREKQERLFKKGLMR--------RLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIV-- 425 (489) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHH-- Confidence 6655433 245555555554443 33333444432221111111 2332221 122222222 Q ss_pred HHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhh-hhh Q lcl|NC_012418. 425 VQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGA-SDM 503 (510) Q Consensus 425 ~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga-~~~ 503 (510) +.+. +..+.+ -...++..+ -+|+.. ..++|++++++++++.+... +....... .+. T Consensus 426 -~~~~---kl~gii----------s~et~~~~l---~~v~~~---d~~~E~~ri~~E~~~~~~~~---~~~~~~~~~~~~ 482 (489) T protein:vir:99 426 -TAAQ---NLYGIV----------SDQTIFEIL---NTVTGV---DAEAELKRLKEEADKKQSLP---EPRLVGDASGQE 482 (489) T ss_pred -HHHH---HHhccC----------CHHHHHHhc---CCCCch---hHHHHHHHHHHHHHHHhccc---cccccCCCCCCc Confidence 2222 111211 112222221 123211 12345555544433322211 11111111 111 Q ss_pred hhcccCC Q lcl|NC_012418. 504 TNALAGV 510 (510) Q Consensus 504 ~~~~ag~ 510 (510) +++...= T Consensus 483 ~~~~~~p 489 (489) T protein:vir:99 483 EPTAEKP 489 (489) T ss_pred CCCCCCC Confidence 1111111 No 85 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.16 E-value=0.00015 Score=41.57 Aligned_cols=428 Identities=10% Similarity=-0.027 Sum_probs=180.1 Q ss_pred ChhHHHHHHHHHh---------------------------hccchHHHHHHHHHhccc---ccCCCC-CCcccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQ 49 (510) Q Consensus 1 ~~~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~lP~---~~~~~~-~~~~~~~~~~~d 49 (510) +++++..+|.... +....++++.+.+|..-. +..... ........++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:78 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeec Confidence 3333333332211 011122344445554321 100010 111112235666 Q ss_pred chHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|NC_012418. 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~ 129 (510) +.+...++..++-|++ -|+. ++.+++. . ...+...+..++|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:78 93 DYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHH Confidence 7777777777766543 2221 2333221 1 1234445667789999999999999 Q ss_pred hhCeEEEEE--eCCC-CcEEEEEece-EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE Q lcl|NC_012418. 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 204 (510) .+|.+.+++ +++. -+++.++..+ |++--|. .+++...+|.+.....+ ....+.-..+++|+ T Consensus 146 ~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt-- 211 (511) T protein:vir:78 146 IYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEe-- Confidence 999876554 4432 2455565544 4443332 46666555555332111 00001111222322 Q ss_pred EeecCCCceEEEEEEEec-Ce------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 205 QRKKGTAMEYAELYHEID-GV------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLY 277 (510) Q Consensus 205 ~~~~~~~~p~~sv~~e~~-~~------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~ 277 (510) +++ .+++..+ +. ........+++.+|++.++- ..+|+|=.+..++-+..++.+.-..... T Consensus 212 ---~~~-----i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~ 278 (511) T protein:vir:78 212 ---SHG-----VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred ---CCc-----EEEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 111 0111111 11 11122333456788876643 4579998899999999999887777777 Q ss_pred HHHhhCCceeeCCCcccchhhhccCCCceee--------c------CCcccccccccCcccchHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 278 ELESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------P------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA 343 (510) Q Consensus 278 ~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~--------p------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 343 (510) .+...+|.+++.-......+.+.....+... . +...+++.+. ...+.......++.+++.|... T Consensus 279 ~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:78 279 MSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred HHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHH Confidence 7777888766543222333332221111111 1 0111222221 2234556667777777776543 Q ss_pred hhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCc--ccccceeee--cHH Q lcl|NC_012418. 344 FMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIET--GLP 417 (510) Q Consensus 344 f~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~--~~~~~~~v~--~is 417 (510) -+. +.. ..-+...|+..+...-. .+........++-.+.+.-+++.++.++...+-...+. .++++.... +.+ T Consensus 357 s~~P~~~~~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n 435 (511) T protein:vir:78 357 TNTPNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 435 (511) T ss_pred hCCccccccccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcC Confidence 221 111 11123456665543322 12222233333333444444444455554332221122 233333321 223 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_012418. 418 ALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL 496 (510) Q Consensus 418 ~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~ 496 (510) .+..++ .+. ...+.++ ...++. .++ |+ -.++|++++.++++.+..+++...... T Consensus 436 ~~e~~d---~~~---kl~G~iS----------~et~l~----~l~~v~-----d~~~El~ri~~E~~~~~~~~~~~~~~~ 490 (511) T protein:vir:78 436 LIEELK---AYI---DSGGKIS----------QTTLMS----LFSFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKD 490 (511) T ss_pred HHHHHH---HHH---HHhccCC----------hHHHHH----hCCCCC-----CHHHHHHHHHHHHHHHHHHHhhccccC Confidence 222222 221 1112111 112222 222 22 124667766665544333332221111 Q ss_pred hhhhhhhhhcccCC Q lcl|NC_012418. 497 LEGASDMTNALAGV 510 (510) Q Consensus 497 ~~ga~~~~~~~ag~ 510 (510) ..+.........+- T Consensus 491 ~~~~~~~~~~~~~~ 504 (511) T protein:vir:78 491 PRDINDDEQDDDTK 504 (511) T ss_pred CCCCCCCCCCCCcc Confidence 11111111111111 No 86 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.16 E-value=0.00015 Score=41.57 Aligned_cols=428 Identities=10% Similarity=-0.027 Sum_probs=180.1 Q ss_pred ChhHHHHHHHHHh---------------------------hccchHHHHHHHHHhccc---ccCCCC-CCcccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQ 49 (510) Q Consensus 1 ~~~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~lP~---~~~~~~-~~~~~~~~~~~d 49 (510) +++++..+|.... +....++++.+.+|..-. +..... ........++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeec Confidence 3333333332211 011122344445554321 100010 111112235666 Q ss_pred chHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|NC_012418. 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~ 129 (510) +.+...++..++-|++ -|+. ++.+++. . ...+...+..++|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:96 93 DYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHH Confidence 7777777777766543 2221 2333221 1 1234445667789999999999999 Q ss_pred hhCeEEEEE--eCCC-CcEEEEEece-EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE Q lcl|NC_012418. 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 204 (510) .+|.+.+++ +++. -+++.++..+ |++--|. .+++...+|.+.....+ ....+.-..+++|+ T Consensus 146 ~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt-- 211 (511) T protein:vir:96 146 IYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEe-- Confidence 999876554 4432 2455565544 4443332 46666555555332111 00001111222322 Q ss_pred EeecCCCceEEEEEEEec-Ce------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 205 QRKKGTAMEYAELYHEID-GV------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLY 277 (510) Q Consensus 205 ~~~~~~~~p~~sv~~e~~-~~------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~ 277 (510) +++ .+++..+ +. ........+++.+|++.++- ..+|+|=.+..++-+..++.+.-..... T Consensus 212 ---~~~-----i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~ 278 (511) T protein:vir:96 212 ---SHG-----VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred ---CCc-----EEEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 111 0111111 11 11122333456788876643 4579998899999999999887777777 Q ss_pred HHHhhCCceeeCCCcccchhhhccCCCceee--------c------CCcccccccccCcccchHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 278 ELESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------P------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA 343 (510) Q Consensus 278 ~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~--------p------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 343 (510) .+...+|.+++.-......+.+.....+... . +...+++.+. ...+.......++.+++.|... T Consensus 279 ~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:96 279 MSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred HHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHH Confidence 7777888766543222333332221111111 1 0111222221 2234556667777777776543 Q ss_pred hhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCc--ccccceeee--cHH Q lcl|NC_012418. 344 FMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIET--GLP 417 (510) Q Consensus 344 f~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~--~~~~~~~v~--~is 417 (510) -+. +.. ..-+...|+..+...-. .+........++-.+.+.-+++.++.++...+-...+. .++++.... +.+ T Consensus 357 s~~P~~~~~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n 435 (511) T protein:vir:96 357 TNTPNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 435 (511) T ss_pred hCCccccccccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcC Confidence 221 111 11123456665543322 12222233333333444444444455554332221122 233333321 223 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_012418. 418 ALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL 496 (510) Q Consensus 418 ~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~ 496 (510) .+..++ .+. ...+.++ ...++. .++ |+ -.++|++++.++++.+..+++...... T Consensus 436 ~~e~~d---~~~---kl~G~iS----------~et~l~----~l~~v~-----d~~~El~ri~~E~~~~~~~~~~~~~~~ 490 (511) T protein:vir:96 436 LIEELK---AYI---DSGGKIS----------QTTLMS----LFSFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKD 490 (511) T ss_pred HHHHHH---HHH---HHhccCC----------hHHHHH----hCCCCC-----CHHHHHHHHHHHHHHHHHHHhhccccC Confidence 222222 221 1112111 112222 222 22 124667766665544333332221111 Q ss_pred hhhhhhhhhcccCC Q lcl|NC_012418. 497 LEGASDMTNALAGV 510 (510) Q Consensus 497 ~~ga~~~~~~~ag~ 510 (510) ..+.........+- T Consensus 491 ~~~~~~~~~~~~~~ 504 (511) T protein:vir:96 491 PRDINDDEQDDDTK 504 (511) T ss_pred CCCCCCCCCCCCcc Confidence 11111111111111 No 87 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.15 E-value=0.00015 Score=41.54 Aligned_cols=408 Identities=12% Similarity=0.041 Sum_probs=171.4 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCC-CCCcccccc--ccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDP-MSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~-~~~~~~~~~--~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) |-+.-....+.|. -.....+.+.+.+|..-..-... +..-...+. +..-+-...+++.||..|.- -+ | T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~----~G---f 84 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNL----EG---F 84 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcc----cc---e Confidence 4444333333331 11122233444444332110000 000001111 12334555666666654431 11 2 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC--CCC---cEEEEEe Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEA---TVVAWSL 150 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~--~~~---~~~~~pl 150 (510) +. ++... ... .+++...++++.....+++++..++|.+.+++.. +.. .+++++- T Consensus 85 ~~--~d~~~----------~~~---------~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp 143 (474) T protein:vir:81 85 VW--PDGDL----------DSL---------GGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDA 143 (474) T ss_pred EC--CCCCc----------cch---------HHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEecc Confidence 22 22110 000 1334467889999999999999999999877743 322 3667766 Q ss_pred ceEEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCC-ceEEEE--EEEE--eecCCCceEEEEEEEecCe Q lcl|NC_012418. 151 RSYAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGS-GSVDLY--THVQ--RKKGTAMEYAELYHEIDGV 224 (510) Q Consensus 151 ~~~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~-~~v~i~--~~v~--~~~~~~~p~~sv~~e~~~~ 224 (510) .+..+..|+ .+++...++... + +.+.+ ....+| ..++ .+++.+.. |..+. T Consensus 144 ~~~~~~~D~~~~~~~~al~~~~---~---------------~~~g~~~~~~ly~~~~~~~~~~~~~~~~-----w~~~~- 199 (474) T protein:vir:81 144 SEATGEWNRRRRGLNNLLSIID---K---------------DKEGKVLSLALYLDNETVTAQRDKATLK-----WQVDR- 199 (474) T ss_pred ceEEEEEeCCCCcceeeeEEEE---E---------------cCCCcEEEEEEEeCCcEEEEEEcCccce-----eeecc- Confidence 553333343 333332222110 0 01111 112222 0111 11111100 11111 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh------ Q lcl|NC_012418. 225 RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD------ 297 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~------ 297 (510) .+..+ . +|++.+..+..-++.+|+|-. +.+++=+..+|+..-..+..++..+.|...+- |+..++ T Consensus 200 ---~~~~~--g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G~~~~~~~d~d~ 271 (474) T protein:vir:81 200 ---DEHVY--G-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL--GADESALKNADG 271 (474) T ss_pred ---CCCCC--C-cceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee--cCChhhcccccc Confidence 12232 2 799999988888899999965 57778889999999888999999999865552 221111 Q ss_pred ---hhccCCCcee--ecCCccccc-------ccccCcccchHHHHHHHHHHHHHHHHHhhcccc-------CCCCCCCCH Q lcl|NC_012418. 298 ---DYQDAEMGDY--VPGGAEAVR-------AYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-------QRDAERVTA 358 (510) Q Consensus 298 ---~~~~~~~g~~--~pg~~~~v~-------~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-------~~~~~~~TA 358 (510) .......+.+ +|+..+... .-++. .++++. -++.++.-|......... .....+-+| T Consensus 272 ~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~-~a~l~~---~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~Sa 347 (474) T protein:vir:81 272 TIKSVWEARLGRIKGLPDDADADIPQLARADVKQFP-AASPDA---HWSDINGLAKLFAREASLPDTAVAISGLSNPTSA 347 (474) T ss_pred cccchhhhhHHHHhcCCCcccccccccccccccccC-CCChhH---HHHHHHHHHHHHHhhhCCCHHHhcccccccccHH Confidence 0000000111 222222111 11111 123332 344444444443322211 111111234 Q ss_pred HH-------HHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee----eecHHHHHHHHHHHH Q lcl|NC_012418. 359 EE-------VRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI----ETGLPALSRSAAVQS 427 (510) Q Consensus 359 tE-------i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~----v~~is~L~raq~~~~ 427 (510) .- ...++++|...+|.-+.++ ++.++.+......-..+++..+... ...-|..++|..+.+ T Consensus 348 eAi~a~~~~l~~kae~k~~~fg~~l~~~--------~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~K 419 (474) T protein:vir:81 348 ESYDASQYELIAEAEGAVDDFTPALRKA--------FIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMK 419 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHH Confidence 32 2456777888888766654 2333333332222233344333222 233344333333333 Q ss_pred HHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhh--HHHhhhhhhh Q lcl|NC_012418. 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQ--ETLLEGASDM 503 (510) Q Consensus 428 ~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~--~~~~~ga~~~ 503 (510) +. +...+++.- ++ ....+|+. ++|+++..+.++++..++.... +....++.++ T Consensus 420 l~------~a~~~~~~~-------~~---~~~~lg~t-------~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 420 QL------AAVPWLAET-------EV---GLELIGLT-------PQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HH------hcccCCCcH-------HH---HHhhcCCC-------HHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 22 222122111 11 22335644 4566554433322222222211 1122233333 No 88 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=404 Identities=10% Similarity=-0.009 Sum_probs=182.8 Q ss_pred ChhH-HHHHHHHHh-hccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKST-AAMLWEKLR-DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~-~~~r~~~lk-r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) |.+. +.+..++.. +.+....++++|+-.-+-+- ...........++..+.+...++..++-|.+- |+ .+. T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~-~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~-----~~~ 88 (453) T protein:vir:73 17 ITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISS-QKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGI--PI-----KKT 88 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc-CCCCCccCccceeecchHHHHHHHhhhhhccc--Cc-----eee Confidence 3332 333333332 33444445555553322111 11111122234566777888888877666431 21 123 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEE-eceEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWS-LRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~p-l~~~~ 154 (510) ..++. ..+ .+...+..++|.....++.++...+|.+.+++ +++.. ++++++ ..-|+ T Consensus 89 ~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~ 148 (453) T protein:vir:73 89 HDDKS-------------VLE-------AMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFM 148 (453) T ss_pred cCChH-------------HHH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 32221 112 23333566789999999999999999987655 43322 355554 45567 Q ss_pred EeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe--eecccccc Q lcl|NC_012418. 155 VRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV--RVGEEGRW 232 (510) Q Consensus 155 i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~--~~~~~~~y 232 (510) +-.|..++....+.++.... +....++||+. + ..++++.++. .+...... T Consensus 149 v~dd~~~~~~~~~i~~~~~~------------------~~~~~~~vyt~-----~-----~i~~~~~~~~~~~~~~~~~~ 200 (453) T protein:vir:73 149 VYDDSIKQKPLFAVYYGFDE------------------EGNLSGTVYTL-----L-----ETISITGKAGEVKFGESTYN 200 (453) T ss_pred EEeCCCCceeEEEEEEEEec------------------CceEEEEEEeC-----C-----eEEEEEecCCceEEccceec Confidence 77676676655555444321 11223444431 1 1112222211 11222223 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCCcee----- Q lcl|NC_012418. 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDY----- 307 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~g~~----- 307 (510) .++.+|++.++ ++.+|+|=.+...+-+-.++.+.-......+....|.+++.- .....+.......+.. T Consensus 201 ~~g~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g-~~~~~~~~~~~~~~~~~~~~~ 274 (453) T protein:vir:73 201 VYSDLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLG-AEVDEEDAKNIKDNRLINFFD 274 (453) T ss_pred cCCceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeec-CCCCchhhhcccccccccccc Confidence 34578988654 346799988888888889998888888888888898776631 1111111111111100 Q ss_pred -ecC------CcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCCHHHHHH-------HHHHHHHHh Q lcl|NC_012418. 308 -VPG------GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRI-------TAEEAENTL 372 (510) Q Consensus 308 -~pg------~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~TAtEi~~-------r~~E~~~~L 372 (510) .++ ...+++.+. ...+.......++.+++.|...-.. +.........|+.-+.. +++++...+ T Consensus 275 ~~~~~~~~~~~~~d~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~ 352 (453) T protein:vir:73 275 KNSNGQGTNAAKVDVKFLD--KPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKF 352 (453) T ss_pred cccccccccccCceeEEee--ecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHH Confidence 111 111122221 2224455677777787777543321 21111113346655432 333333333 Q ss_pred chhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeee--cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCH Q lcl|NC_012418. 373 GGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISL 450 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~--~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~ 450 (510) |..+. -+++.+..++...+.. .....+.+..-. +.+.++.++-+.++ .+.+ .. T Consensus 353 ~~~l~--------~~~~li~~~~~~~~~~-~~~~~i~v~f~~~~p~~~~~~a~~~~k~------~gii----------s~ 407 (453) T protein:vir:73 353 QSALN--------RRYSLWSSLSTNASNK-DAWKDIEYTFTRNEPKDIKEQAETANIL------KGIT----------SE 407 (453) T ss_pred HHHHH--------HHHHHHHHHHhccCCc-cccccceEEeCCCCCCCHHHHHHHHHHH------hccC----------cH Confidence 33333 2333334444333321 111233332211 22222222221111 1111 11 Q ss_pred HHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhh---HHHhhhhh Q lcl|NC_012418. 451 PKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQ---ETLLEGAS 501 (510) Q Consensus 451 d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~---~~~~~ga~ 501 (510) +. +...++. +--.++|++++.+++++++.+++... ..+..|-. T Consensus 408 et----~~~~~~~----~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 408 ET----ALSVISV----IPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HH----HHHhCCC----CCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 11 2233321 11235677777666555444444321 22222333 No 89 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=400 Identities=11% Similarity=0.016 Sum_probs=178.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCC-CCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~-~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~ 79 (510) .+++-.+||++|+ ++|+=.-+...... .........++..+-+...+++.++-|++- |+ .| .. T Consensus 5 ~~~~~~~r~~~l~---------~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~--~~---~~ 68 (440) T protein:vir:95 5 FLGSQKQRLAILA---------SYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PV--SI---GV 68 (440) T ss_pred HHHHHHHHHHHHH---------HHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cc--eE---ee Confidence 3333333443332 22221111111111 011112223556666666666666555331 22 12 22 Q ss_pred ChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEeceEEEe Q lcl|NC_012418. 80 TDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLRSYAVR 156 (510) Q Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl~~~~i~ 156 (510) .++.. ++..+ .+...+..++|.....++.++..++|.+.+++ +++.. +++.++..+.++. T Consensus 69 ~~~~~----------~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~ 131 (440) T protein:vir:95 69 MEGGS----------ADQLS-------TIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVI 131 (440) T ss_pred CCCcc----------HHHHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEE Confidence 22211 11111 23455778899999999999999999987655 44322 3666766665555 Q ss_pred eCCC--CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE------EEEeecCCCceEEEEEEEecCeeecc Q lcl|NC_012418. 157 RDAT--GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT------HVQRKKGTAMEYAELYHEIDGVRVGE 228 (510) Q Consensus 157 ~d~~--G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~------~v~~~~~~~~p~~sv~~e~~~~~~~~ 228 (510) .|.. +++.-.+|.+... ....++||+ +.....+. ++..... T Consensus 132 ~d~~~~~~~~~~i~~~~~~--------------------~~~~~~vyt~~~~~~~~~~~~~~-----------~~~~~~~ 180 (440) T protein:vir:95 132 RDLTVEQNIIAAVHLPIYA--------------------DKVNMTVYTKDKVITYKPYSNNS-----------VRLVVDD 180 (440) T ss_pred EcCCCCCceEEEEEEEEec--------------------CceEEEEEeCCeEEEEEEecCCc-----------cceeecc Confidence 5654 4566555544211 111233332 11111100 0111111 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC--C-cccchhhhccC-CC Q lcl|NC_012418. 229 EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE--A-KGAVVDDYQDA-EM 304 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~--~-g~~~p~~~~~~-~~ 304 (510) ....++..+|++.++. +.+|.|=.+...+-+..++.+.-......+....|.+++.- . ....++..... .. T Consensus 181 ~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~ 255 (440) T protein:vir:95 181 VKKHSYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDA 255 (440) T ss_pred eeeccCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhc Confidence 1122345789987654 45799999999999999999998888888888888766521 0 01111211111 01 Q ss_pred cee-ec--------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHH-------HHHH Q lcl|NC_012418. 305 GDY-VP--------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVR-------ITAE 366 (510) Q Consensus 305 g~~-~p--------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~-------~r~~ 366 (510) +.+ .+ |...+++.+. ...+.+.....++.++..|...-.. +.. ..-+...|+..+. .+++ T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~lt--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 333 (440) T protein:vir:95 256 NMLFLKTGISTTGQQTTADASYIY--KQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRK 333 (440) T ss_pred cceecccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHH Confidence 111 11 1112233322 2234566677788887777543321 111 1112345776653 3455 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhH Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~ 444 (510) ++...+|..+.++- +..+.++....-.......+.+..- .+.+....++-+.++ .+. T Consensus 334 ~k~~~~~~~l~~~~--------~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl------~g~------- 392 (440) T protein:vir:95 334 DKETYFTKALRRRY--------ELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA------GGE------- 392 (440) T ss_pred HHHHHHHHHHHHHH--------HHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH------hcc------- Confidence 56666665544331 2222222211111111223333221 223333332222221 111 Q ss_pred hhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhh Q lcl|NC_012418. 445 DPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMT 504 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~ 504 (510) |-...+++ .++ ++-.++|++++.++++.++...+........+....+ T Consensus 393 ---iS~et~~~----~l~-----~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 393 ---ISQETLME----NAS-----FTDYKTEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred ---CcHHHHHH----hCC-----CCCcHHHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 11122222 232 2233567777776655544433332222323333333 No 90 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.08 E-value=0.00018 Score=41.10 Aligned_cols=409 Identities=11% Similarity=0.011 Sum_probs=175.1 Q ss_pred ChhH-HHHHHHHHhhccchHHHHHHHHHhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKST-AAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~-~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) |... +.+..++- .....++++.+.+|..-. ..... ........++..+-+...++..++-|++- |+ ++ T Consensus 25 ~~~~~i~~~i~~~-~~~~~~~~~~l~~Yy~g~~~i~~~~-~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~--p~-----~~ 95 (470) T protein:vir:99 25 LTSNELLGFIAYN-ETVLKPRYRENMKLYLGKHKILTAP-EKETGADNRIVVNSAKYVVDVYNGYFCGI--EP-----KL 95 (470) T ss_pred cCHHHHHHHHHHH-HHhhHHHHHHHHHHhccccccccCc-ccccCCcceeecchHHHHHHHHhhhhccC--Ce-----eE Confidence 2222 22211111 122223444455554421 01011 11112223555566666777666655432 21 12 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEeceEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLRSYA 154 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl~~~~ 154 (510) +..++. +..+ .+.+.+..++|.....++.++...+|.+.+++ +++.. +++.++..+.+ T Consensus 96 ~~~~d~------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~ 156 (470) T protein:vir:99 96 ALLNDS------------SKID-------EIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAF 156 (470) T ss_pred eeCCch------------hHHH-------HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeE Confidence 222211 0011 23345677899999999999999999876555 44321 35666666655 Q ss_pred EeeCCCCC--eEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEe-cCe---eecc Q lcl|NC_012418. 155 VRRDATGR--WMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEI-DGV---RVGE 228 (510) Q Consensus 155 i~~d~~G~--vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~-~~~---~~~~ 228 (510) +..|..+. +...+|.+... .+.....|..++-. +. .+++.. ++. .... T Consensus 157 ~i~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~~~ 210 (470) T protein:vir:99 157 IIYDDTVQRQPLAFVHYQIDN--------------------SNNWTDAYGVIQYA-DK-----FYKFKGYDIEEDTNAAG 210 (470) T ss_pred EEEcCCCCcceEEEEEEEEEe--------------------cCCeeEEEEEEEec-Ce-----EEEEEeccccccccccc Confidence 55555432 33334333321 01111122222211 11 011111 111 1122 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh-----hcc-C Q lcl|NC_012418. 229 EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-----YQD-A 302 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~-----~~~-~ 302 (510) ....++..+|++..+ +..+|+|=....++-+..++.+.-......+....|.+.+. |+..++. +.. . T Consensus 211 ~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~g~~~~~~~ 283 (470) T protein:vir:99 211 YAINPYGLVPAVEFF-----ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMI--GFKLPEDDEGNPKFDFK 283 (470) T ss_pred ccccCCCccceEeec-----CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcccccccchhhhhh Confidence 222345578887654 35689999999999999999988888888888889887764 2221111 110 1 Q ss_pred CCcee-ecC----CcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHHH-------HHHH Q lcl|NC_012418. 303 EMGDY-VPG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRIT-------AEEA 368 (510) Q Consensus 303 ~~g~~-~pg----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~r-------~~E~ 368 (510) .++.+ +++ ...+++.+. ...+.......++.+.+.|...-.. +. +...+...|+..+..+ ++++ T Consensus 284 ~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~ 361 (470) T protein:vir:99 284 NNRVLYVSQLDPDTNPQIGFIA--KPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSK 361 (470) T ss_pred hcceeeecCCCCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHH Confidence 11111 221 112233222 2234455566677777766443221 11 1112234577666543 3333 Q ss_pred HHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh Q lcl|NC_012418. 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~ 446 (510) ...++..+. -+++.++.++...+........+.+..- .+.+.+..++.+.++ .+.++ T Consensus 362 ~~~~~~~l~--------~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl------~giis------- 420 (470) T protein:vir:99 362 ERKFDKSLM--------QLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA------EGIVS------- 420 (470) T ss_pred HHHHHHHHH--------HHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH------hccCC------- Confidence 333333333 2333333444333322222233343331 233444444433332 11111 Q ss_pred ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHH---Hhhhhhhhhh Q lcl|NC_012418. 447 RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQET---LLEGASDMTN 505 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~---~~~ga~~~~~ 505 (510) ...++..+ -+|+ .++|++++.++++..++.++..... ........+- T Consensus 421 ---~et~l~~l---~~vd------~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 421 ---KKTQLGMI---PDIE------PDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred ---HHHHHHhC---CCCC------HHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 11222221 2233 2356666655443322222211111 1111111111 No 91 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.06 E-value=0.00019 Score=40.98 Aligned_cols=415 Identities=13% Similarity=0.065 Sum_probs=174.8 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc---ccCCC---CCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDP---MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~---~~~~~---~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) |.......+=+-.+.....+|+.+.+|.... ..... .........++..+.+...++..++-|.+ .|. T Consensus 30 ~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g------~~~ 103 (481) T protein:vir:10 30 LKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTG------NPI 103 (481) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhcc------CCc Confidence 2222222211111223334566666665432 11111 01111122345566666777766654432 222 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEec Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLR 151 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl~ 151 (510) .++..++. ..+ .+...+..++|.....++.++..++|.+.+++ +++.. ++++++.. T Consensus 104 -~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~ 162 (481) T protein:vir:10 104 -TITHQDNQ-------------TND-------KIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPK 162 (481) T ss_pred -eEecCChh-------------HHH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEccc Confidence 12222221 111 33445677789999999999999999876544 44322 35667666 Q ss_pred eEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCee--ec Q lcl|NC_012418. 152 SYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR--VG 227 (510) Q Consensus 152 ~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~--~~ 227 (510) +.++..|. .+++...+|.++..-. ++..-..+++| .++ ..++++.++.. .. T Consensus 163 ~~~~v~d~~~~~~~~~~i~~~~~~~~---------------~~~~~~~~~~y-----~~~-----~i~~~~~~~~~~~~~ 217 (481) T protein:vir:10 163 STFVVYDQTLDKKVVAGVRYFEKQDK---------------DKVPVQHVEVY-----TTD-----KIYYIEIKGGTYHRV 217 (481) T ss_pred ceEEEEcCCCCCceEEEEEEEEEeeC---------------CCceEEEEEEE-----ecC-----eEEEEEecCCceeec Confidence 65444443 3566665555442210 01111122222 111 11233333322 12 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC-Cce Q lcl|NC_012418. 228 EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-MGD 306 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~-~g~ 306 (510) ......+..+|++..+ ++.+|+|=.....+-+..++.+.-......+....|.+++.-......+...... .+. T Consensus 218 ~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 292 (481) T protein:vir:10 218 EEVEHYYNDVPIIEYL-----NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANM 292 (481) T ss_pred ccccccCCceeEEEee-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccc Confidence 2222234568887654 2467999888888888889888777777778788887766421111122111110 111 Q ss_pred e-ec--------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHHHHHHHHHHhchh Q lcl|NC_012418. 307 Y-VP--------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGT 375 (510) Q Consensus 307 ~-~p--------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv 375 (510) + .+ +...+++.+.. ..+.+.....++.++..|...-.. +. +...+...|+..+..+..-.... T Consensus 293 ~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k---- 366 (481) T protein:vir:10 293 IHLEPGTNANGSEGKAEVKYVYK--QYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQV---- 366 (481) T ss_pred eeccccccccCCCCCcceeEEee--cCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHH---- Confidence 1 11 11112222211 123455566666666666443221 11 11222334665443322211111 Q ss_pred HhHHHHHHHHHHHHHHHHH----HhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 376 YSLLAENLQSPLAYVCLSE----VDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 376 ~~rl~~E~l~Pli~r~~~i----l~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) .++.+ ..+...+.+.+.+ +...+..+.....+.+..- .+.+....++.+.++ .+. |. T Consensus 367 ~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl------~g~----------is 429 (481) T protein:vir:10 367 RAIKE-RLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL------SGG----------VS 429 (481) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH------hcc----------CC Confidence 22221 2233333444443 3222222222223333321 122222232222211 111 11 Q ss_pred HHHHHHHHHHHcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 450 LPKMMDTIWAAFSVDTSQFYK-SEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~r-s~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) -..+++ .++ ++. .++|+++++++.++.++..+ ..+..+..+...+| T Consensus 430 ~et~~~----~l~-----~i~d~~~E~~ri~~E~~~~~~~~~------~~~~~~~~~~~~~~ 476 (481) T protein:vir:10 430 ESTRLS----LLD-----FIDNPKEELEKMQEEEAQREKQAD------KRGYGEAFENHLNV 476 (481) T ss_pred hHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHhhhh------hccCCccCCCCCCC Confidence 122222 332 111 24666666655443322221 12222333333333 No 92 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.98 E-value=0.00023 Score=40.57 Aligned_cols=435 Identities=10% Similarity=0.010 Sum_probs=185.1 Q ss_pred Ch---------hHHHHHHHHHhhc-cchHHHHHHHHHhcccccCCCCCCcc-----ccccc-cccchHHHHHHHHHHHHH Q lcl|NC_012418. 1 MK---------STAAMLWEKLRDG-SVEQRAIEFAKTTLPYLMVDPMSGSR-----GVVEH-DFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~~---------~~~~~r~~~lkr~-~~~~~w~e~~~~~lP~~~~~~~~~~~-----~~~~~-~~dstg~~a~~~LAa~l~ 64 (510) |- ..+..+|+..++- .=...|++..+-.||..-..+.+..+ .++.+ .|-+.-.+.++.++ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~---- 107 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM---- 107 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh---- Confidence 21 2233445433311 01234455555556653211111111 11111 23344444555544 Q ss_pred HhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC- Q lcl|NC_012418. 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA- 143 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~- 143 (510) +.+|-- .|.+. ++ ..++.++++| -....+++.-+..++.+...+|-+.+++|-+.. T Consensus 108 G~vfrk-~p~~~--~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~ 164 (535) T protein:vir:80 108 GQVFSR-DPIRQ--LP--------------PALEAIVEDI------DGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVG 164 (535) T ss_pred chhhcC-Cccee--cc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 444421 23332 21 2244555544 245667888888899999999999998984321 Q ss_pred --------------c-EEEEEeceEE-Eee---CCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE Q lcl|NC_012418. 144 --------------T-VVAWSLRSYA-VRR---DATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 144 --------------~-~~~~pl~~~~-i~~---d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 204 (510) + +..|+-.+.. +.. |..+++.-+..++..+.+. +.|. .+.++.|.++ T Consensus 165 ~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f~------------~~~~~q~RvL 230 (535) T protein:vir:80 165 RPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGFE------------TTYVQQWRVL 230 (535) T ss_pred CcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCcc------------cceeEEEEEE Confidence 1 4445443321 222 3344565566666654332 2332 3344556666 Q ss_pred EeecCCCceEEEEEE-EecC---------eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHH---H Q lcl|NC_012418. 205 QRKKGTAMEYAELYH-EIDG---------VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLL---S 271 (510) Q Consensus 205 ~~~~~~~~p~~sv~~-e~~~---------~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l---~ 271 (510) .+..+++ +.++. ..++ ..+...++ .+.+++|++.|.-..+..++.|. --|=|+..||.- + T Consensus 231 ~~~~~G~---y~v~~~~~~~~~~~~~~~~~~~~~~~g--~~~l~~IPfv~~~~~~~~~~~~~--pPLl~LA~lni~Hy~~ 303 (535) T protein:vir:80 231 QLNAEGN---YQVERWRRETQEEMYYSYSKHVPTDGN--GNPFKEIPFQFIGPLDNNADIDH--PPLLDLCEVNIGHYRN 303 (535) T ss_pred EecCCce---EEEEEEEeecCCccccccceeecccCC--CcccCeeEEEEeecCCCCCCCCc--cchHHHHHHHHHHhhc Confidence 6644332 22211 1111 12222222 14577888888765555444431 123355555543 2 Q ss_pred HHHHHH-HHHhhCCceeeC------CCcccchhhhccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 272 EKLGLY-ELESLEVLNLVD------EAKGAVVDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA 343 (510) Q Consensus 272 ~~~l~~-~~~a~~p~~l~~------~~g~~~p~~~~~~~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 343 (510) .+-++. +-.+..|.+.+. ++....+..+..+.+..+ .|- ..+.+.++... ..+. .+.++++++++++. T Consensus 304 ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~-~~~~~~~e~~~-~~~a--~~~l~~~e~qM~~l 379 (535) T protein:vir:80 304 SADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQ-GATAGILQITP-NSVP--FEAMTHKESQMIAM 379 (535) T ss_pred hhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCC-CCCcceeeecc-chhH--HHHHHHHHHHHHHH Confidence 333333 344445543332 111222223334333333 232 22234444322 2222 45677777777663 Q ss_pred hhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhc-CCCCCCccccccee-eecHHHHHH Q lcl|NC_012418. 344 FMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAI-ETGLPALSR 421 (510) Q Consensus 344 f~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~-~l~~~~~~~~~~~~-v~~is~L~r 421 (510) = ..+........||+|...+.+..-.+|.-+..++.+- +.+++.++.+- |. .+.++.++..+ ..++..--- T Consensus 380 G-a~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~a-----l~~aL~~~A~w~G~-~~~~~~~~i~~n~dF~~~~ld 452 (535) T protein:vir:80 380 G-ANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSMA-----FRKALRWANQFQTG-IVNDETVEYNLNTDFPAARLT 452 (535) T ss_pred H-HHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-ccCCCceEEEeccccccccCC Confidence 2 2223344456899999988888888888888877654 34455554331 21 12233333222 112111111 Q ss_pred HHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhh Q lcl|NC_012418. 422 SAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGAS 501 (510) Q Consensus 422 aq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~ 501 (510) ++. +..+..... ++ .|..+.+..++ ...||... -+..++|...+..+...... .+......+.+ T Consensus 453 ~~~---~~all~~~~--~G------~Is~et~~~~L-~r~gvl~~-~~~~eee~~ri~~E~~~~~~---~~g~~~d~~~~ 516 (535) T protein:vir:80 453 PNE---RAELILEWQ--QG------AITFKEMRAGL-RRAGVASE-DDAKAETEGKATVEFIAKTA---AAGKVGDAASG 516 (535) T ss_pred HHH---HHHHHHHHh--cC------CCCHHHHHHHH-HhCCCCCc-ccchHHHHHHHHhhhhhccc---cCCCCCCCCCC Confidence 122 222222222 11 24444555554 55676432 12233332222221111000 11111111111 Q ss_pred hh-----hhcccCC Q lcl|NC_012418. 502 DM-----TNALAGV 510 (510) Q Consensus 502 ~~-----~~~~ag~ 510 (510) .+ ++.-+|- T Consensus 517 g~~~~~~~~~~~~~ 530 (535) T protein:vir:80 517 GTNKAKLNNGNGGG 530 (535) T ss_pred CCCcCcccCCcccc Confidence 11 1111111 No 93 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=96.86 E-value=0.0003 Score=39.93 Aligned_cols=408 Identities=9% Similarity=0.013 Sum_probs=178.1 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccC-C-----CCC-CccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV-D-----PMS-GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~-~-----~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++.+.+..++.+ .-..+++.+.+|..-. .+. . ... .......++..+-+...++.+++-|.+ .| T Consensus 25 ~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g--~~-- 98 (472) T protein:vir:93 25 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 98 (472) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcc--cC-- Confidence 2333333333322 1123555556654331 000 0 000 011122356678888888888877643 12 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEEE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~~ 148 (510) +.+...|+. +.+.| ..+..++|-..+.++.++...+|.+.+++..+. + ++.++ T Consensus 99 ---~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~ 154 (472) T protein:vir:93 99 ---IAFKHTDDE-------------VVKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 154 (472) T ss_pred ---eeeccCChH-------------HHHHH--------HHHHhccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEE Confidence 222333321 12211 122346899999999999999998876554332 2 45666 Q ss_pred Eece-EEEee-CCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE----EeecCCCceEEEEEEEec Q lcl|NC_012418. 149 SLRS-YAVRR-DATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV----QRKKGTAMEYAELYHEID 222 (510) Q Consensus 149 pl~~-~~i~~-d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v----~~~~~~~~p~~sv~~e~~ 222 (510) +..+ |++-- +..+++...+|.+...- ...+++|+-. +...+... ...+..+.+ T Consensus 155 ~p~~~~~i~d~~~~~~~~~~ir~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 213 (472) T protein:vir:93 155 PAEQGIPIWTDKEHEELEAFIRMYKLEN--------------------ETKVEYWDKVTVNYYVYENGSL-IPDYSNNLE 213 (472) T ss_pred cccceEEEEcCCCCCceEEEEEEEEeec--------------------ceeEEEEecCeEEEEEEecCee-eeccccccc Confidence 6555 44432 34677776666654311 1123333210 11111100 000001111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhc-- Q lcl|NC_012418. 223 GVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ-- 300 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~-- 300 (510) ...+ .....++..+|++.++. +.+|+|=.....+-+..++.+.-......+....|.+++.-.......... T Consensus 214 ~~~~-~~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 287 (472) T protein:vir:93 214 NSKT-HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL 287 (472) T ss_pred cccc-ccccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHH Confidence 2222 22223346789887764 458999999999999999988888888888888887766311111111110 Q ss_pred cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc-c-cCCCCCCCCHHHH-------HHHHHHHHHH Q lcl|NC_012418. 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A-NQRDAERVTAEEV-------RITAEEAENT 371 (510) Q Consensus 301 ~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~-~~~~~~~~TAtEi-------~~r~~E~~~~ 371 (510) ....+.+..+..++++.+... .+.......++.+++.|...-..- . ....+...|+.-+ ..+++++... T Consensus 288 ~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~ 365 (472) T protein:vir:93 288 LRYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARK 365 (472) T ss_pred HhhccccccCCCCcceeEeec--CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 011123322223344444322 345666777777777775543321 1 1122233455443 2244444555 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) +|..+.++ ++.++.++ +.. ....++.+.. ..+.+..+.++-+.+ ..+.+ . T Consensus 366 ~~~~l~~~--------~~li~~~~---~~~-~~~~~i~v~f~~~~p~~~~~~~~~~~k------~~gii----------s 417 (472) T protein:vir:93 366 AKVAIQEL--------LWFVFEHF---DIK-GEHKDVDISFNYNKVANTELQVQTAQQ------SMGIV----------S 417 (472) T ss_pred HHHHHHHH--------HHHHHHHh---CCC-cccceeeEEeCCCCCCCHHHHHHHHHH------HhccC----------c Confidence 55444332 22222222 211 1112222221 112222222221111 11111 1 Q ss_pred HHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 450 LPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) -.. +...++ ++ -.++|++++.++++..+++++.... ...++..-+- T Consensus 418 ~et----~l~~l~~~~-----d~~~E~~ri~~E~~~~~~~~~~~~~------~~~d~~~~~~ 464 (472) T protein:vir:93 418 HET----VLENHPFVE-----DLQAELERIEQEQMEYNKQLPNLDD------GGADGAQQQE 464 (472) T ss_pred hHH----HHHhCCCCC-----CHHHHHHHHHHHHHHHHHhccCcCc------ccCCCCCCCC Confidence 111 222332 22 1346777666655443333322110 0111111111 No 94 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=96.80 E-value=0.00033 Score=39.66 Aligned_cols=410 Identities=12% Similarity=0.024 Sum_probs=156.9 Q ss_pred ChhHHH-HHHHHHh-hccchHHHHHHHHHhccc-----ccCCCCCCcccccc-ccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAA-MLWEKLR-DGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVE-HDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~-~r~~~lk-r~~~~~~w~e~~~~~lP~-----~~~~~~~~~~~~~~-~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) +++.+. ..+.++. +. ++++.+.+|..=. +..........++. ++.-+-+..+++.++.. ++|.+ T Consensus 14 ~~~~~~~~l~~~~~~~~---~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~----l~~~g- 85 (479) T protein:vir:99 14 LAKYLETKVFPKMNTEC---ERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQ----LIVDG- 85 (479) T ss_pred HHHHHHHHHHHHHHHHh---HHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhh----ccccc- Confidence 444343 2333332 22 2344444443311 11000000011111 11224445555555443 34544 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC-----CC-C--c Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-----DE-A--T 144 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~-----~~-~--~ 144 (510) |+ .++... .+.+ .+.+..++|....+++.++...+|.+.+++.. +. + + T Consensus 86 --f~--~~d~~~---------~~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~ 141 (479) T protein:vir:99 86 --YR--KTGTNE---------NAKG-----------WDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVAR 141 (479) T ss_pred --cc--CCCchh---------hHHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceE Confidence 33 222221 1112 23346678999999999999999998776642 11 1 3 Q ss_pred EEEEEece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC Q lcl|NC_012418. 145 VVAWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 145 ~~~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~ 223 (510) +++++..+ +++-.|.......+| .++. +.+..+.+|+- .. ..+|....+ T Consensus 142 i~~~~p~~~~~iydd~~~~~~~~~---~~~~------------------~~~~~~~~~~~-----~~----~~~~~~~~~ 191 (479) T protein:vir:99 142 IKCIDPRDAFAIWEDPYWDEWPKY---LLER------------------QPNGQYWWWTE-----ED----YSIFEFKQG 191 (479) T ss_pred EEEechhheEEEecCCcccceeeE---EEee------------------cCceeEEEEec-----ce----EEEEEecCC Confidence 55655444 444444432222222 1111 11112222210 00 000111111 Q ss_pred ee-eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch------ Q lcl|NC_012418. 224 VR-VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV------ 296 (510) Q Consensus 224 ~~-~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p------ 296 (510) .. .......++..+|++.++-+... +.+|+|=.+..++-+-.++...-.....++..+.|.+.+. |...+ T Consensus 192 ~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~ 268 (479) T protein:vir:99 192 KFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT--GLMLPEGANAD 268 (479) T ss_pred ceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc--CCCcccccccc Confidence 11 11111122457999998877665 4589998888999889988888888888888888865442 22111 Q ss_pred -hhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccC----CCCCCCCHHHHHHHHHHHHHH Q lcl|NC_012418. 297 -DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ----RDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 297 -~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~----~~~~~~TAtEi~~r~~E~~~~ 371 (510) .......++.+... ..+++..++.. +++ ...++.++.-|...+...... ......++.-+.....-+... T Consensus 269 ~~~~~~~~~~i~~~~-~~~~~~~q~~~-~~~---~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~k 343 (479) T protein:vir:99 269 QEKMRFAQESMLISQ-NEKASFGAIPA-APL---DGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQK 343 (479) T ss_pred hhccccccccceeec-CCCceEEEecc-cch---HHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHH Confidence 11111122222211 11223333321 233 334444444444433322111 112234554443322222111 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee----ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE----TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v----~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~ 446 (510) .++.+. .+.+-+.+++.++.. .+... +.......+. ..-+..+.++-+.++ ++. +.+++ T Consensus 344 ----a~~~~~-~f~~al~~~~~l~~~~~~~~~-~~~~~~i~~~w~~~~~~s~~~~ad~~~kl------~~a-g~is~--- 407 (479) T protein:vir:99 344 ----LFEKQA-TWKASHNQTMRLVNKIEGRTE-EATDLDFTITWQDVTIQSLAQFADAWAKM------VES-LKIPA--- 407 (479) T ss_pred ----HHHHHH-HHHHHHHHHHHHHHHHcCCCc-cccceeeeEEecCCCCCCHHHHHHHHHHH------Hhc-CCCCH--- Confidence 112211 222333444443321 12222 1222222111 122222322222222 111 11111 Q ss_pred ccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH-hhhhhh----------------hhhcccC Q lcl|NC_012418. 447 RISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL-LEGASD----------------MTNALAG 509 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~-~~ga~~----------------~~~~~ag 509 (510) ..+ +....|++. ++++.+++..+.+..+.+.+.+.. .....+ ....+|+ T Consensus 408 ----et~---l~~l~gv~~-------~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (479) T protein:vir:99 408 ----EGV---WDMIPNLDQ-------STVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPAS 473 (479) T ss_pred ----HHH---HHhcCCCCH-------HHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhc Confidence 112 222236553 333333322222111111111100 000011 1123444 Q ss_pred C Q lcl|NC_012418. 510 V 510 (510) Q Consensus 510 ~ 510 (510) | T Consensus 474 ~ 474 (479) T protein:vir:99 474 L 474 (479) T ss_pred c Confidence 4 No 95 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=96.78 E-value=0.00034 Score=39.57 Aligned_cols=424 Identities=10% Similarity=-0.029 Sum_probs=177.7 Q ss_pred ChhHHHHHHHHH--hhccchHHHHHHHHHhc-------c---cccCCC--C---CCccccccccccchHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTL-------P---YLMVDP--M---SGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~~~~~~~r~~~l--kr~~~~~~w~e~~~~~l-------P---~~~~~~--~---~~~~~~~~~~~dstg~~a~~~LAa~l 63 (510) -.+.+.+..++- ++.++...++.+-.+.. | ....-. . ........|+..+-+...+++.++-| T Consensus 16 ~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl 95 (474) T protein:vir:94 16 LPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYL 95 (474) T ss_pred CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhhe Confidence 111122222211 12223333222211111 1 000000 0 00111123455556666666665554 Q ss_pred HHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCC Q lcl|NC_012418. 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSD 141 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~ 141 (510) ++- |+. ++..+.. ....++.++| .+.+..++|.....++.++..++|.+.+++ +++ T Consensus 96 ~g~--pv~-----~~~~~~~--------~~~e~~~~~l-------~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:94 96 HGV--PVT-----YDLDENA--------EKNEKLKKFI-------TNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred ecc--cee-----EeeCCCC--------cchHHHHHHH-------HHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 431 332 2221110 1122344444 334666789999999999999999887655 433 Q ss_pred CC-cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEE Q lcl|NC_012418. 142 EA-TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHE 220 (510) Q Consensus 142 ~~-~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e 220 (510) .. ++++++..+.++-.|..+.....+|.+...- ......+++...+.+.. .+++. T Consensus 154 ~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~-------------------~~~~~~~~~~~~y~~~~-----~~~~~ 209 (474) T protein:vir:94 154 GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD-------------------DDNGTDYVYAEFYDNAY-----YYVFR 209 (474) T ss_pred CeeEEEEEcccceEEEEcCCCceEEEEEEEEEee-------------------CCCceEEEEEEEEcCce-----EEEEe Confidence 22 4555655554444466777654444443210 11111222222222221 12222 Q ss_pred ecCe---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh Q lcl|NC_012418. 221 IDGV---RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 221 ~~~~---~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~ 297 (510) .++. ........++..+|++.++ ++.+|.|=.+...+-+..++.+.-......+...+|.+++.-.+ +..+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~~~~ 283 (474) T protein:vir:94 210 GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-MSEE 283 (474) T ss_pred ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-CCch Confidence 2221 1122222334568887653 46789998999999999999988888888888888877664211 1112 Q ss_pred hhc-cCCCceee-cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 298 DYQ-DAEMGDYV-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~-~~~~g~~~-pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) ... ....|.+. .+...+++.+.. ..+.......++.+++.|...-.. +.. ..-+...|+..+..+-.-.. +.. T Consensus 284 ~~~~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~-~k~ 360 (474) T protein:vir:94 284 MIQETQKSGAFELFDKDMDVKYLTK--DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALE-NKC 360 (474) T ss_pred hhhhhhhcceeEecCCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHH-HHH Confidence 211 11223432 232333443332 234566777888888877554322 211 11223456666544322111 111 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcC--CCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDAL--LQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~--l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) -...+.-.+.+.-+++-++.++...+ ..+....++.+... .+.+.+..++-+..+ .+.++ T Consensus 361 ~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl------~g~iS---------- 424 (474) T protein:vir:94 361 MTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL------KGQVS---------- 424 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH------hccCc---------- Confidence 22222222233333344444443322 12211123333322 133333333322222 11111 Q ss_pred HHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH-HH--hhhhhhhh Q lcl|NC_012418. 450 LPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE-TL--LEGASDMT 504 (510) Q Consensus 450 ~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~-~~--~~ga~~~~ 504 (510) ...++ ..++ ++ -.++|++++.++.+..++....... .. ..-..+.+ T Consensus 425 ~et~~----~~l~~v~-----d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 425 ERTRL----GQSQLVD-----DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred hHHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 11122 2232 21 1235666655444332222111100 00 01111122 No 96 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=96.78 E-value=0.00034 Score=39.57 Aligned_cols=424 Identities=10% Similarity=-0.029 Sum_probs=177.7 Q ss_pred ChhHHHHHHHHH--hhccchHHHHHHHHHhc-------c---cccCCC--C---CCccccccccccchHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTL-------P---YLMVDP--M---SGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~~~~~~~r~~~l--kr~~~~~~w~e~~~~~l-------P---~~~~~~--~---~~~~~~~~~~~dstg~~a~~~LAa~l 63 (510) -.+.+.+..++- ++.++...++.+-.+.. | ....-. . ........|+..+-+...+++.++-| T Consensus 16 ~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl 95 (474) T protein:vir:10 16 LPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYL 95 (474) T ss_pred CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhhe Confidence 111122222211 12223333222211111 1 000000 0 00111123455556666666665554 Q ss_pred HHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCC Q lcl|NC_012418. 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSD 141 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~ 141 (510) ++- |+. ++..+.. ....++.++| .+.+..++|.....++.++..++|.+.+++ +++ T Consensus 96 ~g~--pv~-----~~~~~~~--------~~~e~~~~~l-------~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:10 96 HGV--PVT-----YDLDENA--------EKNEKLKKFI-------TNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred ecc--cee-----EeeCCCC--------cchHHHHHHH-------HHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 431 332 2221110 1122344444 334666789999999999999999887655 433 Q ss_pred CC-cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEE Q lcl|NC_012418. 142 EA-TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHE 220 (510) Q Consensus 142 ~~-~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e 220 (510) .. ++++++..+.++-.|..+.....+|.+...- ......+++...+.+.. .+++. T Consensus 154 ~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~-------------------~~~~~~~~~~~~y~~~~-----~~~~~ 209 (474) T protein:vir:10 154 GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD-------------------DDNGTDYVYAEFYDNAY-----YYVFR 209 (474) T ss_pred CeeEEEEEcccceEEEEcCCCceEEEEEEEEEee-------------------CCCceEEEEEEEEcCce-----EEEEe Confidence 22 4555655554444466777654444443210 11111222222222221 12222 Q ss_pred ecCe---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchh Q lcl|NC_012418. 221 IDGV---RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 221 ~~~~---~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~ 297 (510) .++. ........++..+|++.++ ++.+|.|=.+...+-+..++.+.-......+...+|.+++.-.+ +..+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~~~~ 283 (474) T protein:vir:10 210 GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-MSEE 283 (474) T ss_pred ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-CCch Confidence 2221 1122222334568887653 46789998999999999999988888888888888877664211 1112 Q ss_pred hhc-cCCCceee-cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_012418. 298 DYQ-DAEMGDYV-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~-~~~~g~~~-pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LG 373 (510) ... ....|.+. .+...+++.+.. ..+.......++.+++.|...-.. +.. ..-+...|+..+..+-.-.. +.. T Consensus 284 ~~~~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~-~k~ 360 (474) T protein:vir:10 284 MIQETQKSGAFELFDKDMDVKYLTK--DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALE-NKC 360 (474) T ss_pred hhhhhhhcceeEecCCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHH-HHH Confidence 211 11223432 232333443332 234566777888888877554322 211 11223456666544322111 111 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcC--CCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDAL--LQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~--l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) -...+.-.+.+.-+++-++.++...+ ..+....++.+... .+.+.+..++-+..+ .+.++ T Consensus 361 ~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl------~g~iS---------- 424 (474) T protein:vir:10 361 MTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL------KGQVS---------- 424 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH------hccCc---------- Confidence 22222222233333344444443322 12211123333322 133333333322222 11111 Q ss_pred HHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH-HH--hhhhhhhh Q lcl|NC_012418. 450 LPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE-TL--LEGASDMT 504 (510) Q Consensus 450 ~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~-~~--~~ga~~~~ 504 (510) ...++ ..++ ++ -.++|++++.++.+..++....... .. ..-..+.+ T Consensus 425 ~et~~----~~l~~v~-----d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 425 ERTRL----GQSQLVD-----DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred hHHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 11122 2232 21 1235666655444332222111100 00 01111122 No 97 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=96.75 E-value=0.00036 Score=39.43 Aligned_cols=418 Identities=12% Similarity=0.015 Sum_probs=166.1 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHHhcccc----cCCCCCCcccc-ccccccchHHHHHHHHHHHHHHhhcCcCCcc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTTLPYL----MVDPMSGSRGV-VEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~~lP~~----~~~~~~~~~~~-~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) ....+.+.+..+. +. ++.+.+.+|..-.. .+.......+. ..+..-+-+..+++.++..| ++.+ T Consensus 28 ~~~l~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l----~~~g--- 97 (501) T protein:vir:25 28 LGALVADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL----SVVG--- 97 (501) T ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh----cccc--- Confidence 3444555554443 22 24444455533210 00000000000 11122345555555555543 3443 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCCcEEEEEece Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEATVVAWSLRS 152 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~~~~~~pl~~ 152 (510) |++ +|... .+. ++.....++|....+++.++..++|.+.+++ +++...+++++-.+ T Consensus 98 f~~--~d~~~---------~~~-----------l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp~~ 155 (501) T protein:vir:25 98 YRN--ALAKE---------NDP-----------AWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSPRQ 155 (501) T ss_pred eec--CCccc---------hHH-----------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCeEEEecccc Confidence 332 22211 111 2234567889999999999999999987654 44444567776544 Q ss_pred -EEEeeCCC--CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEE--EEEEeecCCCce-------EEEE-E- Q lcl|NC_012418. 153 -YAVRRDAT--GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLY--THVQRKKGTAME-------YAEL-Y- 218 (510) Q Consensus 153 -~~i~~d~~--G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~--~~v~~~~~~~~p-------~~sv-~- 218 (510) +++-.|+. .++.-.+|.+..... .+....+++| ++++.....+.. .++. . T Consensus 156 ~~~iy~D~~~~~~~~~ai~~~~~~~~----------------~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (501) T protein:vir:25 156 ILAVYADPSVDAWPQYALETWVAQKD----------------AKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPV 219 (501) T ss_pred EEEEEecCCCCcceeEEEEEEeeccc----------------cCcceeEEEecCeeEEEEecCceeeeeccccccccccc Confidence 55655543 234433433321111 0111122222 122211111000 0000 0 Q ss_pred --EEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcc-cc Q lcl|NC_012418. 219 --HEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKG-AV 295 (510) Q Consensus 219 --~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~-~~ 295 (510) +++.+.........++..||++.+.=+.. .+.+|+|=.+..++=+..+|...-..+..++..+.|...+. |+ .. T Consensus 220 ~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~--G~~~~ 296 (501) T protein:vir:25 220 NVREVTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS--GWTGS 296 (501) T ss_pred cccccccccccccccCCccceeeEeccCccc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh--CCCCC Confidence 00111111111222345688887554433 45689997777888888888887777778887777754432 22 11 Q ss_pred hhhhccCCCcee-e-cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCCCHHHH------- Q lcl|NC_012418. 296 VDDYQDAEMGDY-V-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEV------- 361 (510) Q Consensus 296 p~~~~~~~~g~~-~-pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~TAtEi------- 361 (510) .........|.+ . +|. +.+..++. .++++.... .++.-|+....... +.....+.++.-+ T Consensus 297 ~~~~~~~~~~~i~~~~~~--~~~~~q~~-~~~~~~~~~---~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l 370 (501) T protein:vir:25 297 KAEVLKASALRVWTFEDP--EVKAQAFP-PASVEPYNL---ILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQ 370 (501) T ss_pred ccchhhhcccceeccCCC--CceEEEec-ccChHHHHH---HHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHH Confidence 111111122222 2 332 22222332 234544333 34444433332211 1111223355433 Q ss_pred HHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 362 RITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLA 439 (510) Q Consensus 362 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~ 439 (510) ..+++.|...+|..+.++ ++.++.+.. +.-+.....+++.. ..+-+..+.|+-+.++. ++ T Consensus 371 ~~ka~~k~~~f~~~l~~~--------~rl~~~~~~--~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~------~~-- 432 (501) T protein:vir:25 371 QRKLAAKRESFGESWEQL--------LRLAAEMDD--DPDTAADSGAEVLWRDTEARSFGAVVDGITKLA------SA-- 432 (501) T ss_pred HHHHHHHHHHHHHHHHHH--------HHHHHHHhC--CCccccceeeeEEecCCCCCCHHHHHHHHHHHH------hc-- Confidence 345566666666666543 122222222 11111111222221 11223333333222221 11 Q ss_pred ChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHH---HHHhh-HH-----HhhhhhhhhhcccCC Q lcl|NC_012418. 440 PIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQ---AQAAQ-ET-----LLEGASDMTNALAGV 510 (510) Q Consensus 440 ~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~---~~~~~-~~-----~~~ga~~~~~~~ag~ 510 (510) +++. + ..+....|+++ ++++++.++++++..+ .++.. .. ........+...+|+ T Consensus 433 gis~--------e--t~~~~~~g~~~-------~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (501) T protein:vir:25 433 GIPI--------E--HLLSMVPGMTQ-------QTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGV 495 (501) T ss_pred CCCH--------H--HHHHHcCCCCH-------HHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccC Confidence 1111 1 11334556653 3444433332222111 11111 00 000111122233333 No 98 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=96.67 E-value=0.00042 Score=39.07 Aligned_cols=407 Identities=9% Similarity=0.025 Sum_probs=173.8 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccC--CCCC-----CccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV--DPMS-----GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~--~~~~-----~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) .++.+.+..++.+ .-..+++.+.+|..-. ... .... ...+...++..+-+...++..++-|.+ .|+ T Consensus 45 ~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g--~p~- 119 (492) T protein:vir:97 45 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KPI- 119 (492) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcc--cCc- Confidence 3333333333332 1123455555554321 000 0000 011122356677778888887776643 221 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~ 148 (510) .++..|+. +.+.| . .+..++|.....++.++...+|.+.+++ +++.. +++++ T Consensus 120 ----~~~~~d~~-------------~~~~l-------~-~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~ 174 (492) T protein:vir:97 120 ----AFKHTDDE-------------VVKRI-------D-EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 174 (492) T ss_pred ----eeccCchH-------------HHHHH-------H-HHHhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEE Confidence 22333322 11111 1 1234688889999999999999876554 43322 46666 Q ss_pred Eece-EEEee-CCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE------EEEeecCCCceEEEEEEE Q lcl|NC_012418. 149 SLRS-YAVRR-DATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT------HVQRKKGTAMEYAELYHE 220 (510) Q Consensus 149 pl~~-~~i~~-d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~------~v~~~~~~~~p~~sv~~e 220 (510) +..+ |++-- +..+++...+|.+... ....+++|+ .+.. .+.- ......+ T Consensus 175 ~p~~~~~i~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~y~~~~v~~~~~~-~~~~--~~~~~~~ 231 (492) T protein:vir:97 175 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYE-NGSL--IPDYSNN 231 (492) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEe-cCee--eeccccc Confidence 6555 44433 3467887776665421 111233332 1111 1110 0000001 Q ss_pred ecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh- Q lcl|NC_012418. 221 IDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY- 299 (510) Q Consensus 221 ~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~- 299 (510) .+...+ .....++..+|++.++- +.+|+|=.+..++-+..++.+.-......+....|.+++.-......... T Consensus 232 ~~~~~~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 305 (492) T protein:vir:97 232 LENSKT-HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFK 305 (492) T ss_pred cccccc-ccccCCCCCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHH Confidence 111111 12222345788887654 35789988999999999998877777778888888766531111111111 Q ss_pred cc-CCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHH-------HHHHHH Q lcl|NC_012418. 300 QD-AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEEAE 369 (510) Q Consensus 300 ~~-~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~-------r~~E~~ 369 (510) .. ...+.+.-+..++++.+... .+.......++.+++.|...-.. +. ...-+...|+.-+.. ++.++. T Consensus 306 ~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~ 383 (492) T protein:vir:97 306 RLLRYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLA 383 (492) T ss_pred HHHhhccceecCCCCcceeEecc--CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHH Confidence 11 11122222222334443322 34566677777777777554332 11 112223345543322 223333 Q ss_pred HHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh Q lcl|NC_012418. 370 NTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~ 446 (510) ..++. .+++++.++-+ .++. ....++.+... .+.+....++-+.++ +++ T Consensus 384 ~~f~~------------~l~~~~~li~~~~~~~-~~~~~i~v~f~~~~p~~~~e~a~~~~kl----------~G~----- 435 (492) T protein:vir:97 384 RKAKV------------AIQELLWFVFEHFDIK-GEHKDVDISFNYNKVANTELQVQTAQQS----------MGI----- 435 (492) T ss_pred HHHHH------------HHHHHHHHHHHHhcCC-cccceeeEEecCCCCCCHHHHHHHHHHH----------hcc----- Confidence 33333 33333333321 1221 11122332221 122222222211111 121 Q ss_pred ccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 447 RISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 447 ~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) |--.. +...++ ++ -.++|++++.++.+..++..+... .+.........+- T Consensus 436 -iS~et----~l~~l~~v~-----d~~~Eleri~~E~~~~~~~~~~~~----~~~~~~~~~~~~~ 486 (492) T protein:vir:97 436 -VSHET----VLENHPFVE-----DLQAELERIEQEQTEYNKQLPNLD----DGGADSAQQQERS 486 (492) T ss_pred -CchHH----HHHhCCCCC-----CHHHHHHHHHHHHHHHHHhhhccc----cCCCCCCcccccc Confidence 11111 222332 22 124677766655544333322211 1111111111111 No 99 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.59 E-value=0.00048 Score=38.75 Aligned_cols=412 Identities=9% Similarity=0.011 Sum_probs=173.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcc-----cc-cCCCCC---CccccccccccchHHHHHHHHHHHHHHhhcCcC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLP-----YL-MVDPMS---GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP-----~~-~~~~~~---~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) -++.+.+...+.+ .-..+++.+.+|..= .+ ...... .......++..+.+...++..++-|++ -|+ T Consensus 27 ~~~~i~~~i~~~~--~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g--~p~- 101 (478) T protein:vir:10 27 QEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVA--NPV- 101 (478) T ss_pred hHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcc--cCc- Confidence 2222222222221 112334444444321 11 000000 011111244456666677776666654 222 Q ss_pred CcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC-CcEEEE Q lcl|NC_012418. 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~-~~~~~~ 148 (510) .++.+++. ..+ .+...+ .++|.....++.++...+|.+.+++ +++. -++.++ T Consensus 102 ----~~~~~~~~-------------~~~-------~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~ 156 (478) T protein:vir:10 102 ----TFGVDNDK-------------ALK-------QIQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRV 156 (478) T ss_pred ----eeecCChH-------------HHH-------HHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEE Confidence 22333322 111 122222 3688999999999999999877655 4332 135555 Q ss_pred Eece-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEecC Q lcl|NC_012418. 149 SLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 149 pl~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~~ 223 (510) +..+ |.+-.| ..|++...+|.+...- ...+++|+. .+....++...........+ T Consensus 157 ~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------------------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~ 216 (478) T protein:vir:10 157 PAEQAVPIWTNKERDELQAFIRVYELDG--------------------AERVEYWTKDDVTFYELKEGQLIPDFYRSEDH 216 (478) T ss_pred cccceEEEEcCCCCCceEEEEEEEeeeC--------------------ceEEEEEeCCcEEEEEecCCeeeccccccccc Confidence 5555 444333 4688876666654321 112333310 11111222222111111111 Q ss_pred e---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc----h Q lcl|NC_012418. 224 V---RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV----V 296 (510) Q Consensus 224 ~---~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~----p 296 (510) . .......+++..+|++.++. +.+|.|-.+...+-+..++.+.-......+....|.+++.--..-. . T Consensus 217 ~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 291 (478) T protein:vir:10 217 IQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM 291 (478) T ss_pred cccceecccccccCCcceEEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchh Confidence 1 11222234456789887765 4579999999999999999988888888888888877663111111 1 Q ss_pred hhhccCCCceeecCC-cccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHH-------HHHH Q lcl|NC_012418. 297 DDYQDAEMGDYVPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAE 366 (510) Q Consensus 297 ~~~~~~~~g~~~pg~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~-------~r~~ 366 (510) ..+.. .....+++. ..+++.+.. ..+.......++.+++.|...-.. +. ....+...|+..+. .... T Consensus 292 ~~~~~-~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 368 (478) T protein:vir:10 292 HNLKY-YKAISVAGESGSGVDTIKV--EVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKAN 368 (478) T ss_pred hhhhh-CceeEecCCCCCcceEEee--cCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHH Confidence 11111 112223332 233444332 235667777888888877554322 11 11122345665443 2334 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhh Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQ 443 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q 443 (510) ++...++..+ ++.+.++.. .+. .....++.+... .+.+.+..++.+. ..+++ T Consensus 369 ~~~~~~~~~l------------~~~~~li~~~~~~-~~d~~~i~i~f~~~~p~~~~e~~~~~~----------~~~g~-- 423 (478) T protein:vir:10 369 KLKNKTLTAL------------QELLQYIIDFYRL-DVRVQDIEITFNFNVMVNELENSQIAM----------NSTGL-- 423 (478) T ss_pred HHHHHHHHHH------------HHHHHHHHHHhCC-CcccccceEEeCCCCCCCHHHHHHHHH----------HHhCC-- Confidence 4444444443 333333221 111 111122332221 1122222211111 11111 Q ss_pred HhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHH-Hhh-hhhhhhhccc Q lcl|NC_012418. 444 LDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQET-LLE-GASDMTNALA 508 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~-~~~-ga~~~~~~~a 508 (510) +.-..++ ..++ |+ -.++|++++.++.+..+++....... ... .....+.-.- T Consensus 424 ----iS~et~i----~~~~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 424 ----LSKETIL----GNHSWVQ-----DPVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred ----CChHHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHHhccccCCCCcccccccCcCCCCC Confidence 1111222 2222 21 12466666665554433322111100 000 0000000011 No 100 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=96.55 E-value=0.00052 Score=38.58 Aligned_cols=427 Identities=9% Similarity=0.031 Sum_probs=173.0 Q ss_pred ChhHHHHHHHHHh----hccchHHHHHHHHHhcc--------cccCCCCCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 1 MKSTAAMLWEKLR----DGSVEQRAIEFAKTTLP--------YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~~~~~~~r~~~lk----r~~~~~~w~e~~~~~lP--------~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) ||.-++.-|. -+ .-.....+..++.=..+ .+|............++--+.+...++.+|+-|.+-.. T Consensus 7 ~~~~i~~w~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e~~ 85 (518) T protein:vir:78 7 MTRFIKGWLN-GKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYISGKPL 85 (518) T ss_pred HHHHHHHhhc-CCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhhcCCCc Confidence 7765544442 11 00112222211110000 11222111111111222223467777777776644322 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCCCcEE Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEATVV 146 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~~~~~ 146 (510) . +.++..+.. +.+.++++| .+.+..++|+..+.+...+..+.|.+++ |.+....++. T Consensus 86 ~-----i~v~~~~~~---------d~e~~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~ 144 (518) T protein:vir:78 86 S-----IDVTGVNGS---------KDENLTKQL-------KEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSIS 144 (518) T ss_pred e-----EEecCcccc---------CcHHHHHHH-------HHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEE Confidence 1 222211110 112234444 4457889999999999999999998774 6665544677 Q ss_pred EEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCC--------CceEEE-- Q lcl|NC_012418. 147 AWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGT--------AMEYAE-- 216 (510) Q Consensus 147 ~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~--------~~p~~s-- 216 (510) .++-..|+... .+|++..+.+-......+ +-++|+.+++..-+ ..++.. T Consensus 145 ~v~ad~~~P~~-~~g~~~~~~f~~~~~~~~--------------------k~~~y~~lE~he~~~~~~~~~~~~~~~I~n 203 (518) T protein:vir:78 145 VHSSSQFWIDF-KNNEPFRFNFFEEIPTSN--------------------KADIYYLVESREIKQWDKEGKKLSGGFVTY 203 (518) T ss_pred EEcCCeeEEEe-ecCcEEEEEEEEEeecCC--------------------cceeEEEEEeeccccccceeecccceeEEE Confidence 77777776654 357776655543322110 11233333322100 001110 Q ss_pred -EE----------------------EEecCeee--ccccccccccCceEEEeeeec-----CCCccccchHHHHHHHHHH Q lcl|NC_012418. 217 -LY----------------------HEIDGVRV--GEEGRWPIHLCPYIVPTWNLA-----PGEHYGRGHVEDYIGDFAK 266 (510) Q Consensus 217 -v~----------------------~e~~~~~~--~~~~~y~~~~~P~~~~Rw~~~-----~g~~YGrgp~~~~L~d~r~ 266 (510) +| ++.++... ....+ ...|+++...+.. .++.||+|-...+.+-++. T Consensus 204 ~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg---~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~ 280 (518) T protein:vir:78 204 SVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIG---LKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFA 280 (518) T ss_pred EEeeecCcccccccccccccccccccccccCccceeeccC---CccceEEeeccccccccccCCCcCcchHhhhhHHHHH Confidence 11 01111110 00111 2357777766544 3677899999999999999 Q ss_pred HHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCCC----------cee--ecCCcc----c---ccccccCcccch- Q lcl|NC_012418. 267 LSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM----------GDY--VPGGAE----A---VRAYERGDYNKM- 326 (510) Q Consensus 267 L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~~----------g~~--~pg~~~----~---v~~~~~~~~~~~- 326 (510) ||..--+...-... .++...|++ .++..+. ..... -.+ +.|..+ . +..++ .++ T Consensus 281 lD~~~s~~~~e~~~-g~~~i~v~~-~~l~~~~-~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~----~~Ir 353 (518) T protein:vir:78 281 VDYFFTVYMREGEK-TKTKIAASE-RMFRKKV-NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQ----GDFR 353 (518) T ss_pred HHHHHHHHHHHHHh-CCceeeech-hHhccCC-CCCCCccccccCCCCceEEEecCcCCCCCccccceeeee----cccC Confidence 99988777776654 666656643 3332111 00000 001 111110 0 11111 112 Q ss_pred -HHHHHHHHHHHHHHHHHh-hc-cccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhc-CC-C Q lcl|NC_012418. 327 -AAIQQSLQAVVVRLNQAF-MY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LL-Q 401 (510) Q Consensus 327 -~~~~~~i~~~~~~I~~af-~~-~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~-~l-~ 401 (510) ..-...++.+-+.|.... +. ..+.-++...|||||..+.+...+.+--.-..+ ...|.-|+.-.+.++.-. +. . T Consensus 354 ~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~-e~al~~l~~~i~~l~~~~~~~~~ 432 (518) T protein:vir:78 354 DGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLI-QNVYEQMLWDFLYLLTGGTNNKE 432 (518) T ss_pred hHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcCccc Confidence 112223333333332222 11 112223445899999988888766643211111 111222333333333221 11 1 Q ss_pred -CCCcccccce--eee--cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHH Q lcl|NC_012418. 402 -GLITKQHKPA--IET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQ 476 (510) Q Consensus 402 -~~~~~~~~~~--~v~--~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~ 476 (510) ..+.++..+. .=. ..+..++++....+ ++ +++ +-.+.+++.+ ..| -+++|++ T Consensus 433 ~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~------v~--aGi------mS~e~~i~~~--~~~-------~~deea~ 489 (518) T protein:vir:78 433 KAIMRDEIRVIIEFPDPMSVNLNELSSTLNNM------NS--ALA------MSVEEKVKLI--HPK-------WEDEEIQ 489 (518) T ss_pred cccCCCceeEEEEeCCCCCCCHHHHHHHHHHH------Hh--cCC------CCHHHHHHHh--CCC-------CCHHHHH Confidence 1122222222 211 22222332222211 11 121 1122333322 112 2556555 Q ss_pred HHHHHHHHHHHHHHHh-hHHHhhhhhhhhhcccC Q lcl|NC_012418. 477 AEAEQRRQQAAQAQAA-QETLLEGASDMTNALAG 509 (510) Q Consensus 477 ~~r~q~~q~~~~~~~~-~~~~~~ga~~~~~~~ag 509 (510) ++.++-+++ ++.+. +.+...+++. +.-| T Consensus 490 ~e~~ri~~E--~~~~~~~~p~~~~g~~---~~~g 518 (518) T protein:vir:78 490 AEVKRIYLE--NAIGEVPDPEAIGGME---TKGG 518 (518) T ss_pred HHHHHHHHH--hcccCCCCCccccCCC---CCCC Confidence 443332221 11111 1111111111 1111 No 101 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.46 E-value=0.0006 Score=38.23 Aligned_cols=402 Identities=11% Similarity=-0.001 Sum_probs=178.5 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) -.+.+.+..++.+. -..+.+.+.+|..-. ..............++..+.+...+++.++-|++ .| +.++ T Consensus 18 ~~~~i~~~i~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g------~~-~~~~ 88 (452) T protein:vir:36 18 TVEVVTKFMEKHKL--EVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNG------IP-VKKS 88 (452) T ss_pred CHHHHHHHHHHHHH--HHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcc------cC-ceee Confidence 22334443343321 123455556665532 1111111111223356667777777777766653 11 2233 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEeceEEE Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl~~~~i 155 (510) ..++. .. ..+...+..++|....+++.++...+|.+.+++ +++.. +++.++..+.+. T Consensus 89 ~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (452) T protein:vir:36 89 HSDKE-------------IL-------TKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFM 148 (452) T ss_pred cCChh-------------HH-------HHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEE Confidence 33321 11 234455667899999999999999999987655 43321 355565554333 Q ss_pred eeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC--eeeccccc Q lcl|NC_012418. 156 RRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG--VRVGEEGR 231 (510) Q Consensus 156 ~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~--~~~~~~~~ 231 (510) ..|. .+.+...+|.+.- .+....+++|+. + ..++++.++ ..+..... T Consensus 149 v~d~~~~~~~~~~i~~~~~-------------------~~~~~~~~vyt~-----~-----~i~~~~~~~~~~~~~~~~~ 199 (452) T protein:vir:36 149 VYDDTVKQEPLFAVRYGVD-------------------EDKKLQGEVYTL-----L-----ETIKISGENDEISFGEGTY 199 (452) T ss_pred EEcCCCCCceEEEEEEEEe-------------------cCceEEEEEEec-----C-----eEEEEEEcCCceEEeccee Confidence 3333 3445444443321 011123344421 1 112222221 12222222 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC-Ccee-ec Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-MGDY-VP 309 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~-~g~~-~p 309 (510) .++..+|++..+. +..|+|=.....+-+..++.+.-......+...+|.+++.- .....+...... ++.+ ++ T Consensus 200 ~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~~~~~~~ 273 (452) T protein:vir:36 200 NPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRSNRVINYY 273 (452) T ss_pred ccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcCchhhhhhhhcceEEec Confidence 2345688877644 34688888888888999999888888888899999777642 222223322221 1221 22 Q ss_pred CCc----ccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCCHHHHHHH-------HHHHHHHhchhHh Q lcl|NC_012418. 310 GGA----EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRIT-------AEEAENTLGGTYS 377 (510) Q Consensus 310 g~~----~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~TAtEi~~r-------~~E~~~~LGpv~~ 377 (510) ... .+++.+. ...+.......++.+++.|...-.. +.........|+..+..+ +.++...++..+. T Consensus 274 ~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 351 (452) T protein:vir:36 274 ADGEGKNVDVKFLE--KPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLN 351 (452) T ss_pred CCCCccCCcceeEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1222222 2234566677777777777443322 221112134566655432 3333333443333 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHH Q lcl|NC_012418. 378 LLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMD 455 (510) Q Consensus 378 rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~ 455 (510) .+++-++.++...+.. ....++.+..- .+.+.+..++-+.+ ..+. +....++ T Consensus 352 --------~~~~li~~~~~~~~~~-~~~~~i~i~f~~~~p~d~~~~a~~~~k------~~g~----------iS~et~~- 405 (452) T protein:vir:36 352 --------SRYKLFCELSTNVSNK-DSWKDIEYTFTRNEPKDIKEQAETANI------LMGI----------TSQETAL- 405 (452) T ss_pred --------HHHHHHHHHHhccCCc-cccccceEEeCCCCCcCHHHHHHHHHH------Hhcc----------CChHHHH- Confidence 3334444444433221 11122332221 12222222221111 1111 1112222 Q ss_pred HHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 456 TIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 456 ~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ..+| ++ -.++|++++.+++++..+..+... . .+.-.-++ T Consensus 406 ---~~~~~~~-----d~~~E~~ri~~E~~~~~~~~~~~~----~----~~~~~~~~ 445 (452) T protein:vir:36 406 ---SVISVIP-----DVQAEMEKIKKEEASTAIFDKDKQ----P----SEKGTDTV 445 (452) T ss_pred ---HhCCCCC-----CHHHHHHHHHHHHHHHHHHHhhcc----C----CCCccccc Confidence 3333 21 124666666654433222111110 0 00111111 No 102 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=96.45 E-value=0.00061 Score=38.20 Aligned_cols=412 Identities=12% Similarity=0.037 Sum_probs=172.4 Q ss_pred ChhHHHHHHHHHh-hccchHHHHHHHHH--hcccc---cCCCCCCcc-ccccccccchHHHHHHHHHHHHHHhhcCcCCc Q lcl|NC_012418. 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKT--TLPYL---MVDPMSGSR-GVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~~~~~~~r~~~lk-r~~~~~~w~e~~~~--~lP~~---~~~~~~~~~-~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) ..+.+.+..++.+ |.....+.+++|.- -++.+ ......... ....|+..+-+...++..++-|++ -|+ T Consensus 27 ~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g--~p~--- 101 (474) T protein:vir:96 27 QEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVA--NPV--- 101 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcc--cCc--- Confidence 2222333333322 22222233333221 11111 111111111 112245556666666666665544 222 Q ss_pred ccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC-CcEEEEEe Q lcl|NC_012418. 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~-~~~~~~pl 150 (510) .++.+++.. ...+++| + ..||.....++.++...+|.+.+ |.+++. -++.+++. T Consensus 102 --~~~~~d~~~---------~~~l~~~-----------~-~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p 158 (474) T protein:vir:96 102 --TFSSDDDKS---------LKTIQEV-----------L-NHKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPA 158 (474) T ss_pred --eeecCchHH---------HHHHHHH-----------H-hcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcc Confidence 123333221 1122222 2 35788888999999999998765 454432 23555665 Q ss_pred ceEEEeeC--CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE--EE-Ee-ecCCCceEEEEEEE--e- Q lcl|NC_012418. 151 RSYAVRRD--ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT--HV-QR-KKGTAMEYAELYHE--I- 221 (510) Q Consensus 151 ~~~~i~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~--~v-~~-~~~~~~p~~sv~~e--~- 221 (510) .+.++..| ..+++...+|.++.. ....+++|+ .| +. ..++......++.. . T Consensus 159 ~~~~~v~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:96 159 EQAIPIWTNKERDTLKAFIRYYRLD--------------------GAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQ 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CceEEEEEeCCeEEEEEecCCceeecccccccccc Confidence 56544444 367776666665421 111233331 00 01 11111100001100 0 Q ss_pred cCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh-c Q lcl|NC_012418. 222 DGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-Q 300 (510) Q Consensus 222 ~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~-~ 300 (510) .+..+. ....++..+|++.++. +.+|+|=.+...+-+..++.+.-......+...+|.+++.-.+.-..... . T Consensus 219 ~~~~~~-~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 292 (474) T protein:vir:96 219 SHYYVG-NKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMR 292 (474) T ss_pred cccccc-ccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhh Confidence 111111 2223456799987775 45799988999999999998888888888888888776532111111111 1 Q ss_pred c-CCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHH-------HHHHHH Q lcl|NC_012418. 301 D-AEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEEAE 369 (510) Q Consensus 301 ~-~~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~-------r~~E~~ 369 (510) . ...+.+ .+|...+++.+.. ..+.+.....++.+++.|...-.. +. +...+...|+.-+.. .+.++. T Consensus 293 ~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 370 (474) T protein:vir:96 293 NLKYYKAINVDGDGSGVDTIQI--EVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLK 370 (474) T ss_pred hhhcCceEEecCCCCceeEEee--cCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 1 111222 3444444544432 235667777788777777554321 11 111123345554432 223333 Q ss_pred HHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhc Q lcl|NC_012418. 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~ 447 (510) ..++..+.+ +++.+..++ +. ......+.+... .+.+.+..++ .+... + . T Consensus 371 ~~~~~~l~~--------~~~~i~~~~---~~-~~~~~~i~i~f~~~~p~~~~e~~~----------~~~~a-g------~ 421 (474) T protein:vir:96 371 NKTLTALQE--------LLQYIIDFY---KL-NIKVQDVEITFNFNVMVNELEQSQ----------IGVQS-Q------Y 421 (474) T ss_pred HHHHHHHHH--------HHHHHHHHh---CC-CcccceeeEEeccCCCcCHHHHHH----------HHHhc-C------C Confidence 333333332 333333332 11 111222332221 1222221111 11111 1 1 Q ss_pred cCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh--hhhc Q lcl|NC_012418. 448 ISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASD--MTNA 506 (510) Q Consensus 448 id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~--~~~~ 506 (510) +--..++.. ++ |+ -.++|++++.++++...+....+......+... .++- T Consensus 422 iS~et~~~~----~~~v~-----d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 422 LSKETVVTN----HPWVD-----DPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred CchHHHHHh----CCCCC-----CHHHHHHHHHHHHHHHHhcccccccccccccCCCcccCC Confidence 222223322 22 21 124566666554443332221111111111111 1111 No 103 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=413 Identities=10% Similarity=0.003 Sum_probs=176.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhccc-----c------cCCCC--CCccccccccccchHHHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPY-----L------MVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~-----~------~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~ 65 (510) -.+++.+.-++.. ++.-..+.+.+.+|..-. + ...+. ...+....++..+-+..-++..++-|.+ T Consensus 2 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G 81 (470) T protein:vir:10 2 ELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVAS 81 (470) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheec Confidence 2222222223322 222334555566654431 0 00000 0011222355566666666666655444 Q ss_pred hhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC- Q lcl|NC_012418. 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE- 142 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~- 142 (510) -|+ .++..+... ...+++| +. .||...+.++.++...+|.+.+ |.+++. T Consensus 82 --~p~-----~~~~~d~~~---------~~~l~~~-----------~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~ 133 (470) T protein:vir:10 82 --VFP-----DIDVGKDAD---------NKKIIDV-----------LG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGN 133 (470) T ss_pred --cce-----eeecCchHH---------HHHHHHH-----------Hh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCc Confidence 222 233333221 1123332 22 3677778888899999998765 455442 Q ss_pred CcEEEEEeceEEEeeC-C-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE---EEEeec--CCC---- Q lcl|NC_012418. 143 ATVVAWSLRSYAVRRD-A-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT---HVQRKK--GTA---- 211 (510) Q Consensus 143 ~~~~~~pl~~~~i~~d-~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~---~v~~~~--~~~---- 211 (510) -++..++..+.++-.| . .|++..++|.+...-.+ ....-..+++|+ ..+... .+. T Consensus 134 ~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~--------------~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~ 199 (470) T protein:vir:10 134 FRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPD--------------SGKYFTVHEYWTDKEAQFFRTNATDSTVIE 199 (470) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCcEEEEEeecCcceecc Confidence 1355565555444443 3 47776666555432111 000111223331 111111 100 Q ss_pred --ceEEEEEE--EecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCcee Q lcl|NC_012418. 212 --MEYAELYH--EIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNL 287 (510) Q Consensus 212 --~p~~sv~~--e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l 287 (510) ....+... ..++..+ .....++..+|++.++= +.+|.|=.+...+-+-.++.+.-..........+|.++ T Consensus 200 ~~~~~~~~~~~~~~~~~~~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 273 (470) T protein:vir:10 200 PYNIITSYDLSAGYETGQS-NTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (470) T ss_pred ccccccccccccccccccc-cccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCccee Confidence 00000000 0111111 11122335677776553 46899999999999999999999999999999999888 Q ss_pred eCCCcccch-hhhccCC-Ccee-ecC----CcccccccccCcccchHHHHHHHHHHHHHHHHHhh-ccccCCCCCCCCHH Q lcl|NC_012418. 288 VDEAKGAVV-DDYQDAE-MGDY-VPG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGANQRDAERVTAE 359 (510) Q Consensus 288 ~~~~g~~~p-~~~~~~~-~g~~-~pg----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~~~~~~~~TAt 359 (510) +.-.+..+. +...... .+.+ ++. ...+++.+. ...+.......++.+++.|-+.-+ .+...-.....|+. T Consensus 274 l~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt--~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~ 351 (470) T protein:vir:10 274 LTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQ--IDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGV 351 (470) T ss_pred eecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEe--ecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHH Confidence 753222221 1111111 1222 221 112233332 334567778888888888865432 22222222345665 Q ss_pred HHHHH-------HHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCccccccee--eecHHHHHHHHHHHHHH Q lcl|NC_012418. 360 EVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI--ETGLPALSRSAAVQSML 429 (510) Q Consensus 360 Ei~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~ 429 (510) .+..+ +.+++..++.. +++++.++.+ -++-......+.+.. ..+.+.+..++-+.. T Consensus 352 Alk~~~~~l~~k~~~~~~~~~~~------------l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~-- 417 (470) T protein:vir:10 352 AIKMLYSHLELKAAKTQTYFEHA------------INELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST-- 417 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHH-- Confidence 55333 33333333333 3333333211 111111222333322 123333333222111 Q ss_pred HHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhccc Q lcl|NC_012418. 430 NASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK-SEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALA 508 (510) Q Consensus 430 ~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~r-s~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~a 508 (510) +++ .+.-..++. .++ ++. .++|++++.++++..+...+.. .+.... T Consensus 418 --------~~g------~iS~et~l~----~~p-----~v~D~~~E~eri~~E~~e~~~~~~~~----------~~~~~~ 464 (470) T protein:vir:10 418 --------VAN------YSSKEAVAK----ANP-----IVDDWQQELKDLAKDKEENDPYSNQA----------DELNGK 464 (470) T ss_pred --------Hhc------cCcHHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHHhhccc----------cccCCC Confidence 112 111222222 232 122 2456666665544333322221 122223 Q ss_pred CC Q lcl|NC_012418. 509 GV 510 (510) Q Consensus 509 g~ 510 (510) || T Consensus 465 ~~ 466 (470) T protein:vir:10 465 GV 466 (470) T ss_pred CC Confidence 44 No 104 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=96.31 E-value=0.00076 Score=37.68 Aligned_cols=421 Identities=10% Similarity=-0.009 Sum_probs=176.3 Q ss_pred ChhHHHHHHHHHh---------------------------hccchHHHHHHHHHhccc-----ccCCCCCCccccccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVEHDF 48 (510) Q Consensus 1 ~~~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~lP~-----~~~~~~~~~~~~~~~~~ 48 (510) ++.++..+|..-. +..-.++++++.+|..-. +.... ........++. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~ 91 (511) T protein:vir:10 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVA 91 (511) T ss_pred hhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceee Confidence 3333333332211 000112334444443321 11111 11111223455 Q ss_pred cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_012418. 49 QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLL 128 (510) Q Consensus 49 dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl 128 (510) .+.+...++..++-|++ -|+ +++.+++. +. ..+...+..++|.....++.+++ T Consensus 92 ~n~~k~Iv~~~~~yl~g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:10 92 HDYASYISDFINGYFLG--NPI-----QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cchHHHHHHHHhhhhcc--cCc-----eeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHH Confidence 66666667766655543 121 12333222 11 23444567778999999999999 Q ss_pred HhhCeEEEEE--eCCCC-cEEEEEece-EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE Q lcl|NC_012418. 129 IVTGNALLYR--NSDEA-TVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH 203 (510) Q Consensus 129 ~~~G~~~l~~--~~~~~-~~~~~pl~~-~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 203 (510) ..+|.+.+++ +++.. +++.++..+ |++--|. .+++...+|.+.....+- ...+.-..+++|+ T Consensus 145 ~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~------------~~~~~~~~~~iyt- 211 (511) T protein:vir:10 145 SIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK------------TDEDEVFTVDLFT- 211 (511) T ss_pred HhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc------------CccceEEEEEEEe- Confidence 9999876554 44322 455555544 4443332 356665555554321110 0001111222332 Q ss_pred EEeecCCCceEEEEEEEecCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 204 VQRKKGTAMEYAELYHEIDGV------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLY 277 (510) Q Consensus 204 v~~~~~~~~p~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~ 277 (510) ++. .+ .|...++. ........++..+|++.++- ..+|.|-.+..++-+..++...-..... T Consensus 212 ----~~~---i~-~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~ 278 (511) T protein:vir:10 212 ----SHG---VY-RYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANY 278 (511) T ss_pred ----CCc---EE-EEEecCCCcccccccccccccccCcceeEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 111 00 01111111 11122223456788877653 3578998899999999999877777777 Q ss_pred HHHhhCCceeeCCCcccchhhhccCCCce-e-------ec------CCcccccccccCcccchHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 278 ELESLEVLNLVDEAKGAVVDDYQDAEMGD-Y-------VP------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA 343 (510) Q Consensus 278 ~~~a~~p~~l~~~~g~~~p~~~~~~~~g~-~-------~p------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 343 (510) .....+|.+++.-........+..-..+. + .. +...+++.+ ....+.......+..+++.|... T Consensus 279 ~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l--~~~~~~~~~e~~~~~L~~~I~~~ 356 (511) T protein:vir:10 279 MSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMF 356 (511) T ss_pred HHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHH Confidence 77778887665321222222221111111 1 11 111122222 22234566677777777777543 Q ss_pred hhc-ccc-CCCCCCCCHHHHHHH-------HHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--ccccee Q lcl|NC_012418. 344 FMY-GAN-QRDAERVTAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAI 412 (510) Q Consensus 344 f~~-~~~-~~~~~~~TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~--~~~~~~ 412 (510) -+. +.. ..-+...|+..+..+ ..+++..++..+.+ +++-++.++...+-...+.+ ++.+.. T Consensus 357 s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~--------~~~li~~~~~~~~~~~~~~d~~~i~i~f 428 (511) T protein:vir:10 357 TNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR--------RAKLLETILKNTRSIDANKDFNTVRYVY 428 (511) T ss_pred hCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhhCCcccccccceeeEEe Confidence 221 111 111234577665444 44555555544433 23333344433222222222 233322 Q ss_pred ee--cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 413 ET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQA 489 (510) Q Consensus 413 v~--~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~ 489 (510) -. +.+....++.+.++ .+.++ ...++ ..++ |+ -.++|++++.++++.+.+++ T Consensus 429 ~~~~p~d~~~~~~~~~kl------~G~iS----------~et~~----~~l~~v~-----d~~~E~~ri~~E~~~~~~~~ 483 (511) T protein:vir:10 429 NRNLPKSLIEELKAYIDS------GGKIS----------QTTLM----SLFSFFQ-----DPELEVKKIEEDEKESIKKA 483 (511) T ss_pred CCCCCcCHHHHHHHHHHH------hccCc----------HHHHH----HhCCCCC-----CHHHHHHHHHHHHHHHHHHH Confidence 21 22222222222211 12111 11222 2232 22 12467777766655433333 Q ss_pred HHhhHHHhhhhhhhhhcccC--C Q lcl|NC_012418. 490 QAAQETLLEGASDMTNALAG--V 510 (510) Q Consensus 490 ~~~~~~~~~ga~~~~~~~ag--~ 510 (510) +........+.........+ + T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:10 484 QKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred hhhcccCCCCCCCCCCCCcccCc Confidence 22111010011110000000 0 No 105 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.25 E-value=0.00082 Score=37.50 Aligned_cols=403 Identities=9% Similarity=-0.002 Sum_probs=177.2 Q ss_pred Chh-HHHHHHHHHh-hccchHHHHHHHHHhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKS-TAAMLWEKLR-DGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~-~~~~r~~~lk-r~~~~~~w~e~~~~~lP~--~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |-+ .+.+..++++ |. .+.+.+.+|..=. ..............++-.+.+...++..++-|++ .| +. T Consensus 1 l~~~~l~~~i~~~~~~~---~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~ 70 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFN---LSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIG--VP-----VQ 70 (429) T ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcc--cC-----ce Confidence 322 2333333332 22 3333334443211 0000001111222356667777777777776654 12 22 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEEEEece- Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS- 152 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~~pl~~- 152 (510) ++.+++. +.+ .+...+..++|.....++.++...+|.+.+++..+. + ++++++..+ T Consensus 71 ~~~~~~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~ 130 (429) T protein:vir:98 71 TSHENKQ-------------VSN-------YLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEA 130 (429) T ss_pred eecCChH-------------HHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccce Confidence 2332221 222 233445667899999999999999999876554322 2 355564444 Q ss_pred EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEec-Ceeecccc Q lcl|NC_012418. 153 YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEID-GVRVGEEG 230 (510) Q Consensus 153 ~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~-~~~~~~~~ 230 (510) |.+--|. .+++...+|.+.- .+ .+++.+....+. .-+|.+.+ +..+.... T Consensus 131 ~~v~dd~~~~~~~~~i~~~~~-------------------~~-----~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 182 (429) T protein:vir:98 131 FIVYDDSIRQKPLFAVRYFYN-------------------KG-----GVLEGSYSDASN----ITYFKDGEKGIEIGESE 182 (429) T ss_pred EEEEeCCCCCceEEEEEEEEe-------------------cC-----ceEEEEEEeCce----EEEEEecCCceEecccc Confidence 4443333 3445444443310 00 112222222221 11122211 12222333 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC-Ccee-e Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-MGDY-V 308 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~-~g~~-~ 308 (510) ..++..+|++.++ ++.+|+|=.+..++-+..++.+.-......+....|.+++.. .....+...... .+.+ . T Consensus 183 ~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g-~~~~~~~~~~~~~~~~~~~ 256 (429) T protein:vir:98 183 PHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILG-AELDDETLKSLRDTRIINL 256 (429) T ss_pred cccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-CCCCcchhhhHhhCceeec Confidence 3345678887643 456899999999999999999988888888899998776642 111112122111 1222 2 Q ss_pred cCC---cccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCCHHHHHH-------HHHHHHHHhchhHh Q lcl|NC_012418. 309 PGG---AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRI-------TAEEAENTLGGTYS 377 (510) Q Consensus 309 pg~---~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~ 377 (510) ++. ..+++.+. ...+.+.....++.+.+.|...-.. +...-.....|+.-+.. +.+++...+|..+. T Consensus 257 ~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 334 (429) T protein:vir:98 257 KDTDAQQLTVEFLQ--KPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMN 334 (429) T ss_pred cCCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 221 11233332 2235566677778887777554322 21111112346655433 34444444444333 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHH Q lcl|NC_012418. 378 LLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMD 455 (510) Q Consensus 378 rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~ 455 (510) + +++.+..++...+. +....++.+..- .+.+..+. ++.+. +. ++ + +..+. T Consensus 335 ~--------~~~li~~~~~~~~~-~~d~~~i~v~f~~~~p~~~~~~---a~~~~---kl----~g---~---is~et--- 386 (429) T protein:vir:98 335 R--------RYKLIASYPTSKIG-PKDWIGIKYKFTRNLPANLLEE---SQIAG---NL----AG---I---VSEET--- 386 (429) T ss_pred H--------HHHHHHHHhccCCC-ccccccceEEeCCCCCcCHHHH---HHHHH---HH----hc---c---CchHH--- Confidence 2 22333333332221 111122222221 12222222 22221 11 11 1 11122 Q ss_pred HHHHHcC-CCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhh Q lcl|NC_012418. 456 TIWAAFS-VDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMT 504 (510) Q Consensus 456 ~~a~~~G-vp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~ 504 (510) +...+| ++ -.++|++++.++++...+.++.. .-.+.+-...+ T Consensus 387 -~~~~l~~v~-----d~~~E~~ri~~E~~~~~~~~~~~-~~~~~~~~~~~ 429 (429) T protein:vir:98 387 -QVGVLSIVE-----NPQKEIERKNSDKSTLISRQAGG-LNGQNTTTILE 429 (429) T ss_pred -HHHhCCCCC-----CHHHHHHHHHHHHHHHHHHHHhh-hcCCCCCCCCC Confidence 233443 22 12466777666555433322221 11111222222 No 106 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.08 E-value=0.001 Score=36.94 Aligned_cols=416 Identities=9% Similarity=0.013 Sum_probs=173.5 Q ss_pred ChhHHHHH-HHHHhhccchHHHHHHHHHhccc-----ccC-CCCC---CccccccccccchHHHHHHHHHHHHHHhhcCc Q lcl|NC_012418. 1 MKSTAAML-WEKLRDGSVEQRAIEFAKTTLPY-----LMV-DPMS---GSRGVVEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) Q Consensus 1 ~~~~~~~r-~~~lkr~~~~~~w~e~~~~~lP~-----~~~-~~~~---~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 70 (510) ..+++..+ ..+.+ .-..+++.+.+|.... +.. .... .......++..+.+...+++.++-|.+ -|+ T Consensus 26 ~~~~~i~~~i~~~~--~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~~ 101 (478) T protein:vir:10 26 TQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVA--NPV 101 (478) T ss_pred CcHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhcc--CCe Confidence 11111111 11111 1223455555555431 100 0000 011112245566677777777766553 122 Q ss_pred CCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEE Q lcl|NC_012418. 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVA 147 (510) Q Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~ 147 (510) . ++..++. ..+. +...+ ..+|.....++.++...+|.+.+++..+. + ++++ T Consensus 102 ~-----~~~~~d~-------------~~~~-------l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~ 155 (478) T protein:vir:10 102 T-----FGVDNDK-------------ALKQ-------IQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR 155 (478) T ss_pred e-----eecCChH-------------HHHH-------HHHHH-hcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE Confidence 1 2333221 1111 11222 35889999999999999999876554322 3 3445 Q ss_pred EEece-EEEee-CCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE---EEeecCCCceEEEEEEEec Q lcl|NC_012418. 148 WSLRS-YAVRR-DATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH---VQRKKGTAMEYAELYHEID 222 (510) Q Consensus 148 ~pl~~-~~i~~-d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~~~~~p~~sv~~e~~ 222 (510) ++-.+ |.+-- +..|.+...+|.++..- .+.+++|+. .+....++........... T Consensus 156 ~~p~~~~~i~d~~~~~~~~~~v~~~~~~~--------------------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~ 215 (478) T protein:vir:10 156 VPAEQAVPIWTNKERDELQAFIRVYELDG--------------------AERVEYWTKDDVTYYELKEGQLIPDFYRSDD 215 (478) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecC--------------------ceEEEEEeCCeEEEEEEcCCeeecccccccc Confidence 55444 44433 34677877777664221 112333321 1111122222211111111 Q ss_pred Cee---eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh- Q lcl|NC_012418. 223 GVR---VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD- 298 (510) Q Consensus 223 ~~~---~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~- 298 (510) +.. ......+++..+|++.++. ..+|+|=.....+-+..++.+.-......+...+|.+++.-.+..+... T Consensus 216 ~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~ 290 (478) T protein:vir:10 216 HIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDF 290 (478) T ss_pred ccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchh Confidence 111 1112223446789887754 4689998888999999999888888888888888876653111111111 Q ss_pred hcc-CCCcee-ecCCc-ccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHHH-------HH Q lcl|NC_012418. 299 YQD-AEMGDY-VPGGA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRIT-------AE 366 (510) Q Consensus 299 ~~~-~~~g~~-~pg~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~r-------~~ 366 (510) ... ...+.+ ++|.. .+++.+. ...+.......++.+++.|...-.. +.. ...+...|+..+..+ +. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 368 (478) T protein:vir:10 291 MHNLKYYKAISVAGESGSGVDTIK--VEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKAN 368 (478) T ss_pred hhhhhhcceEEecCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHH Confidence 111 111222 33322 2333332 2235566677777777777554322 111 122344566655332 33 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhH Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~ 444 (510) ++...++..+.+ ++..++.++ |. ......+.+..- .+.+.+..++-+.+ ++++ T Consensus 369 ~~~~~~~~~l~~--------~~~li~~~~---g~-~~~~~~i~i~f~~~~p~d~~e~a~~~~k----------l~g~--- 423 (478) T protein:vir:10 369 KLKNKTLTALQE--------LLQYIIDFY---RL-DVKVQDIEITFNFNVMVNELENSQIAMN----------STGL--- 423 (478) T ss_pred HHHHHHHHHHHH--------HHHHHHHHh---CC-CcccccceEEecCCCCCCHHHHHHHHHH----------HhCC--- Confidence 333344433332 222222222 11 111122332221 12233333222111 1221 Q ss_pred hhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHH-hhhhhhhhhcccC Q lcl|NC_012418. 445 DPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETL-LEGASDMTNALAG 509 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~-~~ga~~~~~~~ag 509 (510) +-...++ +.+|. +--.++|++++.++++.+++......... ...-.+.++...= T Consensus 424 ---iS~et~~----~~l~~----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 424 ---LSKETIL----SNHAW----VEDPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred ---CChHHHH----HhCCC----CCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCCCC Confidence 1122233 33331 11124566666655433222211111100 0000001111111 No 107 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=96.01 E-value=0.0011 Score=36.76 Aligned_cols=417 Identities=8% Similarity=-0.053 Sum_probs=175.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc---ccCCCC-CCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~---~~~~~~-~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) +=+++..+|. ..-.++++++.+|.... ...... ........++..+.+...++..++-|.+- |+ + T Consensus 43 ~i~~~i~~~~----~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~--p~-----~ 111 (501) T protein:vir:96 43 LLKNFINHHK----LRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGN--PI-----R 111 (501) T ss_pred HHHHHHHHHH----HHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhccc--Ce-----e Confidence 1111222221 11123455555654431 111111 11112234567777777888777666531 11 2 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEEEEEeceE Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLRSY 153 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~~~pl~~~ 153 (510) ++..+.. ...++.. .+...+..++|.....++.++...+|.+.+++ +++.. +++.++..+. T Consensus 112 ~~~~~~~---------~~~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~ 175 (501) T protein:vir:96 112 VEYDDND---------DNSQNDD-------AIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLET 175 (501) T ss_pred EeeCCcc---------chhHHHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccee Confidence 2322221 1122333 34445677899999999999999999987655 43322 3566665554 Q ss_pred EEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC-eeecccc Q lcl|NC_012418. 154 AVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG-VRVGEEG 230 (510) Q Consensus 154 ~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~-~~~~~~~ 230 (510) ++-.|. .|++...+|.+..... +..+.+++.+. ++ ..+++..++ ....... T Consensus 176 ~~v~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~vyt--~~-----~i~~~~~~~~~~~~~~~ 229 (501) T protein:vir:96 176 FVIYDNSLEDNSIAAVRYYNRGTL-------------------QSAKDVVEIYT--DE-----HIYTLDASDDFNEISVT 229 (501) T ss_pred EEEEcCCCCCceEEEEEEEEeecC-------------------CCcEEEEEEEc--CC-----cEEEEeeCCCceecccc Confidence 433443 4677666555432111 11112222211 11 112222222 1112222 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc----hhhhccCCCce Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV----VDDYQDAEMGD 306 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~----p~~~~~~~~g~ 306 (510) ...+..+|++.++ ++..|+|-.....+-+..++.+.-......+....|.+++.-..... ...+. ..+. T Consensus 230 ~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~--~~~~ 302 (501) T protein:vir:96 230 THAFGTVPITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMK--RTRL 302 (501) T ss_pred ccCCCccceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhh--hcCe Confidence 2234578877653 45689999999999999999988888888888888877663111000 01111 1122 Q ss_pred eec---C----CcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cc-cCCCCCCCCHHHHHHH-------HHHHHH Q lcl|NC_012418. 307 YVP---G----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRIT-------AEEAEN 370 (510) Q Consensus 307 ~~p---g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~~TAtEi~~r-------~~E~~~ 370 (510) +.. + ....+++-.+....+.......++.+++.|...-.. +. +...+...|+..+..+ +.++.. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~ 382 (501) T protein:vir:96 303 MQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQS 382 (501) T ss_pred eeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHH Confidence 211 1 111122211222223445566667666666443221 11 1112234566665433 333334 Q ss_pred HhchhHhHHHHHHHHHHHHHHHHHHhhcCC-CCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhc Q lcl|NC_012418. 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALL-QGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l-~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~ 447 (510) .++..+. -+++.++.++...+. .......+.+... .+.+.+..++-+.++ .+.+ T Consensus 383 ~~~~~l~--------~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl------~g~i--------- 439 (501) T protein:vir:96 383 QFTKGLK--------RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL------GGQV--------- 439 (501) T ss_pred HHHHHHH--------HHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHH------hccC--------- Confidence 4443333 233444444433221 1111122333221 122222222221111 1111 Q ss_pred cCHHHHHHHHHHHcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhc--ccCC Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQFYK-SEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNA--LAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~r-s~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~--~ag~ 510 (510) -...++. .++ ++. .++|++++.++++.+.......+.-...|....... .+=- T Consensus 440 -S~et~~~----~l~-----~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~ 495 (501) T protein:vir:96 440 -SQETALS----LSG-----LVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDD 495 (501) T ss_pred -chHHHHH----hCC-----CCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCc Confidence 1122222 222 122 245666655444322111111110011111111100 0000 No 108 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=95.86 E-value=0.0013 Score=36.33 Aligned_cols=428 Identities=8% Similarity=-0.065 Sum_probs=179.5 Q ss_pred Ch-------hHHHHHHHHHhhccchHHHHHHHHHhccc---ccCCC-CCCccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 1 MK-------STAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~~-------~~~~~r~~~lkr~~~~~~w~e~~~~~lP~---~~~~~-~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 69 (510) .. ..+.+..++- +..-.++.+++.+|.... ..... .........++..+.+...++..++-|.+. T Consensus 34 ~~~~~~~~~~~i~~~i~~h-~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~--- 109 (502) T protein:vir:48 34 LEELMVNNWELLKNFINHH-KLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGN--- 109 (502) T ss_pred hhhhccccHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhccc--- Confidence 00 1011111110 111123445555554431 11111 011111223455566666666666544421 Q ss_pred cCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCCC-cEE Q lcl|NC_012418. 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVV 146 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~~-~~~ 146 (510) . ++++..+... ...+.+ .+...+..++|....+++.+++..+|.+.+++ +++.. +++ T Consensus 110 -p---~~~~~~d~~~---------~~~~~~-------~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~ 169 (502) T protein:vir:48 110 -P---IRVEYDDNED---------NSQNDD-------AIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIK 169 (502) T ss_pred -C---eeEecCCccc---------hhHHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEE Confidence 1 1223322211 112222 33445677899999999999999999987655 44322 456 Q ss_pred EEEece-EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe Q lcl|NC_012418. 147 AWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV 224 (510) Q Consensus 147 ~~pl~~-~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~ 224 (510) .++..+ |.+-.|. .+++...+|.+..... .+....+++|+ ++ ..++++.++. T Consensus 170 ~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~----------------~~~~~~~~iyt-----~~-----~i~~~~~~~~ 223 (502) T protein:vir:48 170 RLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYT-----NQ-----HIYTLDASDS 223 (502) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEEeec----------------CCcEEEEEEEe-----CC-----eEEEEEeCCc Confidence 665544 5554433 5667666665543211 11112233332 11 1123333321 Q ss_pred -eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc-hhhhcc- Q lcl|NC_012418. 225 -RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-VDDYQD- 301 (510) Q Consensus 225 -~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~-p~~~~~- 301 (510) .........+..+|++.++ +...|.|-.+.+++-+..++.+.-......+....|.+.+.-.+... ...... T Consensus 224 ~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 298 (502) T protein:vir:48 224 FNEISVTPHAFGTVPITEFL-----NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDM 298 (502) T ss_pred eeeccceecCCCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhh Confidence 2222222234678987664 34579999999999999999988888888888888877664211111 000100 Q ss_pred CCCceeec-------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 302 AEMGDYVP-------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 302 ~~~g~~~p-------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~r~~E~~~~L 372 (510) ...+.+.. |.....++-.+....+.+.....++.+.+.|...=.. +.. ...+...|+..+.....- +... T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~-l~~k 377 (502) T protein:vir:48 299 KRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFG-LDQD 377 (502) T ss_pred hhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHH-HHHH Confidence 01122211 1111122222222234556667777777777543221 111 111234577666533221 1111 Q ss_pred chhHhHHHHHHHHHHHHHHHHHHhhcCC-CCCCcccccceeee--cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 373 GGTYSLLAENLQSPLAYVCLSEVDDALL-QGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l-~~~~~~~~~~~~v~--~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) .....++-.+.+.-+++.++.++...+- .......+++.... +.+....++ .+. +..+.+ + T Consensus 378 ~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~---~~~---kl~g~i---S------- 441 (502) T protein:vir:48 378 RVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVS---ILN---DLGGQV---S------- 441 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHH---HHH---HHhccC---c------- Confidence 1222222222233333344444432221 11112223332211 222222222 111 111111 1 Q ss_pred HHHHHHHHHHHcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 450 LPKMMDTIWAAFSVDTSQFYK-SEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~r-s~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) -+.+ ...+| ++. .++|++++.+++++...... .......++...+....++ T Consensus 442 ~et~----l~~l~-----~v~D~~~E~~ri~~E~~~~~~~~~-~~~~~~~~~~~~d~~~e~~ 493 (502) T protein:vir:48 442 QETA----LSLSG-----LVENPTEELDKINEESSKIDFKGY-PSYFYDNVGKYTDEVKETH 493 (502) T ss_pred HHHH----HHhCC-----CCCCHHHHHHHHHHHHHhhhhhcc-cccccccccccCCCccCCC Confidence 1222 23333 122 24566666654433211111 1111222233344444455 No 109 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=95.64 E-value=0.0017 Score=35.77 Aligned_cols=422 Identities=10% Similarity=0.013 Sum_probs=176.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccc---cCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL---MVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~---~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) ....+.+..++- .....++|+++.+|....- +.... ........++-.+.+...++..++-|++- |+ T Consensus 23 ~~~~i~~li~~~-~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~--p~----- 94 (506) T protein:vir:94 23 TPNKIMKFITHH-FNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGN--PI----- 94 (506) T ss_pred CHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhccc--Cc----- Confidence 122222222111 1122346677777765421 11110 11111223556677777777777666542 22 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC-C--cEEEEEece Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~-~--~~~~~pl~~ 152 (510) .++..++. .. ..+...+..++|.....++.++...+|.+.+++..+. + ++.+++..+ T Consensus 95 ~~~~~d~~-------------~~-------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~ 154 (506) T protein:vir:94 95 NVKLPDDG-------------SN-------SGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLD 154 (506) T ss_pred eeecCcch-------------HH-------HHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccc Confidence 22333221 11 1234446678999999999999999999876554322 2 355555544 Q ss_pred -EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEe-cCeeeccc Q lcl|NC_012418. 153 -YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEI-DGVRVGEE 229 (510) Q Consensus 153 -~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~-~~~~~~~~ 229 (510) |++--|. .+++.-.+|.+...-.. .+.......++.++ .+.. ...|-.. .+..+... T Consensus 155 ~~~v~dd~~~~~~~~~v~~~~~~~~~---------------~~~~~~~~~~~~~y-t~~~----~~~~~~~~~~~~~~~~ 214 (506) T protein:vir:94 155 TFVIYSTDVDPKPIMAVRYHQIELVD---------------DNQVSTINYVPETW-TADT----YTLYNPTPIMGKMQVD 214 (506) T ss_pred eEEEecCCCCCceEEEEEEEeeeecc---------------CCceeEEEEEEEEE-eCce----EEEeccccCccceecc Confidence 4444333 45665555555432111 11111111122222 1111 1111110 11111111 Q ss_pred cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccch------------- Q lcl|NC_012418. 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV------------- 296 (510) Q Consensus 230 ~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p------------- 296 (510) ...++..+|++.++= ...|.|--+...+-+-.++.+.-..+...+...+|.+++.-...... T Consensus 215 ~~~~~g~vPvv~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 289 (506) T protein:vir:94 215 TTKPITTFPVVEFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPND 289 (506) T ss_pred ccccCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccc Confidence 222345688876532 34577888888888888888777777776666666555421100000 Q ss_pred ------------h---hhccC-----CCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCC Q lcl|NC_012418. 297 ------------D---DYQDA-----EMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAE 354 (510) Q Consensus 297 ------------~---~~~~~-----~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~ 354 (510) . .+... ..+....|...+..+-.+....+.+.....++.+.+.|...-.. +.. ...+. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 369 (506) T protein:vir:94 290 EDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFAS 369 (506) T ss_pred cccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc Confidence 0 00000 00000111111122222223345667777787777777543221 111 11123 Q ss_pred CCCHHHHHH-------HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee--ecHHHHHHHHH Q lcl|NC_012418. 355 RVTAEEVRI-------TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE--TGLPALSRSAA 424 (510) Q Consensus 355 ~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v--~~is~L~raq~ 424 (510) ..|+..+.. ++.++...++..+. .+++.++.++.. .+........+.+..- .+.+.++.++- T Consensus 370 n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~--------~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~ 441 (506) T protein:vir:94 370 NSSGVAMQYKVLGTVELASTKRRMFERGLY--------ARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKA 441 (506) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHH Confidence 456665543 34444555444333 344444455432 2222222223333321 22333333332 Q ss_pred HHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhh Q lcl|NC_012418. 425 VQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMT 504 (510) Q Consensus 425 ~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~ 504 (510) +.++ +++ |....++..+ -+++ -.++|++++.++++.+.... ......+..... T Consensus 442 ~~kl----------~g~------iS~et~~~~l---p~v~-----d~~~E~~ri~~E~~~~~~~~---~~~~~~~~~~~~ 494 (506) T protein:vir:94 442 LVQA----------GAT------LPQKYLYQQL---PGVT-----NPQDIVDMMKEQSANGDYSF---DQNGVISNDGQT 494 (506) T ss_pred HHHH----------hcc------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHhhcc---hhhcCCCcccCc Confidence 2221 110 1112222221 1222 12355565554443222211 111112222233 Q ss_pred hcccCC Q lcl|NC_012418. 505 NALAGV 510 (510) Q Consensus 505 ~~~ag~ 510 (510) ..+++. T Consensus 495 ~~~~~~ 500 (506) T protein:vir:94 495 NTTATQ 500 (506) T ss_pred cccccc Confidence 334444 No 110 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=95.24 E-value=0.0025 Score=34.88 Aligned_cols=412 Identities=10% Similarity=0.041 Sum_probs=170.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcc-----cc-cCCCC-----------CCccccccccccchHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLP-----YL-MVDPM-----------SGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP-----~~-~~~~~-----------~~~~~~~~~~~dstg~~a~~~LAa~l 63 (510) +++.+.....+.+ .-..++.++.+|..= .+ ...+. ........++..+.+...++..++-| T Consensus 6 ~~~~i~~~~~~~~--~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl 83 (471) T protein:vir:10 6 IKKIISSQMVKHG--KFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKAYA 83 (471) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhhhhh Confidence 3333333322221 122344444444321 00 00000 00111122455666666666666555 Q ss_pred HHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCC Q lcl|NC_012418. 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSD 141 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~ 141 (510) .+ -|+. ++..++. ..+.| ..+..++|.....++.++...+|.+.+ |.++. T Consensus 84 ~G--~p~~-----~~~~~~~-------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~ 135 (471) T protein:vir:10 84 LT--YPPT-----FDVDDKK-------------VNDMI--------VDVLGDDYERISKQLCVNAGNAGIAWLHVWKDAS 135 (471) T ss_pred cc--cCce-----eccCChH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCC Confidence 43 2322 2333221 22222 112346888999999999999998765 44533 Q ss_pred CC--cEEEEEeceEEEeeCC--CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEE------EEEEeecCCC Q lcl|NC_012418. 142 EA--TVVAWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLY------THVQRKKGTA 211 (510) Q Consensus 142 ~~--~~~~~pl~~~~i~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~------~~v~~~~~~~ 211 (510) .+ ++.+++..+.++-.|. .+++...+|.+...... ..+....+++| +.+.... T Consensus 136 ~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~--------------~~~~~~~~~vy~~~~~~~y~~~~~--- 198 (471) T protein:vir:10 136 DNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDET--------------DGKNYTVYEYWNDKECSFYRHEKE--- 198 (471) T ss_pred CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccC--------------CCceeEEEEEEeCCcEEEEEecCC--- Confidence 34 3555655554433333 45676666665432211 01111122333 2111111 Q ss_pred ceEEEEEE--------EecCe-eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 212 MEYAELYH--------EIDGV-RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESL 282 (510) Q Consensus 212 ~p~~sv~~--------e~~~~-~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~ 282 (510) .....+.. -.+|. ........++..+|++.++. +.+|.|=.+...+-+-.++.+.-......+... T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 273 (471) T protein:vir:10 199 KPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQ 273 (471) T ss_pred cccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 10010000 00111 11111122345688876654 457888888999999999988888888888889 Q ss_pred CCceeeCCC-cccchhhhccC-CCcee-ecCC--cccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCC Q lcl|NC_012418. 283 EVLNLVDEA-KGAVVDDYQDA-EMGDY-VPGG--AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERV 356 (510) Q Consensus 283 ~p~~l~~~~-g~~~p~~~~~~-~~g~~-~pg~--~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~ 356 (510) +|.+++.-. +....+..... ..+.+ .++. ....++-.+....+.+.....++.+++.|-..-.. +...-..... T Consensus 274 ~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~ 353 (471) T protein:vir:10 274 EVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNS 353 (471) T ss_pred CceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCc Confidence 987666321 11112211111 11222 2211 11112222222345677788888888887554322 1111111234 Q ss_pred CHHHHHHH-------HHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee--ecHHHHHHHHHHHH Q lcl|NC_012418. 357 TAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQS 427 (510) Q Consensus 357 TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v--~~is~L~raq~~~~ 427 (510) |+.-+..+ +.++...++..+.++ ++.+..++. ... ..++.+... .+.+....++-+.+ T Consensus 354 Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~--------~~li~~~~~---~~d--~~~i~i~f~~~~p~n~~e~~~~~~k 420 (471) T protein:vir:10 354 SGVALKFLYSLLELKAGNMETQFRSGYATL--------VKMILKHLG---LSD--KLKIKQTWTRNSINNDTEMAQVVST 420 (471) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhc---cCC--CceeEEEeCCCCCCCHHHHHHHHHH Confidence 55444322 444444444443332 222222221 111 123333221 12222222221111 Q ss_pred HHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhc Q lcl|NC_012418. 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-EEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 428 ~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs-~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~ 506 (510) ++++ |.-..++ ..++ .+.+ ++|++++.++++.++++.. .+ +....+.. T Consensus 421 ----------l~g~------iS~et~~----~~~p-----~v~D~~~E~eri~~E~~~~~~~~~----~~--~~~~~~~e 469 (471) T protein:vir:10 421 ----------LATI------TSRENVA----KSNP-----IVEDWQDELRLQKAEQEGRSEKLY----DM--EEVEHESE 469 (471) T ss_pred ----------Hhcc------CchHHHH----HhCC-----CCCCHHHHHHHHHHHHHHHHhccc----cc--CCCCCccc Confidence 1110 1111222 2222 1222 4566665554433222111 11 11111111 Q ss_pred cc Q lcl|NC_012418. 507 LA 508 (510) Q Consensus 507 ~a 508 (510) +- T Consensus 470 ~~ 471 (471) T protein:vir:10 470 VE 471 (471) T ss_pred cC Confidence 11 No 111 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=95.05 E-value=0.0029 Score=34.51 Aligned_cols=469 Identities=13% Similarity=0.071 Sum_probs=178.6 Q ss_pred ChhHHHHHHHHH-h--hccchHHHHHHHHHhcc-ccc-CCCCCCccccccccccchHHHHHHHHHHHHHHh-hcCcCCcc Q lcl|NC_012418. 1 MKSTAAMLWEKL-R--DGSVEQRAIEFAKTTLP-YLM-VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS-LFPTGIPF 74 (510) Q Consensus 1 ~~~~~~~r~~~l-k--r~~~~~~w~e~~~~~lP-~~~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~-ltpp~~~W 74 (510) ==+++.++|.+. + |..|+ .|++-.+.+-= +.. ....+....+ -++|-|+-. ++--.|.+- =-|.-+|= T Consensus 12 tpe~la~~W~~~I~~a~~~~~-~~h~r~~~~~k~y~~~~~~~~~~~~r-~nl~~sni~----~i~P~iYar~P~p~V~~r 85 (663) T protein:vir:34 12 TPQGWAQRWQEEMSAAREPLE-KWHTQGKEIVKRYRDERDSAHDAETR-WNLFSTNIQ----TQMASLYGQTPKVSVSRR 85 (663) T ss_pred cchhHHHHHHHHHHHHHhccc-hHHHHHHHHHHHhhccccCCCccccc-cchhhhhHH----HHhhhhhcCCCcceeeec Confidence 224688899754 3 44444 44444443332 211 1111111111 122222211 000011100 01111222 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHh--hCeE-EEEEeC--------- Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL--FQNASLAVLTQVIKLLIV--TGNA-LLYRNS--------- 140 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l--~~snfy~~~~~~~~dl~~--~G~~-~l~~~~--------- 140 (510) |.-. |. .-.+..-+.+|+.+...| +..+|+..+..+..+.+. ||++ +.|..+ T Consensus 86 f~d~--d~------------~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~ 151 (663) T protein:vir:34 86 FADA--DD------------DVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDA 151 (663) T ss_pred ccCc--cc------------chhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccc Confidence 2211 10 013444455566555556 447799999999998654 4543 445321 Q ss_pred ----CCC----------------c--EEEEEeceEEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhh-------- Q lcl|NC_012418. 141 ----DEA----------------T--VVAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAG-------- 189 (510) Q Consensus 141 ----~~~----------------~--~~~~pl~~~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~-------- 189 (510) ..+ + +..+.-.+|.+.-- .--.|+=|.++-.||-+++.+.|+.+.-+.. T Consensus 152 ~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~~ 231 (663) T protein:vir:34 152 ILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASVPKVG 231 (663) T ss_pred cCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhccCcC Confidence 000 1 11121122211110 0126888999999999999998865442110 Q ss_pred --hc-c-----CCCceEEEEEEEEeecCCCceEEEEEEEecCeee-c-----cccccccccCceEEEeeeecCCCccccc Q lcl|NC_012418. 190 --RN-L-----SGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-G-----EEGRWPIHLCPYIVPTWNLAPGEHYGRG 255 (510) Q Consensus 190 --~~-~-----~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~-----~~~~y~~~~~P~~~~Rw~~~~g~~YGrg 255 (510) .+ + ++..+..|+.-+..+..+ |||-++|..+ + .++--+|.-||+...-....++-+=+-. T Consensus 232 ~~~~~~~~~~~~~~~~a~VwEIWdK~~~~------V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~~~ds~ipvpd 305 (663) T protein:vir:34 232 KPKDGKDGQSCHPWDRAEVWEIWDKGGRK------VDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANWTTDKVVPRPD 305 (663) T ss_pred CccccCCCCCcchhcCcceeEEEecCCcE------EEEEEcCcceecccCCCCCCCCCCCCCcccccceecCCCeecCCc Confidence 00 0 112355666665555443 5555555432 1 1222233458888776666665444444 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc-hhhhccCCCceeecC-------Cc----ccccccccCcc Q lcl|NC_012418. 256 HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-VDDYQDAEMGDYVPG-------GA----EAVRAYERGDY 323 (510) Q Consensus 256 p~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~-p~~~~~~~~g~~~pg-------~~----~~v~~~~~~~~ 323 (510) .+ -+=.=++.+|.++..+-.. ..+++|.++++-+.... -+.+..+..+.++|= .. ..+.-+++.. T Consensus 306 ~~-~y~~~~~E~n~~t~Rin~l-~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~- 382 (663) T protein:vir:34 306 FV-LAQDLYKEIDLVSTRITLL-ERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEP- 382 (663) T ss_pred HH-HHHHHHHHHHHHHHHHHHH-HhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccchh- Confidence 44 6667778888887766554 56789999986332211 133444555566551 11 1121122111 Q ss_pred cchHHHHHHHHHHHHHHHHHhh-cccc---CCCC----CCCCHHHHHHHHHHHHHHhchhHhHHHHHHH---HHHHHHHH Q lcl|NC_012418. 324 NKMAAIQQSLQAVVVRLNQAFM-YGAN---QRDA----ERVTAEEVRITAEEAENTLGGTYSLLAENLQ---SPLAYVCL 392 (510) Q Consensus 324 ~~~~~~~~~i~~~~~~I~~af~-~~~~---~~~~----~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l---~Pli~r~~ 392 (510) ...+...+-+.+..|+...+ .+++ .|+. +..||.+|.. +.++.-+...++|+. .-++.-.- T Consensus 383 --~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKs------q~gS~RIqe~qdevqR~arDi~ql~A 454 (663) T protein:vir:34 383 --VVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKA------KFGSIRLQRLQDEVARFASDIQRLKA 454 (663) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHH------HHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222333444444443 3332 2322 3345555522 233333333333311 11111111 Q ss_pred HHH------------hhcCCCCC---Cc-------cccccee--ee-----cHHHHHHHHHH-HHHHHHHHHHHhhcChh Q lcl|NC_012418. 393 SEV------------DDALLQGL---IT-------KQHKPAI--ET-----GLPALSRSAAV-QSMLNASQVIAGLAPIA 442 (510) Q Consensus 393 ~il------------~~~~l~~~---~~-------~~~~~~~--v~-----~is~L~raq~~-~~~~~~~q~l~~~~~~~ 442 (510) .|| ....++-. .+ ..++... |. --..++.-+.. +-+..+..+++.+.++. T Consensus 455 EIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~ 534 (663) T protein:vir:34 455 EVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLA 534 (663) T ss_pred HHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 121 11112210 00 1122211 11 11223332322 22233333343333322 Q ss_pred hHhhc-------------------cCHHHHHHHHHHHcC------CCHhHccCCHHHHHHHH--HHHHHHHHHHHHhhHH Q lcl|NC_012418. 443 QLDPR-------------------ISLPKMMDTIWAAFS------VDTSQFYKSEEELQAEA--EQRRQQAAQAQAAQET 495 (510) Q Consensus 443 q~~~~-------------------id~d~~~~~~a~~~G------vp~~~i~rs~~ev~~~r--~q~~q~~~~~~~~~~~ 495 (510) +.-|. .+.+.+++.+.++.- .++. . .++.-++.+ +|...+... +.++.+ T Consensus 535 ~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~-p--a~~~~~~k~~~~q~k~q~~~-aeAq~e 610 (663) T protein:vir:34 535 QQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQS-P--APQQPDPKVVAQAMKGQQEM-AKVQAE 610 (663) T ss_pred HhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCC-c--ccchhhHHHHHHHHHHHHHH-HHHHHH Confidence 21111 133333333332221 1110 0 111111111 110000000 000000 Q ss_pred Hh--hhhhhhh------------------hcccCC Q lcl|NC_012418. 496 LL--EGASDMT------------------NALAGV 510 (510) Q Consensus 496 ~~--~ga~~~~------------------~~~ag~ 510 (510) ++ .-..+.. ..-+++ T Consensus 611 ~q~~~~~~ql~~~~~~~k~~~~a~~~~~~a~q~~~ 645 (663) T protein:vir:34 611 VQGDLLRIQAETQANETKERQQAEWNVREAAQKNL 645 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH Confidence 00 0000001 111111 No 112 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=94.43 E-value=0.0044 Score=33.48 Aligned_cols=444 Identities=11% Similarity=0.008 Sum_probs=152.5 Q ss_pred ChhHHHHHHHHHh----------------------------hcc-chHHHHHHHHHhcccccCCCCCCccc--cc-cccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR----------------------------DGS-VEQRAIEFAKTTLPYLMVDPMSGSRG--VV-EHDF 48 (510) Q Consensus 1 ~~~~~~~r~~~lk----------------------------r~~-~~~~w~e~~~~~lP~~~~~~~~~~~~--~~-~~~~ 48 (510) =|+...++|..+- |+. +.+...+..+...|+++.... .+.. +. .... T Consensus 39 ~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~-p~~~wf~~~p~~~ 117 (641) T protein:vir:94 39 KRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATF-PSDDWFDLKGMVP 117 (641) T ss_pred hhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHHhhhhc-CCCceEEEecCCC Confidence 2333334443221 111 334445566666665433211 0100 00 0011 Q ss_pred -cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHH------ Q lcl|NC_012418. 49 -QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVL------ 121 (510) Q Consensus 49 -dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~------ 121 (510) |..+++.++.. +...+. -..|++.++..+...+...|.+..+ T Consensus 118 ed~~~A~~~~~~----~~~~l~---------------------------~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~ 166 (641) T protein:vir:94 118 ELADAARVVKQL----TKTKLE---------------------------AASIRDIFETYVRNLVLYGVSTYRLGWDTSM 166 (641) T ss_pred ChHHHHHHHHHH----HHHHHh---------------------------hcchHHHHHHHHHHHhhcCceEEEeehhhHH Confidence 11222211110 000000 0123444444555555555532221 Q ss_pred HHHHHHHHhhCeE--------------------------EEEEeCCCC----cEEEEE-----eceEEEeeCCCC--CeE Q lcl|NC_012418. 122 TQVIKLLIVTGNA--------------------------LLYRNSDEA----TVVAWS-----LRSYAVRRDATG--RWM 164 (510) Q Consensus 122 ~~~~~dl~~~G~~--------------------------~l~~~~~~~----~~~~~p-----l~~~~i~~d~~G--~vd 164 (510) .+...+ +..|+. -+|.|+... .|..+- +.+. +.+...| .++ T Consensus 167 ~~~~~~-~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l-~~eg~~~~d~v~ 244 (641) T protein:vir:94 167 ERQFKR-TFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHEL-VTSGYYDLDLTQ 244 (641) T ss_pred HHhhhh-hcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHH-HhcCCCChhhcc Confidence 112122 111211 124443221 122111 1111 1110011 111 Q ss_pred EE-EEEEeecHHHhhHHhh-Hh-----hhhhhhc-cCCCceEEEEEEEE-----ee-cC----CCceEEEEEEEecCeee Q lcl|NC_012418. 165 DI-VLKQRYKSKDLDEAYK-QD-----LMRAGRN-LSGSGSVDLYTHVQ-----RK-KG----TAMEYAELYHEIDGVRV 226 (510) Q Consensus 165 ~i-~r~~~~t~~~l~~~~~-~~-----~~~~~~~-~~~~~~v~i~~~v~-----~~-~~----~~~p~~sv~~e~~~~~~ 226 (510) .. ...++.+-.+-..++. .+ +...+.+ ...+..+.-||... .+ .+ ..+||....|..... T Consensus 245 ~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~-- 322 (641) T protein:vir:94 245 VEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRD-- 322 (641) T ss_pred hhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCC-- Confidence 00 0011111101001110 00 0000000 00111222222211 11 11 235776665553322 Q ss_pred ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH-HHHH-----HHhhCCc-eeeCCCcccchhhh Q lcl|NC_012418. 227 GEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKL-GLYE-----LESLEVL-NLVDEAKGAVVDDY 299 (510) Q Consensus 227 ~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~-l~~~-----~~a~~p~-~l~~~~g~~~p~~~ 299 (510) +.||. -|.. . --|-...|-.+.....-.-.+ +... +..++|. +-..|+++...... T Consensus 323 ---~~YG~--gp~~----------~--~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~ 385 (641) T protein:vir:94 323 ---SVYGM--SVLH----------P--NLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQH 385 (641) T ss_pred ---cccCC--ChHH----------H--HHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCC Confidence 22331 1210 0 114444444433322211111 1111 1122343 22346665433211 Q ss_pred ccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhh------ccccCCCCCC--CCHHHHH-------HH Q lcl|NC_012418. 300 QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM------YGANQRDAER--VTAEEVR-------IT 364 (510) Q Consensus 300 ~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~------~~~~~~~~~~--~TAtEi~-------~r 364 (510) ...+-+.+|..+. ...++.+...-..++.+..-.++ -++....+.- +-..|.. .+ T Consensus 386 --~~v~pl~~~~~~~--------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~ 455 (641) T protein:vir:94 386 --GSLQPIDMGRQDF--------VVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTH 455 (641) T ss_pred --CcceeecCCcccc--------chhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHH Confidence 1112223343322 11233333333445555544432 2222112211 1122222 12 Q ss_pred -HHHHHH-HhchhHhHHHHHHHHHHHHHHHHH----HhhcCC-CCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012418. 365 -AEEAEN-TLGGTYSLLAENLQSPLAYVCLSE----VDDALL-QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAG 437 (510) Q Consensus 365 -~~E~~~-~LGpv~~rl~~E~l~Pli~r~~~i----l~~~~l-~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~ 437 (510) .+|.+. +|+++|++++.+++.|++.|+|++ -.--.+ +.-...+++...+.....+.++++++++..+.+.++. T Consensus 456 l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~ 535 (641) T protein:vir:94 456 IEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGR 535 (641) T ss_pred HHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhc Confidence 234444 799999999999999999999986 111111 1122346666789999999999999999999999887 Q ss_pred hcChhhH-hhccCHHHHHHH-----HHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHH-hh-----------HHHhhh Q lcl|NC_012418. 438 LAPIAQL-DPRISLPKMMDT-----IWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQA-AQ-----------ETLLEG 499 (510) Q Consensus 438 ~~~~~q~-~~~id~d~~~~~-----~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~-~~-----------~~~~~g 499 (510) ..++.+. +.......+++. .+..+..+ . - .-+...++.+++++++.+++|. ++ +.+..+ T Consensus 536 ~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~-~-~-~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~ 612 (641) T protein:vir:94 536 VPQIGQSLDYALILEDLLRQMRFTDPMRYIKKA-E-A-PPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSD 612 (641) T ss_pred ChhhhhcCCHHHHHHHHHHHhCCCCchhhccCc-c-C-chhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHH Confidence 7765432 111111112211 12222222 1 0 1111111122122221111111 00 000001 Q ss_pred hh---------hhhhcccCC Q lcl|NC_012418. 500 AS---------DMTNALAGV 510 (510) Q Consensus 500 a~---------~~~~~~ag~ 510 (510) +. .+..+.|.- T Consensus 613 ~~~~~~~~~~~~~~~~~~~~ 632 (641) T protein:vir:94 613 LASRIGIDTSDVAPEAMAAA 632 (641) T ss_pred HHHhhcCCchhhhHHHHhcc Confidence 11 111111111 No 113 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=93.44 E-value=0.0075 Score=32.22 Aligned_cols=386 Identities=11% Similarity=0.005 Sum_probs=173.5 Q ss_pred ChhHHHHHHH-HHhhccchHHHHHHHHHhcccc-cCCCCCCcccccc---ccccchHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 1 MKSTAAMLWE-KLRDGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVE---HDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~~~~~~~r~~-~lkr~~~~~~w~e~~~~~lP~~-~~~~~~~~~~~~~---~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) |-.++..+.. ++.+ ..++.+.+.+|..-.. ...-+..-..... +..-+-...+++.||..|. .-+ | T Consensus 1 m~~~~i~~L~~~~~~--~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~----~~G---f 71 (422) T protein:vir:97 1 MNYMGMGYLRRKLAL--FKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRII----FRE---F 71 (422) T ss_pred CChHHHHHHHHHHHH--HHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccc----cce---e Confidence 7665554443 3321 1223344455543221 0000000011111 1222344555555554321 111 1 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC--cEEEEEec Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA--TVVAWSLR 151 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~--~~~~~pl~ 151 (510) ..+|. + +++...++++....+++.++..++|.+.+++. ++.+ +++++|.. T Consensus 72 --~~~d~-------------~-----------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~ 125 (422) T protein:vir:97 72 --TNDDF-------------N-----------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEAS 125 (422) T ss_pred --eCCch-------------h-----------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechh Confidence 11111 1 23446778999999999999999999888774 3222 36666655 Q ss_pred eEEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccc Q lcl|NC_012418. 152 SYAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEG 230 (510) Q Consensus 152 ~~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~ 230 (510) +..+..|+ .+++...++++.. +.+......+ +++ +. ..+|+..++.....+. T Consensus 126 ~~~~i~D~~~~~~~~a~~~~~~--------------------~~~~~~~~~~-~~~-~~-----~~~~~~~~~~~~~~~~ 178 (422) T protein:vir:97 126 KATGILDPTTFLLTEGYAILES--------------------DSNGNPTLEA-YFT-DK-----DIWYYPKKGKPYNIKN 178 (422) T ss_pred hEEEEEeCCCCcceeeEEEEEe--------------------cCCCcEEEEE-EEc-Cc-----eEEEEcCCCccccccC Confidence 54333343 3433333322211 0111111111 111 11 1112222222222334 Q ss_pred ccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCcccchhhhccCCCce Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGD 306 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~---~~~g~~~p~~~~~~~~g~ 306 (510) ++ +.+|++.+..+...++.||+|-. +..++=+..+|...-..+..++..+.|...+ +++|... +..+ ...+. T Consensus 179 ~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~-~~~~-~~~~~ 254 (422) T protein:vir:97 179 PT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPM-EKWR-ATVST 254 (422) T ss_pred CC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccC-chhh-hhhhh Confidence 44 45899999999999999999976 5688889999999888889999988886554 2333211 1111 11111 Q ss_pred e--ecCCcc--cccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHH-------HHHHHHHH Q lcl|NC_012418. 307 Y--VPGGAE--AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV-------RITAEEAE 369 (510) Q Consensus 307 ~--~pg~~~--~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi-------~~r~~E~~ 369 (510) + +|...+ .++.-++. .++++. -++.++.-|+....... +..+..+ -+|.-| ..+.++|. T Consensus 255 i~~~~~de~~~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~ 330 (422) T protein:vir:97 255 LLEISKDEDGDKPTVGQFT-TASMAP---FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQ 330 (422) T ss_pred hhccCCCCCCCcceeeecC-CCChhH---HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 2 232211 12222222 234443 34455555544333221 1111111 234332 44567777 Q ss_pred HHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceee-----ecHHHHHHHHHHHHHHHHHHHHHhhcChhhH Q lcl|NC_012418. 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE-----TGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v-----~~is~L~raq~~~~~~~~~q~l~~~~~~~q~ 444 (510) ..+|..+.++. +.++.+. ++.... +.......+ ...+....||.+..+.-+.+. ..++ T Consensus 331 ~~fg~~l~~~~--------rla~~~~--~~~~~~-~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a---~~~~--- 393 (422) T protein:vir:97 331 RSFSSGFLNVA--------YIAVCLR--DEFPYL-RNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQA---IPGF--- 393 (422) T ss_pred HHHHHHHHHHH--------HHHHHHh--cCCccc-chhhccceEEEccCCCCChHHHHHHHHHHHHHHhh---cccc--- Confidence 88887776542 2223232 333333 233222222 245666666666555543332 2121 Q ss_pred hhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHH Q lcl|NC_012418. 445 DPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQ 485 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~ 485 (510) .+.+. +.+.+|+.. .+++.....++.+.. T Consensus 394 ---~~~~~----~~~~lg~~~-----~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 394 ---MDADV----IRDLTGVKG-----ADKPIPAITEVTTDG 422 (422) T ss_pred ---ccHHH----HHHHcCCCc-----hhHHHHHHHhhhccC Confidence 22222 334456542 122222222221111 No 114 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=93.14 E-value=0.0086 Score=31.90 Aligned_cols=427 Identities=10% Similarity=0.022 Sum_probs=170.0 Q ss_pred ChhHHHHHHHHHhhccchH--HHHHHHHHhcccccCCCC-CCccccccc-cccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQ--RAIEFAKTTLPYLMVDPM-SGSRGVVEH-DFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~--~w~e~~~~~lP~~~~~~~-~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) -=..+..+|+..+.. +.. .|..=..| +|+.-...+ +.-..++.+ .|=+.-.+.++.++ +.+|- ..|++. T Consensus 17 ~y~a~~~~W~~ird~-~~G~~~~~~r~~y-l~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~----G~vfr-k~p~~~ 89 (489) T protein:vir:78 17 EWLHYAPKWQKVRHA-LAGELVSYLRNVG-LNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMV----GSVMR-KEPEIN 89 (489) T ss_pred HHHHHHHHHHHHHHH-hcCcccccccCCC-CCCCCCCCChHHHHHHHhccccCChHHHHHHHHh----chhhc-CCccee Confidence 111233445443311 000 00000011 222100010 111112222 23333344555444 33332 234442 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCc------------ Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEAT------------ 144 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~------------ 144 (510) +++ .++.++++| -....+.+.-+...+.+...+|-+.+++|-+... T Consensus 90 --~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~ 147 (489) T protein:vir:78 90 --IPK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLL 147 (489) T ss_pred --ccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcC Confidence 221 244555555 2456778888899999999999999999865331 Q ss_pred ---EEEEEeceE---EEe-eCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEE Q lcl|NC_012418. 145 ---VVAWSLRSY---AVR-RDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAEL 217 (510) Q Consensus 145 ---~~~~pl~~~---~i~-~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv 217 (510) +..|+-.+. -.. .|..+++.-+..+++..+++=+..|+ .+.++.|.++.+..++.. ...+ T Consensus 148 rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~------------~~~~~q~RvL~~~~~g~~-~~~~ 214 (489) T protein:vir:78 148 NPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFE------------TKYGEQYRVLDIDSDGNY-RQRL 214 (489) T ss_pred CcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCcc------------ceeEEEEEEEecCCCcce-EEEE Confidence 555655443 122 24444566666677655544223332 334455555554333211 1122 Q ss_pred EEE-ecCee------eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHH---HHHHHHHH-HHhhCCce Q lcl|NC_012418. 218 YHE-IDGVR------VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLL---SEKLGLYE-LESLEVLN 286 (510) Q Consensus 218 ~~e-~~~~~------~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l---~~~~l~~~-~~a~~p~~ 286 (510) |.+ .+|.. +....+ .+-+++|++.|--..+..+..| .--|=|+..||.- +.+-++.+ -.+.-|.+ T Consensus 215 ~r~~~~g~~~~~~~~~~~~~g--~~~l~~IPfv~~~~~~~~~~~~--~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l 290 (489) T protein:vir:78 215 FRFDAEGGAQEDVVEIYPDLG--ESLRGVIPFTFIGATNNDATID--DAPLLPLAELNIGHYRNSADNEESSFVVGQPTL 290 (489) T ss_pred EEeecCCcccceeeEEeccCC--CCccCeeeEEEEecCCCCCCCC--cCchHHHHHHHHHHhhhhhHHHHHHHHccccee Confidence 222 22211 111222 2457888888887666655543 1113355555543 33334433 44444544 Q ss_pred eeCC-C-------cccchhhhccCCC-ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Q lcl|NC_012418. 287 LVDE-A-------KGAVVDDYQDAEM-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVT 357 (510) Q Consensus 287 l~~~-~-------g~~~p~~~~~~~~-g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~T 357 (510) .+.. + +..++..+..+.+ +...| ..++.+.++... ...+.+.+.+++++..++=. .+.... .+.| T Consensus 291 ~i~G~d~~~~~~~~~~~~~~i~~g~~~~~~lp-~~~~~~~ie~~~---~~~~r~~l~~le~qm~~lGa-~l~~~~-~~~T 364 (489) T protein:vir:78 291 FIYPGENLTPQAFKEANPNGIKFGSRRGHNLG-YGGSAQLIQAGE---NNLARQNMLDKEQQAIQIGA-QLITPT-QQIT 364 (489) T ss_pred eeecCccCCcccccccCccceeeCCcccccCC-CCCCcceeccCc---chHHHHHHHHHHHHHHHHhh-hhccCC-cchh Confidence 3321 1 0111111211111 11111 111222333221 12346667777766654321 122322 3589 Q ss_pred HHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCccccccee-eec-HHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 358 AEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI-ETG-LPALSRSAAVQSMLNASQV 434 (510) Q Consensus 358 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~-v~~-is~L~raq~~~~~~~~~q~ 434 (510) ||+...+...--..|..+...+.+- +.+++.++.+ -|+.. +..+...+ ..+ +..+ -++.+..+ ... T Consensus 365 a~~~~~~~~~~~S~L~~~a~~~e~a-----l~~~l~~~a~w~G~~~--~~~~~i~~n~dF~~~~~-d~~~~~al---~~~ 433 (489) T protein:vir:78 365 AQSARIQRGADTSVMATIARNVSQA-----YTDALRWVAVMLGKPE--DTEVEFRLNMDFFLEPM-TAQDRAAW---MAD 433 (489) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCCCC--CCceEEEeecccCcccC-CHHHHHHH---HHH Confidence 9999999999999999888877653 3455555433 22221 12222111 111 1111 11222221 111 Q ss_pred HHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhh Q lcl|NC_012418. 435 IAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTN 505 (510) Q Consensus 435 l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~ 505 (510) .. ++ .|..+.+.+++ ...||.. .++++++.+-+.+.. -....-.-..-+++-+++. T Consensus 434 ~~--~G------~is~~t~~~~L-~~~gv~d----~~~e~~~~ei~~~~~--~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 434 IN--AG------LLPATAYYAAL-RKAGVTD----WTDADIKDAVADQPL--PVATEVQGEIPQSAQQQEK 489 (489) T ss_pred Hh--cC------CCCHHHHHHHH-HhCCCCC----ccHHHHHHHHhhcCC--CcccCCcccCCCCcccccC Confidence 11 11 12222223322 3334431 122222211111000 0000000000011111111 No 115 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=93.13 E-value=0.0087 Score=31.89 Aligned_cols=427 Identities=8% Similarity=-0.061 Sum_probs=177.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc---ccCCC-CCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~---~~~~~-~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) -...+.+..++- +..-.++++++.+|.... ..... .........++..+.+...++..++-|++- | +. T Consensus 40 ~~~~l~~~i~~~-~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~------p-~~ 111 (501) T protein:vir:27 40 NWELLKNFINHH-KLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGN------P-IR 111 (501) T ss_pred cHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhccc------C-ee Confidence 001111111100 111123455555665431 11111 111112223566677777777777666532 1 12 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC-cEEEEEece- Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSLRS- 152 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~-~~~~~pl~~- 152 (510) ++..+... ...+. ..+......++|.....++.++..++|.+.+++. ++.. +++.++..+ T Consensus 112 ~~~~d~~~---------~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~ 175 (501) T protein:vir:27 112 VEYDDNDN---------NSQND-------DTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLET 175 (501) T ss_pred EecCCccc---------hHHHH-------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEcccee Confidence 23322211 11122 2344456778999999999999999999876654 3322 356665544 Q ss_pred EEEeeCC-CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC-eeecccc Q lcl|NC_012418. 153 YAVRRDA-TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG-VRVGEEG 230 (510) Q Consensus 153 ~~i~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~-~~~~~~~ 230 (510) |++--|. .+++...+|.+...... +....++||+ ++ ..+++..++ ....... T Consensus 176 ~~v~d~~~~~~~~~~ir~~~~~~~~----------------~~~~~~~vyt-----~~-----~v~~~~~~~~~~~~~~~ 229 (501) T protein:vir:27 176 FVIYDNSLEDNSIAAVRYYNRGTLQ----------------NAKDVVEIYT-----NE-----HIYTLDASDDFNEISVT 229 (501) T ss_pred EEEecCCCCCceEEEEEEEEeeecC----------------CcEEEEEEEe-----CC-----eEEEEEeCCceeecccc Confidence 5554443 45665555555321110 1111222321 11 112233222 1121222 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCccc-chhhhc-cCCCceee Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA-VVDDYQ-DAEMGDYV 308 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~-~p~~~~-~~~~g~~~ 308 (510) ..++..+|++.++ +...|+|-....++-+..++.+.-......+...+|.+++.-.... ..+... ....+.+. T Consensus 230 ~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~ 304 (501) T protein:vir:27 230 THAFGTVPITEFL-----NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQ 304 (501) T ss_pred ccCCCcccEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCcee Confidence 2234678988764 3467999999999999999998888888888888887765311110 000000 00112221 Q ss_pred c-------CCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-ccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHH Q lcl|NC_012418. 309 P-------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 309 p-------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl 379 (510) . |....+++-.+....+.+.....++.+++.|...-+. +.. ...+...|+..+.....- +....-...+. T Consensus 305 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~ka~~~~~~ 383 (501) T protein:vir:27 305 LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFG-LDQDRVDTQSQ 383 (501) T ss_pred ecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHH-HHHHHHHHHHH Confidence 1 1111112211222223455566677777766543322 111 111234566554432211 11112223333 Q ss_pred HHHHHHHHHHHHHHHHhhcCC-CCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHH Q lcl|NC_012418. 380 AENLQSPLAYVCLSEVDDALL-QGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDT 456 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~l-~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~ 456 (510) -.+.+.-+++.++.++...+. .......+.+..- .+.+.+..++-+.++ .+.+ -...+ T Consensus 384 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl------~g~i----------S~et~--- 444 (501) T protein:vir:27 384 FTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL------GGQV----------SQETA--- 444 (501) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH------hccC----------cHHHH--- Confidence 333333344444455432221 1111222333221 122222222211111 1111 11122 Q ss_pred HHHHc-CCCHhHccCCHHHHHHHHHHHHHHHHHHHHh--hHHHhhhhhh--------hhhccc Q lcl|NC_012418. 457 IWAAF-SVDTSQFYKSEEELQAEAEQRRQQAAQAQAA--QETLLEGASD--------MTNALA 508 (510) Q Consensus 457 ~a~~~-Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~--~~~~~~ga~~--------~~~~~a 508 (510) ...+ +|+ -.++|++++.++++....++++. ....+.+... .+.++. T Consensus 445 -l~~l~~v~-----D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 445 -LSLSGLVE-----SPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred -HHhCCCCC-----CHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 2223 222 12466766665544322222211 1111111111 111111 No 116 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=92.87 E-value=0.0097 Score=31.63 Aligned_cols=375 Identities=10% Similarity=-0.007 Sum_probs=171.0 Q ss_pred ChhHHHHHHHH-Hh-hccchHHHHHHHHH--hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEK-LR-DGSVEQRAIEFAKT--TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~-lk-r~~~~~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |-..+..+..+ +. +.+...+..++|+- -+|++...- ...-....+..-+-+..++++||..|. ..+ | T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f- 71 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITI-PQALSQQYRSILGWCAKGVDSLADRLV----FRE---F- 71 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhh-hHHHHHHHhhhcchhHHHHHHhHhhcc----cCc---c- Confidence 65555444432 22 22222222333331 111111000 000000112233455666666655432 222 1 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEeceE Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSY 153 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~~ 153 (510) ...|. + +++...+++|.....++.++..++|.+.+++..+.. ++++++..+. T Consensus 72 -~~~d~-------------~-----------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~ 126 (409) T protein:vir:94 72 -ENDDF-------------T-----------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNA 126 (409) T ss_pred -cCCch-------------H-----------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceE Confidence 11111 1 234457788999999999999999998877654322 4677766665 Q ss_pred EEeeCCC-CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEE-ecCeeeccccc Q lcl|NC_012418. 154 AVRRDAT-GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHE-IDGVRVGEEGR 231 (510) Q Consensus 154 ~i~~d~~-G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e-~~~~~~~~~~~ 231 (510) ++..|+. +++...++... + +... ...... + ..++. .+++. .++.......+ T Consensus 127 ~~i~D~~~~~~~~a~~~~~---~---------------d~~~--~~~~~~-~-~~~~~-----~~~~~~~~~~~~~~~n~ 179 (409) T protein:vir:94 127 TGIIDPITGLLTEGYAVLE---R---------------DENN--NVVLEA-H-FLPDR-----TDYYYRDSRNNISIANP 179 (409) T ss_pred EEEEecCCCceeeeEEEEE---e---------------cCCC--ceEEEE-E-EecCc-----EEEEEecCceeEeeeCC Confidence 4545543 45544443321 0 0001 111111 1 11111 11111 12222222334 Q ss_pred cccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCcccchhhhccCCCcee Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDY 307 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~---~~~g~~~p~~~~~~~~g~~ 307 (510) + +.+|++.+..+...++.||+|-. +..++=+..+|+..-..+..++..+.|...+ ++|+.. .+.++... +.+ T Consensus 180 ~--g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~-~~~~~~~~-~~i 255 (409) T protein:vir:94 180 T--GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEP-METWKATV-SSM 255 (409) T ss_pred C--CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcc-cchhhhhH-HHh Confidence 4 46999999999888999999965 5677888899999988999999999985544 333321 11222211 112 Q ss_pred --ecCCccc--ccccccCcccchHHHHHHHHHHHHHHHHHhhcccc-----CCCCCC-CCHHHH-------HHHHHHHHH Q lcl|NC_012418. 308 --VPGGAEA--VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-----QRDAER-VTAEEV-------RITAEEAEN 370 (510) Q Consensus 308 --~pg~~~~--v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-----~~~~~~-~TAtEi-------~~r~~E~~~ 370 (510) +|...++ ++.-++. .++++. -++.++.-|+...+.... .....+ -+|.-| ..++++|.. T Consensus 256 ~~~~~d~dg~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~ 331 (409) T protein:vir:94 256 LQFTKDEDGDKPTLGQFT-QPSMSP---FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQR 331 (409) T ss_pred hcCCCCCCCCCceEEecC-CCChhH---HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHH Confidence 2322111 2222222 234543 345555555544433221 111112 233322 346677777 Q ss_pred HhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccccee----eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh Q lcl|NC_012418. 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI----ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~----v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~ 446 (510) .+|.-+.++ ++.++.+.. +....+.+..+... +..-+....|+.+..+.-+.+ ...+++ T Consensus 332 ~fg~~~~~~--------~rla~~i~~--~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~---ag~~~~---- 394 (409) T protein:vir:94 332 SLGAGLLNV--------AYLAACLRD--DAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQ---AIPEFI---- 394 (409) T ss_pred HHHHHHHHH--------HHHHHHHhC--CCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHH---hccccc---- Confidence 788766654 233333332 33333333222211 223344445555555443332 111111 Q ss_pred ccCHHHHHHHHHHHcCCCHhH Q lcl|NC_012418. 447 RISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~ 467 (510) +. +.+.+.+|.+... T Consensus 395 --~~----~~~~~~lG~~~~d 409 (409) T protein:vir:94 395 --NK----DTIRDLTGIEGGE 409 (409) T ss_pred --ch----hHHHHHcCCCCCC Confidence 11 2244555554432 No 117 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=92.58 E-value=0.011 Score=31.36 Aligned_cols=462 Identities=13% Similarity=0.062 Sum_probs=193.1 Q ss_pred ChhHHHHHHHHHh------hccch--------HHHHHHHHHhcccccCCC-CCCccccccccccchHHHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLR------DGSVE--------QRAIEFAKTTLPYLMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~~~~~~~r~~~lk------r~~~~--------~~w~e~~~~~lP~~~~~~-~~~~~~~~~~~~dstg~~a~~~LAa~l~~ 65 (510) |-. =+++|.--+ ...|. .+.+.+.+|..=.-..-. .-.++. ...++++.|..-+++++. T Consensus 1 m~~-~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d-r~~~~~ps~r~~V~~~~~---- 74 (563) T protein:vir:74 1 MPY-NHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDD-SVPILMPSGRKIVEAVHR---- 74 (563) T ss_pred CCc-cccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc-eeeeccchHHHHHHHHHH---- Confidence 111 111111000 11121 122233333221100000 011111 234678888888888553 Q ss_pred hhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--C--- Q lcl|NC_012418. 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S--- 140 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~--- 140 (510) .| +....|+- +...-.+ ..... +++.+.....+.|+.....++-.+.++.|-+++++- + T Consensus 75 ~L-g~~~~~~V---e~~~~de-----~~~~a-------vq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 75 FL-GVGFDYLV---EPDMGDE-----GIRQS-------LNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred hc-CCCcEEec---CccccCc-----chHHH-------HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccc Confidence 33 44455543 2211100 11111 455666678889999999999999999999887653 2 Q ss_pred CCCcEEEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhh--h-hhccCCCceE--EEEEEEE-eecC--C Q lcl|NC_012418. 141 DEATVVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMR--A-GRNLSGSGSV--DLYTHVQ-RKKG--T 210 (510) Q Consensus 141 ~~~~~~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~--~-~~~~~~~~~v--~i~~~v~-~~~~--~ 210 (510) ..+|.++.++ +.|+-..|++ .|-.+|-..-...=.++++..+.+-+ + ....+++..+ .+.+-.+ +..+ . T Consensus 139 ~g~R~rv~~vDP~~~fp~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd 217 (563) T protein:vir:74 139 AGERISVDEVDPRQIFLIEDGS-TVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWD 217 (563) T ss_pred cCCCceEeecCCceeeeccCCC-CcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhcccccc Confidence 1235665554 6777777764 45455522211111123333222211 1 1111222210 1111111 1110 0 Q ss_pred CceEEEEEE--EecCee----eccc----cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 211 AMEYAELYH--EIDGVR----VGEE----GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 211 ~~p~~sv~~--e~~~~~----~~~~----~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~ 280 (510) ..+.-+.-+ +.++.. .+++ --|+ -.||+.++=...++++||+|--.+.+.=++.||.--...-..+.. T Consensus 218 ~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~--~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~ 295 (563) T protein:vir:74 218 DRGAISDEQARRKEQVRSAQHDEEEEELPEPIS--QLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVF 295 (563) T ss_pred ccCccchhhhcccchhhhhhhhchhhhcccccc--CccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHh Confidence 111111111 222211 1111 1222 268888777788899999999999999999999755444444444 Q ss_pred hhCCceeeCCCcccchhhhccCCCce--eecCCcc-------cccccccCcccchHHHHHHHHHHHHHHHHHhhcc---- Q lcl|NC_012418. 281 SLEVLNLVDEAKGAVVDDYQDAEMGD--YVPGGAE-------AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG---- 347 (510) Q Consensus 281 a~~p~~l~~~~g~~~p~~~~~~~~g~--~~pg~~~-------~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---- 347 (510) .=+|+...+ +....+.. .+..-. +-||... .-....+...++++.++..+.++..|. ..-. T Consensus 296 tG~pi~vl~--~~~p~d~~-~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~era---l~~~s~tP 369 (563) T protein:vir:74 296 QGLGMYVTN--ASAPVDPN-TGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKG---IAEGSGTP 369 (563) T ss_pred cCCCeEEec--cccccccc-cccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHH---HHhhccCc Confidence 444544332 22211211 100000 1233331 112334445567777777777776532 2110 Q ss_pred --cc-CCCCCCC---CHHH-----HHHHHHHHHHHhchhHhHHHHHHH---HHHHHHHHHHHhh---cCCCCCCcccccc Q lcl|NC_012418. 348 --AN-QRDAERV---TAEE-----VRITAEEAENTLGGTYSLLAENLQ---SPLAYVCLSEVDD---ALLQGLITKQHKP 410 (510) Q Consensus 348 --~~-~~~~~~~---TAtE-----i~~r~~E~~~~LGpv~~rl~~E~l---~Pli~r~~~il~~---~~l~~~~~~~~~~ 410 (510) ++ ..|..+. .|=| +..+.+||+..|=.++-+.-.++. .|..+|.+-.-.- .|.-+.|.. ..+ T Consensus 370 avA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~-~~v 448 (563) T protein:vir:74 370 EVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNE-CSV 448 (563) T ss_pred ceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCc-eEE Confidence 01 1122221 2222 245555666544444444333322 2222332221110 111111111 112 Q ss_pred ee-eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 411 AI-ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQA 489 (510) Q Consensus 411 ~~-v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~ 489 (510) .+ -...-|.-+++-++.+....+ ++ -|-...+++.+.++ |-| |=--++|++++..++=+.|+.| T Consensus 449 ~ivf~p~~P~d~~~vv~~~~tl~~-----aG------iiSretAv~~L~~~-g~~---~pdae~e~~~ie~~~i~~~~~a 513 (563) T protein:vir:74 449 VCIFADPMPVNKTQVTQDTLLLQQ-----AH------LILRKMAVAKLRSI-GWE---YPEVDDQGNALTDDDIADMLLA 513 (563) T ss_pred EEEeCCCCCccHHHHHHHHHHHHH-----cC------chhHHHHHHHHHhC-CCC---CCcHHHHHhhcCHHHHHHHHHH Confidence 22 234556566665555542211 12 13445677777776 754 1112456655544433333333 Q ss_pred HHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 490 QAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 490 ~~~~~~~~~ga~~~~~~~ag~ 510 (510) |..+ ++--|+.+...--++= T Consensus 514 ~a~a-d~~~~~~a~~~~g~~~ 533 (563) T protein:vir:74 514 EAEA-DASLGLSAMDNGGAGE 533 (563) T ss_pred Hhhc-cCcccceecccCCCCc Confidence 2221 1111222111111111 No 118 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=92.19 E-value=0.012 Score=31.03 Aligned_cols=437 Identities=11% Similarity=-0.021 Sum_probs=180.7 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhccc--------ccCC-CCCC---ccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--------LMVD-PMSG---SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~--------~~~~-~~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) +++.+.+.+.....+.-..+-+...+|..-. .... .+.. ......|+..+.+...++..++-|++. T Consensus 13 ~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl~G~-- 90 (537) T protein:vir:78 13 LGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYLLSN-- 90 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhhccc-- Confidence 2222222222221111123334445553321 0000 0000 011223567777788888877777654 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCCC-cE Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA-TV 145 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~~-~~ 145 (510) |+. ++..+.. .+++.+ .+...+ ..+|.....++.+++..+|.+.+ |.+++.. ++ T Consensus 91 Pv~-----~~~~d~~----------~~e~~~-------~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~ 147 (537) T protein:vir:78 91 GVE-----VKVKDED----------NTQLDE-------ILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKF 147 (537) T ss_pred Cce-----eecCcch----------hHHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEE Confidence 332 2332221 112222 222223 46788888999999999998754 5555432 45 Q ss_pred EEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE----EeecCCC---------- Q lcl|NC_012418. 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV----QRKKGTA---------- 211 (510) Q Consensus 146 ~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v----~~~~~~~---------- 211 (510) ..++-.+.+.-.|..|....++|.+.....+-... ..+.-..+++|+.- +...... T Consensus 148 ~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~----------~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~ 217 (537) T protein:vir:78 148 QTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQ----------STETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEA 217 (537) T ss_pred EEEccceeEEEEcCCCCceeEEEEEeeeecccccc----------CcceEEEEEEEcCCcEEEEEecCCccccccccccc Confidence 55665665555677888888888777654331100 11111123333110 0011100 Q ss_pred ---ceEEEEEEEecC-------eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012418. 212 ---MEYAELYHEIDG-------VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 212 ---~p~~sv~~e~~~-------~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a 281 (510) .|.-.++...++ .........+|..+|++.++= +.+|.|=-++..+-+-.++.+.-......+.. T Consensus 218 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~~sd~e~v~~LiDayd~~~S~~an~~~~~ 292 (537) T protein:vir:78 218 YNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDF 292 (537) T ss_pred ccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccCCCchhhhHHHHHHHHHHHHhhhhHHHHh Confidence 011111110000 001111223456788876654 45788988999999999999888888888888 Q ss_pred hCCceeeCCCcccc-hhhhccC-CCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCC Q lcl|NC_012418. 282 LEVLNLVDEAKGAV-VDDYQDA-EMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVT 357 (510) Q Consensus 282 ~~p~~l~~~~g~~~-p~~~~~~-~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~T 357 (510) .+|.+++.-.+..+ .+..... ..+.+ +.|...++..+.. ..+.......++.+++.|.+.-+. +......+..| T Consensus 293 ~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l~~--~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~S 370 (537) T protein:vir:78 293 SEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQTV--SIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVT 370 (537) T ss_pred cCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeEEEe--cCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCc Confidence 89877764222211 1111111 12322 4454455554433 245677788888888888654322 22222334445 Q ss_pred HHHHHH-------HHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeee--cHHHHHHHHHHHHH Q lcl|NC_012418. 358 AEEVRI-------TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSM 428 (510) Q Consensus 358 AtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~--~is~L~raq~~~~~ 428 (510) ..-+.. ++.+++..++..+. -+++-++.++...+.......++.+.+.. +.+.+..++-+..+ T Consensus 371 GvAlk~~~~~l~~ka~~ke~~f~~~l~--------~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l 442 (537) T protein:vir:78 371 NVVIKSRYTLLAMKARKMETSLRKVLR--------WCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTE 442 (537) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHH Confidence 433322 33333333333332 23334444443333222222333333221 22222222211111 Q ss_pred HHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHH------------HHHHhh--H Q lcl|NC_012418. 429 LNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAA------------QAQAAQ--E 494 (510) Q Consensus 429 ~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~------------~~~~~~--~ 494 (510) . ..+.++. ..++ ..++ ++-++++...+.++.++... ...... . T Consensus 443 ~----~~giiS~----------eT~l----~~~p-----~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (537) T protein:vir:78 443 A----ETEALKI----------GNIM----TVAP-----RIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQ 499 (537) T ss_pred H----hcCcchH----------HHHH----HhCC-----CCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchh Confidence 0 0011111 0011 1111 11222211111111111000 000000 0 Q ss_pred HHhhhhhh-hhhcc-------cCC Q lcl|NC_012418. 495 TLLEGASD-MTNAL-------AGV 510 (510) Q Consensus 495 ~~~~ga~~-~~~~~-------ag~ 510 (510) +...|... .++.+ ++= T Consensus 500 ~~~~~~~~~~~~~~~d~~~~~~~~ 523 (537) T protein:vir:78 500 AMLDGLPVNANQPPVDPNQPVADP 523 (537) T ss_pred hhcCCCCCCCCCCCCCccCCCCCC Confidence 00011111 01111 000 No 119 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=92.16 E-value=0.013 Score=31.00 Aligned_cols=419 Identities=13% Similarity=0.076 Sum_probs=176.9 Q ss_pred Ch--------hHHHHHHHHHhhc-cchHHHHHHHHHhcccccCCCCCCccccccc-cccchHHHHHHHHHHHHHHhhcCc Q lcl|NC_012418. 1 MK--------STAAMLWEKLRDG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARSLFPT 70 (510) Q Consensus 1 ~~--------~~~~~r~~~lkr~-~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltpp 70 (510) |= .....+|+..+.. .=...+++...-.||..-..+.+.-..++.+ .|-+.-.+.++.++ +.+|.- T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~----G~vf~k 76 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALS----GMVLDQ 76 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHh----chhhcC Confidence 21 1223344333211 0012333333333443211111111222222 23444445555544 444431 Q ss_pred CCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCc----EE Q lcl|NC_012418. 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEAT----VV 146 (510) Q Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~----~~ 146 (510) ++ .++.++. ++.+. .-....+.+.-+...+.+...+|-+.+++|-+..+ +. T Consensus 77 --~p-~~~~p~~--------------l~~~~--------~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~ 131 (452) T protein:vir:94 77 --PP-VITHPDA--------------MSKYF--------EDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYIS 131 (452) T ss_pred --Cc-eecccHH--------------HHHHH--------hcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEE Confidence 21 2232221 22221 12567788888899999999999999999855332 45 Q ss_pred EEEece-EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEE-EecCe Q lcl|NC_012418. 147 AWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYH-EIDGV 224 (510) Q Consensus 147 ~~pl~~-~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~-e~~~~ 224 (510) .|+-.+ .=+..|..|+..-+..+++..+++-++.|+... ++.|.+..-.++. +.++. +.++. T Consensus 132 ~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~------------~~~yRvL~l~~g~----~~v~~~~~~~~ 195 (452) T protein:vir:94 132 VYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNI------------RVRYRCLELVDGL----LQITVHETQDG 195 (452) T ss_pred EechhhhcCccccccCCeeEEEEEEEEEEecCCCccccee------------EEEEEEEEEeCCe----EEEEEEEccCC Confidence 565444 335556678777677777766666455555433 2333333322221 22211 11111 Q ss_pred --------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHH---HHHHHH-HHHHhhCCceeeCCCc Q lcl|NC_012418. 225 --------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLL---SEKLGL-YELESLEVLNLVDEAK 292 (510) Q Consensus 225 --------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l---~~~~l~-~~~~a~~p~~l~~~~g 292 (510) .....++ +-+++|++.|--..+.... ++.--|=|+..||.- ..+-++ .+..+..|...+. | T Consensus 196 ~~~~~~~~~~~~~~~---~~l~~IP~v~~~~~~~~~~--~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~--g 268 (452) T protein:vir:94 196 KVWELAKTSTIQNVG---VTMDYIPFFCITPSGLSMT--PAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT--G 268 (452) T ss_pred ceeeeccceeecCCC---cccceeEEEEEcCCCCCCC--CCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEee--c Confidence 1112222 3466777777665554332 222224466666543 222223 3445555544432 2 Q ss_pred ccchhhhccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHH-HHHHHHHHHH Q lcl|NC_012418. 293 GAVVDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEE-VRITAEEAEN 370 (510) Q Consensus 293 ~~~p~~~~~~~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtE-i~~r~~E~~~ 370 (510) ..+-+.+..+++..+ .|......+-++. .+..+......|+++++.++++=- .+........|++| ...+....-. T Consensus 269 ~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~~m~~~Ga-~ll~~~~~~~~s~ea~~~~~~~~~s 346 (452) T protein:vir:94 269 AESQSTMHIGSTKAWVIPEVAAKVGFLEF-TGQGLQSLEKALSEKQAQLASLSA-RLIDNSTRGSEATETVKLRYMSETA 346 (452) T ss_pred CcCCCceEecccccccCCCCCCcceEEcc-CchhHHHHHHHHHHHHHHHHHHHH-HhhccCCCcchHHHHHHHHHHHhhH Confidence 322233333333222 3421222333332 345577778888888888766321 22333222344554 4455555567 Q ss_pred HhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee-ecHH-HHHHHHHHHHHHHHHHHHHhhcChhhHhhc Q lcl|NC_012418. 371 TLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE-TGLP-ALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v-~~is-~L~raq~~~~~~~~~q~l~~~~~~~q~~~~ 447 (510) .|.-+..++.+-+ ++++.++.+ -|.- .+++..+- +++. .+ -.+.+..+. +.. .++ . T Consensus 347 ~L~~~a~~~e~al-----~~~l~~~a~w~g~~----~~~~v~~n~dF~~~~~-~~~~~~al~---~~~--~~G------~ 405 (452) T protein:vir:94 347 SLKSVTRAVEALL-----NKAYSCIMDMESMG----GTLNIKLNSAFLDSKL-TAAELKAWV---EAY--LSG------G 405 (452) T ss_pred HHHHHHHHHHHHH-----HHHHHHHHHHcCCC----CceEEEeccccccccC-CHHHHHHHH---HHH--hcC------C Confidence 8888877776543 566666543 2321 12222111 1111 11 112222222 211 111 1 Q ss_pred cCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhh Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~ 503 (510) |..+.+.++ ....||+. .++|......++.. +.......+. .+++.+ T Consensus 406 is~~t~~~~-L~~~gvl~-----~~~e~~~i~~E~~~--~~~~~~~~~~-~~~~~~ 452 (452) T protein:vir:94 406 ISKEIYIHA-LKVGKVLP-----PPGESMGVIPDPPA--PEPSPSNTPP-NPSSKA 452 (452) T ss_pred CcHHHHHHH-HHhCCCCC-----CccCHHHHHHHhhc--cCcccCCCCC-CCccCC Confidence 333334443 34456652 22222222211110 1000011111 111111 No 120 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=91.87 E-value=0.014 Score=30.77 Aligned_cols=430 Identities=12% Similarity=0.040 Sum_probs=181.6 Q ss_pred Ch---------hHHHHHHHHHhhc-cchHHHHHHHHHhcccccCCCCCCc-----cccccc-cccchHHHHHHHHHHHHH Q lcl|NC_012418. 1 MK---------STAAMLWEKLRDG-SVEQRAIEFAKTTLPYLMVDPMSGS-----RGVVEH-DFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~~---------~~~~~r~~~lkr~-~~~~~w~e~~~~~lP~~~~~~~~~~-----~~~~~~-~~dstg~~a~~~LAa~l~ 64 (510) |- ..+..+|+..++- .=...|++...-.||..-..+.... ..++.+ .|=+.-.+.++ +|+ T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~----~l~ 76 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLF----GLV 76 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHH----HHh Confidence 32 2234455433311 1123455555555665321111111 111122 23333344444 444 Q ss_pred HhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC- Q lcl|NC_012418. 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA- 143 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~- 143 (510) +.+|- .+-.++.+ ..++.++++| -....+.+.-+..+..+...+|-+.+++|-+.. T Consensus 77 G~vf~---k~p~~~~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~ 133 (501) T protein:vir:95 77 GQVFM---RDPVVKVP--------------ALLNPLVANA------TGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTE 133 (501) T ss_pred hhhhc---CCcceeCc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 44442 11122221 2244555544 245667888888899999999999999985421 Q ss_pred ----------------c-EEEEEeceE-EEe---eCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEE Q lcl|NC_012418. 144 ----------------T-VVAWSLRSY-AVR---RDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYT 202 (510) Q Consensus 144 ----------------~-~~~~pl~~~-~i~---~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~ 202 (510) + +..|+-.+. =+. .|...++.-+..++..+.+. +.|.. +.++.|. T Consensus 134 ~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~~------------~~~~q~R 199 (501) T protein:vir:95 134 AEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFEM------------KTSGQFR 199 (501) T ss_pred CcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCccc------------ceeEEEE Confidence 1 444544332 122 23333455555565555333 33332 3344555 Q ss_pred EEEeecCCCceEEEEEEEecC-----------------eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHH Q lcl|NC_012418. 203 HVQRKKGTAMEYAELYHEIDG-----------------VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFA 265 (510) Q Consensus 203 ~v~~~~~~~~p~~sv~~e~~~-----------------~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r 265 (510) ++.+..++ ...+.+|.+-+. ......++ .+.+++|++.|.-..+...+.| .-.|=|+. T Consensus 200 vL~~~~~g-~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~l~~IPfv~~~~~~~~~~~~--~pPLl~lA 274 (501) T protein:vir:95 200 VLRLDEEG-YYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQ--GKRLTEIPFMFIGSENNDSNPD--NPNFYDLA 274 (501) T ss_pred EEeeCCCc-eEEEEEEEecCCcccCcceecCCcccccceeeeeccC--CCcCCeeeEEEEecCCCCCCCC--ccchHHHH Confidence 55543222 222333332111 11111222 2467888888876666544443 22233555 Q ss_pred HHHHH---HHHHHH-HHHHhhCCceeeCCCcccc-------hhhhccCCCce-eecCCcccccccccCcccchHHHHHHH Q lcl|NC_012418. 266 KLSLL---SEKLGL-YELESLEVLNLVDEAKGAV-------VDDYQDAEMGD-YVPGGAEAVRAYERGDYNKMAAIQQSL 333 (510) Q Consensus 266 ~L~~l---~~~~l~-~~~~a~~p~~l~~~~g~~~-------p~~~~~~~~g~-~~pg~~~~v~~~~~~~~~~~~~~~~~i 333 (510) .||.- +.+-.+ .+-.+..|.+.+. |... ...+..+.+.. ..| ...+.+.++.. +..+ ....+ T Consensus 275 ~lni~hy~~ssd~~~~l~~~~~P~l~i~--G~~~~~~~~~~~~~i~~G~~~~~~lP-~~~~~~~ie~~-~~~i--~~~~l 348 (501) T protein:vir:95 275 SLNMAHYRNSADYEESCYIVGQPTPVLI--GLTEEWVTNVLKGSVNFGSRGGIPLP-VGADAKLLQAS-ENTM--LKEAM 348 (501) T ss_pred HHHHHHHhhhhHHHHHHHHcccceeeee--CCcccccccCCCCceeecccccccCC-CCCceeEEecC-hhhH--HHHHH Confidence 55543 222233 3344455543332 1111 11122222111 122 11222333321 2223 35667 Q ss_pred HHHHHHHHHHhhccccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCccccccee Q lcl|NC_012418. 334 QAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI 412 (510) Q Consensus 334 ~~~~~~I~~af~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~ 412 (510) ++++++++++= ..+.+......||++...+.......|.-+..++.+-+ .+++.++.+ -|+. ++.+++.+ T Consensus 349 ~~l~~~m~~~G-a~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~a~w~g~~---~~~~~v~i 419 (501) T protein:vir:95 349 DTKERQMVALG-AKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAF-----EWALKWAARWVGQA---DSGVKFEL 419 (501) T ss_pred HHHHHHHHHHH-HhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCC---CCceEEEE Confidence 77777776643 23344444558999999999999999999888886543 444444432 1222 23333222 Q ss_pred e-ecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 413 E-TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQA 491 (510) Q Consensus 413 v-~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~ 491 (510) - .+...--.++.++.+ ..... + ..|..+.+.+.+ ...||+.. ..+++.++++.....+-...+. T Consensus 420 ~~df~~~~~~~~~~~al---~~~~~--~------G~is~~t~~~~L-~~~~v~~~---~~~~e~e~i~~~~~~~~~~~~~ 484 (501) T protein:vir:95 420 NTDFDIARMTPDERRSL---VEEWQ--K------GAITFEEMRTGL-RKAGVATE---DDSKAKEKIAKDTAEAMALATP 484 (501) T ss_pred ecccccccCCHHHHHHH---HHHHh--C------CCCcHHHHHHHH-HhCCCCCh---hHHHHHHHHHhhhcCccccccc Confidence 1 111110011111111 11111 1 124444454544 45687742 1233333333322210000000 Q ss_pred hh-HHHhhhhhh-hhhc Q lcl|NC_012418. 492 AQ-ETLLEGASD-MTNA 506 (510) Q Consensus 492 ~~-~~~~~ga~~-~~~~ 506 (510) +. ..-..|+.. ..+. T Consensus 485 ~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 485 ANVPGDGSGGDNVGNSE 501 (501) T ss_pred CCCCCCCcccccccCCC Confidence 00 000111111 1111 No 121 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=91.82 E-value=0.014 Score=30.73 Aligned_cols=420 Identities=11% Similarity=0.022 Sum_probs=168.6 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCC-CCCC------------ccccccccc--cchHHHHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVD-PMSG------------SRGVVEHDF--QSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~-~~~~------------~~~~~~~~~--dstg~~a~~~LAa~l~~ 65 (510) |-+-=.++-.+..+ .......|.. ....+ +.+. +-.....+| +..+-.++++.|..+ T Consensus 1 ~~~~~~a~~~~~~~-----~a~~~~~~~~-~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~-- 72 (461) T protein:vir:80 1 MYSIDKAKQAKIDS-----KIVNRNDFMV-GHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDM-- 72 (461) T ss_pred Cccchhhhhhhhhh-----hhhhhhHHHh-hcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHh-- Confidence 32211111111111 0111112221 10000 0000 000011122 233344444444444 Q ss_pred hhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC-- Q lcl|NC_012418. 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-- 143 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~-- 143 (510) | +.|+.+.-.++.. ...++.|++ +-++...+.++++.--.+|.+.+++.-... T Consensus 73 --~---r~g~~i~~~~~~~---------~~~~~~~~~-----------~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~ 127 (461) T protein:vir:80 73 --V---RAGWSLKTDNKEM---------KKNIESKWR-----------KLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR 127 (461) T ss_pred --h---cCCeeeecCCHHH---------HHHHHHHHH-----------HhhHHHHHHHHHHhhcccccEEEEEEeecCCc Confidence 2 3678877544322 112333332 236788899999998899987777642111 Q ss_pred --cEEEEEeceEEEeeCCCCCe--EEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEE Q lcl|NC_012418. 144 --TVVAWSLRSYAVRRDATGRW--MDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYH 219 (510) Q Consensus 144 --~~~~~pl~~~~i~~d~~G~v--d~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~ 219 (510) ....-||. ...-+.+ ..+|.+..++...... +... ..+.+.+.|+........ . +. T Consensus 128 ~~~~~~~pl~-----~~~~~~~~~l~~~~~~~i~~~~~~~----dp~s-----p~fg~P~~y~i~~~~~~~---~---~~ 187 (461) T protein:vir:80 128 EQADLSTAID-----PKTIKSIPYINTFNTQKVTQLYLNQ----DMFS-----EHFGEVEFFEVNRVSQLG---E---EI 187 (461) T ss_pred cccCccCCcc-----cccccceeEEEeccccccchhhhcc----cCcC-----cccccceEEEEecccccc---c---cc Confidence 11122221 1111111 2233333333322211 1110 011122222222211110 0 00 Q ss_pred EecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC------Cc- Q lcl|NC_012418. 220 EIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE------AK- 292 (510) Q Consensus 220 e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~------~g- 292 (510) +.+ . ........|..+++.+.=...++..||+|..+..++.++..........+.+..+.-..+-.+. +. T Consensus 188 -~~~-~-~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~ 264 (461) T protein:vir:80 188 -LSG-T-TASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDK 264 (461) T ss_pred -ccc-c-cCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHH Confidence 000 0 0000011234455555555666778899999999999999988887777666555444443320 00 Q ss_pred --cc-chhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhh---cccc-CCCCCCCCHHHHHHHH Q lcl|NC_012418. 293 --GA-VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM---YGAN-QRDAERVTAEEVRITA 365 (510) Q Consensus 293 --~~-~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~---~~~~-~~~~~~~TAtEi~~r~ 365 (510) .. ..+.++ ..+|..+-+..+++..+.. ++.-+...+....+.|.-+-= .-++ +..+..=|.++ T Consensus 265 ~~~~~~~~~~~-~~~g~~~~d~~e~~e~~~~----~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~----- 334 (461) T protein:vir:80 265 ANLTAMLDFMF-RTEALAIIKGDEQLTKEST----NVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQY----- 334 (461) T ss_pred HHHHHHHHHhc-CCceEEEEcCCcceEEEec----CcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchH----- Confidence 00 011111 2334444444444433322 344445566666666665541 1111 22233223221 Q ss_pred HHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhc--CC-CCCCcc--cccceeeecHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA--LL-QGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~--~l-~~~~~~--~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~ 440 (510) -....---+.+++.-.+.|.+++++.+|.+. +. +++.|. ++.... .++..+....+++-.....+.+....+ T Consensus 335 --D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f-~~L~~~s~kekAe~~~~~a~a~~~~~~ 411 (461) T protein:vir:80 335 --DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEF-NPLWNLDSKTDAEVRKLTAEADQIYIV 411 (461) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEe-CCCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 1222223355666667889999999987542 22 222332 333222 223222333333322222222222111 Q ss_pred hhhHhhccCHHHHHHHHHHHcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHhhHHHhhh Q lcl|NC_012418. 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYK-SEEELQAEAEQRRQQAAQAQAAQETLLEG 499 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~r-s~~ev~~~r~q~~q~~~~~~~~~~~~~~g 499 (510) ...|+.+++.+.+...+|+++..... .+.+.....++..+..+.+ ...| T Consensus 412 ----~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e------~~~g 461 (461) T protein:vir:80 412 ----NGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKK------NADG 461 (461) T ss_pred ----cCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccccccc------CCCC Confidence 12488888888887777764332222 2222222211111111111 0111 No 122 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=90.93 E-value=0.018 Score=30.11 Aligned_cols=421 Identities=12% Similarity=0.038 Sum_probs=171.1 Q ss_pred ChhHHHHHHHHHhhccc-hHHHHHHHHHhcccccCCCC-CCccccccc-cccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSV-EQRAIEFAKTTLPYLMVDPM-SGSRGVVEH-DFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~-~~~w~e~~~~~lP~~~~~~~-~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) -=..+..+|+..+..-- +..|.- .+--+|+.-...+ +.-..++.+ .|=+.-.+.++.++. .+|- ..|.+ T Consensus 17 ~y~a~~~~W~~ird~~~G~~~~~~-r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G----~vfr-k~p~~-- 88 (491) T protein:vir:95 17 EWLHYAPKWQKVRHALAGDLVGYL-RNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVG----SVMR-KEPEI-- 88 (491) T ss_pred HHHHHHHHHHHHHHHhcCcchhhc-ccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhc----hhhc-CCcee-- Confidence 11123344543331100 000100 0011222100011 111122222 233444455555443 3332 12333 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC-------------- Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-------------- 143 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~-------------- 143 (510) ++++ .++.++++| -....+.+.-+...+.+...+|-+.+++|-+.. T Consensus 89 ~~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~r 148 (491) T protein:vir:95 89 NIPK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLN 148 (491) T ss_pred eccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCC Confidence 2221 244555554 245677888888999999999999999986532 Q ss_pred c-EEEEEeceE----EEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEE Q lcl|NC_012418. 144 T-VVAWSLRSY----AVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELY 218 (510) Q Consensus 144 ~-~~~~pl~~~----~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~ 218 (510) + +..|+-.+. .=..|..+++.-+..+++..+++=+..|+. +.++.|.++.+..++. ....+| T Consensus 149 Py~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~------------~~~~qyRvL~l~~~g~-~~~~v~ 215 (491) T protein:vir:95 149 PTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFET------------KYGEQYRVLDIDTDGN-YRQRLF 215 (491) T ss_pred cEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCccc------------ceEEEEEEEeecCCCc-eEEEEE Confidence 1 555655443 122355667777777776655553344432 3345555555532221 111222 Q ss_pred E-EecCe-------eeccccccccccCceEEEeeeecCCCcccc--chHHHHHHHHHHHHHH---HHHHHHHH-HHhhCC Q lcl|NC_012418. 219 H-EIDGV-------RVGEEGRWPIHLCPYIVPTWNLAPGEHYGR--GHVEDYIGDFAKLSLL---SEKLGLYE-LESLEV 284 (510) Q Consensus 219 ~-e~~~~-------~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGr--gp~~~~L~d~r~L~~l---~~~~l~~~-~~a~~p 284 (510) . ..+|. .+..+++ +.+++|++.|--..+..+.. .| |=|+..||.- +.+-++.+ -.+.-| T Consensus 216 r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pP----Ll~LA~lni~Hy~~ssd~~~~l~~~~~P 288 (491) T protein:vir:95 216 RFDAEGGAQEEVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAP----LLPLAELNIGHYRNSADNEESSFVVGQP 288 (491) T ss_pred EEcCCCcceeeeeeeeecCCC---cccCeeEEEEEecCCCCCCCCcCc----hHHHHHHHHHHhhhhhHHHHHHHHcccc Confidence 1 11121 1122333 34677777777655555544 34 3355555543 33334433 344444 Q ss_pred ceeeCC-C-------cccchhhhccCCC-ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCC Q lcl|NC_012418. 285 LNLVDE-A-------KGAVVDDYQDAEM-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAER 355 (510) Q Consensus 285 ~~l~~~-~-------g~~~p~~~~~~~~-g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~ 355 (510) .+.+.. + ..+++..+.-+.+ +...| ...+.+.++... . ..+...+.+++.+.+.+=. .+.... .+ T Consensus 289 ~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP-~~~~~~~ie~~~-~--~~~~~~l~~~e~qm~~~Ga-~l~~~~-~~ 362 (491) T protein:vir:95 289 TLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLG-YGGSAQLIQAGE-N--NLARQNMLDKEQQAIQIGA-QLITPS-QQ 362 (491) T ss_pred eeeeecCcccCcchhhccCcceeEecCcCCcCCC-CCCccceeecCc-c--hHHHHHHHHHHHHHHHHHH-HhccCC-cc Confidence 433321 1 1111222222221 11112 112223333321 1 2346667777666655421 222333 35 Q ss_pred CCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccc----ee-eecHHHHHHHHHHHHHH Q lcl|NC_012418. 356 VTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKP----AI-ETGLPALSRSAAVQSML 429 (510) Q Consensus 356 ~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~----~~-v~~is~L~raq~~~~~~ 429 (510) .||++...+...--..|..+...+.+-+ .+++.++.+ -|+.. +..+.. .+ +..+. ++.+..+. T Consensus 363 ~Ta~~~~~~~~~~~S~L~~~a~~~e~al-----~~~l~~~a~w~G~~~--~~~v~i~~n~dF~~~~~~----~~~~~all 431 (491) T protein:vir:95 363 ITAESARIQRGADTSVMATIARNVSQAY-----TDALRWVAMMLGKPE--DSEVEFQLNMDFFLQPMT----AQDRAAWM 431 (491) T ss_pred hhHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCCC--CCceEEEeecccccccCC----HHHHHHHH Confidence 8999999999999999998888876543 444555432 23221 222221 11 12222 12222221 Q ss_pred HHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHH-HHHHHHHHHHHHHHHHHHhhHHHhhh--hhhhhhc Q lcl|NC_012418. 430 NASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEE-ELQAEAEQRRQQAAQAQAAQETLLEG--ASDMTNA 506 (510) Q Consensus 430 ~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~-ev~~~r~q~~q~~~~~~~~~~~~~~g--a~~~~~~ 506 (510) .... ++ .|....+.+ ..+..||+. .+.| +..++.++- .--.+..+.+..+ +.++... T Consensus 432 ---~~~~--~G------~is~~t~~~-~L~~~~vl~----~~~e~~~~~ie~~~----~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 432 ---ADIN--AG------LLPATAYYA-ALRKAGVTD----WTDEDILNAIEDAP----LPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred ---HHHh--cC------CCCHHHHHH-HHHhCCCCC----ccHHHHHHHHHhcC----CCCCccccccccchhhhhhccC Confidence 1111 11 121122222 223445541 1222 222221110 0000000000000 0000000 No 123 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=327 Identities=11% Similarity=0.025 Sum_probs=121.1 Q ss_pred Hhcccc--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHH Q lcl|NC_012418. 27 TTLPYL--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARV 104 (510) Q Consensus 27 ~~lP~~--~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~ 104 (510) -..+-+ +....+.........+-+.+. ...+.+.++..+ ...++..... ... ....| T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al-------~~~----~v~~~ 59 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAAL-------RNS----DLFSI 59 (392) T ss_pred CcchhhhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhh-------ccH----HHHHH Confidence 111100 000000000000000000000 000001111000 0011110000 001 11122 Q ss_pred HHHHHHHHHh----------------cCC----HHHHHHHHHHHHhhCeEEEEEeCCC-Cc-EEEEEe--ceEEEeeCCC Q lcl|NC_012418. 105 DRKATQRLFQ----------------NAS----LAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWSL--RSYAVRRDAT 160 (510) Q Consensus 105 e~~~~~~l~~----------------snf----y~~~~~~~~dl~~~G~~~l~~~~~~-~~-~~~~pl--~~~~i~~d~~ 160 (510) =..+...++. -|- +.-+.....++...|++.+++..+. ++ ...+|+ .+.-+..|.+ T Consensus 60 i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~ 139 (392) T protein:vir:39 60 ILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEY 139 (392) T ss_pred HHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCC Confidence 2222222222 222 3344555667778888877765332 22 344444 3333333333 Q ss_pred CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceE Q lcl|NC_012418. 161 GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYI 240 (510) Q Consensus 161 G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |. .+++-+++ ++........+..++ .+ T Consensus 140 ~~-------------------------------------------------~~~y~~~~--~~~~~~~~~~~~~~e--ii 166 (392) T protein:vir:39 140 EN-------------------------------------------------GMYYNITF--DDPKIEPILQAPQSD--LI 166 (392) T ss_pred Cc-------------------------------------------------eEEEEEEe--cCcccceeEEEcccc--EE Confidence 22 11121111 111111111121122 56 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-Ccccchh--------hhccCCC--cee-e Q lcl|NC_012418. 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVD--------DYQDAEM--GDY-V 308 (510) Q Consensus 241 ~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~-~g~~~p~--------~~~~~~~--g~~-~ 308 (510) ..|+...+|..||.||..-+...+.....+.+.......-...|..++.- ++....+ .+....+ |.+ + T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:39 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec Confidence 66777777889999999999999999999998888888888888766542 2222221 1111111 111 2 Q ss_pred cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHH Q lcl|NC_012418. 309 PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPL 387 (510) Q Consensus 309 pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pl 387 (510) +++. ...++... ..+.+. .+..+..+..|-++|=..- .-.+...-|..+ .+...=....|-|.+.++.+|+-.-| T Consensus 247 ~~g~-~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:39 247 DDLE-EFTALEIK-SNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKL 322 (392) T ss_pred CCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 3322 22333221 234443 3556677788888883221 001111112111 11222344556777777776664433 Q ss_pred HHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHH--------HHHHH-------------HHhhcChh---- Q lcl|NC_012418. 388 AYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSML--------NASQV-------------IAGLAPIA---- 442 (510) Q Consensus 388 i~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~--------~~~q~-------------l~~~~~~~---- 442 (510) +..+ .-++.. +-..+...++..+..+. .+... ...+.+.+ T Consensus 323 ~~~~-------------~~d~~~--~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~ 387 (392) T protein:vir:39 323 SDHI-------------SVNMRP--AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS 387 (392) T ss_pred cccc-------------cccchh--hhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCC Confidence 2110 000000 00011111111111110 00000 01222211 Q ss_pred -hHhh Q lcl|NC_012418. 443 -QLDP 446 (510) Q Consensus 443 -q~~~ 446 (510) +-.| T Consensus 388 ~~p~p 392 (392) T protein:vir:39 388 NEPVP 392 (392) T ss_pred CCCCC Confidence 1112 No 124 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=327 Identities=11% Similarity=0.025 Sum_probs=121.1 Q ss_pred Hhcccc--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHH Q lcl|NC_012418. 27 TTLPYL--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARV 104 (510) Q Consensus 27 ~~lP~~--~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~ 104 (510) -..+-+ +....+.........+-+.+. ...+.+.++..+ ...++..... ... ....| T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al-------~~~----~v~~~ 59 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAAL-------RNS----DLFSI 59 (392) T ss_pred CcchhhhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhh-------ccH----HHHHH Confidence 111100 000000000000000000000 000001111000 0011110000 001 11122 Q ss_pred HHHHHHHHHh----------------cCC----HHHHHHHHHHHHhhCeEEEEEeCCC-Cc-EEEEEe--ceEEEeeCCC Q lcl|NC_012418. 105 DRKATQRLFQ----------------NAS----LAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWSL--RSYAVRRDAT 160 (510) Q Consensus 105 e~~~~~~l~~----------------snf----y~~~~~~~~dl~~~G~~~l~~~~~~-~~-~~~~pl--~~~~i~~d~~ 160 (510) =..+...++. -|- +.-+.....++...|++.+++..+. ++ ...+|+ .+.-+..|.+ T Consensus 60 i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~ 139 (392) T protein:vir:10 60 ILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEY 139 (392) T ss_pred HHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCC Confidence 2222222222 222 3344555667778888877765332 22 344444 3333333333 Q ss_pred CCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceE Q lcl|NC_012418. 161 GRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYI 240 (510) Q Consensus 161 G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |. .+++-+++ ++........+..++ .+ T Consensus 140 ~~-------------------------------------------------~~~y~~~~--~~~~~~~~~~~~~~e--ii 166 (392) T protein:vir:10 140 EN-------------------------------------------------GMYYNITF--DDPKIEPILQAPQSD--LI 166 (392) T ss_pred Cc-------------------------------------------------eEEEEEEe--cCcccceeEEEcccc--EE Confidence 22 11121111 111111111121122 56 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-Ccccchh--------hhccCCC--cee-e Q lcl|NC_012418. 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVD--------DYQDAEM--GDY-V 308 (510) Q Consensus 241 ~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~-~g~~~p~--------~~~~~~~--g~~-~ 308 (510) ..|+...+|..||.||..-+...+.....+.+.......-...|..++.- ++....+ .+....+ |.+ + T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:10 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec Confidence 66777777889999999999999999999998888888888888766542 2222221 1111111 111 2 Q ss_pred cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHH Q lcl|NC_012418. 309 PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPL 387 (510) Q Consensus 309 pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pl 387 (510) +++. ...++... ..+.+. .+..+..+..|-++|=..- .-.+...-|..+ .+...=....|-|.+.++.+|+-.-| T Consensus 247 ~~g~-~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:10 247 DDLE-EFTALEIK-SNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKL 322 (392) T ss_pred CCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 3322 22333221 234443 3556677788888883221 001111112111 11222344556777777776664433 Q ss_pred HHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHH--------HHHHH-------------HHhhcChh---- Q lcl|NC_012418. 388 AYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSML--------NASQV-------------IAGLAPIA---- 442 (510) Q Consensus 388 i~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~--------~~~q~-------------l~~~~~~~---- 442 (510) +..+ .-++.. +-..+...++..+..+. .+... ...+.+.+ T Consensus 323 ~~~~-------------~~d~~~--~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~ 387 (392) T protein:vir:10 323 SDHI-------------SVNMRP--AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS 387 (392) T ss_pred cccc-------------cccchh--hhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCC Confidence 2110 000000 00011111111111110 00000 01222211 Q ss_pred -hHhh Q lcl|NC_012418. 443 -QLDP 446 (510) Q Consensus 443 -q~~~ 446 (510) +-.| T Consensus 388 ~~p~p 392 (392) T protein:vir:10 388 NEPVP 392 (392) T ss_pred CCCCC Confidence 1112 No 125 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=89.19 E-value=0.028 Score=29.11 Aligned_cols=376 Identities=11% Similarity=0.026 Sum_probs=169.7 Q ss_pred ChhHHHHHHH-HHh-hccchHHHHHHHHH--hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWE-KLR-DGSVEQRAIEFAKT--TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~-~lk-r~~~~~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |-..+..+.. ++. +.+...+..++|+- -+|++...- ...-...-+..-+-+..++++||..|. ..+ |+ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f~ 72 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITI-PQALSQQYRSILGWCAKGVDSLADRLV----FRE---FE 72 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhh-hHHHHHHHhhhcChhHHHHHHhHhhcc----ccc---cc Confidence 6554444443 232 22222233333432 122211100 000000112233455666666655442 112 11 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC---cEEEEEece- Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS- 152 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~---~~~~~pl~~- 152 (510) ..|. + +++...+++|.....++.++..++|.+.+++..+.. ++++++..+ T Consensus 73 --~~d~-------------~-----------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~ 126 (409) T protein:vir:16 73 --NDDF-------------T-----------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNA 126 (409) T ss_pred --Ccch-------------H-----------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccce Confidence 1111 1 233457789999999999999999998877654322 466666654 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccc Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRW 232 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y 232 (510) +++--+..+++...++...- ..+.+ .+.+.... ++. ...++..++.......++ T Consensus 127 ~~i~D~~~~~~~~a~~~~~~------------------d~~~~--~~~~~~~~--~~~----~~~~~~~~~~~~~~~~~~ 180 (409) T protein:vir:16 127 TGIIDPITGLLTEGYAVLER------------------DENNN--VVLEAHFL--PDR----TDYYYRDSRNNISIANPT 180 (409) T ss_pred EEEeecccccceeeeEEEEe------------------cCCCc--eEEEEEEe--cCc----EEEEEecCccccceecCC Confidence 44443334555443332110 00011 11111111 111 001111222222223444 Q ss_pred ccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCcccchhhhccCCCcee- Q lcl|NC_012418. 233 PIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDY- 307 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~---~~~g~~~p~~~~~~~~g~~- 307 (510) ..||++.+..+...++.||+|=. +..++=+..+|+..-..+..++..+.|...+ .+||.. .+.+.... |.+ T Consensus 181 --g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~-~~~~~~~~-~~i~ 256 (409) T protein:vir:16 181 --GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEP-METWKATV-SSML 256 (409) T ss_pred --CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCc-cchhhhhh-hHhh Confidence 46999999999888999999944 5677888899999888888999999886555 222211 11111111 111 Q ss_pred -ecCCccc--ccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHHH-------HHHHHHHHHH Q lcl|NC_012418. 308 -VPGGAEA--VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV-------RITAEEAENT 371 (510) Q Consensus 308 -~pg~~~~--v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtEi-------~~r~~E~~~~ 371 (510) +|...++ ++.-++. .++++. -++.++.-|+...+... +.....+ -+|.-| ..++++|... T Consensus 257 ~~~~d~~g~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~ 332 (409) T protein:vir:16 257 QFTKDEDGDKPTLGQFT-QPSMSP---FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRS 332 (409) T ss_pred ccCCCCCCCCceEEecC-CCChhH---HHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 2322111 2222222 234443 35555555554443322 1111122 233222 3467778888 Q ss_pred hchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccc--ccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhc Q lcl|NC_012418. 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH--KPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~--~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~ 447 (510) +|..+.++. +.++.+.. +....+++.. ++.- +..-+.-+.|+.+..+.-+.+ ...+++. T Consensus 333 fg~~l~~~~--------rla~~~~~--~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~---a~~~~~~---- 395 (409) T protein:vir:16 333 LGAGLLNVA--------YLAACLRD--DVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQ---AIPEFIN---- 395 (409) T ss_pred HHHHHHHHH--------HHHHHHhc--CCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHh---hcccccc---- Confidence 888776552 33333333 3333333322 2211 223333334444444443322 1112211 Q ss_pred cCHHHHHHHHHHHcCCCHhH Q lcl|NC_012418. 448 ISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~ 467 (510) . +.+.+.+|..... T Consensus 396 --~----~v~~~~~g~~~~d 409 (409) T protein:vir:16 396 --K----DTIRDLTGIKGAE 409 (409) T ss_pred --h----hHHHHhccCCCCC Confidence 1 1234444544332 No 126 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=88.61 E-value=0.031 Score=28.84 Aligned_cols=377 Identities=11% Similarity=0.033 Sum_probs=166.8 Q ss_pred hccchHHHHHHHHHhcccccCCC-CCCcccc---ccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcc Q lcl|NC_012418. 14 DGSVEQRAIEFAKTTLPYLMVDP-MSGSRGV---VEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADS 89 (510) Q Consensus 14 r~~~~~~w~e~~~~~lP~~~~~~-~~~~~~~---~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~ 89 (510) =+-+.++-+.+.+|..-..-.++ +..-... ..+..-+-+..++++||..|. ..+ | ..+|. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~----~~G---f--~~~d~------- 64 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI----FRA---F--ANDDF------- 64 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc----ccc---c--cCCCc------- Confidence 11122233333444332110000 0000000 112334555666666665443 111 1 11111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC--CC-CcEEEEEece-EEEeeCCCCCeEE Q lcl|NC_012418. 90 RDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DE-ATVVAWSLRS-YAVRRDATGRWMD 165 (510) Q Consensus 90 ~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~--~~-~~~~~~pl~~-~~i~~d~~G~vd~ 165 (510) + +++...+++|.....++.++..++|.+.+++.. +. -++++++..+ +++--+..+++.. T Consensus 65 ------~-----------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~ 127 (410) T protein:vir:95 65 ------N-----------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVE 127 (410) T ss_pred ------h-----------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEE Confidence 1 223356789999999999999999998877643 32 2467776555 4444333455544 Q ss_pred EEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeee Q lcl|NC_012418. 166 IVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWN 245 (510) Q Consensus 166 i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~ 245 (510) .++... +. +.+....+.+|. ++ ..+++.-++..-..+.++ +.||++.+..+ T Consensus 128 al~~~~---~~--------------~~~~~~~~~~~~-----~~-----~~~~~~~~~~~~~~~~~~--g~vPvV~f~n~ 178 (410) T protein:vir:95 128 GYAVLA---RD--------------DYNRPTLEAYFE-----PN-----ATHFIPKDGEPYSVTNET--GIPLLVPVIHR 178 (410) T ss_pred EEEEEE---ec--------------CCCeEEEEEEEe-----CC-----cEEEEeeCCccccccCCC--CCcceEEeccc Confidence 333211 00 001111122221 11 112222222211123344 46999999999 Q ss_pred ecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCcccchhhhccCCCcee--ecCCccc--ccc Q lcl|NC_012418. 246 LAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDY--VPGGAEA--VRA 317 (510) Q Consensus 246 ~~~g~~YGrgp~-~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~---~~~g~~~p~~~~~~~~g~~--~pg~~~~--v~~ 317 (510) ...++.||+|=. +..++=+..+|+..-..+..++..+.|...+ +++|...+ ..+... +.+ +|...++ ++. T Consensus 179 ~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~-~~~~~~-~~i~~~~~~~~~~~~~v 256 (410) T protein:vir:95 179 PDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPME-KWKATV-SSLLTISSSDKGVKPSV 256 (410) T ss_pred ccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCc-hhhhhh-hhheeccCCCCCCcceE Confidence 888999999943 5677888899999888888999988885554 23332111 111111 111 2222111 122 Q ss_pred cccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCC-CCHHH-------HHHHHHHHHHHhchhHhHHHHHHH Q lcl|NC_012418. 318 YERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEE-------VRITAEEAENTLGGTYSLLAENLQ 384 (510) Q Consensus 318 ~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-~TAtE-------i~~r~~E~~~~LGpv~~rl~~E~l 384 (510) -++ ..++++. -++.++.-|+...+... +.....+ -+|.- ...++++|...+|.-+.++ T Consensus 257 ~q~-~~~~l~~---~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~----- 327 (410) T protein:vir:95 257 GQF-TTASMSP---FTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNV----- 327 (410) T ss_pred Eec-CCCChHH---HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 222 2245553 34555555544443321 1111122 23322 2346677777788766654 Q ss_pred HHHHHHHHHHHhhcCCCCCCcccccceee-e---cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHH Q lcl|NC_012418. 385 SPLAYVCLSEVDDALLQGLITKQHKPAIE-T---GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAA 460 (510) Q Consensus 385 ~Pli~r~~~il~~~~l~~~~~~~~~~~~v-~---~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~ 460 (510) ++.++.+.. +....+.+..+...+ . ..+.-+.|+.+..+.-+.+ +..+++ +. +.+.+. T Consensus 328 ---~rla~~i~~--~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~---a~~g~~------~~----~~~~~~ 389 (410) T protein:vir:95 328 ---AYVAACLRD--EFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQ---ALPGYI------NA----ETIRDL 389 (410) T ss_pred ---HHHHHHHhc--CCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHH---hccCCc------cH----HHHHHh Confidence 233344433 333333333332211 1 2222233444444433222 222221 11 224456 Q ss_pred cCCCHhHccCCHHHHHHHHHHHHHHHH Q lcl|NC_012418. 461 FSVDTSQFYKSEEELQAEAEQRRQQAA 487 (510) Q Consensus 461 ~Gvp~~~i~rs~~ev~~~r~q~~q~~~ 487 (510) +|..+. ++..++.+.+++..+ T Consensus 390 lg~~~~------~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 390 TGIAGD------MSAKPVVSEGGSNGE 410 (410) T ss_pred cCCChH------HHHHHHHHHHHhCCC Confidence 665432 222222222222111 No 127 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=88.60 E-value=0.031 Score=28.83 Aligned_cols=413 Identities=14% Similarity=0.036 Sum_probs=146.0 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~--lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) +=+.|...++. ++.++ ...+++|+-- +|.+-............++..+-+...++.++..|.+- |+. +. T Consensus 9 ~~~~l~~~~~~-~~~r~-~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~------~~~-~~ 79 (456) T protein:vir:10 9 WLPVLTKRIDD-GMSRV-RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN------GIT-VG 79 (456) T ss_pred HHHHHHHHHHH-HHHHH-HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccC------Cee-cC Confidence 11112222221 11111 1223333321 11110000011111122344556666666666655422 222 22 Q ss_pred CC-hHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC-cEEEEEeceEE Q lcl|NC_012418. 79 LT-DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSLRSYA 154 (510) Q Consensus 79 ~~-d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~-~~~~~pl~~~~ 154 (510) .. |... . ..+++.+.++++.....++.++...+|.+.+++. ++.. ++++++..+.+ T Consensus 80 ~~~d~~~-------------~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 80 GSADSDL-------------A-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred CCCCcch-------------H-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 11 1110 0 1123345677888999999999999999866554 3322 46777666644 Q ss_pred EeeC-CCCC-eEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccccc Q lcl|NC_012418. 155 VRRD-ATGR-WMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGR 231 (510) Q Consensus 155 i~~d-~~G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~ 231 (510) +..| ..++ +...+|.++ .... -+. ......++.....|..+....... .....+ .++... ..... T Consensus 140 ~i~d~~~~~~~~~~i~~~~-~~d~----~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~ 207 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRWWR-DLDA----ESD----FAIVWSGDGWQKFARPCFVQSSSR--RRLVTR-ISDSWVPVGDAV 207 (456) T ss_pred EEEcCCCCcceEEEEEEEE-ecCC----cee----EEEEEeccceeEEEEEEEEeeccc--ceeeee-cCCceeeccccC Confidence 4444 3333 333333332 1100 000 000001111222221111111110 011111 112111 11122 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC----------CCcc-cchhhhc Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD----------EAKG-AVVDDYQ 300 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~----------~~g~-~~p~~~~ 300 (510) ..+..+|++.. .+..|.|-.+..++-+..++...-..+..++..+.|...+. .+|- .++.... T Consensus 208 ~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~ 281 (456) T protein:vir:10 208 VTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIF 281 (456) T ss_pred CCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhh Confidence 22234555433 23468898888888888888776666666655555533221 1110 0111111 Q ss_pred cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCCCHHHHH-------HHHHHH Q lcl|NC_012418. 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEVR-------ITAEEA 368 (510) Q Consensus 301 ~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~TAtEi~-------~r~~E~ 368 (510) ....|.+.... .+.+..++. .++++.....++.+.+.| +.... +..+..+.|+.-|. .+++++ T Consensus 282 ~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~---~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~ 356 (456) T protein:vir:10 282 EAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQL---SSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) T ss_pred hhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHH---HhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHH Confidence 11112221111 112222222 134443333344333333 32211 11222334555332 233444 Q ss_pred HHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHh Q lcl|NC_012418. 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~ 445 (510) ...+|.-+ .+.+.++.. .|.. ....+++.. ..+-+.++.|+-+.++. +. ++... T Consensus 357 ~~~f~~~l------------~~~~rl~~~~~g~~--~~~~~~v~w~~~~~~~~~~~ada~~kl~------~~--gi~~~- 413 (456) T protein:vir:10 357 LSIAKIGL------------EAILVKALQIEGES--VEDTVDVSFESPDRVTLGEKYSAASLAK------AA--GESWA- 413 (456) T ss_pred HHHHHHHH------------HHHHHHHHHhcCCC--cccceeEEecCCCCcCHHHHHHHHHHHH------Hc--CCChH- Confidence 44444433 344444322 2221 112233222 12223233322222221 11 21110 Q ss_pred hccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh Q lcl|NC_012418. 446 PRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASD 502 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~ 502 (510) +. ....+|+.++.+ .++|.+.+++++.+ +.++..+.+ +.-++. T Consensus 414 ------~~---~~~~lg~~~~~i--~~~e~er~~~e~~~--~~~~~~~~~-~~~~~~ 456 (456) T protein:vir:10 414 ------SI---RRNILNYNADQI--KQDDLDRAREQITL--FAGNPVQRP-QEDGSR 456 (456) T ss_pred ------HH---HHhhCCCCHHHH--HHHHHHHHHHHHHH--HhhhhhhcC-CCCCCC Confidence 11 223457654322 11222222222221 111111111 111111 No 128 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=88.60 E-value=0.031 Score=28.83 Aligned_cols=413 Identities=14% Similarity=0.036 Sum_probs=146.0 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~--lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) +=+.|...++. ++.++ ...+++|+-- +|.+-............++..+-+...++.++..|.+- |+. +. T Consensus 9 ~~~~l~~~~~~-~~~r~-~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~------~~~-~~ 79 (456) T protein:vir:10 9 WLPVLTKRIDD-GMSRV-RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN------GIT-VG 79 (456) T ss_pred HHHHHHHHHHH-HHHHH-HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccC------Cee-cC Confidence 11112222221 11111 1223333321 11110000011111122344556666666666655422 222 22 Q ss_pred CC-hHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC-cEEEEEeceEE Q lcl|NC_012418. 79 LT-DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSLRSYA 154 (510) Q Consensus 79 ~~-d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~-~~~~~pl~~~~ 154 (510) .. |... . ..+++.+.++++.....++.++...+|.+.+++. ++.. ++++++..+.+ T Consensus 80 ~~~d~~~-------------~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 80 GSADSDL-------------A-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred CCCCcch-------------H-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 11 1110 0 1123345677888999999999999999866554 3322 46777666644 Q ss_pred EeeC-CCCC-eEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccccc Q lcl|NC_012418. 155 VRRD-ATGR-WMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEEGR 231 (510) Q Consensus 155 i~~d-~~G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~~~ 231 (510) +..| ..++ +...+|.++ .... -+. ......++.....|..+....... .....+ .++... ..... T Consensus 140 ~i~d~~~~~~~~~~i~~~~-~~d~----~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~ 207 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRWWR-DLDA----ESD----FAIVWSGDGWQKFARPCFVQSSSR--RRLVTR-ISDSWVPVGDAV 207 (456) T ss_pred EEEcCCCCcceEEEEEEEE-ecCC----cee----EEEEEeccceeEEEEEEEEeeccc--ceeeee-cCCceeeccccC Confidence 4444 3333 333333332 1100 000 000001111222221111111110 011111 112111 11122 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC----------CCcc-cchhhhc Q lcl|NC_012418. 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD----------EAKG-AVVDDYQ 300 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~----------~~g~-~~p~~~~ 300 (510) ..+..+|++.. .+..|.|-.+..++-+..++...-..+..++..+.|...+. .+|- .++.... T Consensus 208 ~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~ 281 (456) T protein:vir:10 208 VTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIF 281 (456) T ss_pred CCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhh Confidence 22234555433 23468898888888888888776666666655555533221 1110 0111111 Q ss_pred cCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCCCHHHHH-------HHHHHH Q lcl|NC_012418. 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEVR-------ITAEEA 368 (510) Q Consensus 301 ~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~TAtEi~-------~r~~E~ 368 (510) ....|.+.... .+.+..++. .++++.....++.+.+.| +.... +..+..+.|+.-|. .+++++ T Consensus 282 ~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~---~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~ 356 (456) T protein:vir:10 282 EAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQL---SSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) T ss_pred hhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHH---HhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHH Confidence 11112221111 112222222 134443333344333333 32211 11222334555332 233444 Q ss_pred HHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhHh Q lcl|NC_012418. 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~ 445 (510) ...+|.-+ .+.+.++.. .|.. ....+++.. ..+-+.++.|+-+.++. +. ++... T Consensus 357 ~~~f~~~l------------~~~~rl~~~~~g~~--~~~~~~v~w~~~~~~~~~~~ada~~kl~------~~--gi~~~- 413 (456) T protein:vir:10 357 LSIAKIGL------------EAILVKALQIEGES--VEDTVDVSFESPDRVTLGEKYSAASLAK------AA--GESWA- 413 (456) T ss_pred HHHHHHHH------------HHHHHHHHHhcCCC--cccceeEEecCCCCcCHHHHHHHHHHHH------Hc--CCChH- Confidence 44444433 344444322 2221 112233222 12223233322222221 11 21110 Q ss_pred hccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh Q lcl|NC_012418. 446 PRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASD 502 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~ 502 (510) +. ....+|+.++.+ .++|.+.+++++.+ +.++..+.+ +.-++. T Consensus 414 ------~~---~~~~lg~~~~~i--~~~e~er~~~e~~~--~~~~~~~~~-~~~~~~ 456 (456) T protein:vir:10 414 ------SI---RRNILNYNADQI--KQDDLDRAREQITL--FAGNPVQRP-QEDGSR 456 (456) T ss_pred ------HH---HHhhCCCCHHHH--HHHHHHHHHHHHHH--HhhhhhhcC-CCCCCC Confidence 11 223457654322 11222222222221 111111111 111111 No 129 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=88.53 E-value=0.032 Score=28.80 Aligned_cols=311 Identities=11% Similarity=0.025 Sum_probs=118.1 Q ss_pred HHHhhcC------------cCCcccccCCChHHHhhhc------ccc----hhHHHHHHHHHHHHHHH------------ Q lcl|NC_012418. 63 LARSLFP------------TGIPFFRSELTDAIRREAD------SRD----TDITEVTAALARVDRKA------------ 108 (510) Q Consensus 63 l~~~ltp------------p~~~WF~l~~~d~~~~~~~------~~~----~~~~~v~~~L~~~e~~~------------ 108 (510) |+.++|. .-..||.- ..++.+-... ... .....|..-.+.+...+ T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPD-GNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred CcchhhhhhhcccCccccccccccccc-CchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh Confidence 4444331 00011110 0000000000 000 00011111111111111 Q ss_pred HHHHHhcCC----HHHHHHHHHHHHhhCeEEEEEeCCC-Cc-EEEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHH Q lcl|NC_012418. 109 TQRLFQNAS----LAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEA 180 (510) Q Consensus 109 ~~~l~~snf----y~~~~~~~~dl~~~G~~~l~~~~~~-~~-~~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~ 180 (510) ...+.+-|- +.-+...+.++...|++.+++..+. ++ ...+|| ...-+..+.+|. T Consensus 80 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~------------------ 141 (392) T protein:vir:74 80 QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYEN------------------ 141 (392) T ss_pred hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCc------------------ Confidence 011222222 3334445556677777776654332 21 233444 233333333332 Q ss_pred hhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHH Q lcl|NC_012418. 181 YKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDY 260 (510) Q Consensus 181 ~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~ 260 (510) .+++.++. ++........+..++ ++..|+...+|..||.||..-+ T Consensus 142 -------------------------------~~~y~~~~--~~~~~~~~~~~~~~e--vih~~~~~~~~~~~G~s~i~~~ 186 (392) T protein:vir:74 142 -------------------------------GMYYNITF--DDPKIEPILQAPQSD--LIHMKLLSIDGGKTGISPLYSL 186 (392) T ss_pred -------------------------------eEEEEEEe--cCCccceeEEEcCcc--EEEecCCCCCCccccccHHHHH Confidence 12222211 111111111111122 4555666667778999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCcccchhh--------hccCCC--c-eeecCCcccccccccCcccchHH Q lcl|NC_012418. 261 IGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVDD--------YQDAEM--G-DYVPGGAEAVRAYERGDYNKMAA 328 (510) Q Consensus 261 L~d~r~L~~l~~~~l~~~~~a~~p~~l~~-~~g~~~p~~--------~~~~~~--g-~~~pg~~~~v~~~~~~~~~~~~~ 328 (510) ...+.......+.......-...|..++. +++....+. +....+ + .+++++. .+.++... ..+.+. T Consensus 187 ~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~ 264 (392) T protein:vir:74 187 RRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLE-EFTALEIK-SNVAQL 264 (392) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCc-eEEEccCC-hhHHHH Confidence 99999999999888888888888876654 333322221 111111 1 1122222 22333322 234443 Q ss_pred HHHHHHHHHHHHHHHhhcccc-CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccc Q lcl|NC_012418. 329 IQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ 407 (510) Q Consensus 329 ~~~~i~~~~~~I~~af~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~ 407 (510) .+..+..+..|-++|=..-. -.+...-|.. +.+..+-....|.|.+.++.+|+-.-|+.. ...+ T Consensus 265 -~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~-------------~~~~ 329 (392) T protein:vir:74 265 -LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSS-IQQISGMYASALNRYLRPAISELEYKLSDH-------------ISVN 329 (392) T ss_pred -HHHHHHHHHHHHHHhCCCHHHhCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccch-------------hccc Confidence 45566777889898833210 0111111211 111222344556676666666653332211 0001 Q ss_pred ccceeeecHHHHHHHHHHHHHH--------HHHHH-------------HHhhcChh-----hHhh Q lcl|NC_012418. 408 HKPAIETGLPALSRSAAVQSML--------NASQV-------------IAGLAPIA-----QLDP 446 (510) Q Consensus 408 ~~~~~v~~is~L~raq~~~~~~--------~~~q~-------------l~~~~~~~-----q~~~ 446 (510) +.. .-..+...++..+..+. .+... ...+.+.+ +=.| T Consensus 330 ~~~--~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 330 MRP--AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred chh--hhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCCCCCCC Confidence 110 00111122222221111 00000 01222211 1112 No 130 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=83.72 E-value=0.066 Score=27.05 Aligned_cols=412 Identities=13% Similarity=0.004 Sum_probs=146.3 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHh--cccccCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTT--LPYLMVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~--lP~~~~~~~~~-~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) .-+.|.+.|+. +.+.....+++|+-- +++. +..... ......+...+-+..+++.+++.|++- ++ +. T Consensus 9 ~~~~l~~~~~~--~~~r~~~l~~Yy~g~~~i~~~-~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~------g~-~~ 78 (456) T protein:vir:79 9 WLPVLTKRIDD--GMSRVRLLARYSNGDAPLPEL-TRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN------GI-TV 78 (456) T ss_pred HHHHHHHHHHH--HHHHHHHHHHHHhccCChhhc-CcccChhhchhhhhhhcchHHHHHHHHHhhhccC------Ce-ec Confidence 11112222321 111112223333321 1111 000011 111112234556677777777666433 21 22 Q ss_pred CCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC-CcEEEEEeceEE Q lcl|NC_012418. 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSLRSYA 154 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~-~~~~~~pl~~~~ 154 (510) ...++. + +.+ .+.+.+.+++|.....++.++...+|.+.+++ +++. .++++++..+.+ T Consensus 79 ~~~~d~---------~---~~~-------~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~ 139 (456) T protein:vir:79 79 GGSADS---------D---LAL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred CCCCCc---------c---HHH-------HHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeE Confidence 221110 0 111 12334566789999999999999999986554 4332 246666555544 Q ss_pred EeeC-CCCC-eEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEec-Ce-eecccc Q lcl|NC_012418. 155 VRRD-ATGR-WMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEID-GV-RVGEEG 230 (510) Q Consensus 155 i~~d-~~G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~-~~-~~~~~~ 230 (510) +..| ..++ +...+|.+. .... .. ....-..++..+..+..+....+. .++. .... +. ....+. T Consensus 140 ~i~d~~~~~~~~~~~~~~~-~~d~----~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~ 206 (456) T protein:vir:79 140 VSVDPLQPWRIRSAMRWWR-DLDA----ES----DFAIVWSGDGWQKFARPCFVQSSS-RRRL---VTRISDSWVPVGDA 206 (456) T ss_pred EEEcCCCCCceEEEEEEEE-ecCC----ce----eEEEEEcCCceEEEEEEEEeeccc-ccee---eeccCCceeecccc Confidence 4444 3443 444444432 1110 00 000001112222222221111111 1111 1111 11 111222 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC----------Ccc-cchhhh Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE----------AKG-AVVDDY 299 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~----------~g~-~~p~~~ 299 (510) ...+..+|++.++ ...|.|=.+..++-+-.++...-..+..++..+.|...+.- .|- .++... T Consensus 207 ~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~ 280 (456) T protein:vir:79 207 VVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASI 280 (456) T ss_pred cCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhh Confidence 2223456766542 35678877777776666666554545555555555433310 010 011111 Q ss_pred ccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc-----cCCCCCCCCHHHHHH-------HHHH Q lcl|NC_012418. 300 QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEVRI-------TAEE 367 (510) Q Consensus 300 ~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~~TAtEi~~-------r~~E 367 (510) .....|.+..+ ..+.+..++. ..+++... +.++.-|...+.... +..+..+.++.-+.. +.+. T Consensus 281 ~~~~~~~~~~~-~~~~~~~q~~-~~~~~~~~---~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~ 355 (456) T protein:vir:79 281 FEAAPGALWEL-PPGVDIWESQ-TNDFTPML---SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCED 355 (456) T ss_pred hhhhccccccC-CCCcceeeec-ccChHHHH---HHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHH Confidence 11111111111 1112222222 12343333 333344433332221 112223446654433 3344 Q ss_pred HHHHhchhHhHHHHHHHHHHHHHHHHHHh-hcCCCCCCccccccee--eecHHHHHHHHHHHHHHHHHHHHHhhcChhhH Q lcl|NC_012418. 368 AENTLGGTYSLLAENLQSPLAYVCLSEVD-DALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~-~~~l~~~~~~~~~~~~--v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~ 444 (510) +...+++ -+.+.+.++. -.|..+ ...+++.. ..+.+.+++|+-+.++. +. ++... T Consensus 356 ~~~~f~~------------~l~~~~~l~~~~~g~~~--~~~i~v~w~~~~~~s~~~~ada~~kl~------~~--G~~~~ 413 (456) T protein:vir:79 356 RLSIAKI------------GLEAILVKALQIEGESV--EDTVDVSFESPDRVTLGEKYSAASLAK------AA--GESWA 413 (456) T ss_pred HHHHHHH------------HHHHHHHHHHHhcCCCc--cccceEEeCCCCCcCHHHHHHHHHHHH------hc--CCChH Confidence 4444444 3444444432 233221 12233222 12233333333332221 11 21110 Q ss_pred hhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcc Q lcl|NC_012418. 445 DPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNAL 507 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ 507 (510) +. ....+|+++..+- ++|.+.+++++. +++...+.. .+.+.+- T Consensus 414 -------~~---~~~~lg~~~~~i~--~~e~~r~~~e~~------~~~~~~~~~--~~~~~~~ 456 (456) T protein:vir:79 414 -------SI---RRNILNYNADQIK--QDDLDRAREQIT------LFAGNPVQR--PQEDGSR 456 (456) T ss_pred -------HH---HHhcCCCCHHHHH--HHHHHHHHHHHH------HHhhhHhhc--CCCCCCC Confidence 11 2234566543221 112112222111 111111111 1111111 No 131 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=82.73 E-value=0.074 Score=26.77 Aligned_cols=455 Identities=9% Similarity=-0.018 Sum_probs=179.0 Q ss_pred HHHHHHHHhhccchH------HHHHHHHHhcccccCCCCCCccccccccccchHHHH-HHHHHHHHHHhhcCcCCccccc Q lcl|NC_012418. 5 AAMLWEKLRDGSVEQ------RAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALL-VNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 5 ~~~r~~~lkr~~~~~------~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a-~~~LAa~l~~~ltpp~~~WF~l 77 (510) .++. + ...|+. .|+.-++=+.=+..|.-....+....+ ...+..++ .-.....|.++|++.=.| T Consensus 1 m~~~---~-~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~-~~~~~~dst~~~a~~~Laa~l~~~ltp---- 71 (555) T protein:vir:17 1 MKHS---A-QAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGG-YLPTPWQSVGSKGVNVLASKLMLSLFP---- 71 (555) T ss_pred ChhH---H-HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccc-cccccccccHHHHHHHHHHHHHHhhcC---- Confidence 1111 1 122322 244333333323334322222211111 22223222 334667788888875333 Q ss_pred CCChHHHhhhcccchh------HHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEec Q lcl|NC_012418. 78 ELTDAIRREADSRDTD------ITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLR 151 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~------~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~ 151 (510) +..---.+...+.. ..+++.|++..-..+.+.+...-.++.....+.++.. + .+-.| T Consensus 72 --p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~--------~-------L~~~G 134 (555) T protein:vir:17 72 --VNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMK--------H-------LIVTG 134 (555) T ss_pred --CCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHH--------H-------HHhHC Confidence 22111122222211 2356777776655555555544444433333333311 1 01113 Q ss_pred eEEEeeCCCC-CeEEEEEEEeecHHH------hh-------HHhhHhhhhhhhc-c--CCCc-----eEEEEEEEEeecC Q lcl|NC_012418. 152 SYAVRRDATG-RWMDIVLKQRYKSKD------LD-------EAYKQDLMRAGRN-L--SGSG-----SVDLYTHVQRKKG 209 (510) Q Consensus 152 ~~~i~~d~~G-~vd~i~r~~~~t~~~------l~-------~~~~~~~~~~~~~-~--~~~~-----~v~i~~~v~~~~~ 209 (510) +-++-+|.++ ++-.+ .+|.+..+. +. ..+.+...+.... . +..+ .+.++|.+.+++. T Consensus 135 ~a~ly~~~~~~~~~pl-~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~ 213 (555) T protein:vir:17 135 NALLYQGKKNLKLYPL-DRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDK 213 (555) T ss_pred eEEEEecCCceeEEEc-CeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhccccc Confidence 3333333332 11111 112222111 11 1111111111100 0 0011 1245566666665 Q ss_pred CCceEEEEEEEec---Ceeeccccccc------cccCceEEEeeeecCCC-ccccchHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_012418. 210 TAMEYAELYHEID---GVRVGEEGRWP------IHLCPYIVPTWNLAPGE-HYGRGHVEDYI-GDFAKLSLLSEKLGLYE 278 (510) Q Consensus 210 ~~~p~~sv~~e~~---~~~~~~~~~y~------~~~~P~~~~Rw~~~~g~-~YGrgp~~~~L-~d~r~L~~l~~~~l~~~ 278 (510) ++.....+|-++. +......+.-+ ..+.||--.=|....=+ +.|--.|.-.. .-+-.+..+.+. .+.. T Consensus 214 ~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l-~~~~ 292 (555) T protein:vir:17 214 GKSNDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEAL-SQAM 292 (555) T ss_pred CCCcceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHH-HHHH Confidence 5555555554321 11110000000 12334433333333222 11222333222 222333333333 3333 Q ss_pred HHhhCCceeeCCCcccchhhhccCCCceeecCCcccccccccCcccchHHH-------HHHHHHHHHHHHHHhhccccCC Q lcl|NC_012418. 279 LESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAI-------QQSLQAVVVRLNQAFMYGANQR 351 (510) Q Consensus 279 ~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~-------~~~i~~~~~~I~~af~~~~~~~ 351 (510) .+++. ..++|--+..|+.+.. ...+.||..+.+.+-. ..++..+ +..++...+.|+ ..+-++++- T Consensus 293 l~~~~--~~~~pp~lv~~~g~~~--~~~l~~~~~g~v~~g~---~~~v~~~~~~~~~~~~~~~~~i~~~~-~~I~~aFm~ 364 (555) T protein:vir:17 293 VEGSA--ASAKVVFMVSPSATTK--PQNLALAANGAIIQGR---PDDVSVVQANKAADFRTVLEMIQKLE-QRISDAFLM 364 (555) T ss_pred HHHHH--HHhCCceeeccccccC--cceeecCCCceeecCC---cccceeeeccccchhhHHHHHHHHHH-HHHHHHHhh Confidence 33333 2455533444554433 2467888877764322 2233222 344555566664 355666765 Q ss_pred CCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHh-----hc-CCC---CCCcccc-cceeeecHHHHHH Q lcl|NC_012418. 352 DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVD-----DA-LLQ---GLITKQH-KPAIETGLPALSR 421 (510) Q Consensus 352 ~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~-----~~-~l~---~~~~~~~-~~~~v~~is~L~r 421 (510) ... .++.-+ -+.|.... ..--..+|+|++.|+-..+. |. .+. ...|..= ....+++++++.. T Consensus 365 ~~~-~d~~r~--TAtEV~~r-----~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~ 436 (555) T protein:vir:17 365 LQV-RQSERT--TATEVQAT-----VQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWG 436 (555) T ss_pred cCC-CCcccc--hHHHHHHH-----HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHH Confidence 543 455443 23443222 12223467788877765432 21 121 1222211 1235789999999 Q ss_pred HHHHHHHHHHHHHHHhhcChhhHhhcc-CHHHHHHHHHHHcCCCHhHccCCHHHH----HHHHHHHHHHHHHHHHhhHHH Q lcl|NC_012418. 422 SAAVQSMLNASQVIAGLAPIAQLDPRI-SLPKMMDTIWAAFSVDTSQFYKSEEEL----QAEAEQRRQQAAQAQAAQETL 496 (510) Q Consensus 422 aq~~~~~~~~~q~l~~~~~~~q~~~~i-d~d~~~~~~a~~~Gvp~~~i~rs~~ev----~~~r~q~~q~~~~~~~~~~~~ 496 (510) .++.+.+..+.++++.++++.. .|.+ |.=+ .+.+.+.+.-- +==++..+ ++.++.+++++++++ ++++ T Consensus 437 l~r~~~~~~l~~~~~~laq~~~-~p~~~d~id-~d~~~~~~a~~---~Gv~p~~ivrs~eev~~~rq~~~~~~~--q~~~ 509 (555) T protein:vir:17 437 VGRGQDKQQLMEFITTLAQTMG-PEIAMKYIN-PTEFIKRLAAA---QGIDTLQLINSPETMKQLGDQQKQDMV--QASL 509 (555) T ss_pred HHHHHHHHHHHHHHHHHHhhcC-chhHhhcCC-HHHHHHHHHHH---cCCChhhhcCCHHHHHHHHHHHHHHHH--HHHH Confidence 9999999888888777665333 1221 1101 11222222210 00022222 222233333332222 3333 Q ss_pred hhhhhhhhhcccCC Q lcl|NC_012418. 497 LEGASDMTNALAGV 510 (510) Q Consensus 497 ~~ga~~~~~~~ag~ 510 (510) ..-+.+...++++. T Consensus 510 ~~qa~~~~~~~~~~ 523 (555) T protein:vir:17 510 INQAGQLAKTPMAE 523 (555) T ss_pred HHHHHHHHhhhhhh Confidence 33344444445444 No 132 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=82.16 E-value=0.079 Score=26.62 Aligned_cols=435 Identities=9% Similarity=-0.008 Sum_probs=173.7 Q ss_pred Chh------HHHHHHHHHhhccchHHHHHHHHHhcccccCCCC------CCcccccccc--ccchHHHHHHHHHHHHHH- Q lcl|NC_012418. 1 MKS------TAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPM------SGSRGVVEHD--FQSAGALLVNNLAAKLAR- 65 (510) Q Consensus 1 ~~~------~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~~------~~~~~~~~~~--~dstg~~a~~~LAa~l~~- 65 (510) .+. .....|+.-..++--..|. ..|.....+. ..-..+...+ -++.+..+++.+++.+++ T Consensus 19 ~~~~~~~~~~~~~~y~aa~~~r~~~~w~-----~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~ 93 (505) T protein:vir:96 19 WYRYVEPQKNAARAFEAARRDRLGKAWL-----RRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNNVIGP 93 (505) T ss_pred hhhhHHHHHHhhhhcccccCCCcccccc-----CCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcCC Confidence 111 1112233222111111221 1222211110 1111122223 377899999999999996 Q ss_pred -hhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE-eCCCC Q lcl|NC_012418. 66 -SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR-NSDEA 143 (510) Q Consensus 66 -~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~-~~~~~ 143 (510) |++|..++.......++.+.+. -....+.|.+.- -+.+=.+.+||.....++...+.-|-+++-. ..+.. T Consensus 94 ~Gi~~~~~~~~~~~~~~~~~~~~-----ie~~w~~Wa~~~---~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~ 165 (505) T protein:vir:96 94 KGMTFQSRVKRRNGKPDDRANTL-----IEGNWQQWIKKG---NCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPN 165 (505) T ss_pred CcceeeecCCcccccccHHHHHH-----HHHHHHHhcCCc---CcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCC Confidence 8999998876655545544321 112234443311 1133345679999999999999988765322 11111 Q ss_pred --cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEe Q lcl|NC_012418. 144 --TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEI 221 (510) Q Consensus 144 --~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~ 221 (510) +++.- -+..+.|+... ....++.. .|..=|+.+. .+.|. .+|+.- T Consensus 166 ~~~~~lq----------------------liepd~l~~~~--------n~~~~~~~-~i~~GIe~d~-~Gr~~-aY~i~~ 212 (505) T protein:vir:96 166 KWGYALQ----------------------ILECDRLDLNY--------NADLQNGN-RIRMSIELDA-WERPV-AYHLLV 212 (505) T ss_pred CcceEEE----------------------EechhhcCCCC--------CcccCCcC-eEEeceEECC-CCceE-EEEEee Confidence 11111 11111111000 00000000 1333333322 22221 112110 Q ss_pred ---cCeeecccc-ccccccCc--eEEEeee-ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCc- Q lcl|NC_012418. 222 ---DGVRVGEEG-RWPIHLCP--YIVPTWN-LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAK- 292 (510) Q Consensus 222 ---~~~~~~~~~-~y~~~~~P--~~~~Rw~-~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~-~~g- 292 (510) ++....... +..+...| -|..-|. ..+|.+=|.+.-.-+|..++.|.....+.+.++..++.....+. +.+ T Consensus 213 ~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~ 292 (505) T protein:vir:96 213 NHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEA 292 (505) T ss_pred cCCCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCcc Confidence 000000000 00011223 2333333 34778889999999999999999999999999988888765553 222 Q ss_pred ccchhhhccC-CCceeecCCcc------cccccccCc-ccchHHHHHHHHHHHHHHHHHh--hccccCCCCCCCCHHHHH Q lcl|NC_012418. 293 GAVVDDYQDA-EMGDYVPGGAE------AVRAYERGD-YNKMAAIQQSLQAVVVRLNQAF--MYGANQRDAERVTAEEVR 362 (510) Q Consensus 293 ~~~p~~~~~~-~~g~~~pg~~~------~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af--~~~~~~~~~~~~TAtEi~ 362 (510) +..+..-..+ ..-.+-||... +++....+. .++|. .-...+...|..++ -+..+..|-..++-.-++ T Consensus 293 ~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~---~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R 369 (505) T protein:vir:96 293 YDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFG---AFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLR 369 (505) T ss_pred CCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccccccHHHHH Confidence 2211110000 00011233322 233333222 23332 11222233333333 112233444444444344 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccc----ccee----eecHHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH----KPAI----ETGLPALSRSAAVQSMLNASQV 434 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~----~~~~----v~~is~L~raq~~~~~~~~~q~ 434 (510) .-..|....+--.=..+..-|+.|+.++++..+.-.|.+|+|.... +... ...++|+--++..... T Consensus 370 ~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~------ 443 (505) T protein:vir:96 370 SGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSES------ 443 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHH------ Confidence 3333433333333334455677788888887776666666664321 1111 1233443222211000 Q ss_pred HHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHh-----hhhhhhhhcccC Q lcl|NC_012418. 435 IAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLL-----EGASDMTNALAG 509 (510) Q Consensus 435 l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~-----~ga~~~~~~~ag 509 (510) +. +++... ..++...|.+++ +.++++....+. ++.. .+..... .+..+.....+| T Consensus 444 i~--~G~~t~----------~~~~a~~G~D~~------~v~~q~a~e~~~-~~~~-Gl~~~~~~~~~~~~~~~~~~~~~~ 503 (505) T protein:vir:96 444 IK--NRTRSR----------SSIIRAAGDDPE------DVFDEIAWEEQL-MRDK-GVNPTPPEQESKDATTDEEDDSAS 503 (505) T ss_pred HH--cCCCCH----------HHHHHHcCCCHH------HHHHHHHHHHHH-HHHc-CCCCCCCCCCCCCCCCCCCCCCCC Confidence 00 011111 112223454432 111122211111 1111 1100000 000000000000 Q ss_pred C Q lcl|NC_012418. 510 V 510 (510) Q Consensus 510 ~ 510 (510) = T Consensus 504 d 504 (505) T protein:vir:96 504 D 504 (505) T ss_pred C Confidence 0 No 133 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=79.98 E-value=0.099 Score=26.09 Aligned_cols=390 Identities=10% Similarity=0.005 Sum_probs=151.7 Q ss_pred ChhHHHHHHHHHhh--ccchHHHHHHHHHhcccc---cCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRD--GSVEQRAIEFAKTTLPYL---MVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~r~~~lkr--~~~~~~w~e~~~~~lP~~---~~~~~~~~~~~~--~~~~-dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) ||+...+++.+++- ..|... .++ ...|+. +....+...... .... .++--.|++.+|+.+.+ - T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~------l 71 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIAT------L 71 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhh------C Confidence 99999999988752 233221 000 000111 110001111000 1112 23334456666665543 2 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCeEEEEEeCCCCcE-E Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEATV-V 146 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~-~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~~~~-~ 146 (510) ||.-....+..-.+. + .+..++..|. +-| .+.-....+.+|...||+.+++..+.++. . T Consensus 72 p~~~~~~~~~g~~~~---------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~~ 136 (437) T protein:vir:10 72 PLNLYQTKPDGTRVL---------A------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLIG 136 (437) T ss_pred ceeEEEEcCCCceee---------c------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEE Confidence 553222211110000 0 0112222233 333 44445666777888999998887655542 3 Q ss_pred EEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCe Q lcl|NC_012418. 147 AWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGV 224 (510) Q Consensus 147 ~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~ 224 (510) .||| ....+..+.+|.+- +. |...+|. T Consensus 137 L~~l~p~~v~i~~~~~g~~~--------------------------------------------------y~-~~~~~g~ 165 (437) T protein:vir:10 137 LELMLPQRTTVKRLTSGALQ--------------------------------------------------YT-YRNVDGT 165 (437) T ss_pred EEEEcCcceEEEECCCCeEE--------------------------------------------------EE-EEecCce Confidence 4555 44444444433221 10 1111222 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC- Q lcl|NC_012418. 225 RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE- 303 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~- 303 (510) ... +..++ ++..|....+ ..||.||..-+...+.....+.+.......-...|-.++.-++.+.++...... T Consensus 166 ~~~----~~~~d--Iih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 238 (437) T protein:vir:10 166 VST----LAEDD--VFHVRGFSLD-GLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRT 238 (437) T ss_pred EEE----Ecccc--EEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHH Confidence 110 00011 2333433323 389999999999999888888888887777777787777655666664432111 Q ss_pred ----------C-c-e-eecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccc--cC-CCCCCCCHHHHHHHHHH Q lcl|NC_012418. 304 ----------M-G-D-YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA--NQ-RDAERVTAEEVRITAEE 367 (510) Q Consensus 304 ----------~-g-~-~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~--~~-~~~~~~TAtEi~~r~~E 367 (510) | | . +++++. ...++.. +..+.+. .+..+-.+..|-++|=... +. .+....+.+-+.+.... T Consensus 239 ~~~~~~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~ 315 (437) T protein:vir:10 239 DLAEQFGGAMQAGKTMVLEAGM-KYQAITM-NPGDVQL-LETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG 315 (437) T ss_pred HHHHHhcCccccCcceeccCCc-eEEeccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH Confidence 0 1 1 122222 2223322 1234443 3344555678888884321 11 11112222333332222 Q ss_pred HHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeee-cHHHHHHHHHHHHHHHHHHHHHhhcChhhHhh Q lcl|NC_012418. 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET-GLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~-~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~ 446 (510) +...-|.|++.+.-..|.+..+++...... ++. .++.|-|+--.+....+...+.. ++ T Consensus 316 -----------f~~~tl~P~~~~ie~~l~~kll~~~e~~~~---~~~fd~~~ll~~d~~~r~~~~~~~~~~--G~----- 374 (437) T protein:vir:10 316 -----------FLTFTLRPWLTRIEQAARRSLLRPGERDQF---YAEFSVEGLLRADSAGRAAFYSTMTQN--GL----- 374 (437) T ss_pred -----------HHHHHHHHHHHHHHHHHHhhccCccccCce---EEEEechhhhccCHHHHHHHHHHHHhC--CC----- Confidence 123334555555555555554433222211 121 22333332111112111111211 10 Q ss_pred ccCHHHHHHHHHHHcCCCHhHccCCHHHHH----------HHHH-HHHHHHHHHHHh--hHHHhhhhhhhhh Q lcl|NC_012418. 447 RISLPKMMDTIWAAFSVDTSQFYKSEEELQ----------AEAE-QRRQQAAQAQAA--QETLLEGASDMTN 505 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~----------~~r~-q~~q~~~~~~~~--~~~~~~ga~~~~~ 505 (510) +..+ ++-+.+|.|| + ...+++- ..-+ +-.... +.+.. ......+.++++. T Consensus 375 -~T~N----E~R~~~gl~p--i-~gg~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 375 -MTRD----ECRAKENLPP--M-GGNAAVLTVQSALLPIDKLGEHTTATAA-QDALKAWLYQEEKTRATQER 437 (437) T ss_pred -cCHH----HHHHHhCCCC--C-CCCcceEeecCcccchhhccCcCCCcch-hccccccCCCCCCCCccccC Confidence 1111 1122233332 1 1111100 0000 000000 00000 0001112222222 No 134 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=78.95 E-value=0.11 Score=25.86 Aligned_cols=409 Identities=11% Similarity=-0.020 Sum_probs=141.1 Q ss_pred ccccc--cchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHH--HHHH-HHHHhcCCH Q lcl|NC_012418. 44 VEHDF--QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVD--RKAT-QRLFQNASL 118 (510) Q Consensus 44 ~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e--~~~~-~~l~~snfy 118 (510) +..+. +++.-.|++.+|..+.+ .||- +...+..... .........+..+|...+ ..+. ..+....+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~------~p~~-i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~ 72 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAG------FGIN-IIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESERATAT 72 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhc------CCeE-EEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHhhHHH Confidence 44432 45666777777777742 2332 1111100000 000001111111111110 0000 011122345 Q ss_pred HHHHHHHHHHHhhCeEEEEEeCCC--CcEEEEEeceEEEe--eCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCC Q lcl|NC_012418. 119 AVLTQVIKLLIVTGNALLYRNSDE--ATVVAWSLRSYAVR--RDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSG 194 (510) Q Consensus 119 ~~~~~~~~dl~~~G~~~l~~~~~~--~~~~~~pl~~~~i~--~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~ 194 (510) .-+.....++..+||+.+++..+. .....+||..-.|. .|..+.+. .+ ... T Consensus 73 ~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~-~~------------------------~~~ 127 (467) T protein:vir:31 73 NVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQ-LL------------------------EEK 127 (467) T ss_pred HHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEe-ec------------------------CCc Confidence 566778888999999998875442 24566777443332 22221110 00 000 Q ss_pred CceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 195 SGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKL 274 (510) Q Consensus 195 ~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~ 274 (510) ...+.++...+.....+ ...-++.+.+.......-.++ .--.+..|.....+..||.+|..-++..+.......+.. T Consensus 128 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 204 (467) T protein:vir:31 128 EKYFGVAGDRYQTNGNG-DLDPVFVDADDGSTGTSVSNP--ANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYN 204 (467) T ss_pred eeeEEeccccceeeccc-ceeeeeeeeccccccceeEec--cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 11111111111111111 111122222211111111111 112455666666678999999999999888777776666 Q ss_pred HHHHHHhhCCceeeC-CCcccchhhhccCCC-------c------------------eeecCCcc----cccccccC--c Q lcl|NC_012418. 275 GLYELESLEVLNLVD-EAKGAVVDDYQDAEM-------G------------------DYVPGGAE----AVRAYERG--D 322 (510) Q Consensus 275 l~~~~~a~~p~~l~~-~~g~~~p~~~~~~~~-------g------------------~~~pg~~~----~v~~~~~~--~ 322 (510) .....-...|..++. +++.++++....... | .+.++... .++...+. . T Consensus 205 ~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~ 284 (467) T protein:vir:31 205 IDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGI 284 (467) T ss_pred HHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccC Confidence 655555556655542 455555543221110 0 01111111 01101110 1 Q ss_pred ccchHHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCC-HHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012418. 323 YNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVT-AEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD 397 (510) Q Consensus 323 ~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~~T-AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~ 397 (510) ..+.+ ..+.....+..|.++|=.. ....++..-| +++... .=....|.|.+.++..+|-.-|+.+..... T Consensus 285 ~~d~q-f~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~--~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~-- 359 (467) T protein:vir:31 285 DEEAS-FLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRK--EFAEETIQPKQHDFGELLYELVHKQGLDAP-- 359 (467) T ss_pred hhhHH-HHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcchhhccC-- Confidence 11211 2244455667788888322 1111221111 222222 223344556555555443333321111100 Q ss_pred cCCCCCCcccccceeeecHHHHHHHHHHHH-----HHHHHHHHHhhcChhhHhhc-cCHHHHHHHHHHHcC--CCHhHcc Q lcl|NC_012418. 398 ALLQGLITKQHKPAIETGLPALSRSAAVQS-----MLNASQVIAGLAPIAQLDPR-ISLPKMMDTIWAAFS--VDTSQFY 469 (510) Q Consensus 398 ~~l~~~~~~~~~~~~v~~is~L~raq~~~~-----~~~~~q~l~~~~~~~q~~~~-id~d~~~~~~a~~~G--vp~~~i~ 469 (510) ...-.+....+...+...++.-... +.+.......++ ++.+.+. ........ +...| .|.. T Consensus 360 -----~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~G-l~pi~d~~~~~~~~~~--~~~~~~~~~~~--- 428 (467) T protein:vir:31 360 -----DWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFG-FEPFPEEHVYGGETLV--AEVTGGSGPGG--- 428 (467) T ss_pred -----CceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC-CCCCCcccccCCcccc--cccccccCCCC--- Confidence 0000111111222344444332222 122222222221 1111110 00000000 00001 0100 Q ss_pred CCHHHHHH----HHHHHHHHHHHHHHhhHHHhhhhhhhh Q lcl|NC_012418. 470 KSEEELQA----EAEQRRQQAAQAQAAQETLLEGASDMT 504 (510) Q Consensus 470 rs~~ev~~----~r~q~~q~~~~~~~~~~~~~~ga~~~~ 504 (510) .++++..+ ..++.-...+....-+.....|+.+-. T Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 429 GIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred cccCcCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 00000000 000000000000001112223333322 No 135 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=77.84 E-value=0.12 Score=25.62 Aligned_cols=403 Identities=11% Similarity=0.005 Sum_probs=168.4 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCCC----C---Ccc-cccccc----------ccchHHHHHHHHHHH Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPM----S---GSR-GVVEHD----------FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~~----~---~~~-~~~~~~----------~dstg~~a~~~LAa~ 62 (510) -=..+..+|+..+. .+...=+...+-.||.....+. + ... .+.++. |=+.-.+.++ . T Consensus 22 ~y~a~~~~W~~~~d-~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl~----~ 96 (488) T protein:vir:96 22 DYLVNAPQWLRNLD-CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPTMN----A 96 (488) T ss_pred HHHHHhhhhhHhhh-hhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHHHH----H Confidence 11123344443322 2333334444555665321111 1 000 111111 2222233333 3 Q ss_pred HHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCC Q lcl|NC_012418. 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE 142 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~ 142 (510) |++.+|- .++ .++.+ ...+++.++++| -....+.+.-+.....+...+|-+.+++|-+. T Consensus 97 l~G~vfr--k~p-~~~~~------------~~~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~ 155 (488) T protein:vir:96 97 ITGAVMR--REP-EFDTM------------DNPVLIGLRDNI------DGKGNGIDQECKQALNALQWGSRCGWLVRSHP 155 (488) T ss_pred hcchhhc--cCc-eeccC------------CcHHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEecCC Confidence 4444432 111 11111 112355566555 24567788888999999999999999998653 Q ss_pred C--------------cEEEEEeceE-EEeeCC---CCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEE Q lcl|NC_012418. 143 A--------------TVVAWSLRSY-AVRRDA---TGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 143 ~--------------~~~~~pl~~~-~i~~d~---~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 204 (510) . .+..|+-.+. =+..+. ...+.-+..++....++ .......+.+... T Consensus 156 ~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D---------------~~~~~~~~~~~~~ 220 (488) T protein:vir:96 156 ESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERD---------------GGTYVSKQRLINH 220 (488) T ss_pred CcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEecc---------------CCCcccceEEEEE Confidence 2 1445554432 122222 22344454555443221 0001111222221 Q ss_pred EeecCCCceEEEEEEEecCeee----ccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHH---HHHHHH Q lcl|NC_012418. 205 QRKKGTAMEYAELYHEIDGVRV----GEEGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLL---SEKLGL 276 (510) Q Consensus 205 ~~~~~~~~p~~sv~~e~~~~~~----~~~~~y~~~~~P~~~~Rw~~~~g~~YGrg-p~~~~L~d~r~L~~l---~~~~l~ 276 (510) ...++. +.+|...++... ...++ .+.+++|++.|-...+..+..| |- |=|+..||.- ..+-++ T Consensus 221 ~l~~g~----~~v~~~~~~~~~~e~~~~~~g--~~~l~~IP~v~~~~~~~~~~~~~pP---LldLA~lnl~Hy~~ssd~~ 291 (488) T protein:vir:96 221 RLVDGL----CEFQEVTDDEYSDEWTPVLIN--SKQSDTIPFFLASSQSNEWCIDSTP---LTSLAEISLSIYVMNAYSN 291 (488) T ss_pred EEECcE----EEEEEEecCCcccceEeecCC--CcccCeeEEEEEecCCCCCCCCCCc---hHHHHHHHHHHHhhhhHHH Confidence 122221 455443333221 11222 2356777777776666655444 22 2255555543 333333 Q ss_pred HH-HHhhCCceeeCCCcccchhhhccCCCceeecCC-------cccccccccCcccchHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012418. 277 YE-LESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG-------AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA 348 (510) Q Consensus 277 ~~-~~a~~p~~l~~~~g~~~p~~~~~~~~g~~~pg~-------~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~ 348 (510) .+ ..+.-|+|+...++. .++.......+.+..|. .++.+..+.+ ++ ..+.+.+++++++++++=.. + T Consensus 292 ~il~~~~~p~lv~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~-~~--~l~~~~l~~l~~qm~~~Ga~-l 366 (488) T protein:vir:96 292 KAMILANEAKWMVDMGDM-NKTMASEMNPLGFTLAGRMPYYVKNGDVKVIQAQ-FS--PETENKVEKLFEQAVKVGAS-L 366 (488) T ss_pred HHHHhcCCceeeeccCCC-CcccccccccceeeecccccccccCCceeecCCc-hh--HHHHHHHHHHHHHHHHHhHh-h Confidence 33 344455555432232 22211111111111111 1122222221 11 12466677777776553221 2 Q ss_pred cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCC--CCcccccceee-ecHHHHHHHHH Q lcl|NC_012418. 349 NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQG--LITKQHKPAIE-TGLPALSRSAA 424 (510) Q Consensus 349 ~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~--~~~~~~~~~~v-~~is~L~raq~ 424 (510) .+.. .+.|||+...+.+.--..|..+...+.+- +++++.++.+ -|... ..+..++..+- .+...---++. T Consensus 367 ~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~a-----l~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~ 440 (488) T protein:vir:96 367 FTQQ-SNETATGAAIRSGSSTASMATLGNNVEDT-----VRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQM 440 (488) T ss_pred ccCC-CcchHHHHHHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHH Confidence 2333 34799999999999999999888877654 3445555433 12111 11222222110 11110001111 Q ss_pred HHHH--------HHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCH Q lcl|NC_012418. 425 VQSM--------LNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDT 465 (510) Q Consensus 425 ~~~~--------~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~ 465 (510) +..+ .+..-+...+-.---+.|.+++++..+++.+ -|+.- T Consensus 441 ~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~-~g~~~ 488 (488) T protein:vir:96 441 LQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAE-LGFGM 488 (488) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhh-cCCCC Confidence 1111 1111122222111123455677777777653 33221 No 136 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=75.95 E-value=0.14 Score=25.25 Aligned_cols=346 Identities=11% Similarity=0.019 Sum_probs=135.7 Q ss_pred ChhHHHHHHHHHhhccchH-HH-HHHHHHhcccccCCCCCCccccc-cccc-cchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQ-RA-IEFAKTTLPYLMVDPMSGSRGVV-EHDF-QSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~-~w-~e~~~~~lP~~~~~~~~~~~~~~-~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |+ .|+++...+..+ .+ ..+..+..|..+... ..+..-. .+.. .++--.|++.+|+.+.+. ||- T Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l------~~~- 67 (384) T protein:vir:49 1 MP-----IFNITNLATESPPSNQDSFFDITDPEFLDAL-NGSEWVSAETALKNSDLFSIISQLSNDLATA------KIT- 67 (384) T ss_pred Cc-----cccccccCcccccccchhhccccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHHHhhC------cee- Confidence 43 233321111110 00 112233333332211 1111000 1112 333334455544444422 321 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cEEEEEe Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL 150 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~~~~pl 150 (510) +.+.... . .+.+-| .+.=+...+.++...|++.+++..+. + ....+|+ T Consensus 68 --~~~~~~~---------------------~---l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l 121 (384) T protein:vir:49 68 --TSRKQLQ---------------------G---IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYL 121 (384) T ss_pred --eecchhh---------------------h---hhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEE Confidence 1111100 0 111222 33444566677888999988876443 2 2344555 Q ss_pred --ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecc Q lcl|NC_012418. 151 --RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGE 228 (510) Q Consensus 151 --~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~ 228 (510) ..+-+..+.++. ..++.++. ++..... T Consensus 122 ~~~~v~v~~~~~~~-------------------------------------------------~~~y~~~~--~~~~~~~ 150 (384) T protein:vir:49 122 RPSQVSFNRLDNQN-------------------------------------------------GLYYNITF--DDPRIPP 150 (384) T ss_pred cCceeEEEEcCCCc-------------------------------------------------eEEEEEEe--cCccccc Confidence 334333332221 11111111 1111001 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhh--------- Q lcl|NC_012418. 229 EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY--------- 299 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~--------- 299 (510) ...+..++ ++..|+....+..||.||...+...+.......+.......-...|..++.-++....+.. T Consensus 151 ~~~~~~~e--Vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~ 228 (384) T protein:vir:49 151 KQHVPQGD--ILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQA 228 (384) T ss_pred eeEecCcc--EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHh Confidence 11111111 4555655566779999999999999999998888888887777788777654444433211 Q ss_pred --ccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccC-CCCCCCCHHHHHHHHHH-HHHHhc Q lcl|NC_012418. 300 --QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RDAERVTAEEVRITAEE-AENTLG 373 (510) Q Consensus 300 --~~~~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~~~~~TAtEi~~r~~E-~~~~LG 373 (510) .+...-.+++++. ++.++.. +..+.+. .+..+..++.|-++|=.. .+. ..+..-|++.+.+.... ....|- T Consensus 229 ~~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~ 305 (384) T protein:vir:49 229 MKQMQGGPLVLDDLE-DFTPLEI-KSNVAQL-LSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLR 305 (384) T ss_pred cccCCccceecCCCc-eEEEccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHH Confidence 1111111222222 2333332 2234442 456677888999988322 111 12223455554433322 223466 Q ss_pred hhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc--ceeeecHHHHHHHHHHHHHH-------HHHHHHHhhcChh-- Q lcl|NC_012418. 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSML-------NASQVIAGLAPIA-- 442 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~--~~~v~~is~L~raq~~~~~~-------~~~q~l~~~~~~~-- 442 (510) |+.+++..+|..-+..-...... +.+..+. ...+..-+...+++-...+. .+ +.+-...+++ T Consensus 306 pi~~~i~~~l~~~l~~~~~~~~~------~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~-r~~~~~~p~~gG 378 (384) T protein:vir:49 306 PFVSELSKKLSCEVDADILPAVD------PTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDL-PEGETDSTLKGG 378 (384) T ss_pred HHHHHHHHHhchhhhhhhhhhhh------ccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhH-HHHcCCCCCCCC Confidence 77777766654322100000000 0000000 00000000111111111110 01 1111222221 Q ss_pred hHhhcc Q lcl|NC_012418. 443 QLDPRI 448 (510) Q Consensus 443 q~~~~i 448 (510) ..+.+- T Consensus 379 d~~~~~ 384 (384) T protein:vir:49 379 ETNEQY 384 (384) T ss_pred CCCCCC Confidence 111111 No 137 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=70.00 E-value=0.21 Score=24.24 Aligned_cols=433 Identities=14% Similarity=0.047 Sum_probs=178.6 Q ss_pred Ch---------hHHHHHHHHHhhcc-chHHHHHHHHHhcccccCCCCCCccccccc-cccchHHHHHHHHHHHHHHhhcC Q lcl|NC_012418. 1 MK---------STAAMLWEKLRDGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~~---------~~~~~r~~~lkr~~-~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltp 69 (510) .+ ..+..+|+.++..- =....++...-.||.....+...-..++.+ .|-+.-.+.++.++..+..- | T Consensus 6 ~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf~k--~ 83 (513) T protein:vir:97 6 PKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPFSE--P 83 (513) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhhhc--C Confidence 11 12334444332110 022233343444553211111111222222 45566667777776544431 2 Q ss_pred cCCcccccCCChHHHhhhcccchhHHHHHH-HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCC----- Q lcl|NC_012418. 70 TGIPFFRSELTDAIRREADSRDTDITEVTA-ALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----- 143 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~-~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~----- 143 (510) |. .|.. . .+.+.+ ++++| -....+++.-+...+.+...+|-+.+++|-+.. T Consensus 84 p~-~~~~--~--------------p~~~~~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~ 140 (513) T protein:vir:97 84 IK-LNED--V--------------PKAIEETILPDV------DLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPRED 140 (513) T ss_pred cc-cCcC--c--------------hHHHHHHHhhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccc Confidence 21 1111 1 112333 33343 235667888888899999999999888985421 Q ss_pred ---------------c-EEEEEeceE---EEe-eCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEE Q lcl|NC_012418. 144 ---------------T-VVAWSLRSY---AVR-RDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTH 203 (510) Q Consensus 144 ---------------~-~~~~pl~~~---~i~-~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 203 (510) + +..|+-.+. -.. .|..+.+.-+..++....+ +.|... .++.|.+ T Consensus 141 ~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~~~------------~~~q~rv 205 (513) T protein:vir:97 141 GQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFAEV------------CKRRIRV 205 (513) T ss_pred hhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCcce------------EEEEEEE Confidence 1 455554442 121 2444445555556655422 112211 1222222 Q ss_pred EEeecCCCceEEEEEEEecC-------eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHH---HHH Q lcl|NC_012418. 204 VQRKKGTAMEYAELYHEIDG-------VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLL---SEK 273 (510) Q Consensus 204 v~~~~~~~~p~~sv~~e~~~-------~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l---~~~ 273 (510) +. ++. +.+|-..++ -.+...++ +.+++|++.|....+..+..| .--|=|+..||.- ..+ T Consensus 206 L~--~g~----~~v~r~~~~~~~~~~e~~~~~~g~---~~l~~IP~v~~~~~~~~~~~~--~pPLl~LA~ln~~hy~~~S 274 (513) T protein:vir:97 206 LE--PGL----VQLWEPVKKSNAQKEEWALADEWA---TGLNYVPLVTFYADRQGFMMG--KPPLLDLAHLNVAHWQSAS 274 (513) T ss_pred Ee--Cce----EEEEEeecCCCccccceEEecCCC---CcCCceeEEEEecCCCCCCCC--ccchHHHHHHHHHHHhhhh Confidence 21 221 333322211 12222332 346777777766555544333 2223366666654 333 Q ss_pred HHHHH-HHhhCCceeeCCCcccc--hhhhccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_012418. 274 LGLYE-LESLEVLNLVDEAKGAV--VDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN 349 (510) Q Consensus 274 ~l~~~-~~a~~p~~l~~~~g~~~--p~~~~~~~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~ 349 (510) -++.+ ..+..|.+.+. |... .+.+..+++..+ .|+.....+-++. .+..+......+.++++.++++= +.+. T Consensus 275 d~~~il~~~~~P~l~~~--G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~qm~~~G-a~ll 350 (513) T protein:vir:97 275 DQRHILTVSRFPILACS--GASGEDSDPVVVGPNKVLYNPDPAGRFYYVEH-TGQAIAAGRTDLKDLEEQMAGYG-AEFL 350 (513) T ss_pred hHHHHHHhcccceeeee--cCCcCCCCceEeeccccccCCCCCCcceeecc-CchhHHHHHHHHHHHHHHHHHHH-HHhh Confidence 33333 34444443332 2211 123444444333 3432233333333 24567778889999999987654 2233 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhc-CCCCCCcccccceee-ecHHHHHHHHHHHH Q lcl|NC_012418. 350 QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIE-TGLPALSRSAAVQS 427 (510) Q Consensus 350 ~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~-~l~~~~~~~~~~~~v-~~is~L~raq~~~~ 427 (510) +......|||+.+.+.+..-..|.-+...+++ .+++++.++.+- |. -++.++..+- .+...---++.+.. T Consensus 351 ~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~-----al~~~l~~~a~wlg~---~~~~~~v~in~dF~~~~~~~~~~~a 422 (513) T protein:vir:97 351 KRKTGGQTATARALDSAEATSDLSAMTGLFED-----ALAQALDITADWLRL---GPNGGTVELVKDYDLEEMDAPGLQA 422 (513) T ss_pred ccCCccccHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhCC---CCCccEEEeccccCcccCCHHHHHH Confidence 33344589999999999999999988877654 335555555331 11 1222222221 11111111111111 Q ss_pred HHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCC-CHhHccCCHHHHHHHHHHHHHHHHHH----HHhh--------- Q lcl|NC_012418. 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV-DTSQFYKSEEELQAEAEQRRQQAAQA----QAAQ--------- 493 (510) Q Consensus 428 ~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv-p~~~i~rs~~ev~~~r~q~~q~~~~~----~~~~--------- 493 (510) +.+... ++ .|....+.+++-+ .|| ++. + ..+++.+.++++-+.+.... ..+. T Consensus 423 ---l~~a~~--~G------~is~~t~~~~L~r-~gvl~~d-~-d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 488 (513) T protein:vir:97 423 ---LQVARE--KR------DISRKTYLNGLRL-RGVLPED-F-DEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEG 488 (513) T ss_pred ---HHHHHh--CC------CCCHHHHHHHHHh-ccCCCcc-C-CHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCC Confidence 111111 11 1222233333332 333 221 1 11222222222211100000 0000 Q ss_pred --HHHhhhhhhhhhcccCC Q lcl|NC_012418. 494 --ETLLEGASDMTNALAGV 510 (510) Q Consensus 494 --~~~~~ga~~~~~~~ag~ 510 (510) .....|+..+..--+|- T Consensus 489 ~~~~~~~~~~~~~~~~~~~ 507 (513) T protein:vir:97 489 EGEGEGEGGEGGEGGEGGG 507 (513) T ss_pred CCCCCCCCCCCCCccccCC Confidence 00000000011000111 No 138 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=68.54 E-value=0.24 Score=24.02 Aligned_cols=292 Identities=9% Similarity=0.016 Sum_probs=115.9 Q ss_pred eEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeee-ccc---cccccccC- Q lcl|NC_012418. 163 WMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GEE---GRWPIHLC- 237 (510) Q Consensus 163 vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~-~~~---~~y~~~~~- 237 (510) |-++++++. +.. .....+.+++.+...++ .+..++..+ ... .+....++ T Consensus 1 v~Eivw~~~-----------------------~g~-~~~~~l~~r~~~~~~~f--~~~~~~~l~~~~~~~~~g~~~~~lp 54 (355) T protein:vir:78 1 MFEQVYRIE-----------------------NGR-ARLGKLAWRPPRTISRF--DVAPDGGLVAIEQWGVFGKATVRIP 54 (355) T ss_pred CeEEEEEee-----------------------CCe-EEEeeeeecCccceeee--eeccCCceeEEEecCCCCCCcceec Confidence 222222110 000 11111222222211111 122233221 111 11111112 Q ss_pred --ceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCC-ceeeCCCcccc--hh--------------- Q lcl|NC_012418. 238 --PYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV-LNLVDEAKGAV--VD--------------- 297 (510) Q Consensus 238 --P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p-~~l~~~~g~~~--p~--------------- 297 (510) =|++.|....+|+.||.|....+..-..--+...+..+..+++-..| |+..-|.|... .+ T Consensus 55 ~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l 134 (355) T protein:vir:78 55 VDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEG 134 (355) T ss_pred cCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHH Confidence 28999999999999999999999999988898999999998875444 33332322111 10 Q ss_pred -----hhccC-CCceeecCCcccccccccC-cccchHHHHHHHHHHHHHHHHHhhccccCC----CCCCCCHHHHHHHHH Q lcl|NC_012418. 298 -----DYQDA-EMGDYVPGGAEAVRAYERG-DYNKMAAIQQSLQAVVVRLNQAFMYGANQR----DAERVTAEEVRITAE 366 (510) Q Consensus 298 -----~~~~~-~~g~~~pg~~~~v~~~~~~-~~~~~~~~~~~i~~~~~~I~~af~~~~~~~----~~~~~TAtEi~~r~~ 366 (510) .+..+ ..|.++|-+. .+..++.. ..++ ....|+-+.+.|+++++....+- ++......|++.... T Consensus 135 ~~~~~~i~~g~~a~~iip~g~-~ie~~ea~g~~~~---~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~ 210 (355) T protein:vir:78 135 LQLAKEFRAGEAAGGYIPHGA-NFTLTGVQGKLPE---MDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFF 210 (355) T ss_pred HHHHHHhhCCcceeEeecCCc-eEEEeecCCCccc---HHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHH Confidence 01111 1244566433 34444432 2233 34678999999999998764332 122234466643222 Q ss_pred -HHHHHhch-hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhH Q lcl|NC_012418. 367 -EAENTLGG-TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 367 -E~~~~LGp-v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~ 444 (510) .+...-.- +-+.++..++.||+..-|. ...+.|.-.+. .+.. +...+....+.+..++. T Consensus 211 ~~~~~aD~~~i~~~ln~~li~~l~~lN~~-----~~~~~P~~~~~-----~~~~-----~~~~~a~~~~~l~~~G~---- 271 (355) T protein:vir:78 211 TGSLNAVMKHIADVTQQHVVEDLVDQNWG-----PEEPAPRLVPA-----QLGK-----EQPVTAEAIRALVECGA---- 271 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----CCCCCCEEEec-----CcCh-----hHHHHHHHHHHHHhCCC---- Confidence 22222111 1122233333344332111 11111111111 1111 11111222233333322 Q ss_pred hhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhc------------------ Q lcl|NC_012418. 445 DPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNA------------------ 506 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~------------------ 506 (510) .+..+....++.+.+|+|.. - ..+++++.-.+.... .++..+......++...++. T Consensus 272 --~~~~~~~~~~~~e~~gip~p-~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~ 345 (355) T protein:vir:78 272 --FTADPELEKDLRARYGLPAP-A-ERDDGADAAAAKAAG--RRRAKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRH 345 (355) T ss_pred --ccccHHHHHHHHHHhCCCCC-C-CCCcccCCccccccc--cccccccCCccccccccccCCCCCChhhhHHHHHHhhc Confidence 23334556778889998743 1 122222111100000 00000000000000001111 Q ss_pred -------ccC Q lcl|NC_012418. 507 -------LAG 509 (510) Q Consensus 507 -------~ag 509 (510) .+| T Consensus 346 ~~~~~~~~~~ 355 (355) T protein:vir:78 346 PAHRRCAPDG 355 (355) T ss_pred cccCCCCCCC Confidence 111 No 139 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=67.40 E-value=0.25 Score=23.86 Aligned_cols=415 Identities=13% Similarity=0.039 Sum_probs=144.3 Q ss_pred Ch--hHHHHHHHHHhhccchH-HHH--HHHHHhcccccCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MK--STAAMLWEKLRDGSVEQ-RAI--EFAKTTLPYLMVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~--~~~~~r~~~lkr~~~~~-~w~--e~~~~~lP~~~~~~~~~~~~~~--~~~~-dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) |= +.+.+|...-.....+. .|. +-+.+.+ +. .+..+ ... .... .++--.|++.+|+.+.+. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~g-~~V~~~~al~~~~V~~~v~~Ia~~iA~l------ 69 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNL---GA-VAASG-ETVTPHDALQVSAVFASVRLLSETIATL------ 69 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhh---cc-cccCC-ceechHHhhccHHHHHHHHHHHHhhccC------ Confidence 21 11222221100001111 110 0011111 00 00011 110 0111 334445666666665442 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhCeEEEEEeCCCCc-EEE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQN----ASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~s----nfy~~~~~~~~dl~~~G~~~l~~~~~~~~-~~~ 147 (510) |+--..-.+... .++. ...++..++.. +.+.-+..++.++...||+.+++..+.++ ... T Consensus 70 p~~~~~~~~~~~----------~~~~------~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l 133 (457) T protein:vir:13 70 PLSTYSKRGGSR----------KEIV------TPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGL 133 (457) T ss_pred ceEEEEecCCcc----------cccc------cchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEE Confidence 332111111100 0011 11223334432 23445666777888899999887655443 344 Q ss_pred EEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCee Q lcl|NC_012418. 148 WSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR 225 (510) Q Consensus 148 ~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~ 225 (510) +|| ..+.+..+..+... ...|..+.+..++.. T Consensus 134 ~~l~p~~v~v~~~~~~~~~----------------------------------------------~~~~~~y~~~~~~~~ 167 (457) T protein:vir:13 134 DVLDPTKIHVHMVMVDGLR----------------------------------------------RKVFEAYDIDADGNE 167 (457) T ss_pred EEEccCceEEEEecCCCcc----------------------------------------------ceeEEEEEEecCCce Confidence 554 23333332222110 001111112222222 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC-- Q lcl|NC_012418. 226 VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-- 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~-- 303 (510) .. ...|. .--++..|+....|..||.||...+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 168 ~~-~~~~~--~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~ 244 (457) T protein:vir:13 168 VL-LGWFT--PRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREA 244 (457) T ss_pred ee-EEeeC--ccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHH Confidence 11 11111 11255556666667789999999999999999988888888777777887777666666665322111 Q ss_pred ---------C--c-eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccC-CCCCCCCHHHHHHHHHH- Q lcl|NC_012418. 304 ---------M--G-DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RDAERVTAEEVRITAEE- 367 (510) Q Consensus 304 ---------~--g-~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~~~~~TAtEi~~r~~E- 367 (510) | + .+++++. +..++... ..+.+. .+..+..+..|.++|=.. +.. .+....+..-+.+.... T Consensus 245 ~~~~~~g~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f 321 (457) T protein:vir:13 245 WRAANSGVDNAHRVALLTEGA-KFSKVAMS-PDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAF 321 (457) T ss_pred HHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHH Confidence 1 1 1223222 22233221 234443 344456677888888332 111 11111222222222222 Q ss_pred HHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeec-HHHHHHHHHHHHHHHHHHHHHh-hcChhhHh Q lcl|NC_012418. 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG-LPALSRSAAVQSMLNASQVIAG-LAPIAQLD 445 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~-is~L~raq~~~~~~~~~q~l~~-~~~~~q~~ 445 (510) ....|.|.+.++. ..|.+..+++.-. +-.++.+ ++.|-|.--......+...++. +.-+..+- T Consensus 322 ~~~tl~P~~~~ie------------~~ln~~L~~~~~~---~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R 386 (457) T protein:vir:13 322 TMFSLRPWLERIE------------AGFNRLLFAETAD---RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVR 386 (457) T ss_pred HHHHHHHHHHHHH------------HHHHHhhcCcccc---CceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 2333455544444 4443333322111 1111221 2222222111111111111111 00011111 Q ss_pred hccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHH--HHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 446 PRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEA--EQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r--~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ..++++-+=...++.+=+|. .+..-.+..+... ...+.+.-..+..+.+...|...-+.+.... T Consensus 387 ~~~gl~Pi~~g~~d~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~ 452 (457) T protein:vir:13 387 AAEDMTPLPDGLGEKYRVPL-NLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEED 452 (457) T ss_pred HHhCCCCCCCCcccceeecc-ccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCc Confidence 11111110000111111111 1111000000000 0000000000000001111211111111111 No 140 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=66.01 E-value=0.27 Score=23.66 Aligned_cols=239 Identities=12% Similarity=0.048 Sum_probs=95.2 Q ss_pred ChhHHHHHHHHH-hhc-cchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKL-RDG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~l-kr~-~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) |.= |.+. +|+ .....|..-.--+.|+........-. ...-+-.++--.|++.+|+.+.+. ||.-.. T Consensus 1 Mgl-----F~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~al~~~~v~~~i~~ia~~iA~l------p~~~~~ 68 (251) T protein:vir:46 1 MGI-----FYKNEKRDLQYNEDDLQMMVQTLPSFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLARM------PIRVTV 68 (251) T ss_pred CCc-----cccccccccCCCccchhhhhhhhccccCcCcceec-hhhhhccHHHHHHHHHHHHhHhhC------ceEEee Confidence 431 2221 232 12222111111113332211111100 000112344455666666666553 333221 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHH-HhcCCHH----HHHHHHHHHHhhCeEEEEEeCCC-C-cEEEEEe- Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLA----VLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL- 150 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l-~~snfy~----~~~~~~~dl~~~G~~~l~~~~~~-~-~~~~~pl- 150 (510) ..... .++-+...| .+-|-+. -+.....++..+||+.+|+..+. + ....+|| T Consensus 69 -~~~~~-------------------~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 128 (251) T protein:vir:46 69 -NGQIN-------------------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 128 (251) T ss_pred -Ccccc-------------------ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEC Confidence 11100 011122223 2444443 34455667788899998876443 2 3445555 Q ss_pred -ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccc Q lcl|NC_012418. 151 -RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEE 229 (510) Q Consensus 151 -~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~ 229 (510) ...-+..|.+|++- |+ + ...++...... T Consensus 129 ~~~v~v~~~~~g~~~--~~------------------------------------------------~-~~~~~~~~g~~ 157 (251) T protein:vir:46 129 TSEIELKSDARGRLY--YF------------------------------------------------H-QRIDSNGNNIE 157 (251) T ss_pred CceEEEEECCCCcEE--EE------------------------------------------------E-EEeccCCccee Confidence 55556666666331 10 0 00000000000 Q ss_pred cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccchhhhccCCCceee Q lcl|NC_012418. 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDDYQDAEMGDYV 308 (510) Q Consensus 230 ~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~p~~~~~~~~g~~~ 308 (510) ..|..++ ++..|....+| .||.||...+...+...+...+.......-...|..++.-++ +.++ T Consensus 158 ~~~~~~d--iiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~------------ 222 (251) T protein:vir:46 158 RNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNK------------ 222 (251) T ss_pred EEECCcc--EEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCH------------ Confidence 1111111 34444443334 799999999999999888888777776555444543332111 1111 Q ss_pred cCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhccccCCCCCCCCHHH Q lcl|NC_012418. 309 PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEE 360 (510) Q Consensus 309 pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~~TAtE 360 (510) +..+.+++++++.+-..-+ ...-.+--+| T Consensus 223 ----------------------e~~~~~~~~~~~~~~g~~n-~g~~~~gm~~ 251 (251) T protein:vir:46 223 ----------------------KARDRAREEFPKVLVELNK-LGKLSYSMNQ 251 (251) T ss_pred ----------------------HHHHHHHHHHHHHhcCccc-ccccccccCC Confidence 1122233333333211000 0000000011 No 141 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=63.25 E-value=0.32 Score=23.29 Aligned_cols=422 Identities=12% Similarity=0.065 Sum_probs=167.3 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCC-----CCCcccccccc--ccchHHHHHHHHHHHHHH-hhcCcCC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-----MSGSRGVVEHD--FQSAGALLVNNLAAKLAR-SLFPTGI 72 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~-----~~~~~~~~~~~--~dstg~~a~~~LAa~l~~-~ltpp~~ 72 (510) .+....+.|+.-.+.. +|+. .|...++. .+.-..+...+ -++.+..+++.+.+.+++ |++|..+ T Consensus 16 ~~~~~~~~y~aa~~~~---~~~~-----~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~ 87 (495) T protein:vir:10 16 LVPVGASAYEGASGGH---RWQD-----IGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWR 87 (495) T ss_pred hhHHHhhhhhccccCc---ccCC-----CCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccC Confidence 3333344454332211 1110 01111110 01111111222 377889999988887764 5666554 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCC--C----c Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE--A----T 144 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~--~----~ 144 (510) + .++.+.+ .-....+.|.+.| .+-.+.+||.....++...+..|-+++-+. +.. . + T Consensus 88 ~------~~~~~~~-----~ie~~w~~wa~~~-----D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~ 151 (495) T protein:vir:10 88 M------KEQELRQ-----ELQELWGDWVNEA-----DFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQ 151 (495) T ss_pred C------chHHHHH-----HHHHHHHHhhcCc-----ccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceE Confidence 4 3333322 1223345565433 334567899999999999999998764221 111 1 1 Q ss_pred EEEEEeceEEEeeC----CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEE Q lcl|NC_012418. 145 VVAWSLRSYAVRRD----ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHE 220 (510) Q Consensus 145 ~~~~pl~~~~i~~d----~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e 220 (510) ++.+.........+ ++|+. .+...+.+......-||.....++....... . T Consensus 152 lqliepd~l~~~~~~~~~~~g~~----------------------i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~---~ 206 (495) T protein:vir:10 152 LQIIEPDMLASDIPDETLPSGGY----------------------VKGGIRFSNGGKRKAYCFYRNHPAESSLIGD---P 206 (495) T ss_pred EEEechhhcCCCCCCCCCCCCCE----------------------EEeceEECCCCceEEEEEeecCCCccccccc---c Confidence 22222111111111 11110 1112222333344455544433332110000 0 Q ss_pred ecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCcccchhhh Q lcl|NC_012418. 221 IDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVDDY 299 (510) Q Consensus 221 ~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~-~~g~~~p~~~ 299 (510) ..-+.+.. +-+..-|.+.+|.+=|.+..- .+-.++.|+....+.+.++..++.....+. +++--..... T Consensus 207 ~~~~rvpA---------~~vlH~f~~r~gQ~RGis~la-~i~~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 276 (495) T protein:vir:10 207 VDTVWIKA---------EHVLHVTVLTVRSDAGAPWFQ-LLLRLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPT 276 (495) T ss_pred cceeeech---------hheEeccccCCCcccCcchhH-HHHHHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccc Confidence 00011111 123333556688898997654 566799999999999999888877765542 2221110000 Q ss_pred -----ccCC---CceeecCCcc------cccccccC-cccchHHHHHHHHHHHHHHHHHhh--ccccCCCCCCCCHHHHH Q lcl|NC_012418. 300 -----QDAE---MGDYVPGGAE------AVRAYERG-DYNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVR 362 (510) Q Consensus 300 -----~~~~---~g~~~pg~~~------~v~~~~~~-~~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~~TAtEi~ 362 (510) .... .-.+-||... +++..... ..+++. .-...+...|-.++= +..+..|-..++-.-++ T Consensus 277 ~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R 353 (495) T protein:vir:10 277 IGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYE---PWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIR 353 (495) T ss_pred cCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccccccHHHHH Confidence 0000 0011233222 23333222 122332 122333334545441 12233454444443333 Q ss_pred HHHHHHHHHhchhHh-HHHHHHHHHHHHHHHHHHhhcCCCCCCccc------cccee----eecHHHHHHHHHHHHHHHH Q lcl|NC_012418. 363 ITAEEAENTLGGTYS-LLAENLQSPLAYVCLSEVDDALLQGLITKQ------HKPAI----ETGLPALSRSAAVQSMLNA 431 (510) Q Consensus 363 ~r~~E~~~~LGpv~~-rl~~E~l~Pli~r~~~il~~~~l~~~~~~~------~~~~~----v~~is~L~raq~~~~~~~~ 431 (510) .=..|.....--.=. .+...|+.|+.++++....-.|.+++|+.. .+... ...++|+--++..... T Consensus 354 ~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~--- 430 (495) T protein:vir:10 354 AGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGD--- 430 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHH--- Confidence 333333333222111 133456778888888876555655554321 11111 1234443322211100 Q ss_pred HHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhH------HHhhhhhhhhh Q lcl|NC_012418. 432 SQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQE------TLLEGASDMTN 505 (510) Q Consensus 432 ~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~------~~~~ga~~~~~ 505 (510) +. +++... ..++...|.+++ ||...+....+.+... ++.- ....|+.+... T Consensus 431 ---i~--~G~~s~----------~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~-Gl~~~~~p~~~~~~~~~~~~~ 487 (495) T protein:vir:10 431 ---VR--AGFAPI----------SDKQAERGYDME-------ELFDMISDANQLIDEY-DLRLDSDPRYVNGSGAEQKSV 487 (495) T ss_pred ---HH--cCCCCH----------HHHHHHcCCCHH-------HHHHHHHHHHHHHHHc-CCCCCCCCCcCCCccCCCCCC Confidence 00 111111 122233454442 2222222222111111 1110 01111111111 Q ss_pred cccCC Q lcl|NC_012418. 506 ALAGV 510 (510) Q Consensus 506 ~~ag~ 510 (510) ..+.. T Consensus 488 ~~~~~ 492 (495) T protein:vir:10 488 MEAAL 492 (495) T ss_pred CCCCC Confidence 11111 No 142 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=61.10 E-value=0.36 Score=23.02 Aligned_cols=432 Identities=9% Similarity=-0.029 Sum_probs=162.6 Q ss_pred Ch--hHHHHHHHHHhhccchHHHH-HHHHHhcccccCCC-----CCCcccccccc--ccchHHHHHHHHHHHHHH--hhc Q lcl|NC_012418. 1 MK--STAAMLWEKLRDGSVEQRAI-EFAKTTLPYLMVDP-----MSGSRGVVEHD--FQSAGALLVNNLAAKLAR--SLF 68 (510) Q Consensus 1 ~~--~~~~~r~~~lkr~~~~~~w~-e~~~~~lP~~~~~~-----~~~~~~~~~~~--~dstg~~a~~~LAa~l~~--~lt 68 (510) +- ..++..-.+..+..|+.--. .-..+--+....+. ...-..+...+ -++.+..+++.+++.+++ +++ T Consensus 11 ~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~ 90 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGII 90 (502) T ss_pred cChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCcee Confidence 11 11110000011112211000 00000001110000 00011111223 378999999999999996 566 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEE--eCCC---- Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE---- 142 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~--~~~~---- 142 (510) |..++=......+..+.+ .-....+.|.+.| ..=.+.+||.....++...+.-|-+++-. +++. T Consensus 91 ~~~~~~~~~~~~~~~~~~-----~ie~~w~~Wa~~~-----D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 91 VEPHPVLRNGAIARDLAA-----EIRTRWSEWSVSP-----EVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred eeeccCCCChhHHHHHHH-----HHHHHHHHhhcCc-----CccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 655542111100111110 1112233443332 22346789999999999999999876543 2211 Q ss_pred -C----cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEE Q lcl|NC_012418. 143 -A----TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAEL 217 (510) Q Consensus 143 -~----~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv 217 (510) . .++.++....-...+ +| . ..+...+.+.+....=||.....++......+ T Consensus 161 g~~~~l~lq~iepd~l~~~~~-~~---------------------~-~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~- 216 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSD-ES---------------------N-RLNQGVFVDDWGRPEKYLVYKSRPVSGRQMET- 216 (502) T ss_pred CcccceEEEEecchhcCCCCC-CC---------------------C-eeEeeeEECCCCceEEEEEeecCCCCCcccce- Confidence 1 122222211110000 00 0 00111122222223333332222221100000 Q ss_pred EEEecCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCcc-cc Q lcl|NC_012418. 218 YHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKG-AV 295 (510) Q Consensus 218 ~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~-~~g~-~~ 295 (510) ..+.- ++ ++..-....+|..=|.+...-+|..++.|..+..+.+.++..++.....+. +++- .. T Consensus 217 ------~rvpA------~~--vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~ 282 (502) T protein:vir:79 217 ------KEVDA------ER--MLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYE 282 (502) T ss_pred ------eEech------hh--eEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccc Confidence 00100 00 222223355888999999999999999999999999999987777765553 2211 00 Q ss_pred hhh--------hccCCCceeecC-C-cccccccccCc-ccchHHHHHHHHHHHHHHHHHhh--ccccCCCCCCCCHHHHH Q lcl|NC_012418. 296 VDD--------YQDAEMGDYVPG-G-AEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVR 362 (510) Q Consensus 296 p~~--------~~~~~~g~~~pg-~-~~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~~TAtEi~ 362 (510) +.. .....+|.+++. . ...++...... .++|. .-...+...|..++= +..+..|-.. +-.-++ T Consensus 283 ~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~ye~lt~D~s~-nySs~R 358 (502) T protein:vir:79 283 PDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLE---TFRNGQLRAVAAGSRLSFSSTARNYNG-TYSAQR 358 (502) T ss_pred cccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhccccc-hHHHHH Confidence 100 001122332221 1 11333333221 22332 222333334444441 1122223211 222222 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc-----ce----eeecHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK-----PA----IETGLPALSRSAAVQSMLNASQ 433 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~-----~~----~v~~is~L~raq~~~~~~~~~q 433 (510) .=..|....+--.=..+...|+.|+.++++....-.|.+++|...-. .. ....++|+--++-... T Consensus 359 ~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~------ 432 (502) T protein:vir:79 359 QELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKI------ 432 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHH------ Confidence 22222222222222234446777888888877665666665543211 11 1123444322221100 Q ss_pred HHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 434 VIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 434 ~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) .+. +++... ..++...|.+++ +.++++.+..+.+ + ...+......+......+..+- T Consensus 433 ~i~--~Gl~t~----------~~~~a~~G~D~~------~v~~q~a~e~~~~-~-~~Gl~~~~~~~~~~~~~~~~~~ 489 (502) T protein:vir:79 433 QIR--GGAATE----------SDWVRAGGRNPD------DVKRRRKAEIDEN-R-KLDLVFDTDPASDKGGSSAATK 489 (502) T ss_pred HHH--cCCCCH----------HHHHHHcCCCHH------HHHHHHHHHHHHH-H-HcCCCCCCCCCCCCCCCCCCCC Confidence 000 111111 123334455543 1112222222211 1 1111111111111111111111 No 143 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=58.39 E-value=0.41 Score=22.68 Aligned_cols=345 Identities=10% Similarity=0.048 Sum_probs=125.1 Q ss_pred cccccchHHH--HHHHHHHHHHHhhcCcCCcccccCCChHH--Hhhhcccc----hhHHHHHHHHHHHH----------- Q lcl|NC_012418. 45 EHDFQSAGAL--LVNNLAAKLARSLFPTGIPFFRSELTDAI--RREADSRD----TDITEVTAALARVD----------- 105 (510) Q Consensus 45 ~~~~dstg~~--a~~~LAa~l~~~ltpp~~~WF~l~~~d~~--~~~~~~~~----~~~~~v~~~L~~~e----------- 105 (510) -.+|+-.... +... ....|+.+..++.. ........ .....|..-++.+. T Consensus 1 M~~f~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~ 69 (386) T protein:vir:48 1 MPIFNITNLATESPPI-----------SQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTAS 69 (386) T ss_pred Cccccccccccccccc-----------ccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeec Confidence 1222211000 0000 01111111110000 00000000 01111222111111 Q ss_pred -HHHHHHHHhcCCHHH----HHHHHHHHHhhCeEEEEEeCCCC--cEEEEEe--ceEEEeeCCCCCeEEEEEEEeecHHH Q lcl|NC_012418. 106 -RKATQRLFQNASLAV----LTQVIKLLIVTGNALLYRNSDEA--TVVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKD 176 (510) Q Consensus 106 -~~~~~~l~~snfy~~----~~~~~~dl~~~G~~~l~~~~~~~--~~~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~ 176 (510) +.....+.+-|.+.. +...+.++...|++.+++..+.. ....+|+ ..+.+.++.+|.. T Consensus 70 ~~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~------------- 136 (386) T protein:vir:48 70 RKQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG------------- 136 (386) T ss_pred cchhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCce------------- Confidence 112222333343333 34445567778888877654432 2333444 4454545443321 Q ss_pred hhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecCCCccccch Q lcl|NC_012418. 177 LDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGH 256 (510) Q Consensus 177 l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp 256 (510) .++- +..++........+..++ .+..|.....+..||.|| T Consensus 137 ------------------------------------~~y~--~~~~~~~~~~~~~~~~~e--vih~~~~~~~~~~~G~s~ 176 (386) T protein:vir:48 137 ------------------------------------IYYN--ITFDDPRIPPKQHVPQGD--VLHFKLLSVDGGLTSVSP 176 (386) T ss_pred ------------------------------------EEEE--EEecCccccceeEecCcc--EEEecCCCCCCceeeccH Confidence 1111 111111111111111112 444555556677999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccC----------CCcee-ecCCcccccccccCcccc Q lcl|NC_012418. 257 VEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA----------EMGDY-VPGGAEAVRAYERGDYNK 325 (510) Q Consensus 257 ~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~----------~~g~~-~pg~~~~v~~~~~~~~~~ 325 (510) ..-+...+..+..+.+.......-...|..++..++.+.++..... ..+.+ ++++. ++.++.. +..+ T Consensus 177 i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~-~~~~l~~-~~~d 254 (386) T protein:vir:48 177 LMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLE-EFTPLEI-KSNV 254 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecCCCc-eEEEcCC-ChhH Confidence 9999999999999999888888887888877766565555332211 00111 12221 2233322 1233 Q ss_pred hHHHHHHHHHHHHHHHHHhhcc--ccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_012418. 326 MAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGL 403 (510) Q Consensus 326 ~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~ 403 (510) .+ ..+..+..++.|-++|=.. ++...+..-+++|-. ..-....|-|.+..+.+|+-.-|+.+ . T Consensus 255 ~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~--~~~~~~~l~P~~~~ie~~l~~~l~~~------------~ 319 (386) T protein:vir:48 255 SQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--LDLYNKAVSRYLRPFLSELSQKLSCD------------V 319 (386) T ss_pred HH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHHhhcch------------h Confidence 33 2455667778888888322 111111111222211 11233345555555555543222211 0 Q ss_pred CcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHHHHHHHHHHHH Q lcl|NC_012418. 404 ITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEEELQAEAEQRR 483 (510) Q Consensus 404 ~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~ev~~~r~q~~ 483 (510) ..++... ..-+...++..+..+ +. ++ -+..+++ +.....-|+++. ++... T Consensus 320 -~~~~~~~--~~~d~~~~~~~~~~l------~~--~g------~~t~nE~-r~~lg~~~~~~~-------~~~~~----- 369 (386) T protein:vir:48 320 -DADILPA--VDPTGSNSVSRINSM------VK--SG------TLAQNQG-LYILQQAEILPK-------ELPEG----- 369 (386) T ss_pred -hcchhhh--hccChHHHHHHHHHH------Hh--CC------CcCHHHH-HHHhhcCCCCCc-------cchhh----- Confidence 0011000 001111222222111 11 01 1122222 222222222221 10000 Q ss_pred HHHHHHHHhhHHHhhhhhhhhhc Q lcl|NC_012418. 484 QQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 484 q~~~~~~~~~~~~~~ga~~~~~~ 506 (510) +........|+..-.+- T Consensus 370 ------~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 370 ------ENPNKTTLKGGEINGED 386 (386) T ss_pred ------cCCCCCccCCCCCCCCC Confidence 00100111111111111 No 144 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=54.14 E-value=0.51 Score=22.18 Aligned_cols=385 Identities=11% Similarity=0.044 Sum_probs=150.1 Q ss_pred HHHHHH--hhccchHHHHHHHHHhcccccCCCCCCccccc---cccccchHHHHHHHHHHHHHHhhcCcCCcccccCCCh Q lcl|NC_012418. 7 MLWEKL--RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV---EHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTD 81 (510) Q Consensus 7 ~r~~~l--kr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~---~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d 81 (510) ..|+++ ||+.......-........ |........... .-.-.++--.|++.+|+.+.+. ||--....+ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l------~~~~~~~~~ 73 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNM-FGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKL------PIHTYKRTD 73 (416) T ss_pred CccchhcccccCccccCccchhHHHHh-hcCcccccCceechhhhhccHHHHHHHHHHHHhhhhC------ceEEEEecC Confidence 333333 2332211100011111111 111111111111 1123455556677766666533 442222222 Q ss_pred HHHhhhcccchhHHHHHHHHHHHHHHHHHHH-HhcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cEEEEEece-- Q lcl|NC_012418. 82 AIRREADSRDTDITEVTAALARVDRKATQRL-FQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSLRS-- 152 (510) Q Consensus 82 ~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l-~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~~~~pl~~-- 152 (510) ....+ +. +.-++..| .+-| .+.-+.....++...|++.+|+..+. + ....|||.. T Consensus 74 ~~~~~----------~~------~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~ 137 (416) T protein:vir:12 74 GGIER----------KP------EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDY 137 (416) T ss_pred Ccccc----------cc------ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcc Confidence 11110 00 00111112 2333 33345666777888999998876432 2 244566633 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccc Q lcl|NC_012418. 153 YAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRW 232 (510) Q Consensus 153 ~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y 232 (510) .-+..+..+. ..|+.+ ..+|..+ .+ T Consensus 138 v~v~~~~~~~-------------------------------------------------~~~~~~--~~~g~~~----~~ 162 (416) T protein:vir:12 138 TNAYVHPTTG-------------------------------------------------MLWYQT--VLNGKAI----EL 162 (416) T ss_pred eEEEEeCCCc-------------------------------------------------EEEEEE--ecCCeEE----Ee Confidence 2222222221 111111 1233322 11 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhc----------cC Q lcl|NC_012418. 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ----------DA 302 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~----------~~ 302 (510) . ..-++..|+...+ ..||.||..-+...+.......+.......-...|.+++.-++.++++... ++ T Consensus 163 ~--~~eiih~~~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~ 239 (416) T protein:vir:12 163 Y--DYEVLHFKGLSTD-GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKV 239 (416) T ss_pred c--CccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcC Confidence 1 1234555655444 489999999999999998888888888777777787777655666554321 22 Q ss_pred CCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--cc--CCCCCCCCHHHHHHHHHHHHHHhchhHhH Q lcl|NC_012418. 303 EMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN--QRDAERVTAEEVRITAEEAENTLGGTYSL 378 (510) Q Consensus 303 ~~g~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~--~~~~~~~TAtEi~~r~~E~~~~LGpv~~r 378 (510) .+-.+++++. ...++... ..+.+.+ +.....+..|-++|=.. .+ ..++..=+++|.... T Consensus 240 ~~~~vl~~g~-~~~~l~~~-~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~-------------- 302 (416) T protein:vir:12 240 ENIAIIDYGL-EYQSISMP-LQEAQFV-ESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIE-------------- 302 (416) T ss_pred CCeeecCCCc-eEEEccCC-hhhHHHH-HHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHH-------------- Confidence 2222334433 23344332 2445543 44566778888888322 11 112221223332211 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeec-HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHH Q lcl|NC_012418. 379 LAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG-LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 (510) Q Consensus 379 l~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~-is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~ 457 (510) +...-|.|++.+....|.+..+++...... .++.+ ++.|-|+-...........+.. ++ +. .+++ T Consensus 303 f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g--~~i~fd~~~l~~~d~~~~~~~~~~~~~~--G~------~T----~NE~ 368 (416) T protein:vir:12 303 YVRNTLQPWIVNFEQELNVKLFLDHDQKSG--HYVKFNIDSELRGDSKTQAEYLKTLHET--GV------LN----KDEI 368 (416) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCchhhcCC--ceEEeechhhhccCHHHHHHHHHHHHhC--CC------cC----HHHH Confidence 223345566666666555444433222211 11111 2223222222222222222211 10 11 1222 Q ss_pred HHHcCCCHhHccCCHHHHHH-HHHHHHHHHHHHHHhhHH-HhhhhhhhhhcccC Q lcl|NC_012418. 458 WAAFSVDTSQFYKSEEELQA-EAEQRRQQAAQAQAAQET-LLEGASDMTNALAG 509 (510) Q Consensus 458 a~~~Gvp~~~i~rs~~ev~~-~r~q~~q~~~~~~~~~~~-~~~ga~~~~~~~ag 509 (510) -+.+|.|| + ..=|++-. .--........++...+. ...| .++.=-| T Consensus 369 R~~~gl~P--i-~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~g---ge~~~~g 416 (416) T protein:vir:12 369 RELLERNP--I-ENGDKYISSLNYVFLDFLEEYQRLKAGGAMKG---GDNKNEG 416 (416) T ss_pred HHHhCCCC--C-CCcceeeeccccccccccchhhccccccccCC---CCCcCCC Confidence 33344443 1 11111000 000000000000000000 0111 1122222 No 145 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=49.58 E-value=0.63 Score=21.67 Aligned_cols=359 Identities=9% Similarity=0.059 Sum_probs=125.2 Q ss_pred cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHH- Q lcl|NC_012418. 29 LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRK- 107 (510) Q Consensus 29 lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~- 107 (510) ++- |...... .+ ......++ +...+.++ .+..+ ++..... ....|..-.+.+... T Consensus 1 M~~-f~~~~~~--~~-~~~~~~~~------~~~~~~~~---~~~~~----v~~~~al-------~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MPL-LKLNKSH--SQ-GFSLNDPD------WVNFLTGG---EAQKY----VSADTAL-------KNSDIFSLIMQLSGDL 56 (397) T ss_pred Ccc-hhhhhcc--cC-cccCCchh------hhhhhcCC---cCCce----echHHhh-------ccHHHHHHHHHHHHHH Confidence 221 1100000 00 01111110 00000000 00000 1111000 011111111111111 Q ss_pred -----------HHHHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeCCCC--cEEEEEe--ceEEEeeCCCCCeEEEEE Q lcl|NC_012418. 108 -----------ATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDEA--TVVAWSL--RSYAVRRDATGRWMDIVL 168 (510) Q Consensus 108 -----------~~~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~~--~~~~~pl--~~~~i~~d~~G~vd~i~r 168 (510) .+..+.+-| .+.-+..+..+|...|++.+++..+.. ....+|+ ..+.+..+.+|+. ++. T Consensus 57 a~~p~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~--~~y 134 (397) T protein:vir:38 57 AMVRYTSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSG--LIY 134 (397) T ss_pred hhCcccccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce--EEE Confidence 111121222 333445666677788998877654322 2344444 4455555555432 111 Q ss_pred EEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeecC Q lcl|NC_012418. 169 KQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAP 248 (510) Q Consensus 169 ~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ 248 (510) ++.+ ++...+....+..++ .+..|..... T Consensus 135 ~~~~-------------------------------------------------~~~~~~~~~~~~~~e--iih~~~~~~~ 163 (397) T protein:vir:38 135 NINF-------------------------------------------------DEPAIGYMENVPAAD--VIHIRLLSKN 163 (397) T ss_pred EEEe-------------------------------------------------ccccccceeEecCcc--EEEecCCCCC Confidence 1111 000000000111111 4444555556 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc----------CCC-c-ee-ecCCcccc Q lcl|NC_012418. 249 GEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD----------AEM-G-DY-VPGGAEAV 315 (510) Q Consensus 249 g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~----------~~~-g-~~-~pg~~~~v 315 (510) +..||.||..-+...+.......+.......-...|..++.-++.++++.... +.| | .+ +++ .+ T Consensus 164 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~---g~ 240 (397) T protein:vir:38 164 GGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDA---LE 240 (397) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCC---Cc Confidence 77899999999999999999988888887777777777765444444432111 111 1 11 222 22 Q ss_pred cccccCc-ccchHHHHHHHHHHHHHHHHHhhccc--cCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHH Q lcl|NC_012418. 316 RAYERGD-YNKMAAIQQSLQAVVVRLNQAFMYGA--NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 316 ~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~~~--~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +..++.. ..+.+ ..+..+..+..|-.+|-... +.......+..| +...-....|-|.+..+.+|| T Consensus 241 ~~~~l~~~~~d~~-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e--~~~~~~~~~l~P~~~~ie~~l--------- 308 (397) T protein:vir:38 241 DYKPLEVKGNIAS-LLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT--QISGQYAKSLNRYVQAIVGEL--------- 308 (397) T ss_pred eEEecCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH--HHHHHHHHHHHHHHHHHHHHH--------- Confidence 2222222 23343 34556778889999984431 111111112222 111122234445555554443 Q ss_pred HHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHccCCH Q lcl|NC_012418. 393 SEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSE 472 (510) Q Consensus 393 ~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~ 472 (510) .+..++. .+......-.-+...|+.....+ +.. + .+..+++- ..+|.|+ +- .. T Consensus 309 ---n~~l~~~---~~~~~~~~~~~d~~~~~~~~~~~------~~~--G------~~t~nE~R----~~lg~~p--~~-~~ 361 (397) T protein:vir:38 309 ---NDKLHAN---ISANIRFAIDAMGDQYASTISSS------VKG--G------TIAGNQAR----FILQNSG--YL-AK 361 (397) T ss_pred ---HHhccCh---hcccccccccCCHHHHHHHHHHH------HhC--C------CcCHHHHH----HHhCCCC--CC-CC Confidence 3222211 11111111111233333322221 110 1 12222222 2334433 11 11 Q ss_pred HHHH--HHHHHHHHHHHHHHHhhHHHhhhhhhhhhc Q lcl|NC_012418. 473 EELQ--AEAEQRRQQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 473 ~ev~--~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~ 506 (510) +... ..-.........++.-......+-...++. T Consensus 362 d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 362 DLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred ccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 1000 000000000000000000000000001111 No 146 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=44.71 E-value=0.8 Score=21.13 Aligned_cols=347 Identities=10% Similarity=0.039 Sum_probs=132.2 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCccccc-cccc-cchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV-EHDF-QSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~-~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |+= |.+++ +.+....-.....+..+..+... ..+..-. .... .++--.|++.+|+.+.+ +| + T Consensus 1 M~~-----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~al~~~~v~~~i~~ia~~ia~--~p----~-- 66 (386) T protein:vir:49 1 MPI-----FNITNLATESPPINQESFFDIADSDFLASL-NSSEWVSAENALKNSDLFSIISQLSNDLAT--AK----I-- 66 (386) T ss_pred Cch-----hhhhccCCCCcccchhhhhhhhhccccccc-cCCceechhhhhccHHHHHHHHHHHHHhhh--Cc----e-- Confidence 442 44443 22111111111222222221111 1110000 0111 33333455555554433 22 2 Q ss_pred cCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cEEEEEe Q lcl|NC_012418. 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL 150 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~~~~pl 150 (510) .+.+... .. .+.+-| .+.-+...+.+|...|++.+++..+. + ....+|+ T Consensus 67 -~~~~~~~-------------~~-----------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i 121 (386) T protein:vir:49 67 -TTSRKQL-------------QG-----------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYL 121 (386) T ss_pred -eeccchh-------------hh-----------hhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEe Confidence 1111110 01 122223 34445666778888999998875432 2 2333444 Q ss_pred --ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecc Q lcl|NC_012418. 151 --RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGE 228 (510) Q Consensus 151 --~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~ 228 (510) ...-+..+.+|.. .++.+++ ++..... T Consensus 122 ~~~~v~v~~~~~~~~-------------------------------------------------~~y~~~~--~~~~~~~ 150 (386) T protein:vir:49 122 RPSQVSFNRLDNQNG-------------------------------------------------LYYNITF--DDPHIAP 150 (386) T ss_pred cCceeEEEEcCCCce-------------------------------------------------EEEEEEE--cCccccc Confidence 3444444433321 1111111 1111111 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhh---------- Q lcl|NC_012418. 229 EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD---------- 298 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~---------- 298 (510) ...+..+ -++..|+....+..||.||..-+...+.......+.......-...|..++.-++.+.++. T Consensus 151 ~~~~~~~--evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~ 228 (386) T protein:vir:49 151 KQHVPQN--DILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQA 228 (386) T ss_pred eeEEccc--cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHH Confidence 1111111 2555666666688999999999999999999988888888777777877664434444421 Q ss_pred hccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccCCCC-CCCCHHHHHHHHHHHHHHhch Q lcl|NC_012418. 299 YQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDA-ERVTAEEVRITAEEAENTLGG 374 (510) Q Consensus 299 ~~~~~~g~~-~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~-~~~TAtEi~~r~~E~~~~LGp 374 (510) ......+.+ ++++. .+.++... ..+.+ ..+..+..+..|-++|=.. ++..+. ..-+++.+.. -....+-| T Consensus 229 ~~~n~g~~~vl~~g~-~~~~l~~~-~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~---~~~~~i~~ 302 (386) T protein:vir:49 229 MKQMQGGPLVLDDLE-DFTPLEIK-SNVAQ-LLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYN---IYFKSVSR 302 (386) T ss_pred hccCCCCceecCCCc-eEEEccCC-hhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHH---HHHHHHHH Confidence 111111222 22222 23333321 23333 3445677888899998432 122121 2223332221 12223334 Q ss_pred hHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHH---hhcC-hhhHh----h Q lcl|NC_012418. 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIA---GLAP-IAQLD----P 446 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~---~~~~-~~q~~----~ 446 (510) .+..+..++-.-|...+. .....+....+.. +...++.|-++. .-....+...+. .... +.... + T Consensus 303 ~l~~i~~~~~~~l~~~~~--~~~~~~~~~d~~~----~~~~~~~l~~~g-~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~ 375 (386) T protein:vir:49 303 YLRPFVSEMSKKLSCEVD--VDISPAVDPTGSN----YISLINSMVKSG-TLAQNQGLYILQQAEILPKELPDGKNPNRT 375 (386) T ss_pred HHHHHHHHHHHHhcchhc--ccchhhhccCHHH----HHHHHHHHHhCC-CcCHHHHHHHHhhCCCCCCcCcchhccCCC Confidence 444443333221111100 0000000000000 011111221110 011111212221 1111 11111 1 Q ss_pred c-----cCHHH Q lcl|NC_012418. 447 R-----ISLPK 452 (510) Q Consensus 447 ~-----id~d~ 452 (510) . .|-.. T Consensus 376 ~~~gGd~~~~~ 386 (386) T protein:vir:49 376 SLKGGEINEQD 386 (386) T ss_pred CCCCCCCCCCC Confidence 1 22222 No 147 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=44.31 E-value=0.81 Score=21.08 Aligned_cols=409 Identities=8% Similarity=0.017 Sum_probs=161.0 Q ss_pred ChhHHH-HHHHHHh-hccchHHHHHHHHH--hcccccC--CCCCC--ccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAA-MLWEKLR-DGSVEQRAIEFAKT--TLPYLMV--DPMSG--SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~-~r~~~lk-r~~~~~~w~e~~~~--~lP~~~~--~~~~~--~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) |-.... +..++.+ |.......+++|.= -++.+-. ..... ......++..+-....++..++-|.+ . T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G------~ 74 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT------Y 74 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec------c Confidence 222222 2222221 22222222333221 1111100 00000 11111244455555556655554432 1 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEE--EEeCCC-------C Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-------A 143 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l--~~~~~~-------~ 143 (510) | ..++..+.. +..+.+ ..+..++|.....++.++...+|.+.+ |.+++. + T Consensus 75 p-~~~~~~~~~------------~~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~ 133 (451) T protein:vir:10 75 P-VLFDIDNNK------------ELNEKV--------TDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQ 133 (451) T ss_pred c-ceeecCCcH------------HHHHHH--------HHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCccccccccc Confidence 1 112222211 111111 223357899999999999999998754 555431 2 Q ss_pred --cEEEEEece-EEEeeC-CCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEE-eecCCCceEEEEE Q lcl|NC_012418. 144 --TVVAWSLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQ-RKKGTAMEYAELY 218 (510) Q Consensus 144 --~~~~~pl~~-~~i~~d-~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~-~~~~~~~p~~sv~ 218 (510) ++.+++..+ |++-.| ..+++.-.+|.+......- ..... ..+++++ ..++.-..|.... T Consensus 134 ~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~---------------~~~~~-~~~~~~e~yt~~~~~~~~~~~ 197 (451) T protein:vir:10 134 TFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVK---------------GQIQK-QAYTYVEFWTDKILDKYKFFG 197 (451) T ss_pred ceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc---------------ccccc-eEEEEEEEEeCCeEEEEEecc Confidence 355554444 555433 3577776666664332210 00000 1111111 1122110010000 Q ss_pred EEecCee-eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccc-h Q lcl|NC_012418. 219 HEIDGVR-VGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-V 296 (510) Q Consensus 219 ~e~~~~~-~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~-p 296 (510) ....+.. .......++..+|++.++. +.+|.|=.+...+-+..+|.+.-......+...+|.+++.--+... . T Consensus 198 ~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~ 272 (451) T protein:vir:10 198 VSCCGSQIEHITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTS 272 (451) T ss_pred cCccccccccccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccch Confidence 0011111 1111123456788876653 4568888889999999999888788888888888877663111111 1 Q ss_pred hhhccC-CCceee-cC----CcccccccccCcccchHHHHHHHHHHHHHHHHHhhc-cccCCCCCCCCHHHHH------- Q lcl|NC_012418. 297 DDYQDA-EMGDYV-PG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVR------- 362 (510) Q Consensus 297 ~~~~~~-~~g~~~-pg----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~~TAtEi~------- 362 (510) +..... ..+.+. ++ ...+++.+. ...+.+.....++.++..|...-+. +.........|+.-+. T Consensus 273 ~~~~~~~~~~~i~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~ 350 (451) T protein:vir:10 273 EFLKELKRYKTIKTETDSEGDSGGLKTMQ--IEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLE 350 (451) T ss_pred hhHHHHhhCCeEEecCcCCccCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHH Confidence 111111 112221 21 112333332 2245677788888888877654332 2111111234554332 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhh-cCCCCCCcccccceee--ecHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLA 439 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~~~~~~~~~~v--~~is~L~raq~~~~~~~~~q~l~~~~ 439 (510) .++.+++..++..+ .+.+.++.. -+.. ...++.+... .+.+....++-+. ...+.+ T Consensus 351 ~k~~~k~~~f~~~l------------~~~~~li~~~~~~~--d~~~i~i~f~~~~p~n~~e~~~~~~------kl~g~i- 409 (451) T protein:vir:10 351 LKSGLLETEFRTSF------------DKLIKAILYFLGVT--DYKKIQQTYTRNMMSNDLEDADIAT------KSVGII- 409 (451) T ss_pred HHHHHHHHHHHHHH------------HHHHHHHHHHhCCC--CccceeEEecCCCCCCHHHHHHHHH------HHhccC- Confidence 23344444444433 333333211 1111 1223332221 1222222211111 111111 Q ss_pred ChhhHhhccCHHHHHHHHHHHcCCCHhHccCCHH-HHHHHHHHHHHHHHHHHHhhHHHhhhhhh Q lcl|NC_012418. 440 PIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSEE-ELQAEAEQRRQQAAQAQAAQETLLEGASD 502 (510) Q Consensus 440 ~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~rs~~-ev~~~r~q~~q~~~~~~~~~~~~~~ga~~ 502 (510) .-..++ ..++ ++.+.+ |.+.+.++++.+. ++... ......+ T Consensus 410 ---------S~et~~----~~~p-----~v~d~~~e~~~~~ee~~~~~--~~~~~--~~~~~~~ 451 (451) T protein:vir:10 410 ---------PTKIIL----RHHP-----WVDDVEEAEKLYLEEKKIQA--SKVSD--DYNNFTE 451 (451) T ss_pred ---------chHHHH----HhCC-----CCCCHHHHHHHHHHHHHHHH--HHHHh--hcCCCCC Confidence 111122 2222 233333 3333322222111 11111 1111111 No 148 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=41.63 E-value=0.92 Score=20.79 Aligned_cols=363 Identities=9% Similarity=-0.024 Sum_probs=147.9 Q ss_pred ChhHHHHHHHHHh-h-ccchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLR-D-GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lk-r-~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) +.+.+.+.+.+-. . ....+.| ..+ +..+..+.... .-.++--.|++.+|+.+.+ -||--.. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~---~~~-----~g~~~~~~~~a---l~~~~V~~~v~~Ia~~iA~------lp~~~~~ 65 (411) T protein:vir:81 3 WWSRLTRFFRPRNETVDMTNPLL---LQW-----LGVDPDTPRNQ---LSEATYFACLKILSESLGK------LPLKMYQ 65 (411) T ss_pred hHHHHHhhccCcccccccchHHH---HHH-----hcCcccChhhh---hccHHHHHHHHHHHHhHhh------CceeEEE Confidence 3333433332111 0 0011111 111 11111111111 1233334455555554432 2443222 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCeEEEEEeCCCCc---EEEEEe Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEAT---VVAWSL 150 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~-~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~~~---~~~~pl 150 (510) -.+....+. . +..++..|. +-| .+.-+...+.+|...|++.+++..+.++ +..+|. T Consensus 66 ~~~~~~~~~----------~------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~ 129 (411) T protein:vir:81 66 KTERGIVKS----------D------REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPS 129 (411) T ss_pred ecCCceeee----------c------ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECC Confidence 111110000 0 111222232 233 3344566677788899999887765443 333344 Q ss_pred ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccc Q lcl|NC_012418. 151 RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEG 230 (510) Q Consensus 151 ~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~ 230 (510) ..+.+..|..|.+. ..+..++.+....+|.... T Consensus 130 ~~v~~~~~~~~~~~--------------------------------------------~~~~~~~~~~~~~~g~~~~--- 162 (411) T protein:vir:81 130 QYVTIVVDDRGLLG--------------------------------------------EKNAIWYRYNDPYDGKMYV--- 162 (411) T ss_pred ceEEEEEcCccccc--------------------------------------------ccceEEEEEEecCCceEEE--- Confidence 55656666555311 0000011111112232211 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc--------- Q lcl|NC_012418. 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--------- 301 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~--------- 301 (510) +..+ -++..|+....+..||.||..-+...+.......+.......-...|..++.-++.++++.... T Consensus 163 -~~~~--eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~ 239 (411) T protein:vir:81 163 -FRND--EILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFA 239 (411) T ss_pred -Eccc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHh Confidence 1111 2566666555567899999999999999999988888888777777877765555555543211 Q ss_pred --CCC-c--eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc----ccCCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_012418. 302 --AEM-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 302 --~~~-g--~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~~TAtEi~~r~~E~~~~L 372 (510) ..| | .+++++. +..++... ..+.+.+ +..+..+..|-.+|=.. ....++..=++++.. T Consensus 240 ~g~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~---------- 306 (411) T protein:vir:81 240 NGSKNAGKIIPVPLGM-KLVPLDIK-LTDSQFF-ELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN---------- 306 (411) T ss_pred cCccccCCceecCCCc-eEEEccCC-HHHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH---------- Confidence 011 1 1122222 23333321 2344433 44566788898888332 111122211222221 Q ss_pred chhHhHHHHHHHHHHHHHHHHHHhhcCCCCCC-ccccccee----eecHHHHHHHHHHHH-----HHHHHHHHHh--hcC Q lcl|NC_012418. 373 GGTYSLLAENLQSPLAYVCLSEVDDALLQGLI-TKQHKPAI----ETGLPALSRSAAVQS-----MLNASQVIAG--LAP 440 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~-~~~~~~~~----v~~is~L~raq~~~~-----~~~~~q~l~~--~~~ 440 (510) ..+...-|.|++.+.-..+.+..+++.. +......+ +-..+...|+.-.+. +.+....-.. +.+ T Consensus 307 ----~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p 382 (411) T protein:vir:81 307 ----LAFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPA 382 (411) T ss_pred ----HHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 1122233445555555544443332211 01111110 111122333332222 2222222222 223 Q ss_pred hhhHhh------ccCHHHHHHHHHHHcCCCH Q lcl|NC_012418. 441 IAQLDP------RISLPKMMDTIWAAFSVDT 465 (510) Q Consensus 441 ~~q~~~------~id~d~~~~~~a~~~Gvp~ 465 (510) ++--+. .+-.+.+.+...+ |=+. T Consensus 383 ~~ggD~~~~~~n~~pl~~~~~~~~k--gGd~ 411 (411) T protein:vir:81 383 DDYGNNLMANGNYIPLSMLGANYGK--GGDS 411 (411) T ss_pred CCCCCeeeeccCccchhhhhhhhcc--CCCC Confidence 321111 1223333332221 2232 No 149 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=39.37 E-value=1 Score=20.53 Aligned_cols=330 Identities=8% Similarity=0.030 Sum_probs=126.2 Q ss_pred cccccCCCCCCccccc-cccccchHHHHHHHHHHHHHHhhcCcCCcccc-cCCChHHHhhhcccchhHHHHHHHHHHHHH Q lcl|NC_012418. 29 LPYLMVDPMSGSRGVV-EHDFQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 (510) Q Consensus 29 lP~~~~~~~~~~~~~~-~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~-l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~ 106 (510) +. +|.. ........ ...++.+.. ..++ .|.. -.++.... .....|..-.+.+.. T Consensus 1 Mg-~f~~-~~~~~~~~~~~~~~~~~~------------~~~~---~~~~~~~v~~~~~-------l~~~~v~~~i~~ia~ 56 (382) T protein:vir:48 1 MP-IFNL-ATESPPDNQGGFFDVVDS------------DFLA---SLKGNEWVSAETA-------LRNSDLFSIINQLSN 56 (382) T ss_pred Cc-cccc-cccCCcccccccccchhh------------hccc---cccCCcccchHhh-------hccHHHHHHHHHHHH Confidence 21 1100 00000000 000111100 0000 0100 00100000 001112221111111 Q ss_pred HHH------------HHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cEEEEEe--ceEEEeeCCCCCeEEE Q lcl|NC_012418. 107 KAT------------QRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSYAVRRDATGRWMDI 166 (510) Q Consensus 107 ~~~------------~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~~~~pl--~~~~i~~d~~G~vd~i 166 (510) .+. ..+.+-| .+.=+..+..+|...|++.+++..+. + ....+|+ ..+-+..+.+|.. T Consensus 57 ~ia~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~--- 133 (382) T protein:vir:48 57 DLATVKLITSRKKLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG--- 133 (382) T ss_pred hhccCceeeecchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCe--- Confidence 110 1111222 24445566667778899888775442 2 2344454 3344444443321 Q ss_pred EEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeee Q lcl|NC_012418. 167 VLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNL 246 (510) Q Consensus 167 ~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~ 246 (510) .++- +..++........+..+ -++..|+.. T Consensus 134 ----------------------------------------------~~y~--~~~~~~~~~~~~~~~~~--evih~~~~~ 163 (382) T protein:vir:48 134 ----------------------------------------------IYYN--ITFDDPRIPPKQHVPQN--DVLHFRLLS 163 (382) T ss_pred ----------------------------------------------EEEE--EEecCccccceeEEcCc--cEEEecCCC Confidence 1111 11222211111112112 256667766 Q ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhcc----------CCCcee-ecCCcccc Q lcl|NC_012418. 247 APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD----------AEMGDY-VPGGAEAV 315 (510) Q Consensus 247 ~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~----------~~~g~~-~pg~~~~v 315 (510) ..+..||.||..-+...+...+...+.......-...|.+++.-++.++++.... ...|.+ ++++. .+ T Consensus 164 ~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~-~~ 242 (382) T protein:vir:48 164 VDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLE-DF 242 (382) T ss_pred CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCc-eE Confidence 7788999999999999999999999988888888888888776555555533221 011222 22222 23 Q ss_pred cccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccCCCCCCCCHHHHHHHHHHHHHHhchhHhHHHHHHHHHHHHHHHH Q lcl|NC_012418. 316 RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 316 ~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) .++.. +..+.+. .+..+..+..|-++|=.. ++...+. -|.+| .....-....|-|...++.+|+-.-|..+.-. T Consensus 243 ~~l~~-~~~d~q~-~e~~~~~~~~Ia~afgVp~~~lg~~~~-~~~~~-~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~~ 318 (382) T protein:vir:48 243 TPLEI-KSNVSQL-LKQADWTTGQFAKVYGIPDNVVGGQGD-QQSSL-EMSSDLYSKAVSRYLRPFLSELSQKLSCDVDA 318 (382) T ss_pred EEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCC-cccHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcChhhh Confidence 33332 2234443 355677788899998433 1111111 12221 11233445556677777766654333211100 Q ss_pred HHhhcCCCCCCcccccceeeecHHHH------HHHHHHHH----------HHHHHHHHHhh-cChhhHhhccC Q lcl|NC_012418. 394 EVDDALLQGLITKQHKPAIETGLPAL------SRSAAVQS----------MLNASQVIAGL-APIAQLDPRIS 449 (510) Q Consensus 394 il~~~~l~~~~~~~~~~~~v~~is~L------~raq~~~~----------~~~~~q~l~~~-~~~~q~~~~id 449 (510) -+. ..+.+....+. .-+..| .+++-.+. +-........+ ++-..-. | T Consensus 319 ~~~--~~~~~~~~~~~----~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~~~~~GGd~~~~---~ 382 (382) T protein:vir:48 319 DIF--PAVDPTGSNYI----SRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPNSTLKGGEEDGQ---D 382 (382) T ss_pred hhh--hhhccchhHHH----HHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCCCCCCCCCCCCC---C Confidence 000 00000000000 000111 11111000 00001100101 1111111 1 No 150 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=36.41 E-value=1.2 Score=20.20 Aligned_cols=405 Identities=11% Similarity=0.052 Sum_probs=160.6 Q ss_pred hHHHHHHHHHhcccccCCCCCCccc--cc--cc-cccchHHHHHHHHHHHHHHhhcCcC---CcccccCCChHHHhhhcc Q lcl|NC_012418. 18 EQRAIEFAKTTLPYLMVDPMSGSRG--VV--EH-DFQSAGALLVNNLAAKLARSLFPTG---IPFFRSELTDAIRREADS 89 (510) Q Consensus 18 ~~~w~e~~~~~lP~~~~~~~~~~~~--~~--~~-~~dstg~~a~~~LAa~l~~~ltpp~---~~WF~l~~~d~~~~~~~~ 89 (510) ...-+-+..+.. .+. ...++... -. .. .++-.+.-+.+-++.+++.. |+. +.|+.+...+..- T Consensus 1 ~~~~D~~~~~~~-~~g-~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~d~~r~~~~i~~~d~~~----- 71 (437) T protein:vir:52 1 MKFFDGIKSLAL-KLG-SKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIK--RPEDMVRNWREIYSNDLNS----- 71 (437) T ss_pred CchhhhhHhHHh-cCC-CccccceeecCccccccHHHHHHHHHhCchhhHHhhc--chHHhhcCCceEecCCCCH----- Confidence 111111111111 000 00000000 00 00 11112233344455555544 433 6888886532211 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeCCCCcEEEEEeceEEEeeCCCCCeEE--EE Q lcl|NC_012418. 90 RDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDATGRWMD--IV 167 (510) Q Consensus 90 ~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~pl~~~~i~~d~~G~vd~--i~ 167 (510) ..+ + .+.+.+.+-++...+.++++.--.+|.+++++..+... -.-|+. ..|.+.. ++ T Consensus 72 -----~~~-~-------~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~-~~~pl~-------~~~~~~~~~v~ 130 (437) T protein:vir:52 72 -----KQL-D-------LFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQN-TSAPLK-------PTERLKRLIIL 130 (437) T ss_pred -----HHH-H-------HHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCC-cccccc-------cCCceeEEEEe Confidence 111 1 12333444478899999999888899988887654322 123331 1233321 11 Q ss_pred EEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeeccccccccccCceEEEeeeec Q lcl|NC_012418. 168 LKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLA 247 (510) Q Consensus 168 r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~ 247 (510) -+.+++... .......+..+.+.+.|... . +... ++++- .++....+ .+.| .. T Consensus 131 ~~~~v~~~~--------~~~~dp~s~~fg~p~~y~v~---~-~~~~---~~iH~--SRii~~~~---~~~~-------~~ 183 (437) T protein:vir:52 131 PKWKISPTG--------TKDDDVLSPNFGRYSEYSIL---G-GSQS---ITVHH--SRLIILNA---NDAP-------LS 183 (437) T ss_pred chhhccccc--------cccccccccccCcceEEEEe---c-CCcc---eeEcc--ceeEEecC---ccCC-------Cc Confidence 111111100 00000000111222333321 1 1000 11111 11111111 1122 22 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-Ccccc---------h----hhhccCCCceeecCCcc Q lcl|NC_012418. 248 PGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAV---------V----DDYQDAEMGDYVPGGAE 313 (510) Q Consensus 248 ~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~-~g~~~---------p----~~~~~~~~g~~~pg~~~ 313 (510) ...-||+|+.+-.+..++..+.........+..+....+-++- ...+. . ...++ ..|.++-+..+ T Consensus 184 ~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~ 262 (437) T protein:vir:52 184 DNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKS-ATNSLLLDAEN 262 (437) T ss_pred cccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcC-CCceEEEcCCc Confidence 3567899999999999999998888877766655544443320 00000 0 01111 12333323333 Q ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHhhc---cccCCCCCCC-CHH-HHHHHHHHHHHHhchhHhHHHHHHHHHHH Q lcl|NC_012418. 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY---GANQRDAERV-TAE-EVRITAEEAENTLGGTYSLLAENLQSPLA 388 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~---~~~~~~~~~~-TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli 388 (510) +...+.. +|.-+...+....+.|..++=. -++-.....+ |.+ +++.=. =-+..++...+.|++ T Consensus 263 ~~e~~~~----~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yy--------d~i~~~Qe~~l~p~l 330 (437) T protein:vir:52 263 EYDRKEL----TFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYH--------EAIRRLQETRLRPIF 330 (437) T ss_pred ceEEEec----CcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHH--------HHHHHHHHHHHHHHH Confidence 4433332 2444456667777888877711 1222222223 211 322211 124456666789999 Q ss_pred HHHHHHHhhcCCCCCCcccccceeeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHhHc Q lcl|NC_012418. 389 YVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQF 468 (510) Q Consensus 389 ~r~~~il~~~~l~~~~~~~~~~~~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i 468 (510) ++.+.++.+....++++ ++.... .++..+....+++......+......+ ...++++++.+.+.+. |+=+. | T Consensus 331 e~l~~~i~~~~~g~~~~-~~~~~f-~pL~~~s~kekae~~~~~a~a~~~~~~----~g~i~~~e~r~~L~~~-g~~~~-i 402 (437) T protein:vir:52 331 EIIDPLICNELFGGLPA-DWWFEF-VPLTTVKQEQQINMLNTFATAANTLIQ----NGVLNEYQIANELRES-GLFAN-I 402 (437) T ss_pred HHHHHHHHHHhcCCCCC-cceEEe-CCcCCcCHHHHHHHHHHHHHHHHHHHh----cCCCCHHHHHHHHHhc-CCCCC-C Confidence 99999887654444443 333321 123333333333322222222222211 1236777877776553 42211 2 Q ss_pred cCCHHHHHHHHHHH--HHHHHHHHH-hhHHHhhhhhh Q lcl|NC_012418. 469 YKSEEELQAEAEQR--RQQAAQAQA-AQETLLEGASD 502 (510) Q Consensus 469 ~rs~~ev~~~r~q~--~q~~~~~~~-~~~~~~~ga~~ 502 (510) +++++....-.. ....++... .......++.+ T Consensus 403 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 403 --SAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred --CccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 222222111000 000000000 00011111111 No 151 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=30.20 E-value=1.6 Score=19.47 Aligned_cols=442 Identities=10% Similarity=-0.033 Sum_probs=173.5 Q ss_pred ChhHHHHHHHH-HhhccchHHH--HHHHHHhcccccCC-----CCCCcccccccc--ccchHHHHHHHHHHHHHH--hhc Q lcl|NC_012418. 1 MKSTAAMLWEK-LRDGSVEQRA--IEFAKTTLPYLMVD-----PMSGSRGVVEHD--FQSAGALLVNNLAAKLAR--SLF 68 (510) Q Consensus 1 ~~~~~~~r~~~-lkr~~~~~~w--~e~~~~~lP~~~~~-----~~~~~~~~~~~~--~dstg~~a~~~LAa~l~~--~lt 68 (510) --....+|-.. -.++.|+.-- +-..-+. |.+..+ +.+.-..+...+ -++.+..+++.+++.+++ ++. T Consensus 12 sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~-~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~ 90 (548) T protein:vir:95 12 APELVARRLAAREAIQAYEAARPGRTHKAKR-QPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGSGIG 90 (548) T ss_pred chHHHHHHHHhHHHhccccccCccccccccC-CCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCccccc Confidence 11111111110 0111121110 0000000 111000 001111111222 377899999999999997 355 Q ss_pred CcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEe--CCCC--- Q lcl|NC_012418. 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA--- 143 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~--~~~~--- 143 (510) +..++ +..++....++. ..-....+.|.+.| ..-.+.+||.....++...+.-|-+++-.. .... T Consensus 91 i~p~~---l~~d~~~a~~l~--~~ie~~w~~Wa~~~-----D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 91 VEPLP---LRLDGSVHAELA--MEIRSAWAEWSLSP-----ETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred eeeee---cCCCHHHHHHHH--HHHHHHHHHhhcCc-----cccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 44444 333322211111 01112234454332 233467899999999999999997764332 2111 Q ss_pred ------cEEEEEeceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEE Q lcl|NC_012418. 144 ------TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAEL 217 (510) Q Consensus 144 ------~~~~~pl~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv 217 (510) .++.+....+....+..|+ . .+...+.+......-||.....++....... T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~---------------------~-i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~- 217 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSK---------------------G-IVQGIERDTWRRKRAYHLLKDHPGNLQTLGG- 217 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCC---------------------c-eeeeeEECCCCceEEEEEeecCCCccccccc- Confidence 1222222211111111000 0 1112233444444555544444332100000 Q ss_pred EEEecCeeeccccccccccCceEEEe-eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCcccc Q lcl|NC_012418. 218 YHEIDGVRVGEEGRWPIHLCPYIVPT-WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAV 295 (510) Q Consensus 218 ~~e~~~~~~~~~~~y~~~~~P~~~~R-w~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~-~~g~~~ 295 (510) .+ .-+.+..+ -++.- ....+|..=|.+..--+|..+++|.....+.+.++..++.....+. +++-.. T Consensus 218 ~~--~~~rvpA~---------~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~ 286 (548) T protein:vir:95 218 SL--AVKRVEAE---------RIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSY 286 (548) T ss_pred cc--ceeeechh---------HheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccc Confidence 00 00111111 12222 2345788999999999999999999999999999888887766653 221110 Q ss_pred h---hh-----hccCCCceeecC--CcccccccccCc-ccchHHHHHHHHHHHHHHHHHhh--ccccCCCCCCCCHHHHH Q lcl|NC_012418. 296 V---DD-----YQDAEMGDYVPG--GAEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVR 362 (510) Q Consensus 296 p---~~-----~~~~~~g~~~pg--~~~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~~TAtEi~ 362 (510) . .. ...-.+|.+++. ...+++...... .++|. .-...+...|..++= +..+..|-. .|-.=++ T Consensus 287 ~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R 362 (548) T protein:vir:95 287 TVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLE---GFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQR 362 (548) T ss_pred cCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHH Confidence 0 00 001112332221 111344433321 22332 222333444555541 122333422 1333333 Q ss_pred HHHHHHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccc-----cccee----eecHHHHHHHHHHHH-----H Q lcl|NC_012418. 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ-----HKPAI----ETGLPALSRSAAVQS-----M 428 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~-----~~~~~----v~~is~L~raq~~~~-----~ 428 (510) .=..|....+--.=..+...|+.|+..+++....-.|.+++|... +.... ...++|+--++.... + T Consensus 363 ~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl 442 (548) T protein:vir:95 363 QELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGF 442 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCC Confidence 333333333333333445567778888888777666666655431 11211 234666544332211 1 Q ss_pred HHHHHHHHhhcChhhHhhccCHHHHHHH------HHHHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhh Q lcl|NC_012418. 429 LNASQVIAGLAPIAQLDPRISLPKMMDT------IWAAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASD 502 (510) Q Consensus 429 ~~~~q~l~~~~~~~q~~~~id~d~~~~~------~a~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~ 502 (510) .+..+.++..+ .|+++.++. .++.+|++...--+.... . ...+...+.+++-+.. T Consensus 443 ~T~~~~~a~~G--------~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~----~-------~~~~~~~~~~~~~~~~ 503 (548) T protein:vir:95 443 ADEAEVARARG--------RDPRELKKSRETEIKANRAAGLVFSSDAYHQLV----K-------SGMDPVEAVQKVYLGV 503 (548) T ss_pred CCHHHHHHHhC--------CCHHHHHHHHHHHHHHHHHcCCCCCCccccccc----c-------cccCCCCchhhhcccc Confidence 11111222111 233322222 233444432100000000 0 0000000111111111 Q ss_pred hh------------hcccCC Q lcl|NC_012418. 503 MT------------NALAGV 510 (510) Q Consensus 503 ~~------------~~~ag~ 510 (510) .. +-.||. T Consensus 504 ~~~~~~~~~~~~~~~~~~~~ 523 (548) T protein:vir:95 504 GKMLTADEARELVNRYGAGL 523 (548) T ss_pred ccccccchhHHhhccCCCCC Confidence 11 222333 No 152 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=27.84 E-value=0.74 Score=21.29 Aligned_cols=106 Identities=8% Similarity=-0.033 Sum_probs=46.0 Q ss_pred ChhHHHHHHHHHhh---ccchHHHHHHHHHhccc-cc-------CCCCCCcccccc---ccccchHHHH---------HH Q lcl|NC_012418. 1 MKSTAAMLWEKLRD---GSVEQRAIEFAKTTLPY-LM-------VDPMSGSRGVVE---HDFQSAGALL---------VN 57 (510) Q Consensus 1 ~~~~~~~r~~~lkr---~~~~~~w~e~~~~~lP~-~~-------~~~~~~~~~~~~---~~~dstg~~a---------~~ 57 (510) |+.....||+.-++ .+|.+.|.......-.. .. ......+...+. ++.+|....+ -. T Consensus 39 l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~ 118 (175) T protein:vir:10 39 LVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSNK 118 (175) T ss_pred HHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEecCh Confidence 66666677754432 34444443221100000 00 000000000000 0111110000 00 Q ss_pred HHHHHHHHhh--------cCcCCcccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_012418. 58 NLAAKLARSL--------FPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQN 115 (510) Q Consensus 58 ~LAa~l~~~l--------tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~s 115 (510) ..|+-.+-|. +=|.|||+.++-.|+. +..+++.+|+.+.+.+...|.+- T Consensus 119 ~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~---------~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 119 EYAAIHQFGGQAGRGLKVTIPARPWLPVTADGEL---------QPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred hhhhhhhcccccCCCCccccCCccccCCCccccc---------chHHHHHHHHHHHHHHHHHhccC Confidence 1122222232 4589999998876653 22457788888888888777766 No 153 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=27.31 E-value=1.9 Score=19.11 Aligned_cols=395 Identities=8% Similarity=0.052 Sum_probs=154.7 Q ss_pred ChhHHHHHHHHHh--hccchHHHHHHHHHhcccccCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCcCCcccc Q lcl|NC_012418. 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~~~~~~~r~~~lk--r~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~--~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) |=.-+.+...+++ ..++.+.| .++.-|+....+ .++..-.. -...++--.|++.+|+.+.+ -||.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~~~~~-~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~------lp~~v 70 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAF---IKYIGQTFTKYD-NNGKTYLEQGYNINPDVYSCISQMAAKTVA------VPYTI 70 (460) T ss_pred CchhHHHHHhhhhccCCCchHHH---HHhhccccCCCc-cchhhhhHHHHhcchHHHHHHHHHHHhhhh------CceEE Confidence 4444433333322 22344455 456666432211 12221111 12455666777777777643 35543 Q ss_pred cCCChHHH-hhhcccchh-----------HHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeC Q lcl|NC_012418. 77 SELTDAIR-READSRDTD-----------ITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNS 140 (510) Q Consensus 77 l~~~d~~~-~~~~~~~~~-----------~~~v~~~L~~~e~~~~~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~ 140 (510) .......- .+....... ..-....+...+......+.+=| .+.-...+..++..+|++.+|+.. T Consensus 71 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 150 (460) T protein:vir:10 71 KVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMS 150 (460) T ss_pred EeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 22211100 000000000 00000112222223333344434 344456666788899999888764 Q ss_pred CC-----Cc-EEEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCc Q lcl|NC_012418. 141 DE-----AT-VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAM 212 (510) Q Consensus 141 ~~-----~~-~~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~ 212 (510) +. +. ...||| ..+-+..+.+|.+-. +++ .+ T Consensus 151 ~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~--~~~----------------------------~~------------ 188 (460) T protein:vir:10 151 PDDGINAGVPSQMYVLPAHLIKIVLKDDINLLS--TDS----------------------------PI------------ 188 (460) T ss_pred cCCCccCceeEEEEEEcCceEEEEEcCCCceee--eee----------------------------ee------------ Confidence 31 22 234454 556666666664321 110 00 Q ss_pred eEEEEEEEecCeeeccccccccccCceEEEeeee-----cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCcee Q lcl|NC_012418. 213 EYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNL-----APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNL 287 (510) Q Consensus 213 p~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~-----~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l 287 (510) ..+.+..+|.... +..++ .+..|+.. ..+..||.||..-+...+.......+...........|-.+ T Consensus 189 --~~~~~~~~g~~~~----~~~~e--vih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i 260 (460) T protein:vir:10 189 --KSYMLIQGDQFIE----FNEDE--VIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFI 260 (460) T ss_pred --eEEEEecCceeEE----ecccc--eEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccee Confidence 1111111221110 11111 33344332 23567999999999999999888888877777777777777 Q ss_pred eCCCcccchhhhccCC-----------C-c-e-eecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--cc-C Q lcl|NC_012418. 288 VDEAKGAVVDDYQDAE-----------M-G-D-YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-Q 350 (510) Q Consensus 288 ~~~~g~~~p~~~~~~~-----------~-g-~-~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~ 350 (510) +..++-+.++...... | | . +++++. ...++... ..+.+. .+..+..+..|-++|=.. ++ . T Consensus 261 ~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~ 337 (460) T protein:vir:10 261 HGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEI-AFTKISLN-TDELKP-FDYLKYDQKAICNALGWSDKLLNN 337 (460) T ss_pred eecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCC Confidence 7766666654322111 1 1 1 122221 22233221 233332 455567778888888322 11 1 Q ss_pred CCCCCCCHHHHHHHHHH-HHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-ccccee-eecHHHHHHHHHHHH Q lcl|NC_012418. 351 RDAERVTAEEVRITAEE-AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-QHKPAI-ETGLPALSRSAAVQS 427 (510) Q Consensus 351 ~~~~~~TAtEi~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~-~~~~~~-v~~is~L~raq~~~~ 427 (510) .++...|-.-+.+.... ....|.|...++.+|| .+.-+++.... .....+ ...+..+. .+... T Consensus 338 ~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~l------------n~kl~~~~~~~~~~~i~~d~~~l~~l~--~d~~~ 403 (460) T protein:vir:10 338 NEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAF------------DKKFIKRFKGYENAVIEWDISELPEMQ--TDMVA 403 (460) T ss_pred CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHH------------HHhhcCcccccCCceEEeecchhhhHH--HHHHH Confidence 12211222222222222 2224555555555443 33322211111 111111 12222221 11211 Q ss_pred HHHHHHHHHhhcChhhHhhccCHHHHHHHHHHHcCCCHh------------HccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_012418. 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTS------------QFYKSEEELQAEAEQRRQQAAQA 489 (510) Q Consensus 428 ~~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~------------~i~rs~~ev~~~r~q~~q~~~~~ 489 (510) ... .+.. ++ +- .+++-+.+|.||- .++..++.-+..... ...+.| T Consensus 404 ~~~---~~~~--g~------~T----~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~--~~nq~~ 460 (460) T protein:vir:10 404 MAS---WLNT--IP------VT----PNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDS--AFNQNQ 460 (460) T ss_pred HHH---HHhC--CC------CC----HHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCC--cccCCC Confidence 111 1110 11 11 1222233344431 111111100000000 000000 No 154 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=24.84 E-value=2.1 Score=18.79 Aligned_cols=425 Identities=8% Similarity=-0.068 Sum_probs=164.6 Q ss_pred ChhHHHHHHHHHhhccchHHHHHHHHHhcccccCCCCCCcccccccc--ccchHHHHHHHHHHHHHHhhcCcCCcccccC Q lcl|NC_012418. 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHD--FQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~~~~~~~r~~~lkr~~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~--~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) |.++-...-.--.+...... .-+..|..+..|++ . .+-.. .+..+-.+|++.|..++ +.|+.+. T Consensus 70 ~d~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-----~-~l~a~Y~~~~l~r~iVd~~A~d~~-------r~~~~i~ 135 (537) T protein:vir:10 70 MDGLDVEGGTFSAYANPNLS-EGLVLWYAQQAFIG-----H-QMCALIATHWLVNKACSQMPRDAM-------RKGYKII 135 (537) T ss_pred ccccccchhhhhhhcccccc-chhhhhccccCCcc-----H-HHHHHHHhCchhhhhhhhhhHHhh-------cCCceee Confidence 33321111000001110000 00122222222221 0 11112 24455555665555443 5788887 Q ss_pred CChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCeEEEEEeC--CCCcEEEEEeceEEEe Q lcl|NC_012418. 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEATVVAWSLRSYAVR 156 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~snfy~~~~~~~~dl~~~G~~~l~~~~--~~~~~~~~pl~~~~i~ 156 (510) ..+....+ ...++ .+.+.+.+-++...+.+++..--.+|.+.+++.- .....-.-||.---| T Consensus 136 ~~~~~~~~-------~~~~~--------~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i- 199 (537) T protein:vir:10 136 SDDGNELD-------PKDAK--------FIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGV- 199 (537) T ss_pred cCCccccc-------HHHHH--------HHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccc- Confidence 65432111 11122 2333444557889999999997788877665532 222222233321111 Q ss_pred eCCCCCeEEE--EEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeecccccccc Q lcl|NC_012418. 157 RDATGRWMDI--VLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGEEGRWPI 234 (510) Q Consensus 157 ~d~~G~vd~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~~~~~y~~ 234 (510) ..|.+..+ +-+.+.+..- ...+-++..+. .+.+.+.| .+.|+.+....-..| T Consensus 200 --~kg~~k~l~vidp~~~~~~~-~~~~~~dp~sp-----~fg~P~~y------------------~v~g~~iH~SRli~f 253 (537) T protein:vir:10 200 --MPGAYKGIVQIDPYWCAPLL-DAQASSNPVSM-----HFYEPTYW------------------LINGKKYHRSHLAIY 253 (537) T ss_pred --cccceeEEEEechhhccccc-chhhhccCCcc-----ccCCceee------------------eecCeEecceeEEEe Confidence 12222211 1122222110 11111111110 01111122 122332221111111 Q ss_pred --ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccchhhhcc---------C Q lcl|NC_012418. 235 --HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDDYQD---------A 302 (510) Q Consensus 235 --~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~p~~~~~---------~ 302 (510) ...|+ +.+....-||++..+.++..++..........+...++.-..+-++-.. +.+.+.+.. . T Consensus 254 ~g~~~p~----~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~ 329 (537) T protein:vir:10 254 INDEVVD----FLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRD 329 (537) T ss_pred cCCCCch----hhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcC Confidence 11233 2334444579999999999999999888888777766666655443111 111121111 1 Q ss_pred CCceeecCCc-ccccccccCcccchHHHHHHHHHHHHHHHHHhh--cc-ccCCCCCCC--CHH-HHHHHHHHHHHHhchh Q lcl|NC_012418. 303 EMGDYVPGGA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-ANQRDAERV--TAE-EVRITAEEAENTLGGT 375 (510) Q Consensus 303 ~~g~~~pg~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~-~~~~~~~~~--TAt-Ei~~r~~E~~~~LGpv 375 (510) ..|.++-+.. ..+..... ++..+...+....+.|.-++= .. ++-+..... |.+ ++..=. =- T Consensus 330 n~g~~~id~e~e~~e~~~~----~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yy--------d~ 397 (537) T protein:vir:10 330 NYQVRVVDKDNEDVVQIDT----TLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYH--------EE 397 (537) T ss_pred CcceeEecCCCceeEEEec----cCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHH--------HH Confidence 1233333332 33333222 344455666777777777751 11 222211122 222 222111 11 Q ss_pred HhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccce--eeecHHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccCHHHH Q lcl|NC_012418. 376 YSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPA--IETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM 453 (510) Q Consensus 376 ~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~--~v~~is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id~d~~ 453 (510) +..++.+ +.|++++++.+|.+..+-+++ ++... -+.-.+...+|.-......+.+.+-. ..-|+.+++ T Consensus 398 I~~~Qe~-l~p~l~~l~~ll~~~~~~~~~--~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~-------~G~i~~~Ev 467 (537) T protein:vir:10 398 CESTQDD-MRPLIDRHHQLVCRSHLRKRI--RVKVEFPPMDAPKESERADTFLKKMQAAKLAFE-------MGAVDGVDV 467 (537) T ss_pred HHHHHHH-HHHHHHHHHHHHHHhcCCCCc--ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-------cCCCCHHHH Confidence 3444544 678888888888765544332 22221 11222233333322222222222211 124788888 Q ss_pred HHHHHHH-----cCCCHhHccCCHHHHHHHHHH-HHHHHH-HHHH-----hhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 454 MDTIWAA-----FSVDTSQFYKSEEELQAEAEQ-RRQQAA-QAQA-----AQETLLEGASDMTNALAGV 510 (510) Q Consensus 454 ~~~~a~~-----~Gvp~~~i~rs~~ev~~~r~q-~~q~~~-~~~~-----~~~~~~~ga~~~~~~~ag~ 510 (510) -+.+... .|+.+. + ++++.+....+ ..+... .... .+.+...|....++.-+|- T Consensus 468 r~~L~~~~~~g~~~l~~~-~--~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 468 NEYLRMDPTLGFTSITPA-M--RPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred HHHHhccCccccccccCC-C--ChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 8887764 233321 1 11211111000 000000 0000 0001111111122222222 No 155 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=22.90 E-value=2.4 Score=18.52 Aligned_cols=394 Identities=14% Similarity=0.035 Sum_probs=143.9 Q ss_pred HHHHHHHh-h--c-----cchHHHHHHHHHhcccccCCCCCCcccccc--ccc-cchHHHHHHHHHHHHHHhhcCcCCcc Q lcl|NC_012418. 6 AMLWEKLR-D--G-----SVEQRAIEFAKTTLPYLMVDPMSGSRGVVE--HDF-QSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 6 ~~r~~~lk-r--~-----~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~--~~~-dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) =..|+.|. | . .-...|..+.-... ..+. . ..++.... ... .++--.|++.+|..+.+. || T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~-~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~l------p~ 71 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIY-NLGA-T-ASSGERVTPHDALQVSAVFASVRLLSETIATL------PL 71 (457) T ss_pred Cchhhhhhccccccccccccccccccchhhhh-hccc-c-ccCCceechHHhhccHHHHHHHHHHHHhHhhC------ce Confidence 22233321 1 0 00111111111100 1110 0 01111111 111 234445565555555433 33 Q ss_pred cccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHh----cCCHHHHHHHHHHHHhhCeEEEEEeCCCCc-EEEEE Q lcl|NC_012418. 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQ----NASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVAWS 149 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~----snfy~~~~~~~~dl~~~G~~~l~~~~~~~~-~~~~p 149 (510) .-..-.+.... ++.. ..+...+.+ -+.+.-+..+..++...||+.+++..+.++ ...+| T Consensus 72 ~~~~~~~~~~~----------~~~~------~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~ 135 (457) T protein:vir:62 72 STYSKRGGTRK----------EIDT------PEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDV 135 (457) T ss_pred EEEEecCCccc----------cccc------hHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEE Confidence 21111111000 0100 011112222 235556677778888999999888655443 34455 Q ss_pred ec--eEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeec Q lcl|NC_012418. 150 LR--SYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVG 227 (510) Q Consensus 150 l~--~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~ 227 (510) |. ...+.++..+... ...|..+.+..+|... T Consensus 136 l~p~~v~v~~~~~~~~~----------------------------------------------~~~~~~y~~~~~g~~~- 168 (457) T protein:vir:62 136 LDPTKIHVHMVMVDGLR----------------------------------------------RKVFEAYDIDADGNEV- 168 (457) T ss_pred EcCcceEEEEeccCCcc----------------------------------------------ceeEEEEEEccCCcee- Confidence 52 3333333222110 0011111122222211 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccCC---- Q lcl|NC_012418. 228 EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE---- 303 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~~---- 303 (510) ....|..++ .|..|.....|..||.||..-+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 169 ~~~~~~~~e--iih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~ 246 (457) T protein:vir:62 169 LLGWFTPRD--VLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWR 246 (457) T ss_pred EEEeeCccc--eEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHH Confidence 111111112 45556555567789999999999989888888887777777667777766655666664332211 Q ss_pred -------C-c--eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--cc-CCCCCCCCHHHHHHHHHHHHH Q lcl|NC_012418. 304 -------M-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-QRDAERVTAEEVRITAEEAEN 370 (510) Q Consensus 304 -------~-g--~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~~~~~~~TAtEi~~r~~E~~~ 370 (510) | | .+++++. +..++... ..+.+. .+..+..+..|-++|=.. +. ..+....+..-+.+..... T Consensus 247 ~~~~G~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f-- 321 (457) T protein:vir:62 247 AANSGVDNAHRVALLTEGA-KFSKVAMS-PDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAF-- 321 (457) T ss_pred HHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHH-- Confidence 1 1 1122221 22233221 233443 344456777888888322 11 1111112222222222221 Q ss_pred HhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccceeeec-HHHHHHHHHHHHHHHHHHHHHhhcChhhHhhccC Q lcl|NC_012418. 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG-LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~~~v~~-is~L~raq~~~~~~~~~q~l~~~~~~~q~~~~id 449 (510) ...-|.|++.+.-..|.+..+++.-.. -.++.+ ++.|-|+--..+...+...++. ++ + T Consensus 322 ---------~~~~l~P~~~~ie~~ln~~L~~~~~~~---~~~i~fd~~~l~~~d~~~r~~~~~~~~~~--G~------~- 380 (457) T protein:vir:62 322 ---------TMFSLRPWLERIEAGFNRLLFAETADR---FRFVKFNLDEIKRGAPKERMELWSLGLQN--GI------Y- 380 (457) T ss_pred ---------HHHHHHHHHHHHHHHHHhhhcCccccC---ceEEEeechhhhccCHHHHHHHHHHHHhC--CC------c- Confidence 122234455444444443333322111 112221 2233332212222222222221 11 0 Q ss_pred HHHHHHHHHHHcCCCHh------------HccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhhhhcccCC Q lcl|NC_012418. 450 LPKMMDTIWAAFSVDTS------------QFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~------------~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~~~~~ag~ 510 (510) ..+++-+.+|.||- .+....++.+. .. ..........+...+ .++.+.|. T Consensus 381 ---T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~----~~--~~~~~~~~~~~~~~~--~~~~~~~~ 442 (457) T protein:vir:62 381 ---SIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEP----EP--APAPPAIDPPAEEPA--DDEEPDNA 442 (457) T ss_pred ---CHHHHHHHhCCCCCCCCCcceeeeccccccccccccc----cc--cCCCccCCCCccCCC--CCCCCCCC Confidence 11222233333320 11111111000 00 000000011111111 11222233 No 156 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=21.55 E-value=2.6 Score=18.32 Aligned_cols=366 Identities=12% Similarity=0.086 Sum_probs=134.8 Q ss_pred ChhHHHH------HHHHH-hhc-cchHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAM------LWEKL-RDG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~------r~~~l-kr~-~~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) =|++.++ .|.+. +|+ .+-..|-....-.+|.........-. ...-+-.++--.|++.+|+.+.+ - T Consensus 15 ~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~al~~~~V~acv~~Ia~~iA~------l 87 (441) T protein:vir:98 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLAR------M 87 (441) T ss_pred cccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccc-hhhhhccHHHHHHHHHHHHhhcc------C Confidence 1222211 22211 232 11111111111123332111111000 00011233434466666666554 1 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~-~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~ 145 (510) | |++.-.... . . ++-++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p-l~~~~~~~~-------~-----~-------~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:98 88 P-IRVTVNGQI-------N-----Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred c-eEEecCCcc-------c-----c-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE Confidence 3 233211100 0 0 111222232 223 33445666777888999988876432 2 34 Q ss_pred EEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC Q lcl|NC_012418. 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~ 223 (510) ..+|+ +.+.+..|.+|++- |+... +++ T Consensus 148 ~L~~i~~~~v~v~~~~~g~~~--~~~~~-------------------------------------------------~~~ 176 (441) T protein:vir:98 148 NLTFRKTSEIELKLDARGRLY--YFHQR-------------------------------------------------IDS 176 (441) T ss_pred EEEEEcCceeEEEECCCCcEE--EEEEE-------------------------------------------------ecc Confidence 44555 66777778777541 11110 000 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccchhhh--- Q lcl|NC_012418. 224 VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDDY--- 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~p~~~--- 299 (510) ........+..++ ++..|+...+| .||.||...+...+...+.+.+.......-...|..++.-++ +.+++.. T Consensus 177 ~~~~~~~~~~~~d--viHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~ 253 (441) T protein:vir:98 177 NGNNIERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred CcceeeEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHH Confidence 0000000111111 33444443344 799999999988888888777777776666666766654333 3333321 Q ss_pred ccCC----Cc-------eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccCCCCCCCCHHHHHHHHH Q lcl|NC_012418. 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~g-------~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~~TAtEi~~r~~ 366 (510) +..- .| .+++++. ...++.+. ..+.+. .+.....+..|-++|=.. ++..+...-+.+|. .. T Consensus 254 ~~~~~~~~~G~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:98 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 1100 01 1112221 22233221 123332 344455667788888432 11112222233332 12 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc--ceeeecHHHHHHHHHHHHH-----HHHHHH--HHh Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSM-----LNASQV--IAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~--~~~v~~is~L~raq~~~~~-----~~~~q~--l~~ 437 (510) .....|-|.+.++.+||-.-|.. ....-.++ ...+...+..+|+.-...+ .+.... +-. T Consensus 328 ~y~~tl~P~~~~ie~~ln~~L~~------------~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 395 (441) T protein:vir:98 328 DYLSTLKPYITCVCAELNFKFND------------EYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHHhhccc------------cccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 23334555555555554332211 10000111 1111122333343322221 122111 112 Q ss_pred hcChhhHh------h--ccCHHHH----------HHHHHHHcCCCHh Q lcl|NC_012418. 438 LAPIAQLD------P--RISLPKM----------MDTIWAAFSVDTS 466 (510) Q Consensus 438 ~~~~~q~~------~--~id~d~~----------~~~~a~~~Gvp~~ 466 (510) +.+++--+ + .++.|.+ .+. ...-|=.-+ T Consensus 396 l~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~-~~kgGe~ne 441 (441) T protein:vir:98 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDK-KLKGGEENE 441 (441) T ss_pred CCCCCCCCcceEeeccccccccccccccccccccccc-ccCCCCCCC Confidence 22221100 0 0111100 000 001111111 No 157 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=21.34 E-value=2.6 Score=18.29 Aligned_cols=366 Identities=11% Similarity=0.076 Sum_probs=138.9 Q ss_pred ChhHHHH------HHHHH-hhcc-chHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAM------LWEKL-RDGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~------r~~~l-kr~~-~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) =|++-++ .|.+. +|+. +-..|-...--++|+........-. ...-+-.++--.|++.+|+.+.+. T Consensus 15 ~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~al~~~~V~~cv~~Ia~~iA~l------ 87 (441) T protein:vir:94 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLARM------ 87 (441) T ss_pred ccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccc-hhhhhccHHHHHHHHHHHHhhccC------ Confidence 2222222 22222 2321 1111111111122322111100000 000112344445677766666542 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~-~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~ 145 (510) || ++.-... .. . ++-++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p~-~~~~~~~-~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:94 88 PI-RVTVNGQ-IN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred ce-eeecCcc-cc-----------c-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 33 3321110 00 0 111222232 333 23445667777888999988876432 2 23 Q ss_pred EEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC Q lcl|NC_012418. 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~ 223 (510) ..+|+ ..+.+..|.+|++- |+ ++..+. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~g~~~--~~-----------------------------------~~~~~~------------~~ 178 (441) T protein:vir:94 148 NLTFRKTSEIELKSDARGRLY--YF-----------------------------------HQRIDS------------NG 178 (441) T ss_pred EEEEEcCceeEEEECCCccEE--EE-----------------------------------EEEecc------------CC Confidence 44555 66777777776431 10 000000 00 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccchhh---h Q lcl|NC_012418. 224 VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDD---Y 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~p~~---~ 299 (510) ..+ ...|..++ +|..|+...+| .||.||..-+...+.......+.......-...|..++.-++ +.+++. + T Consensus 179 ~~~--~~~~~~~d--vih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~ 253 (441) T protein:vir:94 179 NNI--ERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred cee--EEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHH Confidence 000 01111111 34445444444 799999998888888877777777777666777776654333 333332 1 Q ss_pred ccCC----Cc-------eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccCCCCCCCCHHHHHHHHH Q lcl|NC_012418. 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~g-------~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~~TAtEi~~r~~ 366 (510) +..- .| .+++++. ...++... ..+.+. .+.....+..|-++|=.. ++..+...-+.+|. .. T Consensus 254 r~~~~~~~~G~~nag~~~vl~~G~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:94 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 1110 01 1122222 22233221 123332 344566677888888432 11112222232332 12 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc--ceeeecHHHHHHHHHHHHH-----HHHHHH--HHh Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSM-----LNASQV--IAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~--~~~v~~is~L~raq~~~~~-----~~~~q~--l~~ 437 (510) .....|-|.+.++.+||-.-|..+ -..-.++ ...+...+...|+.-...+ .+.... +-. T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~------------~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:94 328 DYLSTLKPYITCVCAELNFKFNDE------------YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhcccc------------ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 233345555555555544333211 0000111 1111122333444332222 112111 112 Q ss_pred hcChhhHh--------hccCHHHH----------HHHHHHHcCCCHh Q lcl|NC_012418. 438 LAPIAQLD--------PRISLPKM----------MDTIWAAFSVDTS 466 (510) Q Consensus 438 ~~~~~q~~--------~~id~d~~----------~~~~a~~~Gvp~~ 466 (510) +.+++--+ ..+..+.+ .+ -...-|=.-+ T Consensus 396 l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~kgGe~~e 441 (441) T protein:vir:94 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLKGGEENE 441 (441) T ss_pred CCCCCCCCcceEeecccccccccccccccccccccc-cccCCCCCCC Confidence 22221100 00111100 00 0001111111 No 158 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=21.34 E-value=2.6 Score=18.29 Aligned_cols=366 Identities=11% Similarity=0.076 Sum_probs=138.9 Q ss_pred ChhHHHH------HHHHH-hhcc-chHHHHHHHHHhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCcCC Q lcl|NC_012418. 1 MKSTAAM------LWEKL-RDGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~~~~~~~------r~~~l-kr~~-~~~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 72 (510) =|++-++ .|.+. +|+. +-..|-...--++|+........-. ...-+-.++--.|++.+|+.+.+. T Consensus 15 ~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~al~~~~V~~cv~~Ia~~iA~l------ 87 (441) T protein:vir:79 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLARM------ 87 (441) T ss_pred ccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccc-hhhhhccHHHHHHHHHHHHhhccC------ Confidence 2222222 22222 2321 1111111111122322111100000 000112344445677766666542 Q ss_pred cccccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-C-cE Q lcl|NC_012418. 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~-~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~-~~ 145 (510) || ++.-... .. . ++-++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p~-~~~~~~~-~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:79 88 PI-RVTVNGQ-IN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred ce-eeecCcc-cc-----------c-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 33 3321110 00 0 111222232 333 23445667777888999988876432 2 23 Q ss_pred EEEEe--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecC Q lcl|NC_012418. 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~ 223 (510) ..+|+ ..+.+..|.+|++- |+ ++..+. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~g~~~--~~-----------------------------------~~~~~~------------~~ 178 (441) T protein:vir:79 148 NLTFRKTSEIELKSDARGRLY--YF-----------------------------------HQRIDS------------NG 178 (441) T ss_pred EEEEEcCceeEEEECCCccEE--EE-----------------------------------EEEecc------------CC Confidence 44555 66777777776431 10 000000 00 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCc-ccchhh---h Q lcl|NC_012418. 224 VRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDD---Y 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g-~~~p~~---~ 299 (510) ..+ ...|..++ +|..|+...+| .||.||..-+...+.......+.......-...|..++.-++ +.+++. + T Consensus 179 ~~~--~~~~~~~d--vih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~ 253 (441) T protein:vir:79 179 NNI--ERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred cee--EEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHH Confidence 000 01111111 34445444444 799999998888888877777777777666777776654333 333332 1 Q ss_pred ccCC----Cc-------eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc--ccCCCCCCCCHHHHHHHHH Q lcl|NC_012418. 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~g-------~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~~TAtEi~~r~~ 366 (510) +..- .| .+++++. ...++... ..+.+. .+.....+..|-++|=.. ++..+...-+.+|. .. T Consensus 254 r~~~~~~~~G~~nag~~~vl~~G~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:79 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 1110 01 1122222 22233221 123332 344566677888888432 11112222232332 12 Q ss_pred HHHHHhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCccccc--ceeeecHHHHHHHHHHHHH-----HHHHHH--HHh Q lcl|NC_012418. 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSM-----LNASQV--IAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~--~~~v~~is~L~raq~~~~~-----~~~~q~--l~~ 437 (510) .....|-|.+.++.+||-.-|..+ -..-.++ ...+...+...|+.-...+ .+.... +-. T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~------------~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:79 328 DYLSTLKPYITCVCAELNFKFNDE------------YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhcccc------------ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 233345555555555544333211 0000111 1111122333444332222 112111 112 Q ss_pred hcChhhHh--------hccCHHHH----------HHHHHHHcCCCHh Q lcl|NC_012418. 438 LAPIAQLD--------PRISLPKM----------MDTIWAAFSVDTS 466 (510) Q Consensus 438 ~~~~~q~~--------~~id~d~~----------~~~~a~~~Gvp~~ 466 (510) +.+++--+ ..+..+.+ .+ -...-|=.-+ T Consensus 396 l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~kgGe~~e 441 (441) T protein:vir:79 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLKGGEENE 441 (441) T ss_pred CCCCCCCCcceEeecccccccccccccccccccccc-cccCCCCCCC Confidence 22221100 00111100 00 0001111111 No 159 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=21.01 E-value=2.7 Score=18.24 Aligned_cols=397 Identities=10% Similarity=0.013 Sum_probs=142.0 Q ss_pred HHHHHhhcc---------chHHHHHHHHHhcccccCCCCCCccccc--ccccc-chHHHHHHHHHHHHHHhhcCcCCccc Q lcl|NC_012418. 8 LWEKLRDGS---------VEQRAIEFAKTTLPYLMVDPMSGSRGVV--EHDFQ-SAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 8 r~~~lkr~~---------~~~~w~e~~~~~lP~~~~~~~~~~~~~~--~~~~d-stg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) .|.-++|+. -.+.|-....++- ..+.+....+ ... .+... ++--.|++.+|..+. . -||. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~g-~~v~~~~al~~~~V~~~v~~Ia~~iA-~-----lp~~ 72 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVA-EPFAGAWQQG-VKADPEAVLSFHAVFACISLISQDIA-K-----MRLR 72 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhh-hhhcchhhcC-cccChHHhhccHHHHHHHHHHHHhhc-c-----CceE Confidence 555443321 1233544433221 1111111111 111 11122 222334444444333 2 2664 Q ss_pred ccCCChHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCeEEEEEeCCC-Cc-EEEEE Q lcl|NC_012418. 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWS 149 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~~e~~~~~~l~~sn----fy~~~~~~~~dl~~~G~~~l~~~~~~-~~-~~~~p 149 (510) -..-..... ..++.. ..++..+.+=| .+.=...++.+|...||+.+++..+. ++ ...+| T Consensus 73 ~~~~~~~g~---------~~~~~~------~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~ 137 (454) T protein:vir:93 73 LMQTDAQGI---------RRETRR------GDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRI 137 (454) T ss_pred EEEeccCCc---------cchhhh------HHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEE Confidence 322111110 001111 11222233334 33445666667888999998876432 22 33455 Q ss_pred e--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHhhHhhhhhhhccCCCceEEEEEEEEeecCCCceEEEEEEEecCeeec Q lcl|NC_012418. 150 L--RSYAVRRDATGRWMDIVLKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVG 227 (510) Q Consensus 150 l--~~~~i~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~p~~sv~~e~~~~~~~ 227 (510) + ...-+..+.+|.+- | ++..... .... T Consensus 138 i~~~~v~v~~~~~g~~~--y-~~~~~~~------------------------------------------------~~~~ 166 (454) T protein:vir:93 138 LDWNRVEPLVADDGEVF--Y-RITPDRN------------------------------------------------CGIT 166 (454) T ss_pred EcCcceEEEEcCCCcEE--E-EEEeccc------------------------------------------------cccc Confidence 4 44444455554321 1 1100000 0000 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCcccchhhhccC----- Q lcl|NC_012418. 228 EEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA----- 302 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~L~d~r~L~~l~~~~l~~~~~a~~p~~l~~~~g~~~p~~~~~~----- 302 (510) ....+..++ .+..|+....+..||.||...+...+.....+.+.......-...|..++.-++.+.++..... T Consensus 167 ~~~~~~~~e--ViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~ 244 (454) T protein:vir:93 167 EAVTVPARE--VIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWD 244 (454) T ss_pred eeEEecCcc--eEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHH Confidence 000111111 3444444455678999999999999998888888877777777777776654455555432211 Q ss_pred -----CC-c--eeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHhhcc---ccCC-CCCCCCHHHHHHHHHHHHH Q lcl|NC_012418. 303 -----EM-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG---ANQR-DAERVTAEEVRITAEEAEN 370 (510) Q Consensus 303 -----~~-g--~~~pg~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---~~~~-~~~~~TAtEi~~r~~E~~~ 370 (510) .| | .+++++. +..++.. +..+.+. .+..+..+..|-++|=.. +... ++..-+++|.. ..=... T Consensus 245 ~~~~g~n~g~~~vl~~g~-~~~~l~~-~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~--~~f~~~ 319 (454) T protein:vir:93 245 SGYTGENAGKTAILSNGA-KYNPTTF-SPVDSQT-VEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALE--QQYYSQ 319 (454) T ss_pred HHhcccccCCceeccCCc-eEEEccc-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHH--HHHHHH Confidence 11 1 1122222 2233332 1234443 344566677888888321 1111 11111222221 112344 Q ss_pred HhchhHhHHHHHHHHHHHHHHHHHHhhcCCCCCCcccccc--eeeecHHHHHHHHHHHHH-----HHHHHHHHhh--cCh Q lcl|NC_012418. 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKP--AIETGLPALSRSAAVQSM-----LNASQVIAGL--API 441 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~~~~~~~~--~~v~~is~L~raq~~~~~-----~~~~q~l~~~--~~~ 441 (510) .|.|.+.++..++-.-| + +.....++. .-+...+...|+.....+ .+....-..+ .++ T Consensus 320 ~l~P~~~~ie~~ln~~L------------~-~~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi 386 (454) T protein:vir:93 320 CLQTLIESIELLLDEAL------------E-TGENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPL 386 (454) T ss_pred HHHHHHHHHHHHHHHhh------------c-CCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 56677776666643222 1 111111111 001112223333322221 1222221111 122 Q ss_pred hhHhh------ccCHHHHHHHHH-----HHcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHhhHHHhhhhhhh Q lcl|NC_012418. 442 AQLDP------RISLPKMMDTIW-----AAFSVDTSQFYKSEEELQAEAEQRRQQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 442 ~q~~~------~id~d~~~~~~a-----~~~Gvp~~~i~rs~~ev~~~r~q~~q~~~~~~~~~~~~~~ga~~~ 503 (510) +--+. .+-.+.+-..-. ...|.|.+.- ....+...-+.-. ....-...+..-|..++ T Consensus 387 ~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~~----e~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 387 AGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVP-QAVAASDGNKAIT----ETEHDAVKAMFRGILKK 454 (454) T ss_pred CCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCC-CCCCCCCCCCCcc----CCccchhhhhhhhhhcC Confidence 11010 011111111000 0011111100 0000000000000 00000000000011111 Done!