Query lcl|NC_019916.1_cdsid_YP_007236689.1 [gene=G168_gp03] [protein=minor capsid protein] [protein_id=YP_007236689.1] [location=1861..3402] Match_columns 513 No_of_seqs 141 out of 474 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 15:55:29 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94546 Length: 506 100.0 1E-109 7E-113 617.8 51.4 496 1-513 5-504 (506) 2 protein:vir:97171 Length: 512 100.0 2E-105 1E-108 594.8 52.3 479 1-511 13-512 (512) 3 protein:vir:9306 Length: 511 # 100.0 4E-105 2E-108 593.0 53.1 479 1-511 19-511 (511) 4 protein:vir:99781 Length: 511 100.0 8E-105 5E-108 591.2 53.1 479 1-509 19-511 (511) 5 protein:vir:103951 Length: 511 100.0 1E-104 6E-108 590.8 53.0 479 1-511 19-511 (511) 6 protein:vir:96366 Length: 511 100.0 2E-104 1E-107 588.9 53.3 479 1-511 19-511 (511) 7 protein:vir:78805 Length: 511 100.0 2E-104 1E-107 588.9 53.3 479 1-511 19-511 (511) 8 protein:vir:96240 Length: 511 100.0 3E-104 2E-107 588.2 53.0 479 1-511 19-511 (511) 9 protein:vir:2732 Length: 501 # 100.0 3E-103 2E-106 582.4 52.1 473 1-509 18-501 (501) 10 protein:vir:4898 Length: 502 # 100.0 1E-102 8E-106 579.2 52.2 471 1-507 19-502 (502) 11 protein:vir:96494 Length: 501 100.0 2E-102 1E-105 578.2 51.3 473 1-509 18-501 (501) 12 protein:vir:106571 Length: 499 100.0 9E-100 6E-103 563.5 51.0 472 1-513 1-493 (499) 13 protein:vir:94805 Length: 492 100.0 1.1E-99 7E-103 563.1 49.0 452 1-501 27-492 (492) 14 protein:vir:1236 Length: 483 # 100.0 1.6E-99 1E-102 562.2 49.1 451 1-501 19-483 (483) 15 protein:vir:99522 Length: 470 100.0 5.9E-99 4E-102 559.1 51.9 459 1-497 1-470 (470) 16 protein:vir:3964 Length: 453 # 100.0 5.1E-99 3E-102 559.4 51.5 450 1-502 1-453 (453) 17 protein:vir:97336 Length: 492 100.0 3.8E-99 2E-102 560.2 49.1 452 1-501 26-492 (492) 18 protein:vir:106639 Length: 481 100.0 2.1E-98 1E-101 556.1 52.2 463 1-504 6-481 (481) 19 protein:vir:95806 Length: 440 100.0 8.8E-99 5E-102 558.2 47.8 434 26-495 1-440 (440) 20 protein:vir:3609 Length: 452 # 100.0 8.8E-98 5E-101 552.7 51.3 449 1-509 1-452 (452) 21 protein:vir:733 Length: 453 # 100.0 6.7E-98 4E-101 553.3 50.0 447 2-493 1-453 (453) 22 protein:vir:102330 Length: 451 100.0 3.2E-98 2E-101 555.1 48.2 433 18-487 1-451 (451) 23 protein:vir:105461 Length: 470 100.0 9.7E-98 6E-101 552.5 49.1 439 18-496 1-470 (470) 24 protein:vir:95899 Length: 474 100.0 2.3E-97 1E-100 550.4 49.7 451 1-501 1-474 (474) 25 protein:vir:96266 Length: 474 100.0 2.3E-97 1E-100 550.4 49.7 451 1-501 1-474 (474) 26 protein:vir:9922 Length: 489 # 100.0 4E-97 2E-100 549.1 50.8 478 4-506 1-489 (489) 27 protein:vir:105889 Length: 474 100.0 2.3E-97 1E-100 550.4 49.3 451 2-513 1-474 (474) 28 protein:vir:94101 Length: 474 100.0 2.3E-97 1E-100 550.4 49.3 451 2-513 1-474 (474) 29 protein:vir:9871 Length: 429 # 100.0 1.3E-96 8E-100 546.3 49.9 426 18-495 1-429 (429) 30 protein:vir:102950 Length: 471 100.0 9E-97 6E-100 547.1 48.7 438 13-491 1-471 (471) 31 protein:vir:93747 Length: 472 100.0 1.4E-96 9E-100 546.0 49.4 452 1-501 1-472 (472) 32 protein:vir:105292 Length: 478 100.0 2E-96 1.2E-99 545.2 50.0 452 1-511 1-478 (478) 33 protein:vir:107112 Length: 478 100.0 2.3E-96 1.5E-99 544.9 49.4 452 1-501 1-478 (478) 34 protein:vir:94498 Length: 474 100.0 6E-96 3.7E-99 542.6 50.1 450 1-501 1-474 (474) 35 protein:vir:97447 Length: 474 100.0 6E-96 3.7E-99 542.6 50.1 450 1-501 1-474 (474) 36 protein:vir:79043 Length: 479 100.0 1.4E-95 8.4E-99 540.7 49.5 449 1-499 6-479 (479) 37 protein:vir:5961 Length: 503 # 100.0 4.1E-95 2.6E-98 538.0 49.9 467 1-507 1-503 (503) 38 protein:vir:96839 Length: 474 100.0 6.7E-95 4.2E-98 536.9 49.0 448 1-513 1-474 (474) 39 protein:vir:96179 Length: 468 100.0 1.3E-94 8E-98 535.3 49.9 442 1-490 1-468 (468) 40 protein:vir:95113 Length: 474 100.0 1.7E-94 1.1E-97 534.6 49.2 451 1-513 1-474 (474) 41 protein:vir:78083 Length: 537 100.0 2.7E-94 1.7E-97 533.6 47.7 468 1-513 1-522 (537) 42 protein:vir:2427 Length: 485 # 100.0 1.2E-81 7.3E-85 464.3 46.6 457 9-510 1-485 (485) 43 protein:vir:4223 Length: 486 # 100.0 3.2E-81 2E-84 461.9 47.1 455 9-507 1-486 (486) 44 protein:vir:78227 Length: 480 100.0 5.7E-81 3.6E-84 460.5 47.2 453 17-513 1-479 (480) 45 protein:vir:78537 Length: 480 100.0 1E-80 6.4E-84 459.1 47.8 450 17-513 1-479 (480) 46 protein:vir:7768 Length: 484 # 100.0 5.6E-81 3.5E-84 460.6 46.0 460 1-512 1-484 (484) 47 protein:vir:2500 Length: 501 # 100.0 1.3E-80 8E-84 458.6 47.3 461 1-513 1-498 (501) 48 protein:vir:104082 Length: 485 100.0 2.7E-79 1.7E-82 451.3 47.4 456 9-508 1-485 (485) 49 protein:vir:2341 Length: 488 # 100.0 4.4E-79 2.7E-82 450.2 46.7 452 12-513 1-487 (488) 50 protein:vir:105819 Length: 456 100.0 7.4E-79 4.6E-82 448.9 45.2 435 15-493 1-456 (456) 51 protein:vir:102602 Length: 456 100.0 7.4E-79 4.6E-82 448.9 45.2 435 15-493 1-456 (456) 52 protein:vir:99072 Length: 479 100.0 2.5E-78 1.5E-81 446.0 44.7 451 9-513 1-478 (479) 53 protein:vir:80680 Length: 441 100.0 5.4E-78 3.3E-81 444.2 43.5 427 15-492 1-441 (441) 54 protein:vir:7987 Length: 456 # 100.0 2.3E-77 1.4E-80 440.8 46.2 435 15-496 1-456 (456) 55 protein:vir:99916 Length: 504 100.0 1.4E-75 9E-79 430.9 45.0 477 1-513 1-503 (504) 56 protein:vir:98444 Length: 434 100.0 3.7E-71 2.3E-74 406.7 41.5 411 53-504 1-434 (434) 57 protein:vir:9751 Length: 422 # 100.0 4.3E-68 2.7E-71 389.9 38.5 406 18-474 1-422 (422) 58 protein:vir:9568 Length: 410 # 100.0 8.6E-68 5.3E-71 388.3 37.1 395 31-476 1-410 (410) 59 protein:vir:8184 Length: 474 # 100.0 1.9E-66 1.2E-69 380.9 42.5 455 1-492 1-474 (474) 60 protein:vir:94742 Length: 409 100.0 3.1E-67 1.9E-70 385.2 37.8 394 18-461 1-409 (409) 61 protein:vir:1634 Length: 409 # 100.0 1.3E-65 7.9E-69 376.4 37.8 394 18-461 1-409 (409) 62 protein:vir:38 Length: 496 # N 100.0 7.1E-58 4.4E-61 333.9 44.7 456 1-497 1-496 (496) 63 protein:vir:80959 Length: 499 100.0 4.2E-54 2.6E-57 313.2 45.7 444 1-497 1-499 (499) 64 protein:vir:79703 Length: 505 100.0 4.1E-47 2.6E-50 274.9 44.7 456 2-484 1-505 (505) 65 protein:vir:1587 Length: 508 # 100.0 6.6E-45 4.1E-48 262.8 45.6 451 2-495 1-508 (508) 66 protein:vir:101494 Length: 527 100.0 3.1E-46 1.9E-49 270.0 37.2 469 4-509 1-527 (527) 67 protein:vir:102239 Length: 527 100.0 3.6E-46 2.2E-49 269.7 37.2 469 4-509 1-527 (527) 68 protein:vir:3028 Length: 500 # 100.0 1.4E-43 8.4E-47 255.6 44.6 459 2-497 1-500 (500) 69 protein:vir:9815 Length: 500 # 100.0 1.4E-43 8.4E-47 255.6 44.6 459 2-497 1-500 (500) 70 protein:vir:4782 Length: 522 # 100.0 1.3E-41 8.2E-45 244.7 44.2 456 13-498 1-522 (522) 71 protein:vir:7430 Length: 563 # 100.0 1.9E-41 1.2E-44 243.8 36.2 475 5-513 1-555 (563) 72 protein:vir:78907 Length: 518 100.0 3.3E-39 2.1E-42 231.5 42.8 453 1-494 4-518 (518) 73 protein:vir:98883 Length: 517 100.0 2E-35 1.2E-38 210.9 46.7 450 13-510 1-517 (517) 74 protein:vir:97265 Length: 513 99.9 1.2E-24 7.6E-28 151.7 40.0 448 13-510 1-513 (513) 75 protein:vir:94956 Length: 452 99.9 2E-25 1.2E-28 156.0 35.3 421 13-510 1-452 (452) 76 protein:vir:95149 Length: 501 99.9 1.5E-20 9.3E-24 129.3 36.8 432 13-503 1-501 (501) 77 protein:vir:80453 Length: 535 99.9 3.8E-19 2.4E-22 121.6 40.7 464 1-509 1-535 (535) 78 protein:vir:96783 Length: 488 99.8 1.1E-19 6.5E-23 124.6 32.9 437 1-486 1-488 (488) 79 protein:vir:78393 Length: 489 99.8 3.7E-18 2.3E-21 116.2 39.4 438 1-506 1-489 (489) 80 protein:vir:95014 Length: 491 99.8 6.1E-18 3.8E-21 114.9 35.9 437 1-506 1-491 (491) 81 protein:vir:93630 Length: 776 99.8 2.7E-19 1.6E-22 122.4 27.2 471 1-513 22-674 (776) 82 protein:vir:108295 Length: 711 99.8 5.5E-17 3.4E-20 109.7 34.6 475 1-513 12-660 (711) 83 protein:vir:817 Length: 714 # 99.7 4.7E-16 2.9E-19 104.6 37.7 465 2-513 1-644 (714) 84 protein:vir:9950 Length: 714 # 99.7 4.7E-16 2.9E-19 104.6 37.7 465 2-513 1-644 (714) 85 protein:vir:10117 Length: 714 99.7 4.7E-16 2.9E-19 104.6 37.7 465 2-513 1-644 (714) 86 protein:vir:3296 Length: 714 # 99.7 4.7E-16 2.9E-19 104.6 37.7 465 2-513 1-644 (714) 87 protein:vir:2764 Length: 714 # 99.7 4.7E-16 2.9E-19 104.6 37.7 465 2-513 1-644 (714) 88 protein:vir:104437 Length: 714 99.7 1.4E-15 9E-19 101.9 36.8 466 1-513 1-628 (714) 89 protein:vir:80040 Length: 461 99.7 1.2E-15 7.4E-19 102.4 31.9 418 1-513 1-460 (461) 90 protein:vir:105619 Length: 772 99.6 1.5E-14 9.3E-18 96.4 31.8 473 1-513 3-647 (772) 91 protein:vir:8846 Length: 705 # 99.6 9.1E-14 5.6E-17 92.1 30.1 447 4-513 1-615 (705) 92 protein:vir:79538 Length: 502 99.5 2.3E-12 1.4E-15 84.4 40.1 439 13-512 1-502 (502) 93 protein:vir:100920 Length: 725 99.5 8.5E-14 5.3E-17 92.2 28.2 466 13-513 1-635 (725) 94 protein:vir:96738 Length: 505 99.5 5.4E-12 3.3E-15 82.4 38.0 445 13-509 1-505 (505) 95 protein:vir:389 Length: 530 # 99.5 4.6E-12 2.8E-15 82.8 34.0 459 13-508 1-530 (530) 96 protein:vir:105429 Length: 708 99.4 2.5E-12 1.5E-15 84.2 30.4 473 15-513 1-654 (708) 97 protein:vir:80165 Length: 651 99.4 9.7E-12 6E-15 81.0 32.6 462 4-513 1-638 (651) 98 protein:vir:77597 Length: 725 99.4 3.2E-13 2E-16 89.1 24.1 462 13-513 1-635 (725) 99 protein:vir:9263 Length: 725 # 99.4 9E-13 5.6E-16 86.6 26.2 467 13-513 1-635 (725) 100 protein:vir:3420 Length: 533 # 99.4 2.4E-11 1.5E-14 78.8 35.7 457 13-512 1-533 (533) 101 protein:vir:172 Length: 708 # 99.4 5.9E-12 3.6E-15 82.2 28.8 467 15-513 1-647 (708) 102 protein:vir:105520 Length: 706 99.4 1.6E-11 9.7E-15 79.8 30.0 460 13-513 1-638 (706) 103 protein:vir:6382 Length: 553 # 99.4 5.2E-11 3.2E-14 77.0 35.6 475 1-513 1-553 (553) 104 protein:vir:5249 Length: 437 # 99.4 6.8E-12 4.2E-15 81.8 27.5 396 35-512 1-437 (437) 105 protein:vir:79647 Length: 435 99.3 8.6E-12 5.3E-15 81.3 27.0 410 1-510 5-435 (435) 106 protein:vir:107662 Length: 427 99.3 2.1E-11 1.3E-14 79.2 26.5 406 27-513 1-427 (427) 107 protein:vir:104338 Length: 422 99.3 6.9E-11 4.3E-14 76.3 28.7 399 35-513 1-422 (422) 108 protein:vir:95542 Length: 548 99.3 2.3E-10 1.4E-13 73.4 37.3 443 13-513 1-515 (548) 109 protein:vir:95449 Length: 584 99.2 5.9E-10 3.7E-13 71.2 36.0 438 1-487 1-584 (584) 110 protein:vir:94049 Length: 532 99.2 2.2E-10 1.4E-13 73.6 27.3 442 1-513 35-526 (532) 111 protein:vir:10321 Length: 495 99.2 8.3E-10 5.2E-13 70.4 33.7 448 2-509 1-495 (495) 112 protein:vir:107742 Length: 537 99.2 1.6E-10 9.7E-14 74.3 25.0 444 1-513 47-532 (537) 113 protein:vir:96068 Length: 765 99.1 2.3E-10 1.5E-13 73.4 25.2 441 1-513 37-565 (765) 114 protein:vir:3520 Length: 720 # 99.1 1.3E-09 7.9E-13 69.3 34.5 459 15-513 1-624 (720) 115 protein:vir:99563 Length: 862 99.0 2.4E-09 1.5E-12 67.8 25.4 440 1-513 74-581 (862) 116 protein:vir:95821 Length: 763 99.0 7.4E-09 4.6E-12 65.2 32.3 448 1-513 8-656 (763) 117 protein:vir:3139 Length: 599 # 98.9 1.4E-08 9E-12 63.6 28.9 445 1-491 5-599 (599) 118 protein:vir:4156 Length: 542 # 98.7 8.6E-08 5.4E-11 59.3 25.0 442 8-513 1-476 (542) 119 protein:vir:6240 Length: 457 # 98.5 3.5E-07 2.2E-10 55.9 31.4 425 24-513 1-453 (457) 120 protein:vir:102668 Length: 547 98.5 3.6E-07 2.2E-10 55.9 39.0 429 18-504 1-547 (547) 121 protein:vir:63755 Length: 547 98.5 4.1E-07 2.6E-10 55.6 28.4 455 1-513 4-528 (547) 122 protein:vir:1326 Length: 457 # 98.4 9.8E-07 6.1E-10 53.5 30.0 425 24-513 1-456 (457) 123 protein:vir:94709 Length: 522 98.3 1.7E-06 1.1E-09 52.2 42.0 445 13-504 1-522 (522) 124 protein:vir:80644 Length: 551 98.3 2E-06 1.2E-09 51.9 30.6 451 1-513 22-532 (551) 125 protein:vir:95599 Length: 563 98.3 2E-06 1.2E-09 51.9 25.9 451 1-513 1-560 (563) 126 protein:vir:99312 Length: 563 98.3 2E-06 1.2E-09 51.9 25.9 451 1-513 1-560 (563) 127 protein:vir:3153 Length: 467 # 98.3 2E-06 1.2E-09 51.9 30.2 390 66-512 1-467 (467) 128 protein:vir:3843 Length: 397 # 98.3 2E-06 1.2E-09 51.8 28.3 385 24-507 1-397 (397) 129 protein:vir:100150 Length: 437 98.2 2.3E-06 1.4E-09 51.5 28.6 406 27-510 1-437 (437) 130 protein:vir:10447 Length: 536 98.2 2.4E-06 1.5E-09 51.4 38.0 443 13-508 1-536 (536) 131 protein:vir:94599 Length: 641 98.2 2.8E-06 1.7E-09 51.0 34.2 462 1-513 1-609 (641) 132 protein:vir:8883 Length: 543 # 98.1 4.2E-06 2.6E-09 50.1 37.2 451 10-511 1-543 (543) 133 protein:vir:1538 Length: 535 # 98.1 5E-06 3.1E-09 49.6 40.1 442 13-508 1-535 (535) 134 protein:vir:3361 Length: 535 # 98.1 5.3E-06 3.3E-09 49.5 38.4 442 13-508 1-535 (535) 135 protein:vir:102080 Length: 429 98.1 5.3E-06 3.3E-09 49.5 29.0 398 24-508 1-429 (429) 136 protein:vir:4194 Length: 540 # 98.0 6.2E-06 3.9E-09 49.1 24.8 439 8-513 1-470 (540) 137 protein:vir:94572 Length: 535 98.0 6.5E-06 4E-09 49.0 36.3 443 9-508 1-535 (535) 138 protein:vir:4952 Length: 386 # 98.0 7.5E-06 4.6E-09 48.7 27.9 378 24-513 1-386 (386) 139 protein:vir:102727 Length: 945 98.0 8E-06 5E-09 48.5 28.2 431 1-513 62-537 (945) 140 protein:vir:2198 Length: 536 # 98.0 8.8E-06 5.4E-09 48.3 40.2 443 13-508 1-536 (536) 141 protein:vir:1380 Length: 422 # 97.9 9.6E-06 6E-09 48.1 29.2 399 24-509 1-422 (422) 142 protein:vir:81152 Length: 411 97.9 1E-05 6.3E-09 48.0 28.5 389 2-496 1-411 (411) 143 protein:vir:102855 Length: 432 97.9 1.3E-05 7.8E-09 47.4 29.9 404 13-508 1-432 (432) 144 protein:vir:105002 Length: 432 97.9 1.3E-05 7.8E-09 47.4 29.9 404 13-508 1-432 (432) 145 protein:vir:107605 Length: 432 97.9 1.3E-05 7.8E-09 47.4 29.9 404 13-508 1-432 (432) 146 protein:vir:96579 Length: 576 97.8 1.5E-05 9.5E-09 47.0 18.9 451 1-513 1-537 (576) 147 protein:vir:93610 Length: 454 97.8 1.6E-05 1E-08 46.8 31.6 418 23-513 1-442 (454) 148 protein:vir:102118 Length: 409 97.8 1.7E-05 1E-08 46.8 30.3 386 26-510 1-409 (409) 149 protein:vir:80796 Length: 574 97.8 1.8E-05 1.1E-08 46.6 28.4 451 1-513 1-529 (574) 150 protein:vir:100039 Length: 522 97.8 2.2E-05 1.3E-08 46.1 37.1 431 18-509 1-522 (522) 151 protein:vir:7407 Length: 392 # 97.7 3.1E-05 1.9E-08 45.3 26.6 384 1-500 2-392 (392) 152 protein:vir:1266 Length: 416 # 97.6 3.5E-05 2.2E-08 45.0 30.6 391 21-502 1-416 (416) 153 protein:vir:4598 Length: 416 # 97.6 3.5E-05 2.2E-08 45.0 27.4 402 2-509 1-416 (416) 154 protein:vir:81095 Length: 416 97.6 3.5E-05 2.2E-08 45.0 27.4 402 2-509 1-416 (416) 155 protein:vir:95315 Length: 559 97.6 3.6E-05 2.3E-08 44.9 41.1 449 13-511 1-559 (559) 156 protein:vir:8418 Length: 409 # 97.5 4.8E-05 3E-08 44.3 29.5 391 24-510 1-409 (409) 157 protein:vir:103860 Length: 528 97.5 4.8E-05 3E-08 44.3 32.4 407 1-513 17-452 (528) 158 protein:vir:1023 Length: 392 # 97.5 5.7E-05 3.6E-08 43.8 25.5 381 22-500 1-392 (392) 159 protein:vir:3989 Length: 392 # 97.5 5.7E-05 3.6E-08 43.8 25.5 381 22-500 1-392 (392) 160 protein:vir:107822 Length: 555 97.5 6.2E-05 3.8E-08 43.7 39.6 438 13-507 1-555 (555) 161 protein:vir:98506 Length: 555 97.5 6.2E-05 3.8E-08 43.7 39.6 438 13-507 1-555 (555) 162 protein:vir:107404 Length: 555 97.5 6.2E-05 3.8E-08 43.7 39.6 438 13-507 1-555 (555) 163 protein:vir:7853 Length: 518 # 97.5 6.2E-05 3.9E-08 43.6 29.7 415 25-513 1-455 (518) 164 protein:vir:9359 Length: 348 # 97.4 6.8E-05 4.2E-08 43.4 25.8 332 85-502 1-348 (348) 165 protein:vir:101648 Length: 518 97.4 8.6E-05 5.4E-08 42.9 30.9 418 13-513 1-455 (518) 166 protein:vir:4454 Length: 414 # 97.3 9.1E-05 5.6E-08 42.8 32.3 392 24-512 1-414 (414) 167 protein:vir:105064 Length: 421 97.3 0.0001 6.2E-08 42.5 25.5 399 17-513 1-420 (421) 168 protein:vir:79772 Length: 648 97.3 0.0001 6.3E-08 42.5 31.7 424 1-513 33-506 (648) 169 protein:vir:99232 Length: 526 97.2 0.00012 7.3E-08 42.1 35.8 403 1-513 17-447 (526) 170 protein:vir:105782 Length: 449 97.2 0.00013 8.3E-08 41.8 28.2 414 18-510 1-449 (449) 171 protein:vir:101541 Length: 694 97.1 0.00017 1E-07 41.3 19.9 439 1-513 51-559 (694) 172 protein:vir:4828 Length: 382 # 97.1 0.00019 1.2E-07 41.0 26.1 371 24-513 1-382 (382) 173 protein:vir:99672 Length: 532 97.1 0.00019 1.2E-07 41.0 36.6 436 13-509 1-532 (532) 174 protein:vir:78696 Length: 542 97.0 0.00021 1.3E-07 40.7 41.0 434 18-513 1-541 (542) 175 protein:vir:4995 Length: 384 # 97.0 0.00022 1.4E-07 40.6 23.6 377 24-494 1-384 (384) 176 protein:vir:79233 Length: 526 96.9 0.00025 1.6E-07 40.3 35.3 400 1-513 17-447 (526) 177 protein:vir:103219 Length: 201 96.9 0.00026 1.6E-07 40.2 15.5 196 220-513 1-201 (201) 178 protein:vir:7321 Length: 556 # 96.9 0.00029 1.8E-07 40.0 40.1 444 13-508 1-556 (556) 179 protein:vir:5737 Length: 419 # 96.8 0.00035 2.2E-07 39.5 29.4 393 21-513 1-412 (419) 180 protein:vir:3648 Length: 695 # 96.6 0.00047 2.9E-07 38.8 21.0 432 1-513 52-560 (695) 181 protein:vir:103765 Length: 549 96.6 0.00049 3.1E-07 38.7 40.1 439 13-506 1-549 (549) 182 protein:vir:78589 Length: 695 96.6 0.00051 3.2E-07 38.6 20.7 432 1-513 52-560 (695) 183 protein:vir:2683 Length: 412 # 96.6 0.00051 3.2E-07 38.6 29.7 392 24-502 1-412 (412) 184 protein:vir:93943 Length: 409 96.4 0.00066 4.1E-07 38.0 26.9 396 13-502 1-409 (409) 185 protein:vir:483 Length: 413 # 96.3 0.00078 4.9E-07 37.6 32.1 394 21-512 1-413 (413) 186 protein:vir:10362 Length: 432 96.3 0.00082 5.1E-07 37.5 27.3 391 18-512 1-432 (432) 187 protein:vir:94426 Length: 409 96.2 0.00092 5.7E-07 37.2 28.2 390 18-502 1-409 (409) 188 protein:vir:4854 Length: 386 # 96.1 0.00094 5.9E-07 37.2 29.2 376 24-501 1-386 (386) 189 protein:vir:100691 Length: 535 96.1 0.0011 6.5E-07 36.9 35.9 449 1-513 1-532 (535) 190 protein:vir:96980 Length: 409 96.1 0.0011 6.6E-07 36.9 28.5 394 13-502 1-409 (409) 191 protein:vir:101647 Length: 460 96.0 0.0011 6.7E-07 36.9 28.6 409 19-512 1-460 (460) 192 protein:vir:106716 Length: 698 95.9 0.0013 7.9E-07 36.4 22.8 440 1-513 52-574 (698) 193 protein:vir:97060 Length: 432 95.7 0.0016 9.7E-07 36.0 27.0 402 18-513 1-432 (432) 194 protein:vir:105641 Length: 516 95.7 0.0017 1E-06 35.8 35.7 426 1-502 1-516 (516) 195 protein:vir:4337 Length: 434 # 95.6 0.0018 1.1E-06 35.7 28.4 403 13-510 1-434 (434) 196 protein:vir:78641 Length: 278 95.4 0.0021 1.3E-06 35.3 23.9 269 85-428 1-278 (278) 197 protein:vir:1785 Length: 555 # 95.3 0.0023 1.4E-06 35.1 38.5 432 18-512 1-555 (555) 198 protein:vir:9408 Length: 441 # 95.3 0.0023 1.4E-06 35.0 31.7 411 4-509 1-441 (441) 199 protein:vir:79984 Length: 441 95.3 0.0023 1.4E-06 35.0 31.7 411 4-509 1-441 (441) 200 protein:vir:81072 Length: 432 95.1 0.0028 1.8E-06 34.5 28.3 394 18-512 1-432 (432) 201 protein:vir:96988 Length: 516 94.8 0.0034 2.1E-06 34.1 36.3 424 9-502 1-516 (516) 202 protein:vir:345 Length: 663 # 94.5 0.0042 2.6E-06 33.6 25.8 469 1-513 1-603 (663) 203 protein:vir:1082 Length: 359 # 94.4 0.0046 2.8E-06 33.4 23.4 346 24-458 1-359 (359) 204 protein:vir:7017 Length: 515 # 94.0 0.0057 3.6E-06 32.9 37.4 427 4-502 1-515 (515) 205 protein:vir:3868 Length: 417 # 93.7 0.0065 4E-06 32.6 27.8 382 42-511 1-417 (417) 206 protein:vir:1986 Length: 512 # 93.7 0.0068 4.2E-06 32.5 33.9 403 1-513 14-449 (512) 207 protein:vir:189 Length: 424 # 93.4 0.0077 4.8E-06 32.2 28.9 393 13-499 1-424 (424) 208 protein:vir:4509 Length: 424 # 92.8 0.0099 6.1E-06 31.6 27.6 399 9-513 1-424 (424) 209 protein:vir:95378 Length: 406 92.6 0.011 6.6E-06 31.4 29.6 384 24-511 1-406 (406) 210 protein:vir:1431 Length: 419 # 92.6 0.011 6.7E-06 31.4 29.1 395 21-513 1-413 (419) 211 protein:vir:80333 Length: 419 92.5 0.011 6.8E-06 31.3 27.1 393 18-513 1-413 (419) 212 protein:vir:98396 Length: 441 92.2 0.013 7.8E-06 31.0 34.1 418 4-509 1-441 (441) 213 protein:vir:9702 Length: 406 # 91.6 0.015 9.5E-06 30.5 26.2 390 24-511 1-406 (406) 214 protein:vir:100882 Length: 383 91.4 0.016 1E-05 30.4 28.9 375 13-498 1-383 (383) 215 protein:vir:960 Length: 413 # 91.3 0.017 1E-05 30.4 26.6 390 1-507 4-413 (413) 216 protein:vir:103330 Length: 517 91.2 0.017 1.1E-05 30.3 40.8 415 13-496 1-517 (517) 217 protein:vir:99452 Length: 651 91.2 0.017 1.1E-05 30.3 23.9 458 1-513 1-541 (651) 218 protein:vir:99853 Length: 488 91.0 0.018 1.1E-05 30.1 30.5 395 13-513 1-418 (488) 219 protein:vir:100187 Length: 385 89.2 0.028 1.7E-05 29.1 29.7 368 24-497 1-385 (385) 220 protein:vir:104500 Length: 537 89.1 0.028 1.8E-05 29.1 25.0 453 4-513 1-536 (537) 221 protein:vir:78161 Length: 355 88.8 0.03 1.9E-05 28.9 20.2 318 141-513 1-338 (355) 222 protein:vir:79063 Length: 491 88.7 0.03 1.9E-05 28.9 28.8 398 1-513 1-421 (491) 223 protein:vir:107880 Length: 491 88.5 0.032 2E-05 28.8 33.3 397 1-513 1-421 (491) 224 protein:vir:1884 Length: 424 # 88.0 0.035 2.2E-05 28.6 31.8 393 2-499 1-424 (424) 225 protein:vir:6322 Length: 510 # 88.0 0.035 2.2E-05 28.5 37.1 407 23-492 1-510 (510) 226 protein:vir:8100 Length: 466 # 81.5 0.085 5.3E-05 26.5 28.1 402 24-512 1-466 (466) 227 protein:vir:100650 Length: 395 79.2 0.11 6.6E-05 25.9 25.0 372 24-513 1-395 (395) 228 protein:vir:9507 Length: 395 # 79.2 0.11 6.6E-05 25.9 25.0 372 24-513 1-395 (395) 229 protein:vir:101289 Length: 395 79.2 0.11 6.6E-05 25.9 25.0 372 24-513 1-395 (395) 230 protein:vir:77981 Length: 448 76.9 0.13 8.1E-05 25.4 25.0 414 1-513 1-443 (448) 231 protein:vir:106999 Length: 564 72.1 0.19 0.00012 24.6 23.8 476 1-513 1-562 (564) 232 protein:vir:100249 Length: 431 72.1 0.19 0.00012 24.6 30.9 389 24-503 1-431 (431) 233 protein:vir:95965 Length: 385 70.9 0.2 0.00013 24.4 23.4 359 24-513 1-385 (385) 234 protein:vir:78942 Length: 510 69.3 0.22 0.00014 24.1 39.1 421 23-499 1-510 (510) 235 protein:vir:94666 Length: 723 69.0 0.23 0.00014 24.1 32.7 398 49-513 1-444 (723) 236 protein:vir:108215 Length: 469 67.4 0.25 0.00016 23.9 26.9 416 1-513 1-468 (469) 237 protein:vir:6210 Length: 394 # 66.8 0.26 0.00016 23.8 25.5 375 24-512 1-394 (394) 238 protein:vir:104259 Length: 403 65.6 0.28 0.00017 23.6 26.7 378 13-505 1-403 (403) 239 protein:vir:80211 Length: 514 64.9 0.29 0.00018 23.5 37.5 420 18-495 1-514 (514) 240 protein:vir:4089 Length: 395 # 56.8 0.45 0.00028 22.5 25.2 372 13-513 1-394 (395) 241 protein:vir:80134 Length: 403 55.0 0.49 0.0003 22.3 28.2 385 24-512 1-403 (403) 242 protein:vir:5665 Length: 511 # 54.0 0.51 0.00032 22.2 21.1 422 1-485 39-511 (511) 243 protein:vir:79150 Length: 368 50.5 0.61 0.00038 21.8 18.1 330 31-442 1-368 (368) 244 protein:vir:5691 Length: 344 # 47.4 0.7 0.00044 21.4 21.3 295 27-433 1-344 (344) 245 protein:vir:104892 Length: 558 44.2 0.82 0.00051 21.1 26.8 466 1-513 1-552 (558) 246 protein:vir:98816 Length: 446 42.2 0.9 0.00056 20.8 21.8 409 13-464 1-446 (446) 247 protein:vir:6058 Length: 344 # 40.8 0.95 0.00059 20.7 20.0 301 27-432 1-344 (344) 248 protein:vir:98567 Length: 340 40.1 0.99 0.00061 20.6 19.7 291 46-431 1-340 (340) 249 protein:vir:267 Length: 348 # 38.9 1 0.00065 20.5 20.9 318 31-438 1-348 (348) 250 protein:vir:78310 Length: 376 38.7 1.1 0.00065 20.5 23.5 353 24-498 1-376 (376) 251 protein:vir:79207 Length: 351 38.1 1.1 0.00067 20.4 20.5 316 40-441 1-351 (351) 252 protein:vir:78191 Length: 351 36.2 1.2 0.00074 20.2 20.1 316 40-435 1-351 (351) 253 protein:vir:1150 Length: 350 # 36.1 1.2 0.00074 20.2 18.6 315 27-431 1-350 (350) 254 protein:vir:81218 Length: 423 30.9 1.5 0.00095 19.6 29.7 391 24-512 1-423 (423) 255 protein:vir:1661 Length: 378 # 25.7 2 0.0013 18.9 23.1 342 24-510 1-378 (378) 256 protein:vir:8317 Length: 409 # 23.4 2.3 0.0014 18.6 26.3 356 24-499 1-409 (409) 257 protein:vir:79511 Length: 448 22.9 2.4 0.0015 18.5 31.2 410 1-513 1-443 (448) 258 protein:vir:5839 Length: 533 # 22.3 2.5 0.0015 18.4 22.1 445 1-513 1-520 (533) 259 protein:vir:2013 Length: 344 # 22.0 2.5 0.0016 18.4 21.4 302 27-433 1-344 (344) 260 protein:vir:103177 Length: 533 21.9 2.5 0.0016 18.4 25.5 465 1-513 9-532 (533) 261 protein:vir:100328 Length: 346 21.6 2.6 0.0016 18.3 20.8 316 27-433 1-346 (346) 262 protein:vir:93867 Length: 378 21.1 2.7 0.0017 18.3 22.7 342 24-510 1-378 (378) No 1 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=1.1e-109 Score=617.84 Aligned_cols=496 Identities=56% Similarity=0.939 Sum_probs=440.7 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) +.+-+++++.|+.+.+++|++.|.++|++|+..+++|++++++||+|+|+++..+........++++|+++||+++||++ T Consensus 5 ~~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~ 84 (506) T protein:vir:94 5 LTEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADF 84 (506) T ss_pred hhhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHH Confidence 78889999999999999999999999999999999999999999999998887777777788899999999999999999 Q ss_pred HHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) .++||+|+|++|+++++ +.|++||+.|+++.++.+++++++++|+||++||++++|++++.+ ++|.+++|+||++. T Consensus 85 ~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~-~~p~~~~~v~dd~~ 163 (506) T protein:vir:94 85 QTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAK-LDPLDTFVIYSTDV 163 (506) T ss_pred hhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE-EcccceEEEecCCC Confidence 99999999999987754 568999999999999999999999999999999999999988765 89999999999998 Q ss_pred CcceEEEEEEEeecccccccc-eeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHH Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNIT-QTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENV 236 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~-~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v 236 (513) .++++++||+|.....++... ...+++++||+..+++|.....+ +......+|+||.||||+|+|+.+|.|+|+++ T Consensus 164 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~ 240 (506) T protein:vir:94 164 DPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIM---GKMQVDTTKPITTFPVVEFKNSNFRLGDFENV 240 (506) T ss_pred CCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCc---cceeccccccCCccceEEecCCCCCCCchhhh Confidence 889999999998776555443 34567889999999888654333 34455678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccc Q lcl|NC_019916. 237 LSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMA 316 (513) Q Consensus 237 ~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 316 (513) ++|||+||+++|++++.+++|++|+++++|................+...++...........+..+++++++.+++++. T Consensus 241 ~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (506) T protein:vir:94 241 LPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMT 320 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeeccccc Confidence 99999999999999999999999999999998888777777777778888888888888888899999999999999999 Q ss_pred ccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 317 PNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 317 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) ..+...+++++||+|+++.+++++++++|.+.||.+|++|++++++++||+||+||++++++|++||+++++.|+++|++ T Consensus 321 ~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~ 400 (506) T protein:vir:94 321 VNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYA 400 (506) T ss_pred ccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 397 RYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLK 476 (513) Q Consensus 397 ~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~ 476 (513) ++++|+++++..++..++++.+++|+|++++|.|+++.|++++|++|++|.||+++++|+|+|+++|++||++|++++.+ T Consensus 401 ~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~ 480 (506) T protein:vir:94 401 RYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDY 480 (506) T ss_pred HHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhh Confidence 99999999998888788888999999999999999999999999999999999999999999999999999999887655 Q ss_pred HhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 477 TYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .++........++ ..+.+++.+.+ T Consensus 481 ~~~~~~~~~~~~~-------------~~~~~~~~~~e 504 (506) T protein:vir:94 481 SFDQNGVISNDGQ-------------TNTTATQTDEE 504 (506) T ss_pred cchhhcCCCcccC-------------ccccccccccC Confidence 5443322222111 11111111112 No 2 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=1.8e-105 Score=594.83 Aligned_cols=479 Identities=32% Similarity=0.554 Sum_probs=394.1 Q ss_pred Ccc------chhhceeccCCccc----CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceee Q lcl|NC_019916. 1 MID------MQQANMNYQEDADK----LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAV 70 (513) Q Consensus 1 ~~~------~~~~~~~~~~~~~~----~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~ 70 (513) ++. -++||..|+++..+ .+.++|.++|++|...+++||+++++||+|+|+++++.... ..+.++++|++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~-~~~~~~~~ki~ 91 (512) T protein:vir:97 13 LRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVA 91 (512) T ss_pred eeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceee Confidence 222 45677777765322 25688999999999999999999999999999998766543 55678999999 Q ss_pred cchhHHHHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) +||+++||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||++++|++++.+ ++|+ T Consensus 92 ~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~-~~p~ 170 (512) T protein:vir:97 92 HDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAM 170 (512) T ss_pred cchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEE-Eccc Confidence 999999999999999999999988765 468999999999999999999999999999999999999988764 8999 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecC Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRN 225 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n 225 (513) ++||+||++..++++++||||.....++.....++++++||++.+++|....++... .......+|+||.||||+|+| T Consensus 171 ~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 250 (512) T protein:vir:97 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecC Confidence 999999999889999999999988888777788899999999999999876544322 234456789999999999999 Q ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 226 NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 226 ~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) +.+|+|+|+++++|||+||.++|++++.+++|++|+|+++|+....... +..+.. T Consensus 251 n~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-------------------------~~~~~~ 305 (512) T protein:vir:97 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKE 305 (512) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchh-------------------------hhhhhh Confidence 9999999999999999999999999999999999999999975432211 122222 Q ss_pred cceeeccccc-----cccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGM-----APNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTV 380 (513) Q Consensus 306 ~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 380 (513) ++++.+.... .......+++++|++|+.+.+++++++++|.+.||.+|++|++++++++||+||+||++++++|. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~ 385 (512) T protein:vir:97 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) T ss_pred cccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHH Confidence 3333332211 12234678999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCC Q lcl|NC_019916. 381 ELASTKRKQFERGLNQRYTVVAHIEERVNG-KWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTD 459 (513) Q Consensus 381 ~k~~~~~~~f~~~l~~~~~li~~~l~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D 459 (513) +||+++++.|+++|++++++|++++...+. ....++.+++++|++++|.|.++.|+++++++|++|.||+++++|+|+| T Consensus 386 ~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d 465 (512) T protein:vir:97 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCC Confidence 999999999999999999999999887654 2456777899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 460 ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 460 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) +++|++||++|+++.++........... +.++++++ .+.++..++++ T Consensus 466 ~~~E~eri~~E~~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~ 512 (512) T protein:vir:97 466 PELEVKKIEEDEKESIKKAQKGIYKDPR---DINDDEQD--DDTKDTVDKKE 512 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCC---CCCCCCCC--CCccccccccC Confidence 9999999999998876655432211111 11111111 11111111111 No 3 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=3.9e-105 Score=592.97 Aligned_cols=479 Identities=33% Similarity=0.547 Sum_probs=395.6 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.+... ..+.+.|.++|++|...+++||+++++||+|+|+++++.... ....++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:93 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-cccccCcceeecchHHH Confidence 1223567888887642 236789999999999999999999999999999998776544 45678999999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||++++|++++.+ ++|+++||+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~-~~p~~~~~vy 176 (511) T protein:vir:93 98 ISDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAMSTFVIY 176 (511) T ss_pred HHHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EccceeEEEE Confidence 999999999999999987765 468899999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||||.....++.....++++++||++.+++|.....+... .......+|++|.||||+|+|+.+|+| T Consensus 177 dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~g 256 (511) T protein:vir:93 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCC Confidence 999888999999999988887777788899999999999999775543321 234456789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|+....... +..+.+++++.+ T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:93 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchh-------------------------hcccccccceec Confidence 9999999999999999999999999999999999975432211 122333444443 Q ss_pred cccc----cccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGM----APNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~~----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) .... .......+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||++++ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 391 (511) T protein:vir:93 312 EPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred ccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 3322 223456789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNG-KWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+++|++++++|+++++.... ....++.+++++|++++|.|.++.|+++++++|++|.||+++++|+|+||++|++| T Consensus 392 ~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~r 471 (511) T protein:vir:93 392 GLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHH Confidence 99999999999999999877653 34567788999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |++|++++.+...........+.+ +++ ...+.++..++++ T Consensus 472 i~~E~~~~~~~~~~~~~~~~~~~~--~~~---~~~~~~~~~~~~~ 511 (511) T protein:vir:93 472 IEEDEKESIKKAQKGIYKDPRDIN--DDE---QDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhhcccCCCCCC--CCC---CCCcccccccccC Confidence 999998776654332221111111 111 1111111111111 No 4 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=8.2e-105 Score=591.23 Aligned_cols=479 Identities=32% Similarity=0.544 Sum_probs=396.0 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.+... ..+.++|.++|++|...+++||+++++||+|+|+++++.... +...++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:99 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHH Confidence 1123567888877642 247889999999999999999999999999999998766543 55678999999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||++++|++++.+ ++|+++||+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~-~~p~~~~~vy 176 (511) T protein:vir:99 98 ISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAMSTFVIY 176 (511) T ss_pred HHHHHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EccceeEEEE Confidence 999999999999999988765 468899999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||+|.....++....+++++++||++.+++|+....+... .......+|+||.||||+|+|+.+|+| T Consensus 177 d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~s 256 (511) T protein:vir:99 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCC Confidence 999889999999999988877777788899999999999999775544321 234456789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|........ +..+.+++++.+ T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:99 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchh-------------------------hcccccccceec Confidence 9999999999999999999999999999999999975432211 122233333433 Q ss_pred ccc----ccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTG----MAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) ... ........+++++||+|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|..||++++ T Consensus 312 ~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~ 391 (511) T protein:vir:99 312 EPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred ccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 222 1223356789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNG-KWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+.+|++++++|+++++..+. ....++.+++++|++++|.|.++.|++++|++|++|.||+++++|+|+||++|++| T Consensus 392 ~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~r 471 (511) T protein:vir:99 392 GLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHH Confidence 99999999999999999887654 24556778999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) |++|++++++.......... .+.++++++..++...+++| T Consensus 472 i~~E~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 472 IEEDEKESIKKAQKNMYQDP---RNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhcccccC---CCCCCCCCCCCCcCcccccC Confidence 99999887665443222111 11111111111111111111 No 5 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=9.6e-105 Score=590.84 Aligned_cols=479 Identities=33% Similarity=0.545 Sum_probs=395.5 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.++.. ..+.+.|.++|++|...+++||+++++||+|+|+++++.... ..+.++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:10 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHH Confidence 2223578888887643 246789999999999999999999999999999998766543 55678999999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||++++|++++.+ ++|++++|+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~-~~p~~~~~vy 176 (511) T protein:vir:10 98 ISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYK-SDAMSTFVIY 176 (511) T ss_pred HHHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EccceeEEEE Confidence 999999999999999988765 468999999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||||.....++.....++++++||++.+++|....++... .......+|+||.||||+|+|+.+|+| T Consensus 177 dd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~g 256 (511) T protein:vir:10 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCC Confidence 999888999999999998888777788899999999999998765443221 234456789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|........ +..+.+++++.+ T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:10 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh-------------------------hccchhccceec Confidence 9999999999999999999999999999999999965332111 122333444444 Q ss_pred cccc----cccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGM----APNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~~----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) .+.. .......+++++||+|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.+++ T Consensus 312 ~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 391 (511) T protein:vir:10 312 EPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred ccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 3322 223356689999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNG-KWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+++|++++++|+++++...+ ....++.+++++|++++|.|.++.|++++|+.|++|.||+++++|+|+||++|++| T Consensus 392 ~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~r 471 (511) T protein:vir:10 392 GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHH Confidence 99999999999999999887654 24567788999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |++|+++..+........... ...+ ++...+..+..++++ T Consensus 472 i~~E~~~~~~~~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 472 IEEDEKESIKKAQKGIYKDPR--DIND---DEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhhcccCCC--CCCC---CCCCCcccCcccccC Confidence 999988766644321111111 1111 111111111111111 No 6 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=2.2e-104 Score=588.91 Aligned_cols=479 Identities=33% Similarity=0.552 Sum_probs=394.9 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.+... .++.+.|.++|++|...+++||+++++||+|+|+++++.... ..+.++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:96 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc-cccccCcceeecchHHH Confidence 1123567888887642 247889999999999999999999999999999998766543 55678899999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||+|++|.+++.+ ++|+++||+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~-~~p~~~~~v~ 176 (511) T protein:vir:96 98 ISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAMSTFIIY 176 (511) T ss_pred HHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EcccceEEEE Confidence 999999999999999988765 468899999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||||.....++.....++++++||++.+++|....++... .......+|++|.||||+|+|+.+|+| T Consensus 177 dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~g 256 (511) T protein:vir:96 177 DNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCC Confidence 999889999999999988887777788889999999999998776544321 234556789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|........ +..+..++++.+ T Consensus 257 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:96 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-------------------------hcccccccceec Confidence 9999999999999999999999999999999999975432211 122223333333 Q ss_pred ccc----ccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTG----MAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) ... ........+++++|++++.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.+++ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~ 391 (511) T protein:vir:96 312 EPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred cccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 222 1223345679999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccc-cccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNGK-WDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+.+|++++++|+++++..... ...++.+++++|++++|.|.++.|+++++++|++|.||+++++|+|+||++|++| T Consensus 392 ~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~r 471 (511) T protein:vir:96 392 GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Confidence 999999999999999998876543 4566788999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |++|++++.+...........+ ..+++.+ .+.++..++.+ T Consensus 472 i~~E~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~e~~ 511 (511) T protein:vir:96 472 IEEDEKESIKKAQKGIYKDPRD--INDDEQD---DDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhccccCCCC--CCCCCCC---CCccCcccccC Confidence 9999887766544322211111 1111111 11111111111 No 7 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=2.2e-104 Score=588.91 Aligned_cols=479 Identities=33% Similarity=0.552 Sum_probs=394.9 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.+... .++.+.|.++|++|...+++||+++++||+|+|+++++.... ..+.++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:78 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc-cccccCcceeecchHHH Confidence 1123567888887642 247889999999999999999999999999999998766543 55678899999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||+|++|.+++.+ ++|+++||+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~-~~p~~~~~v~ 176 (511) T protein:vir:78 98 ISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAMSTFIIY 176 (511) T ss_pred HHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EcccceEEEE Confidence 999999999999999988765 468899999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc--ccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV--PTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||||.....++.....++++++||++.+++|....++... .......+|++|.||||+|+|+.+|+| T Consensus 177 dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~g 256 (511) T protein:vir:78 177 DNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCC Confidence 999889999999999988887777788889999999999998776544321 234556789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|........ +..+..++++.+ T Consensus 257 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:78 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-------------------------hcccccccceec Confidence 9999999999999999999999999999999999975432211 122223333333 Q ss_pred ccc----ccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTG----MAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) ... ........+++++|++++.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.+++ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~ 391 (511) T protein:vir:78 312 EPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred cccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 222 1223345679999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccc-cccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNGK-WDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+.+|++++++|+++++..... ...++.+++++|++++|.|.++.|+++++++|++|.||+++++|+|+||++|++| T Consensus 392 ~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~r 471 (511) T protein:vir:78 392 GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Confidence 999999999999999998876543 4566788999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |++|++++.+...........+ ..+++.+ .+.++..++.+ T Consensus 472 i~~E~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~e~~ 511 (511) T protein:vir:78 472 IEEDEKESIKKAQKGIYKDPRD--INDDEQD---DDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhccccCCCC--CCCCCCC---CCccCcccccC Confidence 9999887766544322211111 1111111 11111111111 No 8 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=2.9e-104 Score=588.25 Aligned_cols=479 Identities=33% Similarity=0.553 Sum_probs=394.4 Q ss_pred CccchhhceeccCCcc----cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD----KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) -.=-++||..|.+... ..+.+.|.++|++|...+++||+++++||.|+|+++++.... +...++++||++||+++ T Consensus 19 ~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:96 19 YLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASY 97 (511) T ss_pred hhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-cccccCcceeecchHHH Confidence 1223568888887642 247899999999999999999999999999999998765543 55678899999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+||+++++++ +.+++||+.|+++.++.+++++++++|+||++||+|++|++++.+ ++|++++|+| T Consensus 98 Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~-~~p~~~~~vy 176 (511) T protein:vir:96 98 ISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK-SDAMSTFVIY 176 (511) T ss_pred HHHHHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEE-EccceeEEEE Confidence 999999999999999987764 468999999999999999999999999999999999999988764 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc--cccccccccccCcccceEEecCCCCCCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS--VPTLEVAEHSAQFGFPMIEYRNNEYRQG 231 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n~~~~~s 231 (513) |++..++++++||||.....++....+++++++||++.+++|.....+.. ........+|+||.||||+|+|+.+|+| T Consensus 177 dd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~g 256 (511) T protein:vir:96 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKG 256 (511) T ss_pred cCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCC Confidence 99988999999999998888877778889999999999999876544322 1234456789999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 232 DFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 232 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) +|+++++|||+||.++|++++.+++|++|+++++|........ +..+.+++++.+ T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-------------------------~~~~~~~~~~~~ 311 (511) T protein:vir:96 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-------------------------VRKQKEANVLFL 311 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchh-------------------------hcccccccceec Confidence 9999999999999999999999999999999999965432111 122233333333 Q ss_pred cccc----cccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGM----APNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKR 387 (513) Q Consensus 312 ~~~~----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~ 387 (513) .... .......+++++||+++.+.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||++++ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 391 (511) T protein:vir:96 312 EPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 391 (511) T ss_pred ccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 2221 223356789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 388 KQFERGLNQRYTVVAHIEERVNG-KWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 388 ~~f~~~l~~~~~li~~~l~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) +.|+++|++++++|+++++.... ....++.+++++|++++|.|.++.|+++++++|++|.||+++++|+++||++|++| T Consensus 392 ~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~r 471 (511) T protein:vir:96 392 GLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Confidence 99999999999999999887653 34567789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |++|+++..+........... ...+++.+ .+..+..++.+ T Consensus 472 i~~E~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~ 511 (511) T protein:vir:96 472 IEEDEKESIKKAQKGIYKDPR--DINDDEQD---DDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHhhccccCCC--CCCCCCCC---CcccccccccC Confidence 999988766654332211111 11111111 11111111111 No 9 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=3.3e-103 Score=582.39 Aligned_cols=473 Identities=33% Similarity=0.485 Sum_probs=394.4 Q ss_pred CccchhhceeccCCcc-c---CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD-K---LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~-~---~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) ...-++||..|..+.. + .+.++|.++|++|...+.+|++++.+||+|+|+.+.... ......++++|+++||+++ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~-~~~~~~~~~~ki~~n~~k~ 96 (501) T protein:vir:27 18 LRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFG-RRKDREMADKRAVHNYGRM 96 (501) T ss_pred cccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccC-ccCccccccceeccchHHH Confidence 4556789999987743 3 345679999999988889999999999999976554332 3455678999999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH-------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS-------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC 149 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~ 149 (513) ||++.++||+|+||+++++++ +.+++||+.|+|+.++.+++++++++|+||++||++++|++++.+ ++|.++ T Consensus 97 Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~-~~p~~~ 175 (501) T protein:vir:27 97 ISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKR-LNPLET 175 (501) T ss_pred HHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEE-Ecccee Confidence 999999999999999987642 457889999999999999999999999999999999999988764 899999 Q ss_pred EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCC Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYR 229 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~ 229 (513) +|+||++..++++++||+|.....++ ...++++||++.+++|.... ..+.....+|+||.||||+|+|+..| T Consensus 176 ~~v~d~~~~~~~~~~ir~~~~~~~~~----~~~~~~vyt~~~v~~~~~~~----~~~~~~~~~~~~g~vPvv~~~nn~~g 247 (501) T protein:vir:27 176 FVIYDNSLEDNSIAAVRYYNRGTLQN----AKDVVEIYTNEHIYTLDASD----DFNEISVTTHAFGTVPITEFLNNVDG 247 (501) T ss_pred EEEecCCCCCceEEEEEEEEeeecCC----cEEEEEEEeCCeEEEEEeCC----ceeeccccccCCCcccEEEecCCCCC Confidence 99999998899999999998765433 35678999999999887532 23445667899999999999999999 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 230 QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 230 ~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) +|+|+++++|||+||+++|++++.+++|++|+++++|....... .....+...+++ T Consensus 248 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~------------------------~~~~~~~~~~~~ 303 (501) T protein:vir:27 248 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKG------------------------MQASDMKRTRLM 303 (501) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcc------------------------cchhhhhhcCce Confidence 99999999999999999999999999999999999997543211 112234556777 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) .+...+...+...+++++|++|+++.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.++++. T Consensus 304 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~ 383 (501) T protein:vir:27 304 QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQ 383 (501) T ss_pred eecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 88778888888889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) |+++|++++++|+++++..+...++++.+|+|+|++++|.|.++.|++++|++|++|.||+++++|+|+||++|++||++ T Consensus 384 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~ 463 (501) T protein:vir:27 384 FTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHH Confidence 99999999999999998887777888889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 470 QRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 470 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) |+++...... .+...+......++..+..+.+.++..| T Consensus 464 E~~e~~~~~~--~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 464 EVSEIDFKGY--SNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHHhhhHhhh--cCccccccccccCCCCCCccccccccCC Confidence 9876433322 1222211111111111111111222222 No 10 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=1.3e-102 Score=579.22 Aligned_cols=471 Identities=33% Similarity=0.498 Sum_probs=390.2 Q ss_pred CccchhhceeccCCcc-c---CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD-K---LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~-~---~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) .---++||..|.++.. + .+.++|.++|++|...+++||+++.+||+|+|+.+.... ......++++|+++||+++ T Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~-~~~~~~~~~~ki~~n~~k~ 97 (502) T protein:vir:48 19 LRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSG-RRKDNEMADKRAVHNYGRM 97 (502) T ss_pred cccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc-cccccccccceeecchHHH Confidence 4556789999997743 2 345789999999998889999999999999864333322 3355678899999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH-------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS-------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC 149 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~ 149 (513) ||++.++||+|+|++++++++ +.++++|+.|+|+.++.+++++++++|+||+++|++++|++++.+ ++|.++ T Consensus 98 Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~-~~p~~~ 176 (502) T protein:vir:48 98 ISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKR-LSPLET 176 (502) T ss_pred HHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEE-Ecccce Confidence 999999999999999977542 358899999999999999999999999999999999999988764 899999 Q ss_pred EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCC Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYR 229 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~ 229 (513) +|+||++..++++++||+|......+ ..+++++||++.+++|... .........+|+||.||||+|+|+.+| T Consensus 177 ~~vydd~~~~~~~~~ir~~~~~~~~~----~~~~~~iyt~~~i~~~~~~----~~~~~~~~~~~~~g~vPvv~~~nn~~g 248 (502) T protein:vir:48 177 FVIYDNSLEDNSIAAVRYYNRGTLQN----AKDVVEIYTNQHIYTLDAS----DSFNEISVTPHAFGTVPITEFLNNADG 248 (502) T ss_pred EEEEcCCCCCceEEEEEEEEEeecCC----cEEEEEEEeCCeEEEEEeC----CceeeccceecCCCccceEEecCCCCC Confidence 99999988889999999998654332 2567899999999988642 223455677899999999999999999 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 230 QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 230 ~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) +|+|+++++|||+||+++|++++.+++|++|+++++|........ ....+...+++ T Consensus 249 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~------------------------~~~~~~~~~~~ 304 (502) T protein:vir:48 249 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGM------------------------QASDMKRTRLM 304 (502) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccc------------------------chhhhhhccee Confidence 999999999999999999999999999999999999975432111 11234456777 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) .+.......+...+++++|++|+++.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.++++. T Consensus 305 ~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~ 384 (502) T protein:vir:48 305 QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQ 384 (502) T ss_pred eccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 77777777888889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) |+.+|++++++|+++++..+...++++.+|+++|++++|.|.++.|++++|++|++|+||+++++|+|+|+++|++||++ T Consensus 385 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~ 464 (502) T protein:vir:48 385 FTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINE 464 (502) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHH Confidence 99999999999999998887777888889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhhcCCCCC-CCCCCCC-CCCCCCCCCCCCC Q lcl|NC_019916. 470 QRKAMLKTYDTKGGLIIN-GTSGNDP-EDEGVRGQQGEPE 507 (513) Q Consensus 470 E~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~ 507 (513) |+++...... .....+ ...+.++ .++..++....++ T Consensus 465 E~~~~~~~~~--~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 465 ESSKIDFKGY--PSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred HHHhhhhhcc--cccccccccccCCCccCCCCcCcCCCCC Confidence 9775322111 111111 1111111 1111111111111 No 11 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=1.9e-102 Score=578.22 Aligned_cols=473 Identities=34% Similarity=0.493 Sum_probs=391.0 Q ss_pred CccchhhceeccCCccc----CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADK----LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) ..--++||..|.++..+ .+.++|.++|++|...+.+||+++.+||.|+|+.+..+. ......++++|+++||+++ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~-~~~~~~~~~~ri~~n~~k~ 96 (501) T protein:vir:96 18 LRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSG-RRKDNEMADKRAVHNYGRM 96 (501) T ss_pred cccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCcc-ccCccccccceeecchHHH Confidence 45557888889877543 244679999999998889999999999999865443333 3345678899999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH-------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS-------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC 149 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~ 149 (513) ||++.++||+|+||+++++++ +.++++|+.|+|+.++.+++++++++|+||+++|++++|.+++.+ ++|.++ T Consensus 97 Ivd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~-~~p~~~ 175 (501) T protein:vir:96 97 ISKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKR-LSPLET 175 (501) T ss_pred HHHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEE-Ecccee Confidence 999999999999999976542 347889999999999999999999999999999999999988765 899999 Q ss_pred EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCC Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYR 229 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~ 229 (513) +|+||++..++++++||+|......+ ...++++||++.+++|.... ..+.....+|+||.||||+|+|+.+| T Consensus 176 ~~v~d~~~~~~~~~~v~~~~~~~~~~----~~~~~~vyt~~~i~~~~~~~----~~~~~~~~~~~~g~vPvv~~~nn~~g 247 (501) T protein:vir:96 176 FVIYDNSLEDNSIAAVRYYNRGTLQS----AKDVVEIYTDEHIYTLDASD----DFNEISVTTHAFGTVPITEYLNNIDG 247 (501) T ss_pred EEEEcCCCCCceEEEEEEEEeecCCC----cEEEEEEEcCCcEEEEeeCC----CceeccccccCCCccceEEecCCccC Confidence 99999988889999999997654332 35678999999999986432 23445667899999999999999999 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 230 QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 230 ~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) +|+|+++++|||+||+++|++++.+++|++|+++++|+....... ....+...+++ T Consensus 248 ~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~------------------------~~~~~~~~~~~ 303 (501) T protein:vir:96 248 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGM------------------------QASDMKRTRLM 303 (501) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCccc------------------------chhhhhhcCee Confidence 999999999999999999999999999999999999975432211 12334556777 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) .+.......+...+++++|++++.+.+++++++++|+++|+.+|++|++++++++||+||+||++++++|.+||.++++. T Consensus 304 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~ 383 (501) T protein:vir:96 304 QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQ 383 (501) T ss_pred eecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 77777777788889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) |+.+|++++++|+++++..+....+++.+|+|+|++++|.|.++.|++++|++|++|.||+++++|+++||++|++||++ T Consensus 384 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 463 (501) T protein:vir:96 384 FTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 99999999999999998887777888889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 470 QRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 470 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) |+++...... .+...+.....+++..+.+...++...| T Consensus 464 E~~~~~~~~~--~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 464 EMSEIDFKGY--SNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHhhcccc--ccchhhcccccCCcCCCCCCCccccccC Confidence 9876432211 1111111111111111111111111111 No 12 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=9.4e-100 Score=563.49 Aligned_cols=472 Identities=23% Similarity=0.382 Sum_probs=374.4 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |-=.- ...+..+..+++.++|.++|++| ..+.+||+++++||+|+|+|++++. ....++++||++||+++||++ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~l~~Yy~g~~~i~~~~~---~~~~~~~~ki~~n~~~~Iv~~ 74 (499) T protein:vir:10 1 MAVVI--DKDLLDDVNEPNIEAINYAIREL-QNRKKRLDKLSDYYNGKQEIEKHEF---DNATVEAANVMVNHAKYITDM 74 (499) T ss_pred Cccch--hhhHHhhhhcCCHHHHHHHHHHH-HHHHHHHHHHHHHhccccchhcCCc---CcCCCCcceeecchHHHHHHH Confidence 10000 01122234578899999999988 4578999999999999999987654 345678999999999999999 Q ss_pred HHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeE----------------E Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEV----------------S 141 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~----------------~ 141 (513) .++||+|+||+|+++++ +.++++|+.|+++.++.+++++++++|+||++||.+++|.+.+ . T Consensus 75 ~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~ 154 (499) T protein:vir:10 75 NVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKI 154 (499) T ss_pred HhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEE Confidence 99999999999987665 4588899999999999999999999999999999999885432 2 Q ss_pred EEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc--cccccccccccCcccc Q lcl|NC_019916. 142 VKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS--VPTLEVAEHSAQFGFP 219 (513) Q Consensus 142 ~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vP 219 (513) ..++|+++|++|++...++++++||+|...+.++ ...++++++||++.+++|+....+.. ........+|+||.|| T Consensus 155 ~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~--~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 232 (499) T protein:vir:10 155 EVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEG--NTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVP 232 (499) T ss_pred EEEcccceEEEecCCCCcceEEEEEEEEEeecCC--CceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccc Confidence 3479999999999999999999999998765543 35678899999999999976544322 2234556789999999 Q ss_pred eEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 220 MIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 220 vv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) ||+|+|+..|+|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++... T Consensus 233 vv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-------------~~~~~-------- 291 (499) T protein:vir:10 233 IIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDK-------------DDIQR-------- 291 (499) T ss_pred eEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc-------------chhhh-------- Confidence 999999999999999999999999999999999999999999999997532211 11111 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGT 379 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 379 (513) ...+.++.+ ....+++++||+|+++.+++++++++|.+.||.+|++|++++++++||+||+||++++++| T Consensus 292 ---~~~~~~~~~-------~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l 361 (499) T protein:vir:10 292 ---LKRGAIEAP-------PREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGL 361 (499) T ss_pred ---hhhcceecc-------CCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHH Confidence 111222221 2356788999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTD 459 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D 459 (513) .+||.++++.|+++|++++++|+++++..+ ..+++.+++++|++++|.|+++.|+++++++|++|.||+++++|+|+| T Consensus 362 ~~k~~~k~~~~~~~l~~~~~li~~~~~~~~--~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d 439 (499) T protein:vir:10 362 ENLLSIKQRYFFDGLRRRLKLIQTIVNIKG--ANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDN 439 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCC Confidence 999999999999999999999999987654 456778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 460 ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 460 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +++|++||++|+++..+..............+.++..++..+..++++++.... T Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (499) T protein:vir:10 440 PQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGSNHNQS 493 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCccccccC Confidence 999999999998876655443222222111111111111111111111111111 No 13 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1.1e-99 Score=563.13 Aligned_cols=452 Identities=16% Similarity=0.188 Sum_probs=370.0 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----cCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR----RNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----~~~~~~~~~ri~~n~~~~ 76 (513) -.+..+-...+.....+++.+.|.++|++|. .+++|++++++||+|+|+|+.++... .....++++|+++||+++ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~ 105 (492) T protein:vir:94 27 TQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHAN 105 (492) T ss_pred chhhhhhcccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHH Confidence 1222222233334455789999999999986 56799999999999999998765432 234567889999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+|++++++++.. +++|+ .|+++.++.+++++++++|+||++||.+++|.+++.+ ++|.+++|+| T Consensus 106 Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~-~~p~~~~~v~ 183 (492) T protein:vir:94 106 LVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFR-VPAEQGIPIW 183 (492) T ss_pred HHHHHHhhhcccCceeccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEE-EcccceEEEE Confidence 99999999999999999887654 56665 4889999999999999999999999999999988765 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-------ccccccccccccCcccceEEecCC Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-------SVPTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~vPvv~~~n~ 226 (513) |++..+++++++|+|..++ ..++++||+..+++|....+.. ...+.....+|+||.||||+|+|+ T Consensus 184 d~~~~~~~~a~ir~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:94 184 TDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred cCCCCCceEEEEEEEeecc--------ceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCC Confidence 9988899999999998643 2357999999998886544321 123445567899999999999999 Q ss_pred CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhc Q lcl|NC_019916. 227 EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQA 306 (513) Q Consensus 227 ~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 306 (513) .+|+|+|+++++|||+||+++|++++.+++|++|+++++|+...... ++.. ..... T Consensus 256 ~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-------------~~~~-----------~~~~~ 311 (492) T protein:vir:94 256 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELP-------------EFKR-----------LLRYY 311 (492) T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-------------hhHH-----------HHhhc Confidence 99999999999999999999999999999999999999997543211 1111 11222 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~ 386 (513) +++. ..++++++|++|+.+.++++.++++|.++||.+|++|++++++++||+||+||++++++|+.||+++ T Consensus 312 ~~~~---------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k 382 (492) T protein:vir:94 312 GAIK---------VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKL 382 (492) T ss_pred ccee---------cCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHH Confidence 2332 2457889999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) ++.|+.+|++++++|+++++.. .++.+++|+|++++|.|+++.|+++++++|++|.||+++++|+++|+++|++| T Consensus 383 ~~~f~~~l~~~~~li~~~~~~~-----~~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~er 457 (492) T protein:vir:94 383 ARKAKVAIQELLWFVFEHFDIK-----GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELER 457 (492) T ss_pred HHHHHHHHHHHHHHHHHHhcCC-----cccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHH Confidence 9999999999999999987543 35568999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) |++|+++.++..+...+...+...+.+++++..++ T Consensus 458 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 458 IEQEQMEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHHHHhhccccccccCCCCccccCCccccCC Confidence 99999887776555444333322222222111111 No 14 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=1.6e-99 Score=562.18 Aligned_cols=451 Identities=17% Similarity=0.205 Sum_probs=371.3 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~~ri~~n~~~~ 76 (513) -+++..+.+....+ .+++.++|.++|++|. .+++|++++++||+|+|+|+.++.. ......++++||++||+++ T Consensus 19 ~~~~~~~~~~~~~~-~e~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ 96 (483) T protein:vir:12 19 QTEIFDAIVRTNNK-PETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHAN 96 (483) T ss_pred hhhhhhcccccCCc-hhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHH Confidence 33444444444443 4578999999999986 5678999999999999999876533 2334567889999999999 Q ss_pred HHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) ||++.++||+|+|+++++++++. +++|+ .|+++.++.+++++++++|+||++||.|++|.+++.+ ++|.+++|+| T Consensus 97 Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~-~~p~~~~~v~ 174 (483) T protein:vir:12 97 LVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFR-VPAEQGIPIW 174 (483) T ss_pred HHHHHhhhhcccCceeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEE-EcccceEEEE Confidence 99999999999999999887754 55665 4789999999999999999999999999999988765 8999999999 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccC-------CccccccccccccCcccceEEecCC Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVA-------GSVPTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~-------~~~~~~~~~~~~~~g~vPvv~~~n~ 226 (513) |++..+++++++|+|..++ ..++++|++..+++|....+. ....+.....+|+||.||||+|+|+ T Consensus 175 d~~~~~~~~~~ir~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 246 (483) T protein:vir:12 175 TDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 246 (483) T ss_pred cCCCCCceEEEEEEEEeec--------ceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCC Confidence 9998899999999998643 335799999999888654332 1123344567899999999999999 Q ss_pred CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhc Q lcl|NC_019916. 227 EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQA 306 (513) Q Consensus 227 ~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 306 (513) .+|+|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++. ...+.. T Consensus 247 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-------------~~~-----------~~~~~~ 302 (483) T protein:vir:12 247 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELP-------------EFK-----------RLLRYY 302 (483) T ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-------------hHH-----------Hhhhhc Confidence 99999999999999999999999999999999999999997543211 111 111222 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~ 386 (513) +++.+ ..+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++.+|+.||.++ T Consensus 303 ~~~~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~ 373 (483) T protein:vir:12 303 GAIKV---------SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKL 373 (483) T ss_pred ccccc---------CCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHH Confidence 33332 456899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) ++.|+.+|++++++|+++++.. .++.+++|+|++++|.|+++.|++++|++|++|+||+++++|+++||++|++| T Consensus 374 ~~~f~~~l~~~~~li~~~~~~~-----~~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~v~d~~~E~~r 448 (483) T protein:vir:12 374 ARKAKVAIQELLWFVFEHFDIK-----GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELER 448 (483) T ss_pred HHHHHHHHHHHHHHHHHHhcCC-----CccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHH Confidence 9999999999999999987542 35678999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) |++|+++.....+...+...++..+.+.+++...| T Consensus 449 i~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 449 IEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 99999887776655444444333332222222222 No 15 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=5.9e-99 Score=559.13 Aligned_cols=459 Identities=22% Similarity=0.310 Sum_probs=382.4 Q ss_pred Cccch------hhceeccCCcc-cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecch Q lcl|NC_019916. 1 MIDMQ------QANMNYQEDAD-KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSF 73 (513) Q Consensus 1 ~~~~~------~~~~~~~~~~~-~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~ 73 (513) |-+|- ++|..|.++.+ +++++.|.++|++|...+++||+++++||+|+|+|+++.. ...++++|+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~----~~~~~~~ki~~n~ 76 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE----KETGADNRIVVNS 76 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc----cccCCcceeecch Confidence 55442 56666666644 7999999999999999999999999999999999976543 3467899999999 Q ss_pred hHHHHHHHHHHhhcCCeeecCCcH----HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSGPSS----DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC 149 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~~~~----~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~ 149 (513) +++||++.++||+|+||+|+++++ +.++++|+.|+++.++.+++++++++|+||++||++++|++++.+ ++|.++ T Consensus 77 ~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~-~~p~~~ 155 (470) T protein:vir:99 77 AKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMY-SSPNHA 155 (470) T ss_pred HHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEE-Ecccee Confidence 999999999999999999977543 468899999999999999999999999999999999999988764 899999 Q ss_pred EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCC Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYR 229 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~ 229 (513) +|+||+...+++++++|+|.... ......++++|+++.+++|.....+.. .......+|+||.||||+|+|+.+| T Consensus 156 ~~i~d~~~~~~~~~~vr~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~vPvv~~~n~~~g 230 (470) T protein:vir:99 156 FIIYDDTVQRQPLAFVHYQIDNS----NNWTDAYGVIQYADKFYKFKGYDIEED-TNAAGYAINPYGLVPAVEFFENEER 230 (470) T ss_pred EEEEcCCCCcceEEEEEEEEEec----CCeeEEEEEEEecCeEEEEEecccccc-cccccccccCCCccceEeecCCCCC Confidence 99999998889999999997543 334567788999999998876544332 3344567899999999999999999 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 230 QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 230 ~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) +|+|+++++|||+||+++|++++.++++++|+++++|+.....+ .+++ +......+++ T Consensus 231 ~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-----------~g~~-----------~~~~~~~~~~ 288 (470) T protein:vir:99 231 QGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDD-----------EGNP-----------KFDFKNNRVL 288 (470) T ss_pred CcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccc-----------ccch-----------hhhhhhccee Confidence 99999999999999999999999999999999999997543211 1111 1222334444 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) .+.. ...+.+++++|++|+.+.+++++++++|.++|+.+|++|++++++++||+||+||++++++|..||.++++. T Consensus 289 ~~~~----~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 364 (470) T protein:vir:99 289 YVSQ----LDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERK 364 (470) T ss_pred eecC----CCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 4432 234678899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) |+++|++++++++++++.... ..+++.+++++|++++|.|.++.|+++++++|++|.||+++++|+| |+++|++||++ T Consensus 365 ~~~~l~~~~~li~~~~~~~~~-~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~v-d~~~E~eri~~ 442 (470) T protein:vir:99 365 FDKSLMQLYRIVLATLFNNKQ-DQELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDI-EPDAEMKQIAK 442 (470) T ss_pred HHHHHHHHHHHHHHHHhccCC-cccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCC-CHHHHHHHHHH Confidence 999999999999998876544 4667789999999999999999999999999999999999999998 79999999999 Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCC Q lcl|NC_019916. 470 QRKAMLKTYDTKGGLIINGTSGNDPEDE 497 (513) Q Consensus 470 E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) |+++..+................+++++ T Consensus 443 E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 443 EKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 9987766554433222221111111111 No 16 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=5.1e-99 Score=559.44 Aligned_cols=450 Identities=23% Similarity=0.340 Sum_probs=374.8 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |--...-.|.|+.+ .+++.++|.++|++|. .+++|++++++||+|+|+++.++. ....++++|+++||+++||++ T Consensus 1 ~~~~~~~~~~~p~d-~~~~~~~l~~~i~~~~-~~~~r~~~~~~yy~g~~~i~~~~~---~~~~~~~~ki~~n~~~~ivd~ 75 (453) T protein:vir:39 1 MKYKPPKLMTFPKD-EPITNEVVTKFMEKHR-LEVARYEYLKNMYRGIMAIDAEPT---KDLWKPDNRLTVNFTKYIVDT 75 (453) T ss_pred CeecCCcceEcCCC-CCCCHHHHHHHHHHHH-HHHHHHHHHHHHhhccCchhcCCC---ccccCccceeecchHHHHHHH Confidence 55555666666666 5689999999999984 567899999999999999987764 346678999999999999999 Q ss_pred HHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) .++||+|+|++|+++++ +.++++|+.|+|+.++.+++++++++|+||++||++++|.+++.+ ++|.+++|+||+.. T Consensus 76 ~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~-~~p~~~~~v~d~~~ 154 (453) T protein:vir:39 76 FTGYFNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIY-NTPENMFMVYDDTI 154 (453) T ss_pred HhhhhcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEE-EcccceEEEecCCC Confidence 99999999999987764 468999999999999999999999999999999999999988765 89999999999988 Q ss_pred CcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) .+++++++|+|... ....++++||++.+++|....+ .+..++..+|++|.||||+|+|+.+|+|+|++++ T Consensus 155 ~~~~~~~ir~~~~~-------~~~~~~~~yt~~~i~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~ 224 (453) T protein:vir:39 155 KQEPLFAVRYGYDD-------DYKLYGEVYTKETTYALNGTMG---FYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVI 224 (453) T ss_pred CCeEEEEEEEEEeC-------CeEEEEEEEeCCeEEEEEecCC---ceeeecccccCCCceeEEEecCCCCCCcchhhhH Confidence 88999999998632 2356899999999999875432 2344566789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +|||+||+++|++++.+++|++|+++++|....... +..+..++++.+.. . T Consensus 225 ~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~--------------------------~~~~~~~~~~~~~~---~ 275 (453) T protein:vir:39 225 SLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEED--------------------------LKNIRSNRVINYYG---E 275 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchh--------------------------hhhhhhcceeeecC---C Confidence 999999999999999999999999999997432111 11223344444332 2 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) .+..++++++|++++++.+++++++++|.++||.+|++|+++++.+ ||+||+||++++++|+.||+++++.|+.+|+++ T Consensus 276 ~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~ 354 (453) T protein:vir:39 276 SSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSR 354 (453) T ss_pred CCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2345789999999999999999999999999999999999999887 789999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) +++|+++++..+. ..++.+|+|+|++++|.|+++.|++++|++|++|.||+|+++|+++|+++|++||++|+++..+. T Consensus 355 ~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~ 432 (453) T protein:vir:39 355 YKLYCELSTNVSN--KEAWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIF 432 (453) T ss_pred HHHHHHHHhccCC--ccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 9999998876543 56778999999999999999999999999999999999999999999999999999998876553 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) ..... .. ..+.+++.++..++ T Consensus 433 ~~~~~---~~-~~~~~~~~~~~~~e 453 (453) T protein:vir:39 433 DKDKQ---PS-EKGTDTVVPETNEE 453 (453) T ss_pred HHhcc---CC-CCCCCCCCCCcCCC Confidence 32111 11 11111111111111 No 17 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=3.8e-99 Score=560.17 Aligned_cols=452 Identities=16% Similarity=0.198 Sum_probs=368.3 Q ss_pred Cccchhhc-eeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----cCCCCCCcceeecchhH Q lcl|NC_019916. 1 MIDMQQAN-MNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR----RNEKGKADHRAVHSFAR 75 (513) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----~~~~~~~~~ri~~n~~~ 75 (513) =|..+.-+ ..+.....+++.+.|.++|++|. .+++|++++++||+|+|+|++++... .....++++|+++||++ T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k 104 (492) T protein:vir:97 26 PTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHA 104 (492) T ss_pred hhhhhHhhhcccCCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHH Confidence 12222223 23333345788999999999985 57799999999999999998765432 23456788999999999 Q ss_pred HHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) +||++.++||+|+||+++++++.. +++|+ .|+++.++.+++++++++|+||+++|.+++|.+++.+ ++|.+++|+ T Consensus 105 ~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~-~~p~~~~~i 182 (492) T protein:vir:97 105 NLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFR-VPAEQGIPI 182 (492) T ss_pred HHHHHHhhhhcccCceeccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEE-EcccceEEE Confidence 999999999999999999887754 66665 4899999999999999999999999999999988765 899999999 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-------ccccccccccccCcccceEEecC Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-------SVPTLEVAEHSAQFGFPMIEYRN 225 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~vPvv~~~n 225 (513) ||++..+++++++|+|..++ ..++++|++..+++|....+.. ...+.....+|+||.||||+|+| T Consensus 183 ~d~~~~~~~~~~vr~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 254 (492) T protein:vir:97 183 WTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN 254 (492) T ss_pred EcCCCCCceEEEEEEEeecc--------ceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecC Confidence 99988899999999998643 2367899999998886544321 12334556789999999999999 Q ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 226 NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 226 ~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) +.+|+|+|+++++|||+||+++|++++.+++|++|+++++|+...... ++.. .... T Consensus 255 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~-------------~~~~-----------~~~~ 310 (492) T protein:vir:97 255 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELP-------------EFKR-----------LLRY 310 (492) T ss_pred CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccch-------------hHHH-----------HHhh Confidence 999999999999999999999999999999999999999997543211 1111 1122 Q ss_pred cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELAST 385 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~ 385 (513) .+++.+ .++++++|++|+.+.+++++++++|+++||.+|++|++++++++||+||+||++++++|+.||++ T Consensus 311 ~~~~~~---------~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~ 381 (492) T protein:vir:97 311 YGAIKV---------SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK 381 (492) T ss_pred ccceec---------CCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHH Confidence 333332 45688999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHH Q lcl|NC_019916. 386 KRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVK 465 (513) Q Consensus 386 ~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ 465 (513) +++.|+.+|++++++|+++++. ..++.+++++|++++|+|+++.|++++|++|++|.||+++++|+|+|+++|++ T Consensus 382 ~~~~f~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~v~d~~~Ele 456 (492) T protein:vir:97 382 LARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELE 456 (492) T ss_pred HHHHHHHHHHHHHHHHHHHhcC-----CcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHH Confidence 9999999999999999998753 23567899999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 466 MMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 466 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) ||++|+++..+..+...+...+...+.++.++..++ T Consensus 457 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 457 RIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 999999877665544333333222221111111111 No 18 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=2.1e-98 Score=556.14 Aligned_cols=463 Identities=36% Similarity=0.564 Sum_probs=387.5 Q ss_pred Ccc-------chhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-cccccCCCCCCcceeecc Q lcl|NC_019916. 1 MID-------MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILS-PASRRNEKGKADHRAVHS 72 (513) Q Consensus 1 ~~~-------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~-~~~~~~~~~~~~~ri~~n 72 (513) |.. +....+.++.....++++.|.++|++|...+.++++++++||+|+|+.+.. .........++++|+++| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n 85 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHN 85 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecc Confidence 111 112234555556789999999999999989999999999999999876543 344455567889999999 Q ss_pred hhHHHHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce Q lcl|NC_019916. 73 FARYIADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC 149 (513) Q Consensus 73 ~~~~ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~ 149 (513) |+++||++.++||+|+||+++++++ +.++++|+.|+++.++.+++++++++|+||+++|++++|.+++.+ ++|.++ T Consensus 86 ~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~-~~p~~~ 164 (481) T protein:vir:10 86 YAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKV-LDPKST 164 (481) T ss_pred hHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEE-Ecccce Confidence 9999999999999999999987653 578999999999999999999999999999999999999888764 899999 Q ss_pred EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCC Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYR 229 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~ 229 (513) +|+||+....++++++|+|...+.+ ...+.++++||++.+++|....+ .+..++..+|+||.||||+|+|+.+| T Consensus 165 ~~v~d~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~y~~~~i~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~~g 238 (481) T protein:vir:10 165 FVVYDQTLDKKVVAGVRYFEKQDKD---KVPVQHVEVYTTDKIYYIEIKGG---TYHRVEEVEHYYNDVPIIEYLNDQFK 238 (481) T ss_pred EEEEcCCCCCceEEEEEEEEEeeCC---CceEEEEEEEecCeEEEEEecCC---ceeecccccccCCceeEEEeecCCCC Confidence 9999998888999999999865433 34567899999999999875332 24445677999999999999999999 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 230 QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 230 ~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) +|+|+++++|||+||+++|++++.+++|++|+++++|....... .......++++ T Consensus 239 ~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~-------------------------~~~~~~~~~~~ 293 (481) T protein:vir:10 239 QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSE-------------------------DAKAFRDANMI 293 (481) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCcc-------------------------chhhhhhccce Confidence 99999999999999999999999999999999999996432211 11223345555 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) .+.......+.+.+++++|++++++.+++++++++|+++|+.+|++|+++++++++|+||+||++++++|..||+++++. T Consensus 294 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~ 373 (481) T protein:vir:10 294 HLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERL 373 (481) T ss_pred eccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 66556666677788999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) |+.+|++++++++++++... ...++..+++++|++++|.|.++.|+++++++|++|.||+++++|+++|+++|++||++ T Consensus 374 ~~~~l~~~~~li~~~~~~~~-~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~ 452 (481) T protein:vir:10 374 FKKGLMKRYKLLLNNVNLTG-LKQHNYAELTITFTPNLPKSMMESINAFNALSGGVSESTRLSLLDFIDNPKEELEKMQE 452 (481) T ss_pred HHHHHHHHHHHHHHHHhccC-CCccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHH Confidence 99999999999999987654 44677889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhhcCC--CCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 470 QRKAMLKTYDTKGGL--IINGTSGNDPEDEGVRGQQG 504 (513) Q Consensus 470 E~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 504 (513) |+++..+..+..... +.++..++++ +| T Consensus 453 E~~~~~~~~~~~~~~~~~~~~~~~dd~--------~g 481 (481) T protein:vir:10 453 EEAQREKQADKRGYGEAFENHLNVDDS--------NG 481 (481) T ss_pred HHHHHHhhhhhccCCccCCCCCCCCCC--------CC Confidence 998876654432222 1111111111 11 No 19 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=8.8e-99 Score=558.15 Aligned_cols=434 Identities=37% Similarity=0.588 Sum_probs=370.1 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH------HH Q lcl|NC_019916. 26 FIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS------DR 99 (513) Q Consensus 26 ~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~------~~ 99 (513) ||..|+..+++||+++++||+|+|++++++.. ...+.++++||++||+++||++.++||+|+|+++++.+. +. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~-~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~ 79 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHR-RLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLST 79 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccc-cccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHH Confidence 88888899999999999999999998766543 356678999999999999999999999999999965432 35 Q ss_pred HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccce Q lcl|NC_019916. 100 LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQ 179 (513) Q Consensus 100 l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~ 179 (513) ++++|+.|+++.++.+++++++++|+||+++|++++|++++.+ ++|.+++|+||+...+++++++|+|...+ T Consensus 80 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~-~~p~~~~~~~d~~~~~~~~~~i~~~~~~~------- 151 (440) T protein:vir:95 80 IKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVL-ISPLEMFVIRDLTVEQNIIAAVHLPIYAD------- 151 (440) T ss_pred HHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEE-EcccceEEEEcCCCCCceEEEEEEEEecC------- Confidence 8899999999999999999999999999999999999887764 89999999999998889999999987543 Q ss_pred eEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 180 TKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNE 259 (513) Q Consensus 180 ~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~ 259 (513) ..++++||++.+++|.....+.......+..+|+||.||||+|+|++.|+|+|+++++|||+||+++|++++.+++|++ T Consensus 152 -~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~ 230 (440) T protein:vir:95 152 -KVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLND 230 (440) T ss_pred -ceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 3467899999999998776655556667788999999999999999999999999999999999999999999999999 Q ss_pred hhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHH Q lcl|NC_019916. 260 AMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTE 339 (513) Q Consensus 260 ~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 339 (513) |+++++|......... .....+...+++.+.......+.+.+++++|++|+++.++++ T Consensus 231 ~~~v~~g~~~~~~~~~----------------------e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~ 288 (440) T protein:vir:95 231 AMLLVKGDLDGIKLSP----------------------EDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTE 288 (440) T ss_pred ceeeeecccccCCCCc----------------------cchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHH Confidence 9999999643322111 111223445556666666667778899999999999999999 Q ss_pred HHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccee Q lcl|NC_019916. 340 LYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEI 419 (513) Q Consensus 340 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i 419 (513) +++++|.++||.+|++|++++++++||+||+||++++++|++||+++++.|+++|++++++|+.++....+ ..+++.++ T Consensus 289 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~v 367 (440) T protein:vir:95 289 AYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING-PVIEANKL 367 (440) T ss_pred HHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999877654 46778899 Q ss_pred eEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCC Q lcl|NC_019916. 420 GFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPE 495 (513) Q Consensus 420 ~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (513) +++|++++|+|+++.|++++|++|++|.||+|+++|+++ +++|++||++|+++.........+.... .+.+.+ T Consensus 368 ~i~f~~~~p~~~~~~ad~~~kl~g~iS~et~~~~l~~~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~--~~~~~e 440 (440) T protein:vir:95 368 TFTFHPNIPQDVWTEIKAYIEAGGEISQETLMENASFTD-YKTEHSRILKQGGSSDLEIGQIVGDADV--GQADTE 440 (440) T ss_pred eEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCC-cHHHHHHHHHHHHHhhhhHHhhccCCCC--CCcCCC Confidence 999999999999999999999999999999999999985 4689999999988765544332221111 111111 No 20 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=8.8e-98 Score=552.69 Aligned_cols=449 Identities=24% Similarity=0.351 Sum_probs=370.6 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |--..=.-|.++.+ .+++.+.|.++|++|. .+++|++++++||+|+|+++.++. ....++++|+++||+++||++ T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~-~~~~r~~~~~~Yy~g~~~i~~~~~---~~~~~~~~ki~~n~~~~ivd~ 75 (452) T protein:vir:36 1 MKYKPPKLMTFSKD-EPITVEVVTKFMEKHK-LEVARYEYLKNMYLGIMAIDDEPA---KDSWKPDNRLAVNFTKYIVDT 75 (452) T ss_pred CcccCceeEEcCCc-cCCCHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccCcc---ccccCccceeecchHHHHHHH Confidence 22222233444444 5789999999999985 567899999999999999987654 345678999999999999999 Q ss_pred HHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) .++||+|+|++++++++ +.++++|+.|+++.++.+++++++++|+||+++|++++|++++.+ ++|.+++|+||++. T Consensus 76 ~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~-~~p~~~~~v~d~~~ 154 (452) T protein:vir:36 76 FTGYFNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVY-NSPENMFMVYDDTV 154 (452) T ss_pred HhhhhcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE-EcccceEEEEcCCC Confidence 99999999999987764 568899999999999999999999999999999999999988764 89999999999998 Q ss_pred CcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) .+++++++|+|...+ ...++++||++.+++|.... ..+......+|++|.||||+|+|+.+|+|+|++++ T Consensus 155 ~~~~~~~i~~~~~~~-------~~~~~~vyt~~~i~~~~~~~---~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~ 224 (452) T protein:vir:36 155 KQEPLFAVRYGVDED-------KKLQGEVYTLLETIKISGEN---DEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVI 224 (452) T ss_pred CCceEEEEEEEEecC-------ceEEEEEEecCeEEEEEEcC---CceEEecceeccCCcccEEEecCCCCCCcchHHHH Confidence 889999999986432 24578999999999987543 23455667899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +|||+||+++|++++.+++|++|+++++|....... ......++++.+.. T Consensus 225 ~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~--------------------------~~~~~~~~~~~~~~---- 274 (452) T protein:vir:36 225 SLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEED--------------------------LKNIRSNRVINYYA---- 274 (452) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchh--------------------------hhhhhhcceEEecC---- Confidence 999999999999999999999999999997432111 01122234444433 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) .+.+.+++++|++|+++.+++++++++|.++||.+|++|+++++++ ||+||+||++++++|..||+++++.|+.+|+++ T Consensus 275 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 353 (452) T protein:vir:36 275 DGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSR 353 (452) T ss_pred CCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2345678999999999999999999999999999999999998887 789999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) +++|++++...+. ..++.+|+|+|++++|.|+++.|++++|++|++|.||+|+++|+++||++|++||++|++++.+. T Consensus 354 ~~li~~~~~~~~~--~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~ 431 (452) T protein:vir:36 354 YKLFCELSTNVSN--KDSWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIF 431 (452) T ss_pred HHHHHHHHhccCC--ccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 9999999877643 45677899999999999999999999999999999999999999999999999999998875443 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) ... .... .++.+ +..++++.| T Consensus 432 ~~~-~~~~------~~~~~----~~~~~~~~e 452 (452) T protein:vir:36 432 DKD-KQPS------EKGTD----TVVSETNEE 452 (452) T ss_pred Hhh-ccCC------CCccc----ccCccccCC Confidence 221 1000 01111 111111111 No 21 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=6.7e-98 Score=553.32 Aligned_cols=447 Identities=24% Similarity=0.349 Sum_probs=372.6 Q ss_pred ccchhhceeccCC-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 2 IDMQQANMNYQED-ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 2 ~~~~~~~~~~~~~-~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |+| +.+..+.++ .++++++.|.++|++| ..+++||+++.+||+|+|+++++.. ....++++|+++||+++||++ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~r~~~~~~yy~g~~~i~~~~~---~~~~~~~~ki~~n~~~~ivd~ 75 (453) T protein:vir:73 1 MNL-KPIKLMTYSRDEEITDKVVNDFMKKH-QEEVERYEYLGNMYKGIMEISSQKA---KDSWKPDNRLTNNFAKYIVDT 75 (453) T ss_pred Ccc-ccceeeeccccccCCHHHHHHHHHHH-HHHHHHHHHHHHHhccccchhcCCC---CCccCccceeecchHHHHHHH Confidence 443 334444444 4689999999999998 4678999999999999999987553 345678999999999999999 Q ss_pred HHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) .++||+|+|++++++++ +.+++||+.|+|+.++.+++++++++|+||++||++++|.+++.+ ++|.+++|+|++.. T Consensus 76 ~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~-~~p~~~~~v~dd~~ 154 (453) T protein:vir:73 76 FVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIY-CSPLNVFMVYDDSI 154 (453) T ss_pred hhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEE-EcccceEEEEeCCC Confidence 99999999999988765 468899999999999999999999999999999999999988764 89999999999998 Q ss_pred CcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) ++++++++|||.... ...++++||++.+++|....+ .+......+|+||.||||+|+|+.+|+|+|++++ T Consensus 155 ~~~~~~~i~~~~~~~-------~~~~~~vyt~~~i~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~ 224 (453) T protein:vir:73 155 KQKPLFAVYYGFDEE-------GNLSGTVYTLLETISITGKAG---EVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVH 224 (453) T ss_pred CceeEEEEEEEEecC-------ceEEEEEEeCCeEEEEEecCC---ceEEccceeccCCceeEEEecCCCCCCcchhhHH Confidence 899999999876321 245789999999999876432 2344566789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecc--ccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLK--TGM 315 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~ 315 (513) +|||+||+++|++++.+++|++|+|+++|+........ .++ ..+++.+. ... T Consensus 225 ~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~---------------~~~-----------~~~~~~~~~~~~~ 278 (453) T protein:vir:73 225 SLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEEDAK---------------NIK-----------DNRLINFFDKNSN 278 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhh---------------ccc-----------ccccccccccccc Confidence 99999999999999999999999999999754322111 111 11111111 112 Q ss_pred cccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 316 APNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLN 395 (513) Q Consensus 316 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~ 395 (513) .......+++++|++|+.+.+++++++++|.++||.+|++|+++++++ ||+||+||++++++|++||+++++.|+.+|+ T Consensus 279 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~ 357 (453) T protein:vir:73 279 GQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENF-GNSSGVALAYKLQAMSNLALSFQRKFQSALN 357 (453) T ss_pred cccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223455678899999999999999999999999999999999999887 7899999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019916. 396 QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAML 475 (513) Q Consensus 396 ~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~ 475 (513) +++++++++++..+. ..++.+++++|++++|.|+++.|++++|++|++|.||+++++|+++||++|++||++|+++++ T Consensus 358 ~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~ 435 (453) T protein:vir:73 358 RRYSLWSSLSTNASN--KDAWKDIEYTFTRNEPKDIKEQAETANILKGITSEETALSVISVIPDVQAEMEKIKKKKLLQL 435 (453) T ss_pred HHHHHHHHHHhccCC--ccccccceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 999999998876543 456778999999999999999999999999999999999999999999999999999999877 Q ss_pred HHhhhhcCCCCCCCCCCC Q lcl|NC_019916. 476 KTYDTKGGLIINGTSGND 493 (513) Q Consensus 476 ~~~~~~~~~~~~~~~~~~ 493 (513) ..........+++..+.- T Consensus 436 ~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 436 SLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HHHHhccCCcchhhhcCC Confidence 665543322222222222 No 22 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=3.2e-98 Score=555.12 Aligned_cols=433 Identities=18% Similarity=0.228 Sum_probs=362.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----cCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR----RNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS 93 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~ 93 (513) ||.+.|.++|++|. .+++|++++++||+|+|+|+++.... .....++++||++||+++||++.++||+|+||+|+ T Consensus 1 l~~~~i~~~i~~~~-~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADA-ARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 99999999999986 46899999999999999998775432 23345688999999999999999999999999998 Q ss_pred CCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCC--------CceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 94 GPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPS--------QKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 94 ~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~--------~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) +++++. +.+.|..|+++.++.+++++++++|+||+++|++++ |+.++. .++|.+++|+||++..+++. T Consensus 80 ~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~-~i~p~~~~~vydd~~~~~~~ 158 (451) T protein:vir:10 80 IDNNKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYG-VVNTEEIIPIYRNGIERELE 158 (451) T ss_pred cCCcHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEE-EEcccceEEEEcCCCCCceE Confidence 765433 334444699999999999999999999999999986 444444 37999999999999889999 Q ss_pred EEEEEEeeccccccc--ceeEEEEEEEcCCcEEEEEeeccCCcc-ccccccccccCcccceEEecCCCCCCcchhHHHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNI--TQTKYEVETWTENDYTRYKPIVVAGSV-PTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSL 239 (513) Q Consensus 163 ~~ir~~~~~~~~~~~--~~~~~~ve~yt~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~l 239 (513) ++||+|......... .++..++++||++.+++|+....+... .......+|+||.||||+|+|+..|.|+|+++++| T Consensus 159 ~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~l 238 (451) T protein:vir:10 159 AVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKI 238 (451) T ss_pred EEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCchhhHHHH Confidence 999999876554332 356778999999999999876554332 33445668999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccc Q lcl|NC_019916. 240 IDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNG 319 (513) Q Consensus 240 iD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (513) ||+||.++|++++.+++|++|+++++|+.+... .++.. .+...+++.+.. .. T Consensus 239 iDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~-------------~~~~~-----------~~~~~~~i~~~~----~~ 290 (451) T protein:vir:10 239 LDLYDRVMSGFANDLEDIQQIIYILENFGGEDT-------------SEFLK-----------ELKRYKTIKTET----DS 290 (451) T ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCcccc-------------hhhHH-----------HHhhCCeEEecC----cC Confidence 999999999999999999999999999754321 11212 233344444432 23 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYT 399 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~ 399 (513) .+.+++++||+|+.+.+++++++++|.++||.+|++|+++++++ ||+||+||++++++|++||+++++.|+++|+++++ T Consensus 291 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~ 369 (451) T protein:vir:10 291 EGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENF-GNASGVALKFFYRKLELKSGLLETEFRTSFDKLIK 369 (451) T ss_pred CccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45678999999999999999999999999999999999999887 78999999999999999999999999999999999 Q ss_pred HHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 400 VVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYD 479 (513) Q Consensus 400 li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~ 479 (513) +|+++++. .++.+++++|++++|.|+++.|+++++++|++|+||+++++|+++||++|++++++|++++..... T Consensus 370 li~~~~~~------~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~ 443 (451) T protein:vir:10 370 AILYFLGV------TDYKKIQQTYTRNMMSNDLEDADIATKSVGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVS 443 (451) T ss_pred HHHHHhCC------CCccceeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH Confidence 99998753 356789999999999999999999999999999999999999999999999999888776555443 Q ss_pred hhcCCCCC Q lcl|NC_019916. 480 TKGGLIIN 487 (513) Q Consensus 480 ~~~~~~~~ 487 (513) ...+...+ T Consensus 444 ~~~~~~~~ 451 (451) T protein:vir:10 444 DDYNNFTE 451 (451) T ss_pred hhcCCCCC Confidence 32222222 No 23 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=9.7e-98 Score=552.46 Aligned_cols=439 Identities=17% Similarity=0.195 Sum_probs=362.1 Q ss_pred CCHHHHHHHHHHH---HHHHHHHHHHHHHHhcCCCccccccccc--------cCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 18 LTPTRIAAFIRHH---YNNQRPRLEMLYDYYRGQNDGILSPASR--------RNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 18 ~~~~~i~~~i~~~---~~~~~~~~~~~~~YY~G~~~i~~~~~~~--------~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |..+.|.++|..+ +..+.++|+++++||+|+|+|+.++... ....+++++||++||+++||++.++||+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 7777777777653 3456789999999999999998876433 2334578999999999999999999999 Q ss_pred cCCeeecCCcHH---HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEE Q lcl|NC_019916. 87 GNAIAMSGPSSD---RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIM 163 (513) Q Consensus 87 g~p~~~~~~~~~---~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~ 163 (513) |+||+|++++++ .++++++ ++++..+.+++++++++|+||+++|+|++|+.++.+ ++|.++||+||++..+++++ T Consensus 81 G~p~~~~~~d~~~~~~l~~~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~-~~p~~~~~v~d~~~~~~~~a 158 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIIDVLG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGI-IQPDQITPIYATTLDNKLLG 158 (470) T ss_pred ccceeeecCchHHHHHHHHHHh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEE-EcccceEEEEcCCCCCceEE Confidence 999999887764 4666666 568888899999999999999999999999988764 89999999999999899999 Q ss_pred EEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc-----------------ccccccccccCcccceEEecCC Q lcl|NC_019916. 164 AVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV-----------------PTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 164 ~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~g~vPvv~~~n~ 226 (513) +||+|...+.++ .....++++||++.+++|.....+... .......+|+||.||||+|+|| T Consensus 159 ~ir~y~~~~~~~--~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 236 (470) T protein:vir:10 159 ILRSYKQLDPDS--GKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKN 236 (470) T ss_pred EEEEEEeeecCC--ceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecC Confidence 999998766554 345678899999999998765443211 1223445799999999999999 Q ss_pred CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhc Q lcl|NC_019916. 227 EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQA 306 (513) Q Consensus 227 ~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 306 (513) .+|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+... +++.. .+... T Consensus 237 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~-------------~~~~~-----------~~~~~ 292 (470) T protein:vir:10 237 KYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADL-------------HQFMN-----------DLRKY 292 (470) T ss_pred CCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCcccc-------------chhhh-----------hhhhc Confidence 9999999999999999999999999999999999999999754221 11111 22333 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~ 386 (513) +++.+.. .+.+.+++|+|++|+++.++++.++++|.++||.+|++|+++++.+ ||+||+||++++++|++||+++ T Consensus 293 ~~i~~~~----~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~ 367 (470) T protein:vir:10 293 KSIKINN----TGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKT 367 (470) T ss_pred CeEeccC----CCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHH Confidence 4444432 2345689999999999999999999999999999999999999887 7999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKM 466 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~r 466 (513) ++.|+++|++++++|+++++.. ..++.+++++|++++|.|+++.|+++++++|++|.||+++++|+|+||++|++| T Consensus 368 ~~~~~~~l~~~~~~i~~~l~~~----~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~er 443 (470) T protein:vir:10 368 QTYFEHAINELVRAIMRYLNFS----DADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKD 443 (470) T ss_pred HHHHHHHHHHHHHHHHHHhccc----CcccceeeEEeccCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHH Confidence 9999999999999999987542 456789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcCCCCCCCCCCCCCC Q lcl|NC_019916. 467 MDKQRKAMLKTYDTKGGLIINGTSGNDPED 496 (513) Q Consensus 467 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (513) |++|+++.++........ ...+.++++ T Consensus 444 i~~E~~e~~~~~~~~~~~---~~~~~dde~ 470 (470) T protein:vir:10 444 LAKDKEENDPYSNQADEL---NGKGVNDEQ 470 (470) T ss_pred HHHHHHHHHHhhcccccc---CCCCCCCCC Confidence 999988876654332111 111111111 No 24 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=2.3e-97 Score=550.37 Aligned_cols=451 Identities=17% Similarity=0.205 Sum_probs=365.8 Q ss_pred Cccch------hhceeccC---CcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCcc Q lcl|NC_019916. 1 MIDMQ------QANMNYQE---DADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKADH 67 (513) Q Consensus 1 ~~~~~------~~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~~ 67 (513) ||+.- -+...|.. ...+.+.+.|.++|++|. .++++++++++||+|+|+|++++.. ......++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhccccccccccc Confidence 43210 01111111 123567789999999986 5788999999999999999876542 2234457889 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEE Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKL 144 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~ 144 (513) ||++||+++||++.++||+|+||+++++++.. +++|+ .|+++.++.+++++++++|+||+++|++++|++++.+ + T Consensus 80 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~-~ 157 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFR-V 157 (474) T ss_pred ccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE-E Confidence 99999999999999999999999999887654 55555 5889999999999999999999999999999988765 8 Q ss_pred cccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-------ccccccccccccCcc Q lcl|NC_019916. 145 DPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-------SVPTLEVAEHSAQFG 217 (513) Q Consensus 145 ~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~ 217 (513) +|.++||+||++...++++++|+|..+. ..++++||++.+++|.....+. .........+|++|. T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~~--------~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:95 158 PAEQAIPIWTDKEREQLNAFIRIFTFNG--------ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWER 229 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeecC--------eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCc Confidence 9999999999998899999999997532 3578999999999987654321 122334556899999 Q ss_pred cceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccc Q lcl|NC_019916. 218 FPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKM 297 (513) Q Consensus 218 vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 297 (513) ||||+|+|+.+|.|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++ T Consensus 230 vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-------------~~--------- 287 (474) T protein:vir:95 230 VPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-------------EF--------- 287 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-------------ch--------- Confidence 99999999999999999999999999999999999999999999999997542111 11 Q ss_pred hhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHH Q lcl|NC_019916. 298 AQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVL 377 (513) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 377 (513) ...+...+++.+ ..+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||+++++ T Consensus 288 --~~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 356 (474) T protein:vir:95 288 --MEGLKYYKAINV---------SSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYT 356 (474) T ss_pred --hhhhhccceeec---------cCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHH Confidence 112233344443 346899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCC Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNV 457 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v 457 (513) +|.+||.++++.|+++|++++++|+++++. .+++.+|+++|++++|.|+++.|+++++ +|++|.||+++++|+| T Consensus 357 ~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-----~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v 430 (474) T protein:vir:95 357 NLNLKANKLKNKANVALQELMQFILDFNKI-----KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCC Confidence 999999999999999999999999988642 4567889999999999999999999887 5999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) +|+++|++||++|+++..+.+....+..+++..+.+++++..++ T Consensus 431 ~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 431 DDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 99999999999999887666554444433322222222222211 No 25 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=2.3e-97 Score=550.37 Aligned_cols=451 Identities=17% Similarity=0.205 Sum_probs=365.8 Q ss_pred Cccch------hhceeccC---CcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCcc Q lcl|NC_019916. 1 MIDMQ------QANMNYQE---DADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKADH 67 (513) Q Consensus 1 ~~~~~------~~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~~ 67 (513) ||+.- -+...|.. ...+.+.+.|.++|++|. .++++++++++||+|+|+|++++.. ......++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhccccccccccc Confidence 43210 01111111 123567789999999986 5788999999999999999876542 2234457889 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEE Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKL 144 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~ 144 (513) ||++||+++||++.++||+|+||+++++++.. +++|+ .|+++.++.+++++++++|+||+++|++++|++++.+ + T Consensus 80 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~-~ 157 (474) T protein:vir:96 80 RITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFR-V 157 (474) T ss_pred ccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE-E Confidence 99999999999999999999999999887654 55555 5889999999999999999999999999999988765 8 Q ss_pred cccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-------ccccccccccccCcc Q lcl|NC_019916. 145 DPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-------SVPTLEVAEHSAQFG 217 (513) Q Consensus 145 ~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~ 217 (513) +|.++||+||++...++++++|+|..+. ..++++||++.+++|.....+. .........+|++|. T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~~--------~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:96 158 PAEQAIPIWTDKEREQLNAFIRIFTFNG--------ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWER 229 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeecC--------eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCc Confidence 9999999999998899999999997532 3578999999999987654321 122334556899999 Q ss_pred cceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccc Q lcl|NC_019916. 218 FPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKM 297 (513) Q Consensus 218 vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 297 (513) ||||+|+|+.+|.|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++ T Consensus 230 vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-------------~~--------- 287 (474) T protein:vir:96 230 VPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-------------EF--------- 287 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-------------ch--------- Confidence 99999999999999999999999999999999999999999999999997542111 11 Q ss_pred hhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHH Q lcl|NC_019916. 298 AQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVL 377 (513) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 377 (513) ...+...+++.+ ..+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||+++++ T Consensus 288 --~~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 356 (474) T protein:vir:96 288 --MEGLKYYKAINV---------SSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYT 356 (474) T ss_pred --hhhhhccceeec---------cCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHH Confidence 112233344443 346899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCC Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNV 457 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v 457 (513) +|.+||.++++.|+++|++++++|+++++. .+++.+|+++|++++|.|+++.|+++++ +|++|.||+++++|+| T Consensus 357 ~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-----~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v 430 (474) T protein:vir:96 357 NLNLKANKLKNKANVALQELMQFILDFNKI-----KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCC Confidence 999999999999999999999999988642 4567889999999999999999999887 5999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) +|+++|++||++|+++..+.+....+..+++..+.+++++..++ T Consensus 431 ~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 431 DDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 99999999999999887666554444433322222222222211 No 26 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=4e-97 Score=549.08 Aligned_cols=478 Identities=28% Similarity=0.420 Sum_probs=387.9 Q ss_pred chhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHH Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTS 83 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~ 83 (513) |=+.++.+..-..++++++|.++|++|...+++||+++++||+|+|++++++. .....++++||++||+++||++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~--~~~~~~~~~ki~~n~~~~iv~~~~~ 78 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPA--KTDKYAADNRIASDFAKYITVFEQG 78 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccc--cccccCCcceeecchHHHHHHHHhh Confidence 66777777777788999999999999988889999999999999999987653 3455678999999999999999999 Q ss_pred HhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec----CCCceeEEEEEcccceEEEecCC Q lcl|NC_019916. 84 YSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD----PSQKGEVSVKLDPMECFIIYDRS 156 (513) Q Consensus 84 ~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d----~~~~~~~~~~~~p~~~~~~~d~~ 156 (513) ||+|+|++++++++ +.+++||+.|+++.++.+++++++++|+||+++|+. +++.+++. .++|.+++|+||+. T Consensus 79 ~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~-~~~p~~~~~v~dd~ 157 (489) T protein:vir:99 79 YMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLY-QLPAEQTFVIYDDT 157 (489) T ss_pred hhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEE-EEcccceEEEEcCC Confidence 99999999987765 468899999999999999999999999999999974 44555554 47999999999998 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHH Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENV 236 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v 236 (513) ..++++++||+|.....+ .....++++||++.+++|+....+..........+|++|.||||+|+|+..|+|+|+++ T Consensus 158 ~~~~~~~~i~~~~~~~~~---~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v 234 (489) T protein:vir:99 158 YQRNSLMAVHFYDIDYGS---GKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESV 234 (489) T ss_pred CCCceEEEEEEEEEecCC---CceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCchhhh Confidence 888999999999865433 24467889999999999987665555555667789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccc Q lcl|NC_019916. 237 LSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMA 316 (513) Q Consensus 237 ~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 316 (513) ++|||+||.++|++++.++++++|+|+++|................ ... .........+..++++.+..... T Consensus 235 ~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~-------~~~-~~~~~~~~~~~~~~~~~~~~~~~ 306 (489) T protein:vir:99 235 LDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGR-------LNP-NGRLAISIGFKKAQVLILDDNPN 306 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcc-------ccc-ccccccccccccceeeeeccccC Confidence 9999999999999999999999999999997654333222111110 000 11111222334455555554433 Q ss_pred ccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 317 PNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 317 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) ..+.+++++||+++++.+++++++++|.++||.+|++|++++++++||+||+||++++++|.+||.++++.|+.+|++ T Consensus 307 --~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 384 (489) T protein:vir:99 307 --PNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMR 384 (489) T ss_pred --ccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345678899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccccc--ccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCC--CHHHHHHHHHHHHH Q lcl|NC_019916. 397 RYTVVAHIEERVNGKW--DIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVT--DADEIVKMMDKQRK 472 (513) Q Consensus 397 ~~~li~~~l~~~~~~~--~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~--D~~~E~~ri~~E~~ 472 (513) ++++|+++++..++.. ...+.+++|+|++++|.|.++.|++++|++|++|+||+++++|+|+ |+++|++||++|++ T Consensus 385 ~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~ 464 (489) T protein:vir:99 385 RLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTGVDAEAELKRLKEEAD 464 (489) T ss_pred HHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCchhHHHHHHHHHHHHH Confidence 9999999998766543 3345689999999999999999999999999999999999999997 78999999999987 Q ss_pred HHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 473 AMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP 506 (513) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (513) +.+...+..... ..+++.++. +++| T Consensus 465 ~~~~~~~~~~~~------~~~~~~~~~---~~~p 489 (489) T protein:vir:99 465 KKQSLPEPRLVG------DASGQEEPT---AEKP 489 (489) T ss_pred HHhccccccccC------CCCCCcCCC---CCCC Confidence 655432211111 111111112 2222 No 27 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=2.3e-97 Score=550.35 Aligned_cols=451 Identities=19% Similarity=0.240 Sum_probs=371.6 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---ccccc-----------cccCCCCCCcc Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDG---ILSPA-----------SRRNEKGKADH 67 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i---~~~~~-----------~~~~~~~~~~~ 67 (513) |+|+|... .....+++++.|.++|+.|. ..++|+.++.+||+|.++. +.++. .......++++ T Consensus 1 ~~~~~~~~--~~~~~~~~~e~i~~~i~~~~-~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:10 1 MTLYKLID--DIEAQGILPKHIEALIESHK-DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHh--hccccCCCHHHHHHHHHHhh-hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 88888764 44455899999999999985 5678999999999997653 22221 11233456889 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCc----H----HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCcee Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPS----S----DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGE 139 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~----~----~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~ 139 (513) ||++||+++||++.++||+|+||+|++++ + +.+++||+.|+++.++.+++++++++|+||++||.+++|+++ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~ 157 (474) T protein:vir:10 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIR 157 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeE Confidence 99999999999999999999999997643 1 357889999999999999999999999999999999999988 Q ss_pred EEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccc Q lcl|NC_019916. 140 VSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFP 219 (513) Q Consensus 140 ~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 219 (513) +.+ ++|.+++|+||++. +++++||+|...+.. .....+++++||+..+++|.....+ .+......+|++|.|| T Consensus 158 ~~~-i~p~~~~~v~d~~~--~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vP 230 (474) T protein:vir:10 158 IKN-IDPYNVIFVGDNIL--EPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGID--ALQEVGRYEHLFDYNP 230 (474) T ss_pred EEE-EcccceEEEEcCCC--ceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCC--cccccccccCCCCccc Confidence 764 89999999998754 588999999876533 3456778999999999998764332 2345566789999999 Q ss_pred eEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 220 MIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 220 vv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) ||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+..... . T Consensus 231 vv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~---------------~----------- 284 (474) T protein:vir:10 231 LFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEE---------------M----------- 284 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCch---------------h----------- Confidence 99999999999999999999999999999999999999999999999743211 0 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGT 379 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 379 (513) ...+...+++.+ .+.+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++++| T Consensus 285 ~~~~~~~~~i~~--------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 356 (474) T protein:vir:10 285 IQETQKSGAFEL--------FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMAL 356 (474) T ss_pred hhhhhhcceeEe--------cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHH Confidence 111233444444 345789999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGK-WDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVT 458 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~ 458 (513) .+||.++++.|+.+|++++++|+++++..+.. .++++.+++++|++++|.|+++.|++++|++|++|+||+++++|+|+ T Consensus 357 ~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~ 436 (474) T protein:vir:10 357 ENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVD 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCC Confidence 99999999999999999999999999887543 45677899999999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 459 DADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 459 D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ||++|++||++|+++..+..........++..+ +. ++| T Consensus 437 d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~-----~~------------~s~ 474 (474) T protein:vir:10 437 DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQ-----NN------------QSE 474 (474) T ss_pred CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCc-----cc------------cCC Confidence 999999999999877655432221111111110 00 011 No 28 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=2.3e-97 Score=550.35 Aligned_cols=451 Identities=19% Similarity=0.240 Sum_probs=371.6 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---ccccc-----------cccCCCCCCcc Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDG---ILSPA-----------SRRNEKGKADH 67 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i---~~~~~-----------~~~~~~~~~~~ 67 (513) |+|+|... .....+++++.|.++|+.|. ..++|+.++.+||+|.++. +.++. .......++++ T Consensus 1 ~~~~~~~~--~~~~~~~~~e~i~~~i~~~~-~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:94 1 MTLYKLID--DIEAQGILPKHIEALIESHK-DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHh--hccccCCCHHHHHHHHHHhh-hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 88888764 44455899999999999985 5678999999999997653 22221 11233456889 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCc----H----HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCcee Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPS----S----DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGE 139 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~----~----~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~ 139 (513) ||++||+++||++.++||+|+||+|++++ + +.+++||+.|+++.++.+++++++++|+||++||.+++|+++ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~ 157 (474) T protein:vir:94 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIR 157 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeE Confidence 99999999999999999999999997643 1 357889999999999999999999999999999999999988 Q ss_pred EEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccc Q lcl|NC_019916. 140 VSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFP 219 (513) Q Consensus 140 ~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 219 (513) +.+ ++|.+++|+||++. +++++||+|...+.. .....+++++||+..+++|.....+ .+......+|++|.|| T Consensus 158 ~~~-i~p~~~~~v~d~~~--~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vP 230 (474) T protein:vir:94 158 IKN-IDPYNVIFVGDNIL--EPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGID--ALQEVGRYEHLFDYNP 230 (474) T ss_pred EEE-EcccceEEEEcCCC--ceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCC--cccccccccCCCCccc Confidence 764 89999999998754 588999999876533 3456778999999999998764332 2345566789999999 Q ss_pred eEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 220 MIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 220 vv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) ||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+..... . T Consensus 231 vv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~---------------~----------- 284 (474) T protein:vir:94 231 LFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEE---------------M----------- 284 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCch---------------h----------- Confidence 99999999999999999999999999999999999999999999999743211 0 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGT 379 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 379 (513) ...+...+++.+ .+.+++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||++++++| T Consensus 285 ~~~~~~~~~i~~--------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 356 (474) T protein:vir:94 285 IQETQKSGAFEL--------FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMAL 356 (474) T ss_pred hhhhhhcceeEe--------cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHH Confidence 111233444444 345789999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGK-WDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVT 458 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~ 458 (513) .+||.++++.|+.+|++++++|+++++..+.. .++++.+++++|++++|.|+++.|++++|++|++|+||+++++|+|+ T Consensus 357 ~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~ 436 (474) T protein:vir:94 357 ENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVD 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCC Confidence 99999999999999999999999999887543 45677899999999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 459 DADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 459 D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ||++|++||++|+++..+..........++..+ +. ++| T Consensus 437 d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~-----~~------------~s~ 474 (474) T protein:vir:94 437 DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQ-----NN------------QSE 474 (474) T ss_pred CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCc-----cc------------cCC Confidence 999999999999877655432221111111110 00 011 No 29 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=1.3e-96 Score=546.29 Aligned_cols=426 Identities=25% Similarity=0.362 Sum_probs=364.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS 97 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~ 97 (513) ||+++|.++|++|. .+.+||+++++||+|+|+|+.+.. ....++++||++||+++||++.++||+|+|++++++++ T Consensus 1 l~~~~l~~~i~~~~-~~~~r~~~l~~yy~g~~~il~~~~---~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~ 76 (429) T protein:vir:98 1 MTKDLLSELIQKHR-SFNLSYSAYKQLYEGDHAILQQKQ---KEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENK 76 (429) T ss_pred CCHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccc---cccCCCcceeecchHHHHHHHHhhhhcccCceeecCCh Confidence 99999999999985 557999999999999999986654 34567899999999999999999999999999988765 Q ss_pred ---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc Q lcl|NC_019916. 98 ---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV 174 (513) Q Consensus 98 ---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~ 174 (513) +.+++||+.|+++.++.+++++++++|+||++||++++|.+++.+ ++|.+++|+||+...+++++++|+|.... T Consensus 77 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~-~~p~~~~~v~dd~~~~~~~~~i~~~~~~~-- 153 (429) T protein:vir:98 77 QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITY-LTPLEAFIVYDDSIRQKPLFAVRYFYNKG-- 153 (429) T ss_pred HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEE-EcccceEEEEeCCCCCceEEEEEEEEecC-- Confidence 468999999999999999999999999999999999999988764 89999999999988889999999986432 Q ss_pred cccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 175 DNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYM 254 (513) Q Consensus 175 ~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~ 254 (513) ...+.++|+.+.+++|..... .....+..+|++|+||||+|+|+.+|+|+|+++++|||+||+++|++++.+ T Consensus 154 -----~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~ 225 (429) T protein:vir:98 154 -----GVLEGSYSDASNITYFKDGEK---GIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDV 225 (429) T ss_pred -----ceEEEEEEeCceEEEEEecCC---ceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 356678899999888865332 244556778999999999999999999999999999999999999999999 Q ss_pred HHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCC Q lcl|NC_019916. 255 TDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYD 334 (513) Q Consensus 255 ~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 334 (513) ++|++|+++++|...... ....+..++++.++.+ .+++++++|++|+.+ T Consensus 226 ~~~~~p~~~i~g~~~~~~--------------------------~~~~~~~~~~~~~~~~-----~~~~~~~~~l~~~~~ 274 (429) T protein:vir:98 226 EYFADAYLKILGAELDDE--------------------------TLKSLRDTRIINLKDT-----DAQQLTVEFLQKPDA 274 (429) T ss_pred HHhcCceeeeecCCCCcc--------------------------hhhhHhhCceeeccCC-----CCCCcceeEEeecCC Confidence 999999999999743211 1122344555555432 356789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_019916. 335 SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDI 414 (513) Q Consensus 335 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~ 414 (513) .+++++++++|.++|+.+|++|+++++++ ||+||+||++++++|..||.++++.|+.+|++++++|+++++...+ .. T Consensus 275 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~--~~ 351 (429) T protein:vir:98 275 DATQEHLLDRLENLIFRTAMVANISDESF-GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIG--PK 351 (429) T ss_pred HHHHHHHHHHHHHHHHHHhCccccCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--cc Confidence 99999999999999999999999999887 7899999999999999999999999999999999999999866543 45 Q ss_pred ccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCC Q lcl|NC_019916. 415 DPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDP 494 (513) Q Consensus 415 ~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 494 (513) ++.+|+|+|++++|.|+++.|++++|++|++|.||+++++|+|+|+++|++||++|+++..+.... ....+.++.+. T Consensus 352 d~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~---~~~~~~~~~~~ 428 (429) T protein:vir:98 352 DWIGIKYKFTRNLPANLLEESQIAGNLAGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAG---GLNGQNTTTIL 428 (429) T ss_pred ccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh---hhcCCCCCCCC Confidence 677899999999999999999999999999999999999999999999999999998876553221 22211111111 Q ss_pred C Q lcl|NC_019916. 495 E 495 (513) Q Consensus 495 ~ 495 (513) + T Consensus 429 ~ 429 (429) T protein:vir:98 429 E 429 (429) T ss_pred C Confidence 1 No 30 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=9e-97 Score=547.15 Aligned_cols=438 Identities=18% Similarity=0.221 Sum_probs=357.2 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc------------cCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR------------RNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------------~~~~~~~~~ri~~n~~~~ivd~ 80 (513) ++.+ +..+.|.+++.+| ..++++++++++||+|+|+|++++... .....++++||++||+++||++ T Consensus 1 ~~~e-~~~~~i~~~~~~~-~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 78 (471) T protein:vir:10 1 MEIE-VIKKIISSQMVKH-GKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQ 78 (471) T ss_pred CCHH-HHHHHHHHHHHHH-HHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHh Confidence 3322 3355555555555 346789999999999999998764321 1223457899999999999999 Q ss_pred HHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC-CCceeEEEEEcccceEEEecCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP-SQKGEVSVKLDPMECFIIYDRS 156 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~-~~~~~~~~~~~p~~~~~~~d~~ 156 (513) .++|++|+||++++++++. ++.|+ .|+++.++.+++++++++|+||+++|+++ +|.+++. .++|++++|+||++ T Consensus 79 ~~~yl~G~p~~~~~~~~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~-~~~p~~~~~i~d~~ 156 (471) T protein:vir:10 79 KKAYALTYPPTFDVDDKKVNDMIVDVL-GDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYA-CVDSKEVIPIYSKS 156 (471) T ss_pred hhhhhcccCceeccCChHHHHHHHHHH-hcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEE-EEcccceEEEEcCC Confidence 9999999999999887653 55555 48999999999999999999999999985 5777665 48999999999999 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-----------------ccccccccccccCcccc Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-----------------SVPTLEVAEHSAQFGFP 219 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~g~vP 219 (513) ..++++++||+|......+ .....++++||++.+++|.....+. .........+|+||.|| T Consensus 157 ~~~~~~~~ir~~~~~~~~~--~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 234 (471) T protein:vir:10 157 LDKKSIGVLRVYSSIDETD--GKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVP 234 (471) T ss_pred CCCceEEEEEEEEeeccCC--CceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCcee Confidence 8889999999998755433 3557789999999999987654431 11233455689999999 Q ss_pred eEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 220 MIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 220 vv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) ||+|+|+.+|.|+|+++++|||+||.++|++++.+++|++|+++++|+.+... +++. T Consensus 235 vv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-------------~~~~---------- 291 (471) T protein:vir:10 235 FIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDK-------------QEFL---------- 291 (471) T ss_pred EEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc-------------chhH---------- Confidence 99999999999999999999999999999999999999999999999753221 1111 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGT 379 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 379 (513) ..+..++++.+.. .+.+.+++++|++|+++.++++.++++|.++||.+|++|+++++++ ||+||+||++++++| T Consensus 292 -~~~~~~~~i~~~~----~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~-gn~Sg~Alk~~~~~l 365 (471) T protein:vir:10 292 -EDLKRYKMIKMDN----DGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL-GNSSGVALKFLYSLL 365 (471) T ss_pred -HHhhcCCeEEecC----CCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc-cCccHHHHHHHHHHH Confidence 2223344444432 2346778999999999999999999999999999999999999887 789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTD 459 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D 459 (513) .+||.++++.|+++|++++++|+++++.. +..+++|+|++++|.|+++.|+++++++|++|.||+++++|+++| T Consensus 366 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~------d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D 439 (471) T protein:vir:10 366 ELKAGNMETQFRSGYATLVKMILKHLGLS------DKLKIKQTWTRNSINNDTEMAQVVSTLATITSRENVAKSNPIVED 439 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccC------CCceeEEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCC Confidence 99999999999999999999999987542 456889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCC Q lcl|NC_019916. 460 ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSG 491 (513) Q Consensus 460 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 491 (513) |++|++||++|+++..+......+...+...+ T Consensus 440 ~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 440 WQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred HHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 99999999999887655444332222221111 No 31 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=1.4e-96 Score=546.01 Aligned_cols=452 Identities=17% Similarity=0.204 Sum_probs=366.8 Q ss_pred Cccc-----hhhceeccCC-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCcceee Q lcl|NC_019916. 1 MIDM-----QQANMNYQED-ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKADHRAV 70 (513) Q Consensus 1 ~~~~-----~~~~~~~~~~-~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~~ri~ 70 (513) |-+. .--|..+.++ ..+++.+.|.++|++|. .+++|++++++||+|+|+|+.++.. ......++++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~ 79 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 79 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccchhhccccccccccccccc Confidence 2111 1122222233 34678999999999985 5678999999999999999876542 2234557889999 Q ss_pred cchhHHHHHHHHHHhhcCCeeecCCcHH---HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAMSGPSSD---RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~~~~~~~---~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) +||+++||++.++||+|+|+++++++++ .+++|+ .|+++.++.+++++++++|+||++||.+++|++++.+ ++|. T Consensus 80 ~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~-~~p~ 157 (472) T protein:vir:93 80 TNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFR-VPAE 157 (472) T ss_pred cchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEE-Eccc Confidence 9999999999999999999999888764 456666 4899999999999999999999999999999988764 8999 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-------ccccccccccccCcccce Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-------SVPTLEVAEHSAQFGFPM 220 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~vPv 220 (513) +++|+||++..+++.+++|+|..++ ..++++|++..+++|......- ...+.....+|+||.||| T Consensus 158 ~~~~i~d~~~~~~~~~~ir~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 229 (472) T protein:vir:93 158 QGIPIWTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPF 229 (472) T ss_pred ceEEEEcCCCCCceEEEEEEEEeec--------ceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcce Confidence 9999999988899999999997643 2357899999988876543321 112334556899999999 Q ss_pred EEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh Q lcl|NC_019916. 221 IEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL 300 (513) Q Consensus 221 v~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 300 (513) |+|+|+.+|+|+|+++++|||+||+++|++++.+++|++|+++++|+...... ++.. T Consensus 230 v~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~-------------~~~~---------- 286 (472) T protein:vir:93 230 IPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELP-------------EFKR---------- 286 (472) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccch-------------hhHH---------- Confidence 99999999999999999999999999999999999999999999997543211 1111 Q ss_pred hcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHH Q lcl|NC_019916. 301 EAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTV 380 (513) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 380 (513) .....+++. ..++++++|++|+++.+++++++++|+++||.+|++|+++++.++||+||+||++++.+|+ T Consensus 287 -~~~~~~~~~---------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 356 (472) T protein:vir:93 287 -LLRYYGAIK---------VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 356 (472) T ss_pred -HHhhccccc---------cCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHH Confidence 111222222 2457899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCH Q lcl|NC_019916. 381 ELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDA 460 (513) Q Consensus 381 ~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~ 460 (513) .||+++++.|+++|++++++|+++++. ..++.+++|+|++++|.|+++.|++++|++|++|.||+++++|+++|+ T Consensus 357 ~ka~~~~~~~~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~~~d~ 431 (472) T protein:vir:93 357 LKADKLARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDL 431 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCH Confidence 999999999999999999999988753 235668999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 461 DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 461 ~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) ++|++||++|+++.....+...+..+++..+++++++..++ T Consensus 432 ~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 432 QAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 99999999999887776655544444333332222222222 No 32 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=2e-96 Score=545.22 Aligned_cols=452 Identities=16% Similarity=0.205 Sum_probs=367.6 Q ss_pred Cccc--------hhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc----CCCCCCcce Q lcl|NC_019916. 1 MIDM--------QQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRR----NEKGKADHR 68 (513) Q Consensus 1 ~~~~--------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~----~~~~~~~~r 68 (513) |++. .+.-+.+..+..+.+.++|.++|++|. .+.++++++++||+|+|++++++.+.. ....++++| T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK-ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWR 79 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCchhccccccccccccccccccce Confidence 6554 122233455666789999999999985 567899999999999999887654332 234568899 Q ss_pred eecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEc Q lcl|NC_019916. 69 AVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLD 145 (513) Q Consensus 69 i~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~ 145 (513) |++||+++||++.++||+|+||++++++++. +++++ .|+++.++.+++++++++|+||+++|.+++|++++.+ ++ T Consensus 80 i~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~-~~ 157 (478) T protein:vir:10 80 MYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR-VP 157 (478) T ss_pred eccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHH-hcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE-Ec Confidence 9999999999999999999999999888765 44555 3889999999999999999999999999999988765 89 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc-----------ccccccccccc Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS-----------VPTLEVAEHSA 214 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~-----------~~~~~~~~~~~ 214 (513) |.+++|+||++..+++.+++|+|.... ..++++||++.+++|+...+... ........+|+ T Consensus 158 p~~~~~i~d~~~~~~~~~~v~~~~~~~--------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELDG--------AERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMS 229 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecC--------ceEEEEEeCCeEEEEEEcCCeeeccccccccccccceeccccccc Confidence 999999999988889999999997543 34689999999988876443211 12234556899 Q ss_pred CcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccc Q lcl|NC_019916. 215 QFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLAD 294 (513) Q Consensus 215 ~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (513) +|.||||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++.. T Consensus 230 ~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~-------------~~~~---- 292 (478) T protein:vir:10 230 WGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMK-------------DFMH---- 292 (478) T ss_pred CCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-------------hhhh---- Confidence 99999999999999999999999999999999999999999999999999997543211 1111 Q ss_pred ccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHH Q lcl|NC_019916. 295 EKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKY 374 (513) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~ 374 (513) .+...+++.+. ..++++++|++++++.++++.++++|.+.||.+|++|++++++++||+||+||++ T Consensus 293 -------~~~~~~~~~~~-------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~ 358 (478) T protein:vir:10 293 -------NLKYYKAISVA-------GESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKF 358 (478) T ss_pred -------hhhhcceEEec-------CCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHH Confidence 12223344332 1356899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL 454 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 454 (513) ++++|.+||+++++.|+++|++++++|+++++ .+++..+|+|+|++++|.|+++.|++++|++|++|+||+++++ T Consensus 359 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g-----~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 433 (478) T protein:vir:10 359 MYSNLDLKANKLKNKTLTALQELLQYIIDFYR-----LDVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILSNH 433 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHHHHhC Confidence 99999999999999999999999999998863 3567788999999999999999999999999999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 455 PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 455 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) |+++|+++|++||++|+++..+........... ..+.+++ .++.+ T Consensus 434 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~-----------~~~~~ 478 (478) T protein:vir:10 434 AWVEDPVAEMERIEQENIELNQQLPDIEEGLNG-EQQRQSE-----------NNQPE 478 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhccccccccCC-CCCCCCC-----------CCCCC Confidence 999999999999999987765544332221111 1111111 11111 No 33 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=2.3e-96 Score=544.86 Aligned_cols=452 Identities=16% Similarity=0.200 Sum_probs=367.1 Q ss_pred Cccch----h----hceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----cCCCCCCcce Q lcl|NC_019916. 1 MIDMQ----Q----ANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR----RNEKGKADHR 68 (513) Q Consensus 1 ~~~~~----~----~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----~~~~~~~~~r 68 (513) |++.. + .-+....+..+.+.++|.+++++|. .+++|++++.+||+|+|++++++... .....++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK-ENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWR 79 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccchhhhcccccccccccce Confidence 55541 1 1122344556788999999999985 56789999999999999998765432 2345678899 Q ss_pred eecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEc Q lcl|NC_019916. 69 AVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLD 145 (513) Q Consensus 69 i~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~ 145 (513) |++||+++||++.++||+|+||++++++++. ++++++ |+++.++.+++++++++|+||++||.+++|++++.+ ++ T Consensus 80 i~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~-~~ 157 (478) T protein:vir:10 80 MYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR-VP 157 (478) T ss_pred eccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEE-Ec Confidence 9999999999999999999999998887654 555554 899999999999999999999999999999988764 89 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-----------cccccccccccc Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-----------SVPTLEVAEHSA 214 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-----------~~~~~~~~~~~~ 214 (513) |.+++|+|+++..+++.+++|+|..++ ..++++||++.+++|+...... .........+|+ T Consensus 158 p~~~~~v~d~~~~~~~~~~ir~~~~~~--------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELDG--------AERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMS 229 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEeeeC--------ceEEEEEeCCcEEEEEecCCeeeccccccccccccceeccccccc Confidence 999999999988899999999997543 3468999999998886643321 112234456899 Q ss_pred CcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccc Q lcl|NC_019916. 215 QFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLAD 294 (513) Q Consensus 215 ~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (513) +|+||||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+...... ++... T Consensus 230 ~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~-------------~~~~~--- 293 (478) T protein:vir:10 230 WGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMK-------------DFMHN--- 293 (478) T ss_pred CCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCccccc-------------chhhh--- Confidence 99999999999999999999999999999999999999999999999999997543211 11111 Q ss_pred ccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHH Q lcl|NC_019916. 295 EKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKY 374 (513) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~ 374 (513) +...+++.+. ..++++++|++|+++.+++++++++|.+.||.+|++|++++++++||+||+||++ T Consensus 294 --------~~~~~~~~~~-------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~ 358 (478) T protein:vir:10 294 --------LKYYKAISVA-------GESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKF 358 (478) T ss_pred --------hhhCceeEec-------CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHH Confidence 2223344332 2356889999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL 454 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 454 (513) ++++|++||.++++.|+++|++++++|+++++ ..+++.+|+|+|++++|.|+++.|+++++++|++|.||+++++ T Consensus 359 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-----~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~ 433 (478) T protein:vir:10 359 MYSNLDLKANKLKNKTLTALQELLQYIIDFYR-----LDVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILGNH 433 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhC Confidence 99999999999999999999999999998863 2567778999999999999999999999999999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 455 PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 455 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) |+|+|+++|++||++|+++.........+...+.+. ..++++..| T Consensus 434 ~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~--~~~~d~~~e 478 (478) T protein:vir:10 434 SWVQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQ--RQSEDNQSE 478 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHHHhccccCCCCccccc--ccCcCCCCC Confidence 999999999999999998765544332222221110 000111001 No 34 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=6e-96 Score=542.61 Aligned_cols=450 Identities=17% Similarity=0.195 Sum_probs=365.5 Q ss_pred Ccc----------chhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCc Q lcl|NC_019916. 1 MID----------MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKAD 66 (513) Q Consensus 1 ~~~----------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~ 66 (513) |++ .++....+... .+++.+.|.++++.|. .+.+|++++++||+|+|+|+.+... ......+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQ-FETQEEMIVRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPD 78 (474) T ss_pred CcccccccCCCchhhHHHHhhhhc-ccCHHHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcccchhccccccccccCc Confidence 222 23333333333 4578899999999985 5689999999999999999865432 223456789 Q ss_pred ceeecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 67 HRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 67 ~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +||++||+++||++.++||+|+|+++++++++. ++.|+ .|+++.++.+++++++++|+||+++|++++|++++.+ T Consensus 79 ~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~- 156 (474) T protein:vir:94 79 WRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFR- 156 (474) T ss_pred ceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEE- Confidence 999999999999999999999999999887653 44444 5889999999999999999999999999999888765 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc-------ccccccccccCc Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV-------PTLEVAEHSAQF 216 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g 216 (513) ++|.+++|+||++..+++++++|+|.... ..++++||++.+++|+...++... .......+|++| T Consensus 157 ~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g 228 (474) T protein:vir:94 157 VPAEQAIPIWVDKEREELKSFIRYYKFNN--------EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWG 228 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecC--------eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCC Confidence 89999999999998899999999997543 346899999999998765443211 122334679999 Q ss_pred ccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccccc Q lcl|NC_019916. 217 GFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEK 296 (513) Q Consensus 217 ~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 296 (513) +||||+|+|+..|+|+|+++++|||+||+++|++++.+++|++|+++++|+.+.... ++. T Consensus 229 ~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-------------~~~------- 288 (474) T protein:vir:94 229 RVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLE-------------EFM------- 288 (474) T ss_pred ccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-------------hhh------- Confidence 999999999999999999999999999999999999999999999999997543211 111 Q ss_pred chhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHH Q lcl|NC_019916. 297 MAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKV 376 (513) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~ 376 (513) ..+...+++.+ ..+++++|++++.+.+++++++++|.+.||.+|++|++++++++||+||+||++++ T Consensus 289 ----~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 355 (474) T protein:vir:94 289 ----RGLKYYKAINV---------DGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLY 355 (474) T ss_pred ----hhhhccceeec---------cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHH Confidence 11223344433 35678999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCC Q lcl|NC_019916. 377 LGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPN 456 (513) Q Consensus 377 ~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~ 456 (513) ++|++||.++++.|+++|++++++|+++++. ..++.+|+|+|++++|.|+++.|++++++ |++|+||+++++|+ T Consensus 356 ~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~-----~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~ 429 (474) T protein:vir:94 356 GNLDLKANKLKNKATVAIQELISFIIDFNNL-----KTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPL 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCC Confidence 9999999999999999999999999988643 35677899999999999999999999986 89999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 457 VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 457 v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) |+|+++|++||++|+++..+......+...++..+.++.++..+| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 430 VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 999999999999999877665544333333222221111111111 No 35 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=6e-96 Score=542.61 Aligned_cols=450 Identities=17% Similarity=0.195 Sum_probs=365.5 Q ss_pred Ccc----------chhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccCCCCCCc Q lcl|NC_019916. 1 MID----------MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS----RRNEKGKAD 66 (513) Q Consensus 1 ~~~----------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~~~~~~~ 66 (513) |++ .++....+... .+++.+.|.++++.|. .+.+|++++++||+|+|+|+.+... ......+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQ-FETQEEMIVRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPD 78 (474) T ss_pred CcccccccCCCchhhHHHHhhhhc-ccCHHHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcccchhccccccccccCc Confidence 222 23333333333 4578899999999985 5689999999999999999865432 223456789 Q ss_pred ceeecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 67 HRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 67 ~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +||++||+++||++.++||+|+|+++++++++. ++.|+ .|+++.++.+++++++++|+||+++|++++|++++.+ T Consensus 79 ~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~- 156 (474) T protein:vir:97 79 WRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFR- 156 (474) T ss_pred ceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEE- Confidence 999999999999999999999999999887653 44444 5889999999999999999999999999999888765 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc-------ccccccccccCc Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV-------PTLEVAEHSAQF 216 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g 216 (513) ++|.+++|+||++..+++++++|+|.... ..++++||++.+++|+...++... .......+|++| T Consensus 157 ~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g 228 (474) T protein:vir:97 157 VPAEQAIPIWVDKEREELKSFIRYYKFNN--------EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWG 228 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecC--------eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCC Confidence 89999999999998899999999997543 346899999999998765443211 122334679999 Q ss_pred ccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccccc Q lcl|NC_019916. 217 GFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEK 296 (513) Q Consensus 217 ~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 296 (513) +||||+|+|+..|+|+|+++++|||+||+++|++++.+++|++|+++++|+.+.... ++. T Consensus 229 ~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-------------~~~------- 288 (474) T protein:vir:97 229 RVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLE-------------EFM------- 288 (474) T ss_pred ccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-------------hhh------- Confidence 999999999999999999999999999999999999999999999999997543211 111 Q ss_pred chhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHH Q lcl|NC_019916. 297 MAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKV 376 (513) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~ 376 (513) ..+...+++.+ ..+++++|++++.+.+++++++++|.+.||.+|++|++++++++||+||+||++++ T Consensus 289 ----~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 355 (474) T protein:vir:97 289 ----RGLKYYKAINV---------DGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLY 355 (474) T ss_pred ----hhhhccceeec---------cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHH Confidence 11223344433 35678999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCC Q lcl|NC_019916. 377 LGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPN 456 (513) Q Consensus 377 ~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~ 456 (513) ++|++||.++++.|+++|++++++|+++++. ..++.+|+|+|++++|.|+++.|++++++ |++|+||+++++|+ T Consensus 356 ~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~-----~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~ 429 (474) T protein:vir:97 356 GNLDLKANKLKNKATVAIQELISFIIDFNNL-----KTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPL 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCC Confidence 9999999999999999999999999988643 35677899999999999999999999986 89999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 457 VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 457 v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) |+|+++|++||++|+++..+......+...++..+.++.++..+| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 430 VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 999999999999999877665544333333222221111111111 No 36 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=1.4e-95 Score=540.68 Aligned_cols=449 Identities=18% Similarity=0.220 Sum_probs=367.8 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccccccc------CCCCCCcceeecch Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNN-QRPRLEMLYDYYRGQNDGILSPASRR------NEKGKADHRAVHSF 73 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~------~~~~~~~~ri~~n~ 73 (513) +++..--+..+.+ .+..++.++|+.+... +.++++++++||+|+|++++++.... ....++++|+++|| T Consensus 6 ~~~~~~~~~~~~~----~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~ 81 (479) T protein:vir:79 6 ISETDLIKVQLKK----ESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNY 81 (479) T ss_pred ecccceEeecccc----CChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecch Confidence 4444444434433 3556666767666544 56889999999999999987754332 23456889999999 Q ss_pred hHHHHHHHHHHhhcCCeeecCCcHH--HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEE Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSGPSSD--RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFI 151 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~~~~~--~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~ 151 (513) +++||++.++||+|+|++++++++. .+.+.|..|+++.++.+++++++++|+||++||.+++|.+++.+ ++|.+++| T Consensus 82 ~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~-~~p~~~~~ 160 (479) T protein:vir:79 82 HKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVI-IPAEEAIP 160 (479) T ss_pred HHHHHHHHHhhhhcCCceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEE-EccceeEE Confidence 9999999999999999999887754 35556667999999999999999999999999999999988765 89999999 Q ss_pred EecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc----------------cccccccccccC Q lcl|NC_019916. 152 IYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS----------------VPTLEVAEHSAQ 215 (513) Q Consensus 152 ~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~ 215 (513) +||+...+++++++|+|...+.++ +...++++|+++.+++|.....+.. ........+|+| T Consensus 161 v~d~~~~~~~~~~ir~y~~~~~~~---~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (479) T protein:vir:79 161 IWDSKRQRELVAFIRFYYIEDIDG---NKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGW 237 (479) T ss_pred EEeCCCCCceEEEEEEEEEeecCC---ceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCC Confidence 999998889999999998765443 3467899999999999876543321 112234568999 Q ss_pred cccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccc Q lcl|NC_019916. 216 FGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADE 295 (513) Q Consensus 216 g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 295 (513) |.||||+|+|+..|+|+|+++++|||+||.++|++++.+++|++|+++++|+....... + T Consensus 238 ~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~-------------~------- 297 (479) T protein:vir:79 238 GKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQE-------------F------- 297 (479) T ss_pred CcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccccc-------------c------- Confidence 99999999999999999999999999999999999999999999999999975432211 1 Q ss_pred cchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHH Q lcl|NC_019916. 296 KMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYK 375 (513) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~ 375 (513) ...+..++++.+ .++++++|++++.+.+++++++++|.++||.+|++|+++++++ ||+||+||+++ T Consensus 298 ----~~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn~Sg~Ai~~~ 363 (479) T protein:vir:79 298 ----IDNIRYYKSIKV---------DGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT-GDKSGVALKFL 363 (479) T ss_pred ----hhhhhhccceec---------CCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc-cchhHHHHHHH Confidence 112233444443 3568899999999999999999999999999999999998876 78999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCC Q lcl|NC_019916. 376 VLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLP 455 (513) Q Consensus 376 ~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~ 455 (513) +++|.+||.++++.|+++|++++++++++++..+ ..+++..+++|+|++++|.|+++.|+++++++|++|.||+++++| T Consensus 364 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~g~iS~et~l~~l~ 442 (479) T protein:vir:79 364 YSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG-NKSYDYKTVQITFNHSMIINEAEKIDMAAKSTGIVSDETIVSNHP 442 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCC Confidence 9999999999999999999999999999987654 457788899999999999999999999999999999999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 456 NVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGV 499 (513) Q Consensus 456 ~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) +++|+++|++||++|+++..+..+..++...+ ..++. T Consensus 443 ~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-------~~~e~ 479 (479) T protein:vir:79 443 WVEDVNDELERLKKQEDTQKEYDDLIPNNQDG-------VIDET 479 (479) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhccCcccCC-------CcCcC Confidence 99999999999999998776655443321111 11111 No 37 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=4.1e-95 Score=538.03 Aligned_cols=467 Identities=18% Similarity=0.206 Sum_probs=368.2 Q ss_pred Cccc--------hhhceeccCCc---ccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc-------CCC Q lcl|NC_019916. 1 MIDM--------QQANMNYQEDA---DKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRR-------NEK 62 (513) Q Consensus 1 ~~~~--------~~~~~~~~~~~---~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~-------~~~ 62 (513) |.++ +..+..+.... .+.+.+.|.++|++| ++++++++++||.|+|+|+.++.... ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~---~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 77 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEH---NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDD 77 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhh---cHHHHHHHHHHhccccchhhccchhccccccccccc Confidence 3322 22233333332 244566777777765 46789999999999999887654332 234 Q ss_pred CCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHH--HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeE Q lcl|NC_019916. 63 GKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR--LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEV 140 (513) Q Consensus 63 ~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~--l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~ 140 (513) .++++|+++||+++||++.++|++|+|++++++++.. ..+.|..|+++.++.+++++++++|+||++||++++|++++ T Consensus 78 ~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i 157 (503) T protein:vir:59 78 TKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDY 157 (503) T ss_pred ccccceeecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEE Confidence 5678999999999999999999999999999887653 23334569999999999999999999999999999998887 Q ss_pred EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc-----------ccccc Q lcl|NC_019916. 141 SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV-----------PTLEV 209 (513) Q Consensus 141 ~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~-----------~~~~~ 209 (513) .+ ++|.+++|+||+...+++.++||+|.....+ .....++++||++.+++|.....+... ..... T Consensus 158 ~~-~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~---~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (503) T protein:vir:59 158 VI-FPAEEMIVVYKDNTRRDILFALRYYSYKGIM---GEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKG 233 (503) T ss_pred EE-EccceeEEEEeCCCCCceEEEEEEEEEecCC---CceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeec Confidence 64 8999999999999889999999999876543 345678999999999998765443211 12234 Q ss_pred cccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhh Q lcl|NC_019916. 210 AEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAM 289 (513) Q Consensus 210 ~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~ 289 (513) ..+|+++.||||+|+|+..|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+... .++. T Consensus 234 ~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~-------------~~~~ 300 (503) T protein:vir:59 234 GQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENP-------------KEFT 300 (503) T ss_pred ceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCcccc-------------chhh Confidence 568999999999999999999999999999999999999999999999999999999754321 1111 Q ss_pred hccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH Q lcl|NC_019916. 290 KKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG 369 (513) Q Consensus 290 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg 369 (513) .++ ...+++. ..++++++|++++++.++++.++++|+++|+.+|++|+++++.++||+|| T Consensus 301 ~~~-----------~~~~~~~---------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg 360 (503) T protein:vir:59 301 ANL-----------RYHSVIK---------VSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATG 360 (503) T ss_pred hhh-----------hccccee---------ccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccH Confidence 111 2223332 23568899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCH Q lcl|NC_019916. 370 VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQ 447 (513) Q Consensus 370 ~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~ 447 (513) +||++++++|.+||+++++.|+.+|++++++|+++++...........+|+++|++++|.|+++.|++++++ +|++|. T Consensus 361 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~ 440 (503) T protein:vir:59 361 PALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSK 440 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCch Confidence 999999999999999999999999999999999999877655445567899999999999999999999998 689999 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCC---CCCCCCCCCCCCCC Q lcl|NC_019916. 448 EYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGND---PEDEGVRGQQGEPE 507 (513) Q Consensus 448 et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 507 (513) ||+++++|+++||++|++||++|+++..+......+......++.. ..++..++++|... T Consensus 441 et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 441 ETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 9999999999999999999999988776655443332222111111 11111111122111 No 38 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=6.7e-95 Score=536.88 Aligned_cols=448 Identities=17% Similarity=0.200 Sum_probs=363.0 Q ss_pred Cccc-------hhhceeccCC-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc----CCCCCCcce Q lcl|NC_019916. 1 MIDM-------QQANMNYQED-ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRR----NEKGKADHR 68 (513) Q Consensus 1 ~~~~-------~~~~~~~~~~-~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~----~~~~~~~~r 68 (513) |+.. +..+.....+ ..+++.+.|.++|++|. .+.++++++++||+|+|+|+.++.... ....++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHK-PKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWR 79 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHH-HHHHHHHHHHHHhccCCcchhccchhcccccccccccchh Confidence 4443 3333322222 34678899999999985 568999999999999999987764322 234578899 Q ss_pred eecchhHHHHHHHHHHhhcCCeeecCCcHH---HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEc Q lcl|NC_019916. 69 AVHSFARYIADFQTSYSVGNAIAMSGPSSD---RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLD 145 (513) Q Consensus 69 i~~n~~~~ivd~~~~~l~g~p~~~~~~~~~---~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~ 145 (513) |++||+++||++.++||+|+|++|++++++ .+++|++ |+++.++.+++++++++|+||++||++++|++++.+ ++ T Consensus 80 i~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~-~~ 157 (474) T protein:vir:96 80 MFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFR-VP 157 (474) T ss_pred cccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEE-Ec Confidence 999999999999999999999999987754 4666765 789999999999999999999999999999988764 89 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc-----------ccccccccccc Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS-----------VPTLEVAEHSA 214 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~-----------~~~~~~~~~~~ 214 (513) |.++||+||++...++.+++|+|..+. ..++++||++.+++|........ ........+|+ T Consensus 158 p~~~~~v~d~~~~~~~~~~vr~~~~~~--------~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:96 158 AEQAIPIWTNKERDTLKAFIRYYRLDG--------AERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVS 229 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEeecC--------ceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccC Confidence 999999999988889999999997543 33578999999998875443211 11223456899 Q ss_pred CcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccc Q lcl|NC_019916. 215 QFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLAD 294 (513) Q Consensus 215 ~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (513) +|.||||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+.... ++ T Consensus 230 ~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-------------~~------ 290 (474) T protein:vir:96 230 WGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLD-------------EF------ 290 (474) T ss_pred CCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccccc-------------ch------ Confidence 99999999999999999999999999999999999999999999999999997542211 11 Q ss_pred ccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHH Q lcl|NC_019916. 295 EKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKY 374 (513) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~ 374 (513) ...+...+++.+ .+.+++|+|++|+++.+++++++++|.++||.+|++|++++++++||+||+||++ T Consensus 291 -----~~~~~~~~~i~~--------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~ 357 (474) T protein:vir:96 291 -----MRNLKYYKAINV--------DGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKF 357 (474) T ss_pred -----hhhhhcCceEEe--------cCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHH Confidence 112233444444 2457889999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL 454 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 454 (513) ++++|++||.++++.|+++|++++++|+++++ ..+++.+++|+|++++|.|+++.|+++.+ +|++|+||+++++ T Consensus 358 ~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~-----~~~~~~~i~i~f~~~~p~~~~e~~~~~~~-ag~iS~et~~~~~ 431 (474) T protein:vir:96 358 MYSNLDLKANKLKNKTLTALQELLQYIIDFYK-----LNIKVQDVEITFNFNVMVNELEQSQIGVQ-SQYLSKETVVTNH 431 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHHHhC Confidence 99999999999999999999999999998863 34667789999999999999999998865 6999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 455 PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 455 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) |+|+|+++|++||++|+++..+......+.... +..++++++| T Consensus 432 ~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~----------------~~~d~~~e~~ 474 (474) T protein:vir:96 432 PWVDDPVAELERIEQDNIDFNKQLPPLEGDANG----------------RAQDNESETN 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhccccccccccc----------------ccCCCcccCC Confidence 999999999999999988765543322111100 0111122222 No 39 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=1.3e-94 Score=535.32 Aligned_cols=442 Identities=17% Similarity=0.206 Sum_probs=367.1 Q ss_pred Cccc----hhh----ceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc----CCCCCCcce Q lcl|NC_019916. 1 MIDM----QQA----NMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRR----NEKGKADHR 68 (513) Q Consensus 1 ~~~~----~~~----~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~----~~~~~~~~r 68 (513) |++. .+. -+..+.+..+++.+.|.++|+.|. .+.++++++++||+|+|++++++.... ....++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHK-ENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 5543 111 133334456889999999999985 567899999999999999987755432 234568899 Q ss_pred eecchhHHHHHHHHHHhhcCCeeecCCcHH---HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEc Q lcl|NC_019916. 69 AVHSFARYIADFQTSYSVGNAIAMSGPSSD---RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLD 145 (513) Q Consensus 69 i~~n~~~~ivd~~~~~l~g~p~~~~~~~~~---~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~ 145 (513) |++||++.||++.++||+|+||++++++++ .++++|+ |+++..+.+++++++++|+||++||++++|.+++.+ ++ T Consensus 80 i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~-~~ 157 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFR-VP 157 (468) T ss_pred cccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEE-Ec Confidence 999999999999999999999999887754 4667775 889999999999999999999999999999888764 89 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC-----------cccccccccccc Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-----------SVPTLEVAEHSA 214 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-----------~~~~~~~~~~~~ 214 (513) |.+++|+|+++..+++++++|+|..+. ..++++|+++.+++|....... .........+|+ T Consensus 158 p~~~~~v~~~~~~~~~~~~ir~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (468) T protein:vir:96 158 AEQAIPIWTNKERDELKAFIRLYELDG--------GERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMS 229 (468) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecC--------ceEEEEEeCCeEEEEEEcCCceeecccccccccccceeecccccc Confidence 999999999988889999999997543 3467899999998887654321 122344566899 Q ss_pred CcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccc Q lcl|NC_019916. 215 QFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLAD 294 (513) Q Consensus 215 ~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (513) +|+||||+|+|+.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+..... +++ T Consensus 230 ~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~-------------~~~------ 290 (468) T protein:vir:96 230 WNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDL-------------EEF------ 290 (468) T ss_pred CCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccc-------------chh------ Confidence 9999999999999999999999999999999999999999999999999999754211 111 Q ss_pred ccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHH Q lcl|NC_019916. 295 EKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKY 374 (513) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~ 374 (513) ...+..++++.+.. .++++++|++|+.+.+++++++++|.++||.+|++|++++++++||+||+||++ T Consensus 291 -----~~~~~~~~~i~~~~-------d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~ 358 (468) T protein:vir:96 291 -----MYNLKYYKAINVDG-------DGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKF 358 (468) T ss_pred -----hhhhhcCceEEecC-------CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHH Confidence 11223344444432 346789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL 454 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 454 (513) ++++|.+||.++++.|+++|++++++|+++++ ..+++.+++|+|++++|.|+++.|+++++ +|++|+||+++++ T Consensus 359 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g-----~~~d~~~i~i~f~~~~p~d~~e~a~~~~~-~g~iS~et~i~~l 432 (468) T protein:vir:96 359 MYSNLDLKANKLKNKTLTALQELLQYIIDFYK-----LSIKVQDVEITFNFNVMVNELEQSQIGVN-SQYLSKETVVTNH 432 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccceeeEEecCCCCcCHHHHHHHHHh-cCCCchHHHHHhC Confidence 99999999999999999999999999998863 34677789999999999999999999877 4999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC Q lcl|NC_019916. 455 PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTS 490 (513) Q Consensus 455 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 490 (513) |+++||++|++||++|+++..+..+.+.+...+.++ T Consensus 433 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 433 PWVDDPVAEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 999999999999999998877665554443333322 No 40 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1.7e-94 Score=534.61 Aligned_cols=451 Identities=17% Similarity=0.200 Sum_probs=364.4 Q ss_pred Cccchh------h---ceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----cCCCCCCcc Q lcl|NC_019916. 1 MIDMQQ------A---NMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR----RNEKGKADH 67 (513) Q Consensus 1 ~~~~~~------~---~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----~~~~~~~~~ 67 (513) |+++=. . -+....+..+++++.|.++|++|. .+.+|++++++||.|+|+|+++.... .....++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDW 79 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHH-HHHHHHHHHHHHhcccCchhccccccccccccccccccc Confidence 443211 0 112223345688999999999874 67889999999999999998765432 234467889 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCcHH---HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEE Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPSSD---RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKL 144 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~---~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~ 144 (513) ||++||+++||++.++||+|+||+++++++. .++.|+ .|+++.++.+++++++++|+||++||++++|++++.+ + T Consensus 80 ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~-~ 157 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFR-V 157 (474) T ss_pred eeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEE-E Confidence 9999999999999999999999999988764 355555 4889999999999999999999999999999888764 8 Q ss_pred cccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc-------ccccccccccCcc Q lcl|NC_019916. 145 DPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV-------PTLEVAEHSAQFG 217 (513) Q Consensus 145 ~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g~ 217 (513) +|.+++|+|+++..+++.+++|+|.... ..++++||++.+++|+....+-.. .......+|++|. T Consensus 158 ~p~~~~~v~d~~~~~~~~~~i~~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 229 (474) T protein:vir:95 158 PAEQAIPIWVDKEREELKSFIRYYKFNN--------EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGR 229 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEEEcC--------eeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCc Confidence 9999999999988889999999997543 346899999999998765443211 1233446799999 Q ss_pred cceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccc Q lcl|NC_019916. 218 FPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKM 297 (513) Q Consensus 218 vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 297 (513) ||||+|+|+..|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+.... ++. T Consensus 230 iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-------------~~~-------- 288 (474) T protein:vir:95 230 VPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLE-------------EFM-------- 288 (474) T ss_pred cceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-------------hhh-------- Confidence 99999999999999999999999999999999999999999999999997543211 111 Q ss_pred hhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHH Q lcl|NC_019916. 298 AQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVL 377 (513) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 377 (513) ..+...+++.+ ..+++++|++++++.++++.++++|.++||.+|++|++++++++||+||+||+++++ T Consensus 289 ---~~~~~~~~i~~---------~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 356 (474) T protein:vir:95 289 ---RGLKYYKAINV---------DGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYG 356 (474) T ss_pred ---hhhhccceeec---------cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHH Confidence 11222334333 356889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCC Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNV 457 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v 457 (513) +|+.||+++++.|+++|++++++|+++++. ..++.+++|+|++++|.|+++.|++++++ |++|.||++.++|++ T Consensus 357 ~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-----~~d~~~i~v~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v 430 (474) T protein:vir:95 357 NLDLKANKLKNKATVAIQELIGFIIDFNNL-----KMDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLV 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCC Confidence 999999999999999999999999988643 45778899999999999999999999985 999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +|+++|++||++|+++..+......+...+...+.+.+++. ++| T Consensus 431 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~------------~~~ 474 (474) T protein:vir:95 431 DDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDK------------ESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccC------------CCC Confidence 99999999999998877665544333322222221111111 111 No 41 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=2.7e-94 Score=533.56 Aligned_cols=468 Identities=16% Similarity=0.124 Sum_probs=348.0 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccc-------cCCCCCCcceeecc Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHH-YNNQRPRLEMLYDYYRGQNDGILSPASR-------RNEKGKADHRAVHS 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~~~~YY~G~~~i~~~~~~~-------~~~~~~~~~ri~~n 72 (513) ||+. ...+..+ ...+.|.+.+..| ..+++++++++++||+|+|+|++++... ..+..++++||++| T Consensus 1 ~~~~-----~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~n 74 (537) T protein:vir:78 1 MTSP-----LLNKPID-QLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHG 74 (537) T ss_pred CCcc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccc Confidence 4432 1111211 2234454544444 4567899999999999999998876543 23455789999999 Q ss_pred hhHHHHHHHHHHhhcCCeeecCCcH--H----HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcc Q lcl|NC_019916. 73 FARYIADFQTSYSVGNAIAMSGPSS--D----RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDP 146 (513) Q Consensus 73 ~~~~ivd~~~~~l~g~p~~~~~~~~--~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p 146 (513) |+++||++.++||+|+||+|+++++ + .++.++ .|+++.++.+++++++++|+||+++|.+++|+.++.+ ++| T Consensus 75 f~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~-i~p 152 (537) T protein:vir:78 75 FFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQT-VDG 152 (537) T ss_pred hHHHHHHHHhhhhcccCceeecCcchhHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEE-Ecc Confidence 9999999999999999999987643 2 234444 4889999999999999999999999999999988764 899 Q ss_pred cceEEEecCCCCcceEEEEEEEeecccc--cccceeEEEEEEEcCCcEEEEEeeccCCc--------------------- Q lcl|NC_019916. 147 MECFIIYDRSVNPKPIMAVRYHAVQTVV--DNITQTKYEVETWTENDYTRYKPIVVAGS--------------------- 203 (513) Q Consensus 147 ~~~~~~~d~~~~~~~~~~ir~~~~~~~~--~~~~~~~~~ve~yt~~~~~~~~~~~~~~~--------------------- 203 (513) .++||+||+.. ++.+++|+|...... ......++++++||++.+++|....++.. T Consensus 153 ~~~~pv~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 230 (537) T protein:vir:78 153 LTLIPVFDDYG--VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIE 230 (537) T ss_pred ceeEEEEcCCC--CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecc Confidence 99999999754 677888888665433 23446788999999999999976544321 Q ss_pred --------cccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccc Q lcl|NC_019916. 204 --------VPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDS 275 (513) Q Consensus 204 --------~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~ 275 (513) ........+|+||.||||+|+||.+|+|+|+++++|||+||.++|+++|.+++|++|+++++|+.+.. T Consensus 231 ~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~---- 306 (537) T protein:vir:78 231 ESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDS---- 306 (537) T ss_pred ccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCcc---- Confidence 11223445799999999999999999999999999999999999999999999999999999975421 Q ss_pred cccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_019916. 276 TLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHT 355 (513) Q Consensus 276 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 355 (513) .+++..++ +..+++.+ .+.+++|+|++|+++.+++++++++|.++||.+|++ T Consensus 307 ---------~~~~~~~l-----------~~~~~i~v--------~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~ 358 (537) T protein:vir:78 307 ---------TDKLRQNI-----------KAKKMIGV--------NGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMG 358 (537) T ss_pred ---------chhHHHHH-----------hhcCceee--------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCC Confidence 11222222 22333332 246789999999999999999999999999999999 Q ss_pred cccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHH Q lcl|NC_019916. 356 PDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAII 435 (513) Q Consensus 356 p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 435 (513) |+.+. .++||+||+||++++++|++||.++++.|+++|++++++|+++++..+ ...+++.+|+++|++++|.|+++.| T Consensus 359 ~~~~~-~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~-~~~~d~~~i~i~f~~~~P~n~~e~a 436 (537) T protein:vir:78 359 FNSTA-VGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRG-LGEYDSNDICFEIEPHVLANELDIA 436 (537) T ss_pred CCCcc-ccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CcccccceeeEEeccCCCCCHHHHH Confidence 99765 467899999999999999999999999999999999999999987654 4577889999999999999999999 Q ss_pred HHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCC-CC----CCCCCCCCCCCCCCCCC Q lcl|NC_019916. 436 TALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGT-SG----NDPEDEGVRGQQGEPED 508 (513) Q Consensus 436 ~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~~~~~~ 508 (513) ++++++ .|++|+||+++++|+|+|++.| +++++|.+...+...........+. .. ....+....+...+|.+ T Consensus 437 ~~~~~l~~~giiS~eT~l~~~p~vdd~e~e-k~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 515 (537) T protein:vir:78 437 TTRKTEAETEALKIGNIMTVAPRIGDDETL-KLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVD 515 (537) T ss_pred HHHHHHHhcCcchHHHHHHhCCCCCCHHHH-HHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCC Confidence 999987 4899999999999999998433 3334343322222111111100000 00 00000111111111122 Q ss_pred --ccCCC Q lcl|NC_019916. 509 --ERTSD 513 (513) Q Consensus 509 --~~~~~ 513 (513) +.+-| T Consensus 516 ~~~~~~~ 522 (537) T protein:vir:78 516 PNQPVAD 522 (537) T ss_pred ccCCCCC Confidence 22222 No 42 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=1.2e-81 Score=464.28 Aligned_cols=457 Identities=13% Similarity=0.080 Sum_probs=339.3 Q ss_pred eeccCCc---ccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 9 MNYQEDA---DKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 9 ~~~~~~~---~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) ..++.+. .+++...+..|+++| ..+.+|++++++||+|+|++.+...... ...+++++++||+++||++.++|| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~-~~~~~r~~~~~~YY~G~~~i~~~~~~~~--~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAF-EDQNQNLRSNTSYYEAERRPEAIGVTVP--VQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHH-HHHHHHHHHHHHHHhccCchhhcCcccc--hhhhhhhhccchHHHHHHHHhhhh Confidence 4445443 245666667777776 5667999999999999998865443322 233577899999999999999999 Q ss_pred hcCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc--------eeEEEEEcccceEEEecC Q lcl|NC_019916. 86 VGNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK--------GEVSVKLDPMECFIIYDR 155 (513) Q Consensus 86 ~g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~--------~~~~~~~~p~~~~~~~d~ 155 (513) ++++++...++ ++.++++|+.|+|+.++.+++++++++|+||++||.++++. +++. .++|.+++++||+ T Consensus 78 ~~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~-~~~p~~~~~i~D~ 156 (485) T protein:vir:24 78 AVEGFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIR-VEPPTRMYAEIDP 156 (485) T ss_pred ccCceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEE-EeccceeEEEeeC Confidence 99999976543 35699999999999999999999999999999999997643 3333 5899999999998 Q ss_pred CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCC Q lcl|NC_019916. 156 SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQ 230 (513) Q Consensus 156 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~ 230 (513) .. +++.+++++|... ......++++|+++.+++|....++ +......+|+||.||||+|+|+. +|+ T Consensus 157 ~~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~---~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~ 227 (485) T protein:vir:24 157 RI-GRPAKAIRVAYDA-----EGNEIQAATLYTPNETFGWFRAEGE---WVEWFSDPHGLGAVPVVPLPNRTRLSDLYGT 227 (485) T ss_pred Cc-CceeEEEEEEEee-----cCCeEEEEEEEcCCcEEEEEecCCc---eEeecccccCCCcccEEEeccCcccCCcCCc Confidence 76 5677777766532 1234667899999999988764332 33445678999999999999985 578 Q ss_pred cchh-HHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhccee Q lcl|NC_019916. 231 GDFE-NVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMI 309 (513) Q Consensus 231 sd~e-~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 309 (513) |+++ .|++|||+||+++|++++.+++|++|+++++|.......... +.....+ ....+.++ T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-----------------~~~~~~~-~~~~~~i~ 289 (485) T protein:vir:24 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDP-----------------ETGQTLF-DAYLARIL 289 (485) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCcccccccc-----------------ccccchh-hhccccee Confidence 8988 599999999999999999999999999999997543221100 0000001 11111222 Q ss_pred eccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc----ccHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 310 LLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN----SSGVAMKYKVLGTVELAST 385 (513) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~k~~~ 385 (513) . .+++++++.+ .+.++.++++++|+..|+.++++|++++..|+++ +||+||++++.+|++||++ T Consensus 290 ~----------~~~~~~~~~q--~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~ 357 (485) T protein:vir:24 290 A----------FEDAEGKIQQ--FSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVER 357 (485) T ss_pred c----------cCCCCceEEe--ecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHH Confidence 2 2345566644 4556788999999999999999888887777653 7999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHH Q lcl|NC_019916. 386 KRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDAD 461 (513) Q Consensus 386 ~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~ 461 (513) +++.|+++|++++++++.+.+. .....+...|+++|+++.|+|+++.|++++|+. |++|+||+++++||+.|+. T Consensus 358 ~~~~f~~~l~~~~~l~~~~~~~--~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~ 435 (485) T protein:vir:24 358 KNAIFGGAWEEAMRLAYRLMKG--GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAER 435 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhcC--CCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHH Confidence 9999999999999999887653 234567789999999999999999999999973 5799999999999999988 Q ss_pred HHHHHHHHHHHHH-HHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 462 EIVKMMDKQRKAM-LKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 462 ~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) +|++++++|+.+. ....+.+.+.....+...++++.........+++.. T Consensus 436 ~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 436 EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred HHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 8999888776543 223344444333322222222222222222233333 No 43 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=3.2e-81 Score=461.88 Aligned_cols=455 Identities=12% Similarity=0.037 Sum_probs=335.5 Q ss_pred eeccCCc---ccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 9 MNYQEDA---DKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 9 ~~~~~~~---~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) +.++.+. .+...++|.+++.+| ..+++|++++.+||+|+|++.+..... + ....+.++++||+++||++.++|| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~-~~~~~r~~~l~~YY~G~~~i~~~~~~~-~-~~~~~~~~v~n~~~~iVd~~~~~l 77 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAF-EDASKDLASNTSYYDAERRPEAIGVTV-P-REMQQLLAHVGYPRLYVDSVAERQ 77 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHH-HHHHHHHHHHHHHhcccCcchhccccc-c-hhHhhhhhccchHHHHHHHHHhhh Confidence 3333332 122234566676665 567799999999999999986554332 2 222355788999999999999999 Q ss_pred hcCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc------ee-EEEEEcccceEEEecCC Q lcl|NC_019916. 86 VGNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK------GE-VSVKLDPMECFIIYDRS 156 (513) Q Consensus 86 ~g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~------~~-~~~~~~p~~~~~~~d~~ 156 (513) .+.+++...++ +..++++|+.|+++.++.+++++++++|+||++||.++.+. +. .+..++|.+++++||+. T Consensus 78 ~~~g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~ 157 (486) T protein:vir:42 78 AVEGFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPR 157 (486) T ss_pred cccceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCC Confidence 99998876543 35689999999999999999999999999999999987442 22 22347999999999986 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCc Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQG 231 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~s 231 (513) . +++.+++|+|... ..+.++++++|+++.+++|....++ +......+|+||.||||+|+|+. .|+| T Consensus 158 ~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~---~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s 228 (486) T protein:vir:42 158 I-NRVSKAIRVAYDK-----EGNEIQAATLYTPMETIGWFRADGE---WAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTS 228 (486) T ss_pred C-CCeEEEEEEEEec-----CCCeEEEEEEEcCCcEEEEEecCCc---EEeecceecCCCCceEEEeccccccCCCCCcc Confidence 5 5799999988632 2344667899999999998764332 33445678999999999999985 4789 Q ss_pred chhH-HHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceee Q lcl|NC_019916. 232 DFEN-VLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMIL 310 (513) Q Consensus 232 d~e~-v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 310 (513) +++. |++|||+||+++|++++.+++|++|+++++|........... .....+ ....++++. T Consensus 229 ~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------------~~~~~~-~~~~~~~~~ 290 (486) T protein:vir:42 229 EITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------------TGQTLF-DAYLARILA 290 (486) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccc-----------------cccchh-hhhhchhcc Confidence 9985 999999999999999999999999999999975432211000 000001 011112221 Q ss_pred ccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc----ccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 311 LKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN----SSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~k~~~~ 386 (513) .++++++|.++ +..+.++++++|+..|+.++.+|++.+..|+++ +||+||++++.+|++||+++ T Consensus 291 ----------~~~~~~~~~q~--~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~ 358 (486) T protein:vir:42 291 ----------FEDAEGKIQQF--SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERK 358 (486) T ss_pred ----------cCCCCceEEee--cccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHH Confidence 23456777554 455688899999999999998888877777654 69999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDADE 462 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~ 462 (513) ++.|+.+|++++++++++++.. ....+...++++|+++.|+|+++.|++++|++ |++|+||+++++|+++|+.+ T Consensus 359 ~~~f~~~l~~~~~l~~~~~~~~--~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~ 436 (486) T protein:vir:42 359 NLMFGGAWEEAMRIAYRIMKGG--DVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKERE 436 (486) T ss_pred HHHHHHHHHHHHHHHHHHhcCC--CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHH Confidence 9999999999999998876532 23456678999999999999999999999984 78999999999999999999 Q ss_pred HHHHHHHHHHHHHH-HhhhhcCCCCCCCCCC----CCCCCCCCCCCCCCC Q lcl|NC_019916. 463 IVKMMDKQRKAMLK-TYDTKGGLIINGTSGN----DPEDEGVRGQQGEPE 507 (513) Q Consensus 463 E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~ 507 (513) |++|+++|+.+... ..+.+.+.....+.+. ...+++..+..+.++ T Consensus 437 e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 437 EMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 99999887765433 2333333332222111 111222223334444 No 44 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=5.7e-81 Score=460.50 Aligned_cols=453 Identities=14% Similarity=0.063 Sum_probs=325.0 Q ss_pred cCCHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCC Q lcl|NC_019916. 17 KLTPTRI-AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGP 95 (513) Q Consensus 17 ~~~~~~i-~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~ 95 (513) ..|+.++ ..|+.+ +..+++|++++++||+|+|++.+..... +....++|+++||+++||++.++||.+++++...+ T Consensus 1 ~~t~~~~i~~L~~~-~~~~~~r~~~l~~Yy~G~~~i~~~~~~~--~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGL-LARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) T ss_pred CCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhcccccccccccc--chhHhhhhhhcchHHHHHHHHHhhhccCceecCCC Confidence 4566664 445555 4677899999999999999975543322 22335778999999999999999999999987654 Q ss_pred c--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec------CCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 96 S--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD------PSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 96 ~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d------~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) + .+.++++|+.|+++.++.+++++++++|+||++||.+ ++|.+++. .++|.+++++||+...+++.+++|+ T Consensus 78 ~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~-~~~p~~~~~~~D~~~~~~~~~~i~~ 156 (480) T protein:vir:78 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIR-VESPLYMYAELDPRNTRRVTRAVRL 156 (480) T ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEE-EEcccceEEEEcCCCccceEEEEEE Confidence 3 4679999999999999999999999999999999974 45556554 4899999999999988999999999 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCC-ccccccccccccCcccceEEecCCC-----CCCcchhH-HHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG-SVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFEN-VLSLI 240 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~-v~~li 240 (513) |...+. ....+++++|+++.+++|+...+.. .+....+..+|+||.||||+|+|+. .|+|+++. |++|| T Consensus 157 ~~~~~~----~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) T protein:vir:78 157 YTTRDD----VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) T ss_pred EEeecC----CCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHH Confidence 865432 2346788999999999987765432 2333445668999999999999974 57899985 99999 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccccccc Q lcl|NC_019916. 241 DLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQ 320 (513) Q Consensus 241 D~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (513) |+||+++|++++.+++|++|+++++|........... ...+.. ..+.++ . T Consensus 233 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-------------------~~~~~~-~~~~~~----------~ 282 (480) T protein:vir:78 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-------------------NTTLDI-YYGRIL----------T 282 (480) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccc-------------------cchhhh-hhhhhc----------c Confidence 9999999999999999999999999975332211000 000100 001111 2 Q ss_pred ccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 321 QTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG----NSSGVAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 321 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) .+++++++.+++. +..++++++|+..|+.++.+|++.+..|++ ++||+||++++.+|+.||+++++.|+.+|++ T Consensus 283 ~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~ 360 (480) T protein:vir:78 283 LASEAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) T ss_pred CCCCCceEEecCc--cCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2356677777664 335555666666666665555444444432 3799999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH----hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_019916. 397 RYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA----GAQIPQEYLYQYLPNVTDADEIVKMMDKQRK 472 (513) Q Consensus 397 ~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl----~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~ 472 (513) ++++++.+++. ....++..++++|+++.|+|.++.+++++|+ .|++|.||+++++||++|+.++++++++|+. T Consensus 361 ~~~l~~~~~g~---~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~ 437 (480) T protein:vir:78 361 AMRIAMQIMGR---EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQET 437 (480) T ss_pred HHHHHHHHcCC---CccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHH Confidence 99999887642 3345667899999999999999999999987 2479999999999999988888887766655 Q ss_pred HHH-HHhhhhcCCCCC-CCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 473 AML-KTYDTKGGLIIN-GTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 473 ~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.. ............ .+.+..++ +....++...+..++.- T Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 479 (480) T protein:vir:78 438 EDMIDTLYSTTKAQADATPKPTVTE-TKTETQTSPSGFNRTKT 479 (480) T ss_pred HHHHHHhhccccccCCCCCCCCCCC-CCCccccccCCCCcccC Confidence 432 122111111111 11111111 11111111122222222 No 45 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1e-80 Score=459.11 Aligned_cols=450 Identities=14% Similarity=0.061 Sum_probs=322.8 Q ss_pred cCCHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCC Q lcl|NC_019916. 17 KLTPTR-IAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGP 95 (513) Q Consensus 17 ~~~~~~-i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~ 95 (513) .-|+.+ |..++.+ +..+++|++++++||+|+|++.+..... + ....++|+++||+++||++.++||++++++...+ T Consensus 1 ~~t~~d~i~~L~~~-~~~~~~r~~~~~~Yy~G~~~i~~~~~~~-~-~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGL-LARDLPNLLEAEAYRNGTRRLKTIGIGA-P-PELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) T ss_pred CCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhccccchhccccc-c-hhhhhhhhhcchHHHHHHHHHhhhccCceecCCC Confidence 335555 5555555 4677899999999999999876544322 2 2334678999999999999999999999987544 Q ss_pred c--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeee------cCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 96 S--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYR------DPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 96 ~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~------d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) + .+.++++|+.|+++.++.+++++++++|+||++||. +++|.+++. .++|.+++++||+...+++.+++|+ T Consensus 78 ~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~-~~~p~~~~~i~D~~~~~~~~~~i~~ 156 (480) T protein:vir:78 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR-VESPLYMYAELDPRNTRRVTRAVRL 156 (480) T ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEE-EEcccceEEEEcCCCccceEEEEEE Confidence 3 467999999999999999999999999999999996 345666655 4899999999999988999999999 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCc-cccccccccccCcccceEEecCCC-----CCCcchhH-HHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS-VPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFEN-VLSLI 240 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~-v~~li 240 (513) |...+. ....+++++|+++.+++|........ +....+..+|+||.||||+|+|+. .|+|+++. |++|| T Consensus 157 ~~~~d~----~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~ 232 (480) T protein:vir:78 157 YTTRDD----VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) T ss_pred EEeecC----CcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHH Confidence 865432 23357889999999999876654322 233345678999999999999975 47899985 99999 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccccccc Q lcl|NC_019916. 241 DLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQ 320 (513) Q Consensus 241 D~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (513) |+||+++|++++.+++|++|+|+++|........... ...+.. ..+.++ . T Consensus 233 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-------------------~~~~~~-~~~~~~----------~ 282 (480) T protein:vir:78 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-------------------NTTLDI-YYGRIL----------T 282 (480) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccc-------------------cchhhh-hhhhhc----------c Confidence 9999999999999999999999999975432211000 000100 011111 2 Q ss_pred ccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 321 QTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG----NSSGVAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 321 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) .++++++|.+++. +..++++++++..|+.++.+|++.+..|++ ++||+||++++.+|+.||+++++.|+.+|++ T Consensus 283 ~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 360 (480) T protein:vir:78 283 LASEAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) T ss_pred CCCCCceEEecCc--cCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2345677777654 334556666666666665555554444433 3799999999999999999999999999999 Q ss_pred HHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_019916. 397 RYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDADEIVKMMDKQRK 472 (513) Q Consensus 397 ~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~ 472 (513) ++++++.+++ .....++..++++|+++.|+|+++.+++++|+. |++|.+|+++++||++|+.+|++++++++. T Consensus 361 ~~rl~~~~~~---~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~ 437 (480) T protein:vir:78 361 AMRIAMQIMG---REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQET 437 (480) T ss_pred HHHHHHHHcC---CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHH Confidence 9999988764 233456778999999999999999999999873 368999999999999999999887776655 Q ss_pred HHHHHhhhhcCCC----CCCCCCCCCCCCCCCCCCCC-CCCccCCC Q lcl|NC_019916. 473 AMLKTYDTKGGLI----INGTSGNDPEDEGVRGQQGE-PEDERTSD 513 (513) Q Consensus 473 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 513 (513) +... +.+.... ..++.+..++. +++.... .+-.++-- T Consensus 438 ~~~~--~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 479 (480) T protein:vir:78 438 EDMI--DTLYSTTKAQADATPKPTVTET--KTETQTSPSGFNRTKT 479 (480) T ss_pred HHHH--HHhhccccCCCccccCCCCCCC--CCccCCCcccCCCcCC Confidence 4332 2211111 11111111111 1110000 00011101 No 46 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=5.6e-81 Score=460.55 Aligned_cols=460 Identities=14% Similarity=0.093 Sum_probs=335.9 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |+++-. ..+.++++++.+.|..++..+.+|++++.+||+|+|++.+...... ....+.++++||+++||++ T Consensus 1 ~~~~~~-------~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~--~~~~~~~~~~n~~~~ivd~ 71 (484) T protein:vir:77 1 MTSPLQ-------KQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVP--QQMQKLLAHVGYPRLYIDA 71 (484) T ss_pred CCCccc-------ccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccc--hhHHhhhhhcCcHHHHHHH Confidence 444322 2357899988887777777888999999999999999755443322 2223556789999999999 Q ss_pred HHHHhhcCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCcee-------EEEEEcccceEE Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGE-------VSVKLDPMECFI 151 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~-------~~~~~~p~~~~~ 151 (513) .++||++++++...++ +..++++|+.|+|+.++.+++++++++|+||++||.++++... .+..++|.++++ T Consensus 72 ~~~~l~~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~ 151 (484) T protein:vir:77 72 IAARQELEGFRLGGADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYA 151 (484) T ss_pred HHhhhccCceecCCcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEE Confidence 9999999999976543 3579999999999999999999999999999999999887542 234579999999 Q ss_pred EecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC---- Q lcl|NC_019916. 152 IYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE---- 227 (513) Q Consensus 152 ~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~---- 227 (513) +||+. .+++.+++++|.... .....++++|+++.+++|....+ . +...+..+|++|.||||+|+|+. T Consensus 152 ~~D~~-~~~~~~a~~~~~~~~-----~~~~~~~~~y~~~~~~~~~~~~~--~-~~~~~~~~~~~g~vPvv~f~N~~~~~~ 222 (484) T protein:vir:77 152 QIDPR-TRQVMRAIRAIEDEE-----GNEVIGATLYLPNNTVIWNREDG--Q-WVQVANVAHNLEMVPVIPIPNRTRLSD 222 (484) T ss_pred EecCC-CCceEEEEEEEEeec-----CCcEEEEEEEecCeEEEEEecCC--c-eEeeccccCCCCCcceEEeccccccCc Confidence 99986 468999999887542 22356778999999988866432 2 33445678999999999999975 Q ss_pred -CCCcchh-HHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 228 -YRQGDFE-NVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 228 -~~~sd~e-~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) +|+|+|+ .|++|||+||+++|++++.+++|++|+++++|.......... +.....++. .. T Consensus 223 ~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-----------------~~~~~~~~~-~~ 284 (484) T protein:vir:77 223 LYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDP-----------------ETGQTLFDA-YL 284 (484) T ss_pred cCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccc-----------------cccchhhhh-hh Confidence 4789998 599999999999999999999999999999997543221100 000000110 01 Q ss_pred cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc----ccHHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN----SSGVAMKYKVLGTVE 381 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~ 381 (513) +.++ ..+++++++.++ +..+.++++++|+..|+.+|.+|++.+..|+++ +||+||++++.+|++ T Consensus 285 ~~~~----------~~~~~~~~~~q~--~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ 352 (484) T protein:vir:77 285 ARIL----------AFEDHESKAQQF--SAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVK 352 (484) T ss_pred hhhc----------ccCCCCceeEee--cCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHH Confidence 1111 123455666544 445567788888888888877776666655433 799999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCC Q lcl|NC_019916. 382 LASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNV 457 (513) Q Consensus 382 k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v 457 (513) ||+++++.|+++|++++++++.+.+.. ....+...++++|+++.|+|.++.|++++|++ |++|.||+++++|++ T Consensus 353 ka~~k~~~f~~~l~~~~~l~~~~~~~~--~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~ 430 (484) T protein:vir:77 353 TVERKNKIFGGAWEQAMRVAYKVMNGG--DIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYS 430 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCC Confidence 999999999999999999998875432 23456678999999999999999999999973 489999999999999 Q ss_pred CCHHHHHHHHHHHHHHHH-HHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAML-KTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) +|+.+|++++++|+.+.. ...+.+.+.... .+.++...+.+++...++++..- T Consensus 431 ~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 431 ITEREEMRKWDEEEQAQGLGLMGTMFGTDPS--GGGNPDNPETPEPQPNPAEEAAA 484 (484) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhcccccc--CCCCCCCCCcccccCCCccccCC Confidence 999999999887765432 333333322211 11111111111111111111111 No 47 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=1.3e-80 Score=458.57 Aligned_cols=461 Identities=13% Similarity=0.110 Sum_probs=330.3 Q ss_pred Cccchh-----hceeccCCcccCCHHHHHHHHHH---HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCC-cceeec Q lcl|NC_019916. 1 MIDMQQ-----ANMNYQEDADKLTPTRIAAFIRH---HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKA-DHRAVH 71 (513) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~i~~~i~~---~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~-~~ri~~ 71 (513) ||-.-. --..+.-+.+.++.+.+..++.+ ++..+++|++++++||+|+|++....... ++..+. ++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~-~~~~~~~~~~~v~ 79 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGA-SDEVKELAKLSVK 79 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccC-ChhhhhhHhhhhc Confidence 432211 11122223345677666655553 23457889999999999999876554433 333444 445778 Q ss_pred chhHHHHHHHHHHhhcCCeeecC-CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceE Q lcl|NC_019916. 72 SFARYIADFQTSYSVGNAIAMSG-PSSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECF 150 (513) Q Consensus 72 n~~~~ivd~~~~~l~g~p~~~~~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~ 150 (513) ||+++||++.++|++.++++... +..+.++++|+.|+|+.++.++++++++||+||++||.++++. ++. .++|+++| T Consensus 80 n~~~~ivd~~a~~l~~~gf~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~-~i~-~~sp~~~~ 157 (501) T protein:vir:25 80 NVLSLVRDSFAQNLSVVGYRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGP-VFR-TRSPRQIL 157 (501) T ss_pred ChHHHHHHHHHhhhcccceecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCC-eEE-EeccccEE Confidence 99999999999999999988744 3456799999999999999999999999999999999998873 444 58999999 Q ss_pred EEecC-CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeecc------CCcc-------c-----cccccc Q lcl|NC_019916. 151 IIYDR-SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVV------AGSV-------P-----TLEVAE 211 (513) Q Consensus 151 ~~~d~-~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~------~~~~-------~-----~~~~~~ 211 (513) ++|++ ..++++.+++++|...... ....++++|++..++++..... .+.+ . ...... T Consensus 158 ~iy~D~~~~~~~~~ai~~~~~~~~~----~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (501) T protein:vir:25 158 AVYADPSVDAWPQYALETWVAQKDA----KPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGAT 233 (501) T ss_pred EEEecCCCCcceeEEEEEEeecccc----CcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccc Confidence 99965 5566799999998765432 2244677888877777643211 0000 0 111234 Q ss_pred cccCcccceEEecCC----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhh Q lcl|NC_019916. 212 HSAQFGFPMIEYRNN----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDAD 287 (513) Q Consensus 212 ~~~~g~vPvv~~~n~----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~ 287 (513) +|+||.||||+|+|+ .+|+|+|+++++|+|+||+++|++++.++++++|+++++|+...... T Consensus 234 ~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~-------------- 299 (501) T protein:vir:25 234 FEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAE-------------- 299 (501) T ss_pred cCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccc-------------- Confidence 789999999999995 45899999999999999999999999999999999999997532211 Q ss_pred hhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCcccccccccccc Q lcl|NC_019916. 288 AMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN 366 (513) Q Consensus 288 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n 366 (513) ......++++.+ +++++++.+++ .+.+++...++++..+|+..|++|+.+++.+++| T Consensus 300 ------------~~~~~~~~i~~~----------~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N 357 (501) T protein:vir:25 300 ------------VLKASALRVWTF----------EDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMIN 357 (501) T ss_pred ------------hhhhcccceecc----------CCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCC Confidence 011222333333 23456676666 4667888899999999999999999999988899 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcC-C Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQ-I 445 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~-i 445 (513) +||+||++++.+|++||+++++.|+.+|+++++|++.+.+. ....+..+++++|+++.|+|.+++||+++|+.|+ + T Consensus 358 ~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~---~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi 434 (501) T protein:vir:25 358 VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDD---PDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI 434 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC Confidence 99999999999999999999999999999999999877643 2334556899999999999999999999999765 8 Q ss_pred CHHHHHHhCCCCCCHHHHHHHHHHHHHHHH--HHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 446 PQEYLYQYLPNVTDADEIVKMMDKQRKAML--KTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 446 S~et~~~~l~~v~D~~~E~~ri~~E~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) |.||++.++|++++++ +++++++++++. ...+.+......+....++++ +.+..+.++.+.+ T Consensus 435 s~et~~~~~~g~~~~~--ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 498 (501) T protein:vir:25 435 PIEHLLSMVPGMTQQT--IQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQA----AAQALNEGGVNGN 498 (501) T ss_pred CHHHHHHHcCCCCHHH--HHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCC----CccccccccCCCC Confidence 9999999999998644 455554443322 222222222221111111111 1111111111111 No 48 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=2.7e-79 Score=451.34 Aligned_cols=456 Identities=13% Similarity=0.074 Sum_probs=325.0 Q ss_pred eeccCC--cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 9 MNYQED--ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 9 ~~~~~~--~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) +.-+.+ .+.-++..+.+.|..++..+++|++++++||+|+|++.+....... ...++++++||+++||++.++||+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~--~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPI--QMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCCh--hhhhhhhhcCcHHHHHHHHHhhhc Confidence 222222 1223444444444455577889999999999999998665543322 224567889999999999999999 Q ss_pred cCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC--------ceeEEEEEcccceEEEecCC Q lcl|NC_019916. 87 GNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ--------KGEVSVKLDPMECFIIYDRS 156 (513) Q Consensus 87 g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~--------~~~~~~~~~p~~~~~~~d~~ 156 (513) +++++...++ +..++++|+.|+|+.++.+++++++++|+||++||.++.+ .+.+ ..++|.+++++||+. T Consensus 79 ~~g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i-~~~~p~~~~~~~D~~ 157 (485) T protein:vir:10 79 VEGFRFGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPII-RVEPPTRMYAEIDPR 157 (485) T ss_pred ccceecCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEE-EEEccceeEEEEcCC Confidence 9998875443 3569999999999999999999999999999999998653 3333 357999999999986 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCc Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQG 231 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~s 231 (513) .. ++.+++++|... ...+..++++||++.+++|....++ +......+|++|.||||+|+|+. .|+| T Consensus 158 ~~-~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s 228 (485) T protein:vir:10 158 IG-RVSKAIRVAYDA-----EGNEIQAATLYTPNDIFGWYRVENE---WQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTS 228 (485) T ss_pred CC-ceeEEEEEEEee-----CCCeEEEEEEEeCCeEEEEEEcCCc---eEEeccccCCCCcccEEEeccccccCCCCCcc Confidence 64 566666665432 1234667899999999998765432 33445678999999999999985 3788 Q ss_pred chhH-HHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceee Q lcl|NC_019916. 232 DFEN-VLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMIL 310 (513) Q Consensus 232 d~e~-v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 310 (513) +++. |++|||+||+++|++++.+++|++|+++++|.......... +.....+ ....+.++. T Consensus 229 ~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-----------------~~~~~~~-~~~~~~i~~ 290 (485) T protein:vir:10 229 EITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDP-----------------ETGQTLF-DAYLARILA 290 (485) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccc-----------------cccchhh-hhcccceec Confidence 9985 99999999999999999999999999999997543221100 0000001 111122222 Q ss_pred ccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc----cccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 311 LKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG----NSSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~ 386 (513) .++++++|.++ +.+..++++++|+..|+.++.+|++.+..|++ ++||+||++++.+|+.||+++ T Consensus 291 ----------~~~~d~k~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k 358 (485) T protein:vir:10 291 ----------FEDAEGKIQQF--SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERK 358 (485) T ss_pred ----------cCCCCceEEee--cccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHH Confidence 23456777654 44556778888888888887776665555543 379999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDADE 462 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~ 462 (513) ++.|+.+|++++++++.+.+. .....+...++|+|+++.|+|+++.|++++||. |++|+||+++++|+++|..+ T Consensus 359 ~~~f~~~l~~~~~l~~~~~~~--~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~ 436 (485) T protein:vir:10 359 NSIFGGAWEEAMRLAYRMMKG--GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAERE 436 (485) T ss_pred HHHHHHHHHHHHHHHHHHhCC--CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHH Confidence 999999999999999887543 334556778999999999999999999999983 48999999999999988888 Q ss_pred HHHHHHHHHHHHHH-HhhhhcCCCC--CCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 463 IVKMMDKQRKAMLK-TYDTKGGLII--NGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 463 E~~ri~~E~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 508 (513) |++++++|+.+... ..+.+.+... +++.+.+++.+.+.+.++.++. T Consensus 437 ~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 437 EMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 88888766544222 2222222211 1111222222222222222222 No 49 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=4.4e-79 Score=450.18 Aligned_cols=452 Identities=13% Similarity=0.074 Sum_probs=324.0 Q ss_pred cCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCee Q lcl|NC_019916. 12 QEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIA 91 (513) Q Consensus 12 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~ 91 (513) ....+++++..+.+.|..++..+++|++++++||+|+|++.+..... + ....++++++||+++||++.++||+.++++ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~-~-~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~ 78 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAV-P-LDMRKYLAHVGYPRTYVDAIAERQELEGFR 78 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCccc-c-hhhhhhhhhcchHHHHHHHHHHhhhcccee Confidence 33345677776655555556777899999999999999986654432 2 233477899999999999999888777665 Q ss_pred ecC---------Cc---HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC--------CCceeEEEEEcccceEE Q lcl|NC_019916. 92 MSG---------PS---SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP--------SQKGEVSVKLDPMECFI 151 (513) Q Consensus 92 ~~~---------~~---~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~--------~~~~~~~~~~~p~~~~~ 151 (513) +.. ++ ...++++|+.|+|+.++.+++++++++|+||++||.++ ++.+.+. .++|.++++ T Consensus 79 ~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~-~~~p~~~~~ 157 (488) T protein:vir:23 79 IPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIR-VEPPTALYA 157 (488) T ss_pred ccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEE-EeccceeEE Confidence 421 12 24689999999999999999999999999999998754 3334443 579999999 Q ss_pred EecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC---- Q lcl|NC_019916. 152 IYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE---- 227 (513) Q Consensus 152 ~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~---- 227 (513) +||+.. +++.+++++|...+ ...++++++|+++.+++|....+ . +......+|+||.||||+|+|+. T Consensus 158 ~~d~~~-~~~~~~~~~~~~~~-----~~~~~~~~~y~~~~~~~~~~~~~--~-~~~~~~~~h~~g~vPvv~f~n~~~~~~ 228 (488) T protein:vir:23 158 EVDPRT-RKVLYAIRAIYGAD-----GNEIVSATLYLPDTTMTWLRAEG--E-WEAPTSTPHGLEMVPVIPISNRTRLSD 228 (488) T ss_pred EEecCC-CceEEEEEEEEecC-----CCcEEEEEEEecCcEEEEEecCC--c-eEeccccccCCCCcceEEeccccccCC Confidence 999764 57888888875432 23356788999999998865432 2 34455678999999999999975 Q ss_pred -CCCcchh-HHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 228 -YRQGDFE-NVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 228 -~~~sd~e-~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) +|+|+++ .|++|||+||+++|++++.+++|++|+++++|........... ....+..... T Consensus 229 ~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~------------------~~~~~~~~~~ 290 (488) T protein:vir:23 229 LYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE------------------TGQRMFDAYM 290 (488) T ss_pred cCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc------------------ccchhhhhhh Confidence 5789997 5899999999999999999999999999999975432211100 0000111111 Q ss_pred cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc----cccHHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG----NSSGVAMKYKVLGTVE 381 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~ 381 (513) +.++.+ ..++++++.+++ ..+.++++++|+..|+.++.+|++.+..|++ ++||+||++++++|+. T Consensus 291 ~~v~~~---------~~g~~~~~~q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~ 359 (488) T protein:vir:23 291 ARILAF---------EGGEGAHAEQFS--AAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVK 359 (488) T ss_pred hhhccC---------CCCCCceeEecC--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHH Confidence 222222 234556765544 4556777777777777777666665555543 3699999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCC Q lcl|NC_019916. 382 LASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNV 457 (513) Q Consensus 382 k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v 457 (513) ||+++++.|+.+|++++++++.+++... ...+..+++++|+++.|.|+++.+++++|+. |++|+||+++++|++ T Consensus 360 k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~ 437 (488) T protein:vir:23 360 KVERKNKIFGGAWEQAMRLAYKMVKGGD--IPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYT 437 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCC--cchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCC Confidence 9999999999999999999998765432 2346678999999999999999999999973 479999999999999 Q ss_pred CCHHHHHHHHHHHHHHH-HHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAM-LKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +|+.+|++++++++.+. ....+.+.....+...+. + .+.+++.++.... T Consensus 438 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~---~~~~~~~~~e~~~ 487 (488) T protein:vir:23 438 IVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPG----E---APVGEPPAPEPDA 487 (488) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCC----C---CCCCCCCCCCCCC Confidence 99999999887654432 222333322221111111 0 1111122222222 No 50 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=7.4e-79 Score=448.92 Aligned_cols=435 Identities=13% Similarity=0.071 Sum_probs=315.4 Q ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG 94 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~ 94 (513) ...+|++++.+.|..++..+++|++++++||+|+|++.+............++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 55678866555444445778899999999999999987665555444444578999999999999999999999999865 Q ss_pred Cc----HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEee Q lcl|NC_019916. 95 PS----SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAV 170 (513) Q Consensus 95 ~~----~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~ 170 (513) ++ +..++++|+.|+++.++.+++++++++|+||++||.+++|.+++. .++|.+++++||+...+++.+++|+|.. T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~-~~~p~~~~~i~d~~~~~~~~~~i~~~~~ 159 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATIT-ADSPETMVVSVDPLQPWRIRAAMRWWRD 159 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEE-EEccceeEEEEcCCCCcceEEEEEEEEe Confidence 43 346899999999999999999999999999999999999998876 4899999999999998999999999975 Q ss_pred cccccccceeEEEEEEEcCCcEEE-EE------------eeccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 171 QTVVDNITQTKYEVETWTENDYTR-YK------------PIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 171 ~~~~~~~~~~~~~ve~yt~~~~~~-~~------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) .+.. . .+ ..+|....+.. ++ ....++ .+......+|.+|.+||++| +|.+|+|+|++++ T Consensus 160 ~d~~--~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi 231 (456) T protein:vir:10 160 LDAE--S---DF-AIVWSGDGWQKFARPCFVQSSSRRRLVTRISD-SWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHI 231 (456) T ss_pred cCCc--e---eE-EEEEeccceeEEEEEEEEeecccceeeeecCC-ceeeccccCCCCCceeEEEe-cCCCCCchhhhhH Confidence 4321 1 11 22222222111 11 011111 22333456788898888877 5668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +|||+||+++|++++.++++++|+++++|.......... .+.. + +.........+.++.+ T Consensus 232 ~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~--------~g~~---~---~~~~~~~~~~~~~~~~------ 291 (456) T protein:vir:10 232 DIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDE--------NGNA---I---DYASIFEAAPGALWEL------ 291 (456) T ss_pred HHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccc--------cccc---c---chhhhhhhhccccccC------ Confidence 999999999999999999999999999997543211100 0000 0 0000000011111111 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) .+++++..++ +.+.+++...++++...|+..|++|+.+++.+++|+||+||++++.+|++||+++++.|+++|+++ T Consensus 292 ---~~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) T protein:vir:10 292 ---PPGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) T ss_pred ---CCCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2334444332 356788888899999999999999999888888899999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKA 473 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~ 473 (513) +++++.+.+ ..+...++++|+++.|+|.++.||+++|+ +|++|.+++++++|++.+ .++|++|+++|... T Consensus 368 ~rl~~~~~g------~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~ 441 (456) T protein:vir:10 368 LVKALQIEG------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL 441 (456) T ss_pred HHHHHHhcC------CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHH Confidence 999876532 22345789999999999999999999998 478999999999987654 33456665555432 Q ss_pred HHHHhhhhcCCCCCCCCCCC Q lcl|NC_019916. 474 MLKTYDTKGGLIINGTSGND 493 (513) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~ 493 (513) ........+ +++++- T Consensus 442 ~~~~~~~~~-----~~~~~~ 456 (456) T protein:vir:10 442 FAGNPVQRP-----QEDGSR 456 (456) T ss_pred HhhhhhhcC-----CCCCCC Confidence 111110000 000000 No 51 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=7.4e-79 Score=448.92 Aligned_cols=435 Identities=13% Similarity=0.071 Sum_probs=315.4 Q ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG 94 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~ 94 (513) ...+|++++.+.|..++..+++|++++++||+|+|++.+............++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 55678866555444445778899999999999999987665555444444578999999999999999999999999865 Q ss_pred Cc----HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEee Q lcl|NC_019916. 95 PS----SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAV 170 (513) Q Consensus 95 ~~----~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~ 170 (513) ++ +..++++|+.|+++.++.+++++++++|+||++||.+++|.+++. .++|.+++++||+...+++.+++|+|.. T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~-~~~p~~~~~i~d~~~~~~~~~~i~~~~~ 159 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATIT-ADSPETMVVSVDPLQPWRIRAAMRWWRD 159 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEE-EEccceeEEEEcCCCCcceEEEEEEEEe Confidence 43 346899999999999999999999999999999999999998876 4899999999999998999999999975 Q ss_pred cccccccceeEEEEEEEcCCcEEE-EE------------eeccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 171 QTVVDNITQTKYEVETWTENDYTR-YK------------PIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 171 ~~~~~~~~~~~~~ve~yt~~~~~~-~~------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) .+.. . .+ ..+|....+.. ++ ....++ .+......+|.+|.+||++| +|.+|+|+|++++ T Consensus 160 ~d~~--~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi 231 (456) T protein:vir:10 160 LDAE--S---DF-AIVWSGDGWQKFARPCFVQSSSRRRLVTRISD-SWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHI 231 (456) T ss_pred cCCc--e---eE-EEEEeccceeEEEEEEEEeecccceeeeecCC-ceeeccccCCCCCceeEEEe-cCCCCCchhhhhH Confidence 4321 1 11 22222222111 11 011111 22333456788898888877 5668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +|||+||+++|++++.++++++|+++++|.......... .+.. + +.........+.++.+ T Consensus 232 ~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~--------~g~~---~---~~~~~~~~~~~~~~~~------ 291 (456) T protein:vir:10 232 DIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDE--------NGNA---I---DYASIFEAAPGALWEL------ 291 (456) T ss_pred HHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccc--------cccc---c---chhhhhhhhccccccC------ Confidence 999999999999999999999999999997543211100 0000 0 0000000011111111 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) .+++++..++ +.+.+++...++++...|+..|++|+.+++.+++|+||+||++++.+|++||+++++.|+++|+++ T Consensus 292 ---~~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) T protein:vir:10 292 ---PPGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) T ss_pred ---CCCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2334444332 356788888899999999999999999888888899999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKA 473 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~ 473 (513) +++++.+.+ ..+...++++|+++.|+|.++.||+++|+ +|++|.+++++++|++.+ .++|++|+++|... T Consensus 368 ~rl~~~~~g------~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~ 441 (456) T protein:vir:10 368 LVKALQIEG------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL 441 (456) T ss_pred HHHHHHhcC------CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHH Confidence 999876532 22345789999999999999999999998 478999999999987654 33456665555432 Q ss_pred HHHHhhhhcCCCCCCCCCCC Q lcl|NC_019916. 474 MLKTYDTKGGLIINGTSGND 493 (513) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~ 493 (513) ........+ +++++- T Consensus 442 ~~~~~~~~~-----~~~~~~ 456 (456) T protein:vir:10 442 FAGNPVQRP-----QEDGSR 456 (456) T ss_pred HhhhhhhcC-----CCCCCC Confidence 111110000 000000 No 52 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=2.5e-78 Score=446.04 Aligned_cols=451 Identities=15% Similarity=0.141 Sum_probs=312.7 Q ss_pred eeccCCcccCCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCcceeecchhHHHHHHHHH Q lcl|NC_019916. 9 MNYQEDADKLTPTRIAAFIRH----HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KADHRAVHSFARYIADFQTS 83 (513) Q Consensus 9 ~~~~~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~ri~~n~~~~ivd~~~~ 83 (513) |.+..+ +.|+.++|.++|.. .+..+.+|++++++||+|+|++++.......... +..+++++||+++||++.++ T Consensus 1 ~~~~p~-~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 79 (479) T protein:vir:99 1 MIDLPD-EDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQ 79 (479) T ss_pred CccCCc-ccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHh Confidence 555555 57999988876652 3456789999999999999998876554433322 23445688999999999999 Q ss_pred HhhcCCeeecCCc-HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeee-----cCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 84 YSVGNAIAMSGPS-SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYR-----DPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 84 ~l~g~p~~~~~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~-----d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) |++.++++...++ .+.++++|+.|+++.++.+++++++++|+||++||. |++|.+++. .++|++++++|++.. T Consensus 80 ~l~~~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~-~~~p~~~~~iydd~~ 158 (479) T protein:vir:99 80 QLIVDGYRKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIK-CIDPRDAFAIWEDPY 158 (479) T ss_pred hcccccccCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEE-EechhheEEEecCCc Confidence 9999988875433 456899999999999999999999999999999996 455666554 479999999998865 Q ss_pred Ccc-eEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC----CCCCcc Q lcl|NC_019916. 158 NPK-PIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN----EYRQGD 232 (513) Q Consensus 158 ~~~-~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~----~~~~sd 232 (513) .+. ++++++++. ...+++||...++.+.... + .+...+..+|+||+||||+|+|+ .+|+|+ T Consensus 159 ~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd 224 (479) T protein:vir:99 159 WDEWPKYLLERQP-----------NGQYWWWTEEDYSIFEFKQ--G-KFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGD 224 (479) T ss_pred ccceeeEEEeecC-----------ceeEEEEecceEEEEEecC--C-ceeeccccccCCCCcceEEeecCCCcCcCCcch Confidence 443 333332211 1235678887777665432 2 23445677999999999999998 579999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecc Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLK 312 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 312 (513) |+++++|||+||+++|++++.+++|++|+++++|........... . .......+++.. T Consensus 225 ~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-------------~--------~~~~~~~~i~~~- 282 (479) T protein:vir:99 225 VEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQ-------------E--------KMRFAQESMLIS- 282 (479) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccch-------------h--------ccccccccceee- Confidence 999999999999999999999999999999999975422111100 0 001112233322 Q ss_pred ccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 313 TGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFE 391 (513) Q Consensus 313 ~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~ 391 (513) .++++++.+++ .+.+++.+.++.+..+|+..+++|+.+++ +.+|+||+||++++.+|++||+++++.|+ T Consensus 283 ---------~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g-~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~ 352 (479) T protein:vir:99 283 ---------QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAG-QIVNVAADALAAGTRQTMQKLFEKQATWK 352 (479) T ss_pred ---------cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcc-cccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34556776554 23444444455555555555666766554 35789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 392 RGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 392 ~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) .+|++++++++.+.+ .....+..+++++|+++.|+|.++.|++++|| +|++|.||+++++|++++++ ++++++ T Consensus 353 ~al~~~~~l~~~~~~---~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~--~e~~~~ 427 (479) T protein:vir:99 353 ASHNQTMRLVNKIEG---RTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQST--VNGWKE 427 (479) T ss_pred HHHHHHHHHHHHHcC---CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH--HHHHHH Confidence 999999999988753 23344556789999999999999999999997 57999999999999999754 444444 Q ss_pred HHHHH---HHHhhhhcCCCCCCCCCCCCCCCCCCCCC-----CCCCCccCCC Q lcl|NC_019916. 470 QRKAM---LKTYDTKGGLIINGTSGNDPEDEGVRGQQ-----GEPEDERTSD 513 (513) Q Consensus 470 E~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 513 (513) +++++ ....+.+.. ......+.++.++..++++ ++|.+=..+- T Consensus 428 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (479) T protein:vir:99 428 IYDREGDFGKYMRKLQN-GPDPAEQRGGPNGATNMQQANNKTGEPASLNKSG 478 (479) T ss_pred HHHHHHHHHHHHHHHhc-ccCcccccCCCCCCCCCCCCCCCCcchhccCCCC Confidence 33322 222222211 1111111111111111111 1111111111 No 53 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=5.4e-78 Score=444.20 Aligned_cols=427 Identities=12% Similarity=0.033 Sum_probs=321.1 Q ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG 94 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~ 94 (513) .++-..++|..|++++ ..+++|++++.+||+|+|++........ ...+++|+++|||++||++.++|+.+++++ . T Consensus 1 ~~~~~~~~i~~l~~~~-~~~~~r~~~l~~Yy~G~~~i~~~~~~~~--~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~--~ 75 (441) T protein:vir:80 1 MNSDELALIEGMYDRI-QRLSSWHCCIEGYYEGSNRVRDLGVAIP--PELQRVQTVVSWPGIAVDALEERLDWLGWT--N 75 (441) T ss_pred CCccHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCcchhcCcccc--hhhhhhhhhcchHHHHHHHHHhhhcccccc--C Confidence 2222334688888876 5667999999999999998755443332 234578999999999999999999777655 5 Q ss_pred CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc Q lcl|NC_019916. 95 PSSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV 174 (513) Q Consensus 95 ~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~ 174 (513) ++++.++++|+.|+++.++.+++++++++|+||++||.|++|++++. .++|.+++++||+...+...++++|+... T Consensus 76 ~d~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~-~~~p~~~~~i~d~~~~~~~~~~~~~~~~~--- 151 (441) T protein:vir:80 76 GDGYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVR-PQSPKNCTGKFSADGSRLDAGLVVQQTCD--- 151 (441) T ss_pred CChHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEE-EEccceEEEEEeCCCCceeEEEEEEEEec--- Confidence 56678999999999999999999999999999999999999998775 48999999999988766666655655422 Q ss_pred cccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCC-----CCcchh-HHHHHHHHHHHHHH Q lcl|NC_019916. 175 DNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEY-----RQGDFE-NVLSLIDLYDVAQS 248 (513) Q Consensus 175 ~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~-----~~sd~e-~v~~liD~~~~~~S 248 (513) ....++++|+++.+++|..... ..+...+..+|+||.||||+|+|+.+ |+|+++ .|++|||+||+++| T Consensus 152 ----~~~~~~~vy~~~~~~~~~~~~~--~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s 225 (441) T protein:vir:80 152 ----PEVVEAELLLPDVIVQVERRGS--REWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLL 225 (441) T ss_pred ----CceEEEEEEecCeEEEEEEcCC--cceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHH Confidence 1245678999999988865433 23445567899999999999999853 788886 59999999999999 Q ss_pred HHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeE Q lcl|NC_019916. 249 DTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANY 328 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (513) ++++.+++|++|+++++|+........ .+ ....++++.++. ...+..+++ T Consensus 226 ~~~~~~~~~~~~~~~i~G~~~~~~~~~-------------~~-----------~~~~~~i~~~~~------~~~~~~~~~ 275 (441) T protein:vir:80 226 GQSVNRDFYAYPQRWVTGVSADEFSQP-------------GW-----------VLSMASVWAVDK------DDDGDTPNV 275 (441) T ss_pred HHHHHHHhhcCceeeeecCCccccccc-------------hh-----------hhcccccccCCC------CCCCCccee Confidence 999999999999999999753221110 00 111123333321 122223444 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc---c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 329 IHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG---N-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 329 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~---n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) .+ .+.+..++++++|+..|+.++.+|++++..|++ | .||+||++++.+|+.||+++++.|+++|++++++++.+ T Consensus 276 ~~--~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 353 (441) T protein:vir:80 276 GS--FPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKA 353 (441) T ss_pred Ee--cCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 444566777777877777777666665555433 3 59999999999999999999999999999999999998 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cC--CCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG--AQ--IPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~--iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) ++..... ...+.+++++|+++.|+|.++.|++++|+. |+ +|++++++++|+++ .|++++++|++++.+.... T Consensus 354 ~~~~~~~-~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~---~e~~~~~~e~~e~~~~~~~ 429 (441) T protein:vir:80 354 LDSRVDE-ADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDD---VQVEAVMRHRAESSDPLAV 429 (441) T ss_pred hcCCCcc-cccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCH---HHHHHHHHHHHHHHHHHHH Confidence 8765433 345678999999999999999999999984 43 68899999999875 4666677776666665555 Q ss_pred hcCCCCCCCCCC Q lcl|NC_019916. 481 KGGLIINGTSGN 492 (513) Q Consensus 481 ~~~~~~~~~~~~ 492 (513) +.+....++++- T Consensus 430 ~~~~~~~~~~~~ 441 (441) T protein:vir:80 430 LAGAISRQTNEV 441 (441) T ss_pred HhhhhhcccccC Confidence 444443333333 No 54 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=2.3e-77 Score=440.78 Aligned_cols=435 Identities=14% Similarity=0.072 Sum_probs=316.4 Q ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG 94 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~ 94 (513) ....|++++.+.|.+++..+++|++++++||+|+|++.+............++++++||+++||++.++|++|+|+++.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 45678887766666666788999999999999999987655444333333456788999999999999999999999865 Q ss_pred Cc----HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEee Q lcl|NC_019916. 95 PS----SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAV 170 (513) Q Consensus 95 ~~----~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~ 170 (513) .+ .+.++++|+.|+|+.++.+++++++++|+||+++|.+++|++++. .++|.+++++||+...+++.+++|+|.. T Consensus 81 ~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~-~~~p~~~~~i~d~~~~~~~~~~~~~~~~ 159 (456) T protein:vir:79 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATIT-ADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) T ss_pred CCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEE-EeccceeEEEEcCCCCCceEEEEEEEEe Confidence 43 346899999999999999999999999999999999999998765 5899999999999998999999999865 Q ss_pred cccccccceeEEEEEEEcCCcEEEEEe-------------eccCCccccccccccccCcccceEEecCCCCCCcchhHHH Q lcl|NC_019916. 171 QTVVDNITQTKYEVETWTENDYTRYKP-------------IVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVL 237 (513) Q Consensus 171 ~~~~~~~~~~~~~ve~yt~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~ 237 (513) .+. ...+..+|+.+.++++.. ...++. +......+|.++.||||+| +|.+|.|+|++++ T Consensus 160 ~d~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~pvv~~-~N~~~~gd~e~v~ 231 (456) T protein:vir:79 160 LDA------ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDS-WVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHI 231 (456) T ss_pred cCC------ceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCc-eeecccccCCCCceeEEEe-cCCCCCchhhhhH Confidence 331 123344555555444321 111111 2223456889999999998 4578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +|||+||+++|++++.++++++|++++.|........... +... ..........+.++.+ T Consensus 232 ~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~--------g~~i------~~~~~~~~~~~~~~~~------ 291 (456) T protein:vir:79 232 DIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDEN--------GNAI------DYASIFEAAPGALWEL------ 291 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccc--------cccc------chhhhhhhhccccccC------ Confidence 9999999999999999999999999999975432211100 0000 0000000011111111 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) .+++++..+ .+.+.+.+...++.+..+|+..+++|+.+++.+++|+||+||++++.+|++||+++++.|+++|+++ T Consensus 292 ---~~~~~~~q~-~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) T protein:vir:79 292 ---PPGVDIWES-QTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) T ss_pred ---CCCcceeee-cccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223333222 2356677777888888888888889988888778899999999999999999999999999999999 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKA 473 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~ 473 (513) +++++.+.+ ..+..+++++|+++.|.|.++.||+++|+ +|++|.+++++.++++.+ .++|++|+++|... T Consensus 368 ~~l~~~~~g------~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~ 441 (456) T protein:vir:79 368 LVKALQIEG------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL 441 (456) T ss_pred HHHHHHhcC------CCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHH Confidence 999877642 23445789999999999999999999997 578999999998887653 33445555544332 Q ss_pred HHHHhhhhcCCCCCCCCCCCCCC Q lcl|NC_019916. 474 MLKTYDTKGGLIINGTSGNDPED 496 (513) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~ 496 (513) . .+...+. ...+++- T Consensus 442 ~-------~~~~~~~-~~~~~~~ 456 (456) T protein:vir:79 442 F-------AGNPVQR-PQEDGSR 456 (456) T ss_pred H-------hhhHhhc-CCCCCCC Confidence 1 1111111 0001110 No 55 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=1.4e-75 Score=430.88 Aligned_cols=477 Identities=9% Similarity=-0.028 Sum_probs=338.0 Q ss_pred CccchhhceeccCCcccCCHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTR---IAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYI 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~i 77 (513) ||+.=..+-.|+.....|+++. |..|+.++ ..+++|++++.+||+|+|++.+..... +... .++++++||+++| T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~-~~~~~r~~~l~~YY~G~~~i~~~~~~~-p~~~-~~~~~v~n~~~~i 77 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQL-VDRTPRNLLRASFYDGKYAIRQIGNLI-PPEY-LRTATVLGWSAKA 77 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHH-HHHhHHHHHHHHHHhccccchhccccc-cHHH-HHHhhccCcHHHH Confidence 9999999999999988888875 77787776 556799999999999999876544332 2222 3556889999999 Q ss_pred HHHHHHHhhcCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEE-EEEcccceEEEec Q lcl|NC_019916. 78 ADFQTSYSVGNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVS-VKLDPMECFIIYD 154 (513) Q Consensus 78 vd~~~~~l~g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~-~~~~p~~~~~~~d 154 (513) |++.++++..++++...++ +..+++||+.|+|+.+..++++++++||+||++||.+++|++.+. ..++|+++|++|| T Consensus 78 Vd~~a~rl~~~Gf~~~d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD 157 (504) T protein:vir:99 78 VDTLARRCNLESFVWPDGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWN 157 (504) T ss_pred HHHHHhhhccceeeCCCCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEe Confidence 9999999999999875443 346999999999999999999999999999999999999987643 3479999999999 Q ss_pred CCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CC Q lcl|NC_019916. 155 RSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YR 229 (513) Q Consensus 155 ~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~ 229 (513) +.. +++.+++++|..+. + .....+++|+++.++++.... .+.+..+..+|++| ||||+|+|+. +| T Consensus 158 ~~~-~~~~~a~~~~~~d~-~----g~~~~~~~y~~~~~~~~~~~~---~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G 227 (504) T protein:vir:99 158 SRR-NAMDSLLSITSRDA-E----GHPTGIALYEDGVTVTADMDD---DGDWHADVRTHKLG-VPVEVLPYKPREDRPLG 227 (504) T ss_pred CCC-CceeEEEEEEEecC-C----CeEEEEEEEcCCcEEEEEEcC---CceeeeccccCCCC-cceEEecccccCccccC Confidence 864 67778887765422 1 235568899999999886542 23345567789997 9999999973 57 Q ss_pred Ccchh-HHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcce Q lcl|NC_019916. 230 QGDFE-NVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANM 308 (513) Q Consensus 230 ~sd~e-~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 308 (513) +|++. +|++|+|++|+++|++++..++|++|+++++|.....+....+... .......+++ T Consensus 228 ~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~------------------~~~~~~~~~i 289 (504) T protein:vir:99 228 SSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMK------------------PAWQIALARV 289 (504) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcccccccccccc------------------chhhhhhhhh Confidence 88875 8999999999999999999999999999999976443221111100 0011112233 Q ss_pred eeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccccc--cccccHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 309 ILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNF--SGNSSGVAMKYKVLGTVELAST 385 (513) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~n~Sg~Ai~~~~~~l~~k~~~ 385 (513) +.++..... ....+.++++.+.+ .+.+++...++.+..+|+..|++|..+++.. .+|+||+||++++.+|.+||++ T Consensus 290 ~~~~~~~~~-~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~ 368 (504) T protein:vir:99 290 FALPDDEDE-PDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEG 368 (504) T ss_pred hcCCCcccc-ccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHH Confidence 333221111 11223445555444 3455666666666677777788888776533 4678999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcC-----CCHHHHHHhCCCCCCH Q lcl|NC_019916. 386 KRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQ-----IPQEYLYQYLPNVTDA 460 (513) Q Consensus 386 ~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~-----iS~et~~~~l~~v~D~ 460 (513) |++.|+.+|++++++++.+.+.... ...+..+++++|+++.|.|.++.||+++|+.+. .+.+++|+++++. T Consensus 369 k~~~f~~~l~~~~rla~~~~~~~~~-~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~--- 444 (504) T protein:vir:99 369 ATDDWSPAFRRSMIRALAIKNGLDR-IPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLT--- 444 (504) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCc-cccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCC--- Confidence 9999999999999999888765433 345567899999999999999999999998541 3468899999874 Q ss_pred HHHHHHHHHHHHHHH--HHhhhhcCCCCCCCCCCCCCCCCCCCCCC----CCCCccCCC Q lcl|NC_019916. 461 DEIVKMMDKQRKAML--KTYDTKGGLIINGTSGNDPEDEGVRGQQG----EPEDERTSD 513 (513) Q Consensus 461 ~~E~~ri~~E~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 513 (513) +.|++|+++|++++. ...+.+.........+.++.+++..+..+ ..+..-+.+ T Consensus 445 ~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~ 503 (504) T protein:vir:99 445 PQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLV 503 (504) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccC Confidence 356777776554322 22233322222222222222221111111 111111111 No 56 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=3.7e-71 Score=406.72 Aligned_cols=411 Identities=14% Similarity=0.085 Sum_probs=280.8 Q ss_pred cccccccCCCCCCcc-eeecchhHHHHHHHHHHhhcCCeeecC-CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEe Q lcl|NC_019916. 53 LSPASRRNEKGKADH-RAVHSFARYIADFQTSYSVGNAIAMSG-PSSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYV 130 (513) Q Consensus 53 ~~~~~~~~~~~~~~~-ri~~n~~~~ivd~~~~~l~g~p~~~~~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v 130 (513) +.+.... +..+..+ ++++|||++|||+.++|+.+++++... +.++.++++|+.|+|+.++.+++++++++|+||++| T Consensus 1 ~l~~~~~-~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v 79 (434) T protein:vir:98 1 MLPKNAE-QAFLDFQRKARTNFCGLIANASVHRLLALGVTGPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLV 79 (434) T ss_pred CCCCCcc-HHHHHhhhhhhccchHHHHHHHHhhhccCceecCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEE Confidence 2223222 2233444 457899999999999999999988743 345679999999999999999999999999999999 Q ss_pred eecCCCc-----ee-EEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc- Q lcl|NC_019916. 131 YRDPSQK-----GE-VSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS- 203 (513) Q Consensus 131 ~~d~~~~-----~~-~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~- 203 (513) |.++++. +. .+..++|+++|++||+..+ ++.+++++|..... + .....+++|+...++++........ T Consensus 80 ~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (434) T protein:vir:98 80 GAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDID-G---FGYARVFFDDTSFPYRTRERTGARLP 154 (434) T ss_pred ecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEeccC-C---ceEEEEEEeCcEEEEEEeeccccccc Confidence 9987553 22 2335799999999998764 69999998875432 1 1222333444444444333222110 Q ss_pred -----c---ccccccccccCcccceEEecCC----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccc Q lcl|NC_019916. 204 -----V---PTLEVAEHSAQFGFPMIEYRNN----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTL 271 (513) Q Consensus 204 -----~---~~~~~~~~~~~g~vPvv~~~n~----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~ 271 (513) + .......+|++|+||||+|+|+ ++|+|+|+++++|||+||+++|++++.+++|++|+++++|..... T Consensus 155 ~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~ 234 (434) T protein:vir:98 155 WGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAK 234 (434) T ss_pred cccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccc Confidence 0 1123456799999999999999 679999999999999999999999999999999999999976543 Q ss_pred cccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHH Q lcl|NC_019916. 272 FDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIH 350 (513) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~ 350 (513) ..+....... ...+.....++++. .+++++++.+.+ .+.+++...++.+...|+ T Consensus 235 ~~~~~~~~~~---------------~~~~~~~~~~~i~~----------~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~ 289 (434) T protein:vir:98 235 RTDPATGMTV---------------VDQPFVPSPSAVWA----------SEGENTQFGQLDATDLSGFLKEHASDVRDML 289 (434) T ss_pred ccccccccch---------------hhhhhhcccccccc----------CCCCCceEEEecCcchHHHHHHHHHHHHHHh Confidence 3222111000 00111111222222 234556665543 233444444444444555 Q ss_pred HHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcC Q lcl|NC_019916. 351 KFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTD 430 (513) Q Consensus 351 ~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d 430 (513) ..+++|+.+++...+|+||+||++++.+|+.||+++++.|+++|++++++++.+. + ...+..+++++|+++.|+| T Consensus 290 ~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~----g-~~~~~~~~~v~w~~~~~~s 364 (434) T protein:vir:98 290 TISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA----G-VPEDYTEAEVRWANPAHVT 364 (434) T ss_pred cccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----C-CChhheeeeEEecCCCCCC Confidence 5555565555544468999999999999999999999999999999999987653 2 2346678999999999999 Q ss_pred HHHHHHHHHHHhcC-CCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 431 DVAIITALVQAGAQ-IPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQG 504 (513) Q Consensus 431 ~~e~a~~~~kl~g~-iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (513) .+++||+++||.++ +|.+++++++|+.+ +|++|+++|+++.............+...+.++.++++. +| T Consensus 365 ~~~~ada~~kl~~~g~~~e~~~~~lg~~~---~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~--dg 434 (434) T protein:vir:98 365 MAVKADAATKLKSIGYPLDVIAEELDESP---ARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAV--DG 434 (434) T ss_pred HHHHHHHHHHHHhcCCcHHHHHHhCCCCH---HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCC--CC Confidence 99999999999775 89999999999853 688888887665444333332222222222222111111 11 No 57 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=4.3e-68 Score=389.89 Aligned_cols=406 Identities=8% Similarity=-0.032 Sum_probs=305.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS 97 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~ 97 (513) |+...|..|+.+ +..+++|++++.+||+|+|++.+..... ++..+..+++++|||+++|++.++++..++++. ++ T Consensus 1 m~~~~i~~L~~~-~~~~~~r~~~~~~yy~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~---~d 75 (422) T protein:vir:97 1 MNYMGMGYLRRK-LALFKTGVDKRYRYYAMDDRDDTRSIVM-PNNVREMYRSVLEWTAKGVDSLADRIIFREFTN---DD 75 (422) T ss_pred CChHHHHHHHHH-HHHHHHHHHHHHHHHhcCCChhhcCccc-cHHHHHHHHhhcchhHHHHHHHHhccccceeeC---Cc Confidence 777777777665 4667889999999999999986655433 334456677888999999999999998888873 23 Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC-CCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccc Q lcl|NC_019916. 98 DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP-SQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDN 176 (513) Q Consensus 98 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~-~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~ 176 (513) ..++++|+.|+|+....++|++|++||+||++||.++ +|.+.+. .++|++++++||+.. +++.+++++|.... + T Consensus 76 ~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~-~~sp~~~~~i~D~~~-~~~~~a~~~~~~~~---~ 150 (422) T protein:vir:97 76 FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQ-VIEASKATGILDPTT-FLLTEGYAILESDS---N 150 (422) T ss_pred hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEE-EechhhEEEEEeCCC-CcceeeEEEEEecC---C Confidence 4589999999999999999999999999999999986 4555544 479999999998875 56667777665332 1 Q ss_pred cceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcch-hHHHHHHHHHHHHHHHH Q lcl|NC_019916. 177 ITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDF-ENVLSLIDLYDVAQSDT 250 (513) Q Consensus 177 ~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~-e~v~~liD~~~~~~S~~ 250 (513) . ....+.+|++..+++++.. +.+ ...+|++|.||||+|+|+. +|+|++ +.|++|+|++|++++++ T Consensus 151 ~--~~~~~~~~~~~~~~~~~~~---~~~----~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~ 221 (422) T protein:vir:97 151 G--NPTLEAYFTDKDIWYYPKK---GKP----YNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERA 221 (422) T ss_pred C--cEEEEEEEcCceEEEEcCC---Ccc----ccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHH Confidence 1 1234556677666665432 221 2347999999999999874 588988 78999999999999999 Q ss_pred HHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe Q lcl|NC_019916. 251 ANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH 330 (513) Q Consensus 251 ~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 330 (513) .+..++|++|+++++|....... ...+ ....++++.++.. .++..+++-+ T Consensus 222 ~~~~e~~a~pqr~i~G~d~d~~~-----------------------~~~~-~~~~~~i~~~~~d------e~~~~~~v~q 271 (422) T protein:vir:97 222 EVTAEFYSFPQKYVLGMDPDAKP-----------------------MEKW-RATVSTLLEISKD------EDGDKPTVGQ 271 (422) T ss_pred HHHHHHhcchhhhhcccCccccc-----------------------Cchh-hhhhhhhhccCCC------CCCCcceeee Confidence 99999999999999997421100 0000 0111233333221 1223344433 Q ss_pred ec-CCHHHHHHHHHHHHHHHHHHhCcccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019916. 331 KE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV 408 (513) Q Consensus 331 ~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~ 408 (513) .+ .+.+++.+.++.+..+|+..|++|..+++..+.| +||+||++++.+|++||+++++.|+.+|++++++++++.+.. T Consensus 272 ~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~ 351 (422) T protein:vir:97 272 FTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEF 351 (422) T ss_pred cCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 33 3455566666666666677778887777755555 699999999999999999999999999999999999887654 Q ss_pred ccccccccceeeEEeCCCCCcC---HHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019916. 409 NGKWDIDPDEIGFIFRDNLPTD---DVAIITALVQAG----AQIPQEYLYQYLPNVTDADEIVKMMDKQRKAM 474 (513) Q Consensus 409 ~~~~~~~~~~i~i~f~~~~p~d---~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~ 474 (513) ... ...+.+++++|.++.|.+ .++.||+++|+. |+++.+++++++|+ ++++.|+.++++++.+- T Consensus 352 ~~~-~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 352 PYL-RNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGV-KGADKPIPAITEVTTDG 422 (422) T ss_pred ccc-chhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCC-CchhHHHHHHHhhhccC Confidence 332 445678999999888887 788899999974 57899999999987 77889999988875543 No 58 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=8.6e-68 Score=388.26 Aligned_cols=395 Identities=10% Similarity=-0.029 Sum_probs=301.3 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHHHhcCHH Q lcl|NC_019916. 31 YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFNRRNDID 110 (513) Q Consensus 31 ~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~~~n~~~ 110 (513) ++-+.+|++++.+||+|+|++.+..... +...+.++|+++||++++||+.++++..++++. ++..+++||+.|+|+ T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~---~d~~l~~i~~~N~ld 76 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITI-PAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAN---DDFNVTEIFDRNNPD 76 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhc-cHHHHhHHHhhcchhHHHHHHhHhhhccccccC---CCchHHHHHhhcChH Confidence 4445788999999999999886655433 334556678899999999999999999999873 334699999999999 Q ss_pred HHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCC Q lcl|NC_019916. 111 TLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEN 190 (513) Q Consensus 111 ~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~ 190 (513) ....+++++|++||+||++||.+++|.+.+. .++|.+++++||+ ..+++.++++++.... ......+.+|+++ T Consensus 77 ~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~-~~sP~~~~~i~Dp-~~~~~~~al~~~~~~~-----~~~~~~~~~~~~~ 149 (410) T protein:vir:95 77 IFFDSAILSALIGSCSFVYISKGEDDEVRLQ-VIESSNATGVIDP-ITGLLVEGYAVLARDD-----YNRPTLEAYFEPN 149 (410) T ss_pred HHHHHHHHHHHHhCceeEEEecCCCCceEEE-EEcccceEEEEeC-CCCceEEEEEEEEecC-----CCeEEEEEEEeCC Confidence 9999999999999999999999999988776 4899999999998 4578999998875432 1235567899999 Q ss_pred cEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcch-hHHHHHHHHHHHHHHHHHHHHHHhhhhhhhe Q lcl|NC_019916. 191 DYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDF-ENVLSLIDLYDVAQSDTANYMTDLNEAMLVI 264 (513) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~-e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~ 264 (513) .++++..... . ...+|++|.||||+|+|+. +|+|++ +.|++|+|++|++++++.+..++|++|++++ T Consensus 150 ~~~~~~~~~~---~----~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i 222 (410) T protein:vir:95 150 ATHFIPKDGE---P----YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYI 222 (410) T ss_pred cEEEEeeCCc---c----ccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhee Confidence 9998865321 1 1347999999999999864 488887 6799999999999999999999999999999 Q ss_pred ecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHH Q lcl|NC_019916. 265 KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKK 343 (513) Q Consensus 265 ~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~ 343 (513) +|....... ...++ ...++++.++.. .++..+++.+.+ .+.+++...++ T Consensus 223 ~G~d~d~~~-----------------------~~~~~-~~~~~i~~~~~~------~~~~~~~v~q~~~~~l~~~~~~l~ 272 (410) T protein:vir:95 223 LGLDPDAEP-----------------------MEKWK-ATVSSLLTISSS------DKGVKPSVGQFTTASMSPFTEQLR 272 (410) T ss_pred eccCCCCCc-----------------------Cchhh-hhhhhheeccCC------CCCCcceEEecCCCChHHHHHHHH Confidence 997421100 00011 112334444321 122344554444 46667777777 Q ss_pred HHHHHHHHHhCcccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEE Q lcl|NC_019916. 344 RLAADIHKFSHTPDLTDDNFSGN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFI 422 (513) Q Consensus 344 ~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~ 422 (513) .+..+|+..|++|..+++..+.| +||+||++++.+|..||+++++.|+.+|++++++++.+.+.... ...+..++++. T Consensus 273 ~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~-~~~~~~~~~v~ 351 (410) T protein:vir:95 273 TAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRY-TRSQFVRTAVK 351 (410) T ss_pred HHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-cccccceeeEE Confidence 77788888888998887755555 79999999999999999999999999999999999888654433 24556689999 Q ss_pred eC---CCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 423 FR---DNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLK 476 (513) Q Consensus 423 f~---~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~ 476 (513) |. ++..++.++.||+++|+. |+++.+++++++||+++ ++..++.+|+++.-+ T Consensus 352 W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~--~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 352 WEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGD--MSAKPVVSEGGSNGE 410 (410) T ss_pred eeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChH--HHHHHHHHHHHhCCC Confidence 98 455568999999999973 67899999999999653 333333333332111 No 59 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=1.9e-66 Score=380.87 Aligned_cols=455 Identities=11% Similarity=-0.013 Sum_probs=317.9 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |++-+-++..=..+ + ...+|..|+.++ ..++++++.+++||+|+|++.+..... +...+ ..++++||++++|++ T Consensus 1 ~~~~~~~~~~gl~~-~--~~~~~~~L~~~~-~~~~~~~~~~~~Yy~G~~~~~~~~~~~-p~~~r-~~~~v~nw~~~~Vd~ 74 (474) T protein:vir:81 1 MIQQQTVRIPSLSN-D--ENALINGLLAQI-ENLRWKNLLRTSYYENKRTIQYVGTLI-PPQYF-NLGLVLGWTGKAVDA 74 (474) T ss_pred CcCCCcCcCCCCCh-h--HHHHHHHHHHHH-HHHhhHHHHHHHHhccCCChhhccccc-cHHHH-HHHhhcChHHHHHHH Confidence 77776665542222 1 123466666654 667889999999999999976654433 22223 456889999999999 Q ss_pred HHHHhhcCCeeecCCc--HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEE-EEEcccceEEEecCCC Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPS--SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVS-VKLDPMECFIIYDRSV 157 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~--~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~-~~~~p~~~~~~~d~~~ 157 (513) .++++..++++...++ +..++++|+.|+|+....+++++|++||+||++|+.+++|++.+. ..++|++++++||+.. T Consensus 75 ~a~rl~~~Gf~~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~ 154 (474) T protein:vir:81 75 LARRCNLEGFVWPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRR 154 (474) T ss_pred HHhhhcccceECCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCC Confidence 9999999999975533 356999999999999999999999999999999999988876543 3479999999999865 Q ss_pred CcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcc Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGD 232 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd 232 (513) +++.++++.|.... + .......+|+++.++++....++. .+..+..+|++| ||||+|+|+. +|+|+ T Consensus 155 -~~~~~al~~~~~~~-~----g~~~~~~ly~~~~~~~~~~~~~~~--~w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~s~ 225 (474) T protein:vir:81 155 -RGLNNLLSIIDKDK-E----GKVLSLALYLDNETVTAQRDKATL--KWQVDRDEHVYG-VPAQVLPYKPAPKRPFGQSR 225 (474) T ss_pred -CcceeeeEEEEEcC-C----CcEEEEEEEeCCcEEEEEEcCccc--eeeeccCCCCCC-cceEEecccccccCcCCccc Confidence 56667776654321 1 123456789999998887654433 334566789997 8999999974 58888 Q ss_pred h-hHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeec Q lcl|NC_019916. 233 F-ENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILL 311 (513) Q Consensus 233 ~-e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (513) + ++|++|+|++|++++++....++|++|+++++|.....+.+..+.....+. ..-++++.+ T Consensus 226 i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~------------------~~~~~i~~~ 287 (474) T protein:vir:81 226 ITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWE------------------ARLGRIKGL 287 (474) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhh------------------hhHHHHhcC Confidence 7 699999999999999999999999999999999865443222111110000 001112222 Q ss_pred cccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCcccccccc--ccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDN--FSGNSSGVAMKYKVLGTVELASTKRK 388 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~n~Sg~Ai~~~~~~l~~k~~~~~~ 388 (513) +.... ......+.+++.+.+ .+.+++...++.+..+|+..|++|..+++. +.+++||+||++++.+|..||+++++ T Consensus 288 ~~d~d-~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~ 366 (474) T protein:vir:81 288 PDDAD-ADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVD 366 (474) T ss_pred CCccc-ccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 11100 001112234444433 455666667777777778888999888763 34568999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcccc-cccccceeeEEeCCCCCcCHHHHHHHHHHHh----cCCCHHHHHHhCCCCCCHHHH Q lcl|NC_019916. 389 QFERGLNQRYTVVAHIEERVNGK-WDIDPDEIGFIFRDNLPTDDVAIITALVQAG----AQIPQEYLYQYLPNVTDADEI 463 (513) Q Consensus 389 ~f~~~l~~~~~li~~~l~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~E 463 (513) .|+.+|++++++++.+.+..... .......+++.|.++..++.++.||+++|+. |+.+.+++++++++ + +++ T Consensus 367 ~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-t--~~~ 443 (474) T protein:vir:81 367 DFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGL-T--PQQ 443 (474) T ss_pred HHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCC-C--HHH Confidence 99999999999998886544322 2344568999999999999999999999984 34566777777765 4 356 Q ss_pred HHHHHHHHHHH--HHHhhhhcCCCCCCCCCC Q lcl|NC_019916. 464 VKMMDKQRKAM--LKTYDTKGGLIINGTSGN 492 (513) Q Consensus 464 ~~ri~~E~~~~--~~~~~~~~~~~~~~~~~~ 492 (513) +++++.++.++ ....+.+......+.... T Consensus 444 i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 444 ARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 66666554332 222232222211111111 No 60 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=3.1e-67 Score=385.19 Aligned_cols=394 Identities=9% Similarity=-0.015 Sum_probs=302.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS 97 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~ 97 (513) |+.+.|.+|+.+. ..+.+|++++.+||+|+|++.+..... +...+.++|+++||+++||++.++++..++++. ++ T Consensus 1 ~~~~~i~~L~~~~-~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---~d 75 (409) T protein:vir:94 1 MTEKGIGYLRFKL-SVHKRRAEMRYDQYAMKYVDRFKGITI-PQALSQQYRSILGWCAKGVDSLADRLVFREFEN---DD 75 (409) T ss_pred CCHHHHHHHHHHH-HHHhHHHHHHHHHhcccCchhhcChhh-hHHHHHHHhhhcchhHHHHHHhHhhcccCcccC---Cc Confidence 8888888887774 667899999999999999876554432 333455678899999999999999998888762 34 Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccc Q lcl|NC_019916. 98 DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNI 177 (513) Q Consensus 98 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~ 177 (513) ..+++||+.|+|+....+++++|++||+||+.||.+++|.+++.+ ++|.+++++||+. .+++.++++++..... T Consensus 76 ~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~-~sp~~~~~i~D~~-~~~~~~a~~~~~~d~~---- 149 (409) T protein:vir:94 76 FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQV-IEAVNATGIIDPI-TGLLTEGYAVLERDEN---- 149 (409) T ss_pred hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEE-eccceEEEEEecC-CCceeeeEEEEEecCC---- Confidence 579999999999999999999999999999999999999887764 8999999999985 4679999998753321 Q ss_pred ceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcch-hHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 178 TQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDF-ENVLSLIDLYDVAQSDTA 251 (513) Q Consensus 178 ~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~-e~v~~liD~~~~~~S~~~ 251 (513) .......+|+++.++++....+. + ...+|++|.||||+|+|+. +|+|++ +.|++|+|++|++++++. T Consensus 150 -~~~~~~~~~~~~~~~~~~~~~~~--~----~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~ 222 (409) T protein:vir:94 150 -NNVVLEAHFLPDRTDYYYRDSRN--N----ISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERAD 222 (409) T ss_pred -CceEEEEEEecCcEEEEEecCce--e----EeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHH Confidence 12345568999999887654322 1 2357999999999999974 578888 679999999999999999 Q ss_pred HHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 252 NYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 252 ~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) +..++|++|+++++|...... . ...++ ...++++.++. ..++.++++.+. T Consensus 223 ~~~e~~a~pqr~i~G~d~d~~---------------~--------~~~~~-~~~~~i~~~~~------d~dg~~~~v~q~ 272 (409) T protein:vir:94 223 VTAEFYSFPQKYVTGLSDDAE---------------P--------METWK-ATVSSMLQFTK------DEDGDKPTLGQF 272 (409) T ss_pred HHHHHhcChhheeEecCCCCc---------------c--------cchhh-hhHHHhhcCCC------CCCCCCceEEec Confidence 999999999999999742110 0 00011 11133333322 122334455444 Q ss_pred c-CCHHHHHHHHHHHHHHHHHHhCcccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019916. 332 E-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVN 409 (513) Q Consensus 332 ~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~ 409 (513) + .+.+++...++.+..+++..|++|..+++..+.| +||+||++++.+|..||++|++.|+.+|++++++++.+.+... T Consensus 273 ~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~ 352 (409) T protein:vir:94 273 TQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAP 352 (409) T ss_pred CCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 3 3556666677777777777778888777755445 7999999999999999999999999999999999988865543 Q ss_pred cccccccceeeEEeCCCCCcC---HHHHHHHHHHHh--c--CCCHHHHHHhCCCCCCHH Q lcl|NC_019916. 410 GKWDIDPDEIGFIFRDNLPTD---DVAIITALVQAG--A--QIPQEYLYQYLPNVTDAD 461 (513) Q Consensus 410 ~~~~~~~~~i~i~f~~~~p~d---~~e~a~~~~kl~--g--~iS~et~~~~l~~v~D~~ 461 (513) . ...++.+++++|.+..|.+ .++.||+++|+. | +.+.++++.++||.. ++ T Consensus 353 ~-~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~-~d 409 (409) T protein:vir:94 353 Y-LREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEG-GE 409 (409) T ss_pred c-cccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCC-CC Confidence 2 3456678999999776665 688899999984 3 467799999999864 23 No 61 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=1.3e-65 Score=376.35 Aligned_cols=394 Identities=9% Similarity=-0.008 Sum_probs=300.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS 97 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~ 97 (513) |+.+.|.+|+.+ +..+.+|++++.+||+|+|++.+..... +...+.+.|+++||+++||++.++++..++++. ++ T Consensus 1 ~~~~~i~~L~~~-~~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---~d 75 (409) T protein:vir:16 1 MTEKGIGYLRFK-LSVHKRRAEMRYEQYAMKHVDRFKGITI-PQALSQQYRSILGWCAKGVDSLADRLVFREFEN---DD 75 (409) T ss_pred CCHHHHHHHHHH-HHHHhHHHHHHHHHHhccCchhhcchhh-hHHHHHHHhhhcChhHHHHHHhHhhcccccccC---cc Confidence 888888888766 4677899999999999999875544332 334455678889999999999999998888762 34 Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccc Q lcl|NC_019916. 98 DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNI 177 (513) Q Consensus 98 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~ 177 (513) ..+++||+.|+|+....+++++|++||+||++||.+++|.+.+.+ ++|.+++++||+. .+++.+++++|.... T Consensus 76 ~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~-~sP~~~~~i~D~~-~~~~~~a~~~~~~d~----- 148 (409) T protein:vir:16 76 FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQV-IEATNATGIIDPI-TGLLTEGYAVLERDE----- 148 (409) T ss_pred hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEE-EcccceEEEeecc-cccceeeeEEEEecC----- Confidence 579999999999999999999999999999999999999887764 8999999999875 568888888875322 Q ss_pred ceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcch-hHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 178 TQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDF-ENVLSLIDLYDVAQSDTA 251 (513) Q Consensus 178 ~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~-e~v~~liD~~~~~~S~~~ 251 (513) ........+|+++.++++....+. + ...+|++|.||||+|+|+. .|+|++ +.|++|+|++|++++++. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~--~----~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~ 222 (409) T protein:vir:16 149 NNNVVLEAHFLPDRTDYYYRDSRN--N----ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERAD 222 (409) T ss_pred CCceEEEEEEecCcEEEEEecCcc--c----cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHH Confidence 112344568888888877653322 1 2357999999999999974 588988 679999999999999999 Q ss_pred HHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 252 NYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 252 ~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) +..++|++|+++++|....... ...++ ...++++.++.. .++.++++-+. T Consensus 223 ~~~e~~a~pqr~i~G~d~d~~~-----------------------~~~~~-~~~~~i~~~~~d------~~g~~~~v~q~ 272 (409) T protein:vir:16 223 VTAEFYSFPQKYVTGLSDDAEP-----------------------METWK-ATVSSMLQFTKD------EDGDKPTLGQF 272 (409) T ss_pred HHHHHhcChhheeEecCCCCCc-----------------------cchhh-hhhhHhhccCCC------CCCCCceEEec Confidence 9999999999999997421100 00011 112333433221 22233444333 Q ss_pred c-CCHHHHHHHHHHHHHHHHHHhCcccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019916. 332 E-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVN 409 (513) Q Consensus 332 ~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~ 409 (513) + .+.+++.+.++.+..+++..|++|..+++..+.| +||+||++++.+|..||+++++.|+.+|++++++++.+.+... T Consensus 273 ~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~ 352 (409) T protein:vir:16 273 TQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVP 352 (409) T ss_pred CCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3 4556667777777777777788888777755556 6999999999999999999999999999999999998865543 Q ss_pred cccccccceeeEEeCCCCCcC---HHHHHHHHHHHhc----CCCHHHHHHhCCCCCCHH Q lcl|NC_019916. 410 GKWDIDPDEIGFIFRDNLPTD---DVAIITALVQAGA----QIPQEYLYQYLPNVTDAD 461 (513) Q Consensus 410 ~~~~~~~~~i~i~f~~~~p~d---~~e~a~~~~kl~g----~iS~et~~~~l~~v~D~~ 461 (513) . ......+++++|.++.+.+ .++.||+++|+.+ +...+++++++++..+ + T Consensus 353 ~-~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~-d 409 (409) T protein:vir:16 353 Y-LREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGA-E 409 (409) T ss_pred c-cchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCC-C Confidence 2 2334568899999777554 8999999999853 3457899999988642 3 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=7.1e-58 Score=333.89 Aligned_cols=456 Identities=13% Similarity=0.084 Sum_probs=301.5 Q ss_pred Cccchhhc-eeccCCcccCCHHHHHHHHHHH----HHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 1 MIDMQQAN-MNYQEDADKLTPTRIAAFIRHH----YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~i~~~i~~~----~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |+..=++. ..+... ....+-|..++.+. ...+..+++++++||+|+|+++.++........+..+++++||++ T Consensus 1 m~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k 78 (496) T protein:vir:38 1 MINQIIAGVKGVMRR--MGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPK 78 (496) T ss_pred ChhHHHHHHHHHHHH--hccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHH Confidence 22211111 000000 01112233333211 123456789999999999998877665555566677889999999 Q ss_pred HHHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) +||++.++||+|+|++++.+++ +.|+++++.|+|...+.+++..++++|.+|+++|+|++|.+++.+ ++|.+++|+ T Consensus 79 ~i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~-v~~~~~~P~ 157 (496) T protein:vir:38 79 VTAKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSF-ATADCMYPL 157 (496) T ss_pred HHHHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEE-EcccceEEE Confidence 9999999999999999987765 357888999999999999999999999999999999999988775 799999999 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCC--c-EEE---EEeeccC---Cccc-------cccccccccCc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEN--D-YTR---YKPIVVA---GSVP-------TLEVAEHSAQF 216 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~--~-~~~---~~~~~~~---~~~~-------~~~~~~~~~~g 216 (513) |++..+-..+++++.|... ....+.++.|+.. . .+. |+..... .... ......-+++. T Consensus 158 ~~~~~~~~~~~f~~~~~~~------~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~ 231 (496) T protein:vir:38 158 SNDSENVDECVIANSFHKN------NKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFT 231 (496) T ss_pred EecCCcEEEEEEEEEEEeC------CeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCC Confidence 9886543445555444321 1233445555421 1 111 2221111 0000 01111224567 Q ss_pred ccceEEecCC---------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhh Q lcl|NC_019916. 217 GFPMIEYRNN---------EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDAD 287 (513) Q Consensus 217 ~vPvv~~~n~---------~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~ 287 (513) +.||++|+++ ..|.|+|+++++|||+||.++|++++.++....++.+=..+-.... T Consensus 232 ~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~--------------- 296 (496) T protein:vir:38 232 RPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAV--------------- 296 (496) T ss_pred cceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccC--------------- Confidence 7888888764 2488999999999999999999999999876555544111100000 Q ss_pred hhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccc-ccccc Q lcl|NC_019916. 288 AMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDD-NFSGN 366 (513) Q Consensus 288 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n 366 (513) +..+..... .+...+.+....+ ...+....++.++.++..+++...++.+.+.|...+++|+..++ ..+|+ T Consensus 297 ---~~~g~~~~~--~~~~~~~~~~~~~---~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~ 368 (496) T protein:vir:38 297 ---NLDGSTTQY--FDSTDEAFFLYQG---DQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL 368 (496) T ss_pred ---CCCCccccC--CCCccceEEEeec---CCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCcccc Confidence 000000000 0111111111000 01112234666777888899999999999999999888865543 23466 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--h Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--G 442 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~ 442 (513) .||.+++++++.|..++..+++.|+.+|++++++++.+.... ..+...+...++++|++++|.|..+.+++++++ + T Consensus 369 ~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~ 448 (496) T protein:vir:38 369 KTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQ 448 (496) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhc Confidence 799999999999999999999999999999999998765422 233345556799999999999999999999986 6 Q ss_pred cCCCHHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCC Q lcl|NC_019916. 443 AQIPQEYLYQYLPNVTDA--DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDE 497 (513) Q Consensus 443 g~iS~et~~~~l~~v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) |++|.+|++..+|+++|. ++|++|+++|+++.++. +.+++...+ ++ T Consensus 449 GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~-~d~~~~~~~--------~e 496 (496) T protein:vir:38 449 GMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEMPN-NDMNGIFGE--------EE 496 (496) T ss_pred CCCCHHHHHHhcCCCChHHHHHHHHHHHHhhhccCcc-ccccCCCCC--------CC Confidence 999999999999999874 45888888877644221 111111110 00 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=4.2e-54 Score=313.19 Aligned_cols=444 Identities=14% Similarity=0.112 Sum_probs=299.2 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhcCCCccccccccccCCCC Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHH-----------------YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG 63 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~-----------------~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~ 63 (513) |++- -..+|+.++++. .+....++.++++||+|+|+.+..+........ T Consensus 1 m~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~ 66 (499) T protein:vir:80 1 MINQ--------------IIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNP 66 (499) T ss_pred ChhH--------------HHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCc Confidence 2220 112223333221 123346788999999999988766555555556 Q ss_pred CCcceeecchhHHHHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeE Q lcl|NC_019916. 64 KADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEV 140 (513) Q Consensus 64 ~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~ 140 (513) +.++++++|+++.||++.++|++|+|++++.+++ +.++++++.|+|.....+++..|+++|.+|+++|+|++|++++ T Consensus 67 ~~~~~~s~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i 146 (499) T protein:vir:80 67 VNRRQLSMNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKV 146 (499) T ss_pred cccceeecchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEE Confidence 6788999999999999999999999999988774 4578889999999999999999999999999999999998888 Q ss_pred EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc--CCc--EEE-----EEeecc---CCccc--- Q lcl|NC_019916. 141 SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT--END--YTR-----YKPIVV---AGSVP--- 205 (513) Q Consensus 141 ~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt--~~~--~~~-----~~~~~~---~~~~~--- 205 (513) .+ ++|.+++|+|.++.+-..+++++.+...+ ...+++|.|+ ... .++ |+.... +.... T Consensus 147 ~~-v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~------~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~ 219 (499) T protein:vir:80 147 SF-ATADCMYPLSNDSENVDECLIANSFHKNN------KYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKL 219 (499) T ss_pred EE-EcCCceEEEEecCCCeEEEEEEEEEeecC------eEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhh Confidence 65 79999999987765444455555443321 2223334322 221 111 111111 11110 Q ss_pred ---cc-cccccccCcccceEEecCC---------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccc Q lcl|NC_019916. 206 ---TL-EVAEHSAQFGFPMIEYRNN---------EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLF 272 (513) Q Consensus 206 ---~~-~~~~~~~~g~vPvv~~~n~---------~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~ 272 (513) .. ....-.++++.||++|+++ ..|.|+|+++++|||+||.++|++++.++....++.+-..+-.... T Consensus 220 ~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~ 299 (499) T protein:vir:80 220 LFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAV 299 (499) T ss_pred hccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccC Confidence 00 1111234778889999765 2388999999999999999999999999987777766222111000 Q ss_pred ccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 273 DDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 352 (513) . ..+.....+ +...+.+....+. ....+..++.++.++..+++...++.+.+.|... T Consensus 300 ~------------------~~g~~~~~~--~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~ 356 (499) T protein:vir:80 300 N------------------LDGSTTQYF--DSTDEAFFLYQGE---QDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQ 356 (499) T ss_pred C------------------CCCCcccCC--CcccceeeEeecc---CCCCcCceeEecCcCChHHHHHHHHHHHHHHHHh Confidence 0 000000000 1111111111100 0111234677778889999999999999999999 Q ss_pred hCccccccc-cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccccccceeeEEeCCCCCc Q lcl|NC_019916. 353 SHTPDLTDD-NFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVN--GKWDIDPDEIGFIFRDNLPT 429 (513) Q Consensus 353 s~~p~~~~~-~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~--~~~~~~~~~i~i~f~~~~p~ 429 (513) +++++..++ ..+|+.||.+++++++.|..++..+++.|+.+|++++++|+.+..... .+...+...++|+|++.+|. T Consensus 357 ~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~ 436 (499) T protein:vir:80 357 VGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQ 436 (499) T ss_pred cCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCC Confidence 988765543 234667999999999999999999999999999999999987654432 22234556899999999999 Q ss_pred CHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCC Q lcl|NC_019916. 430 DDVAIITALVQA--GAQIPQEYLYQYLPNVTDAD--EIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDE 497 (513) Q Consensus 430 d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) |..+.+++++++ +|++|.+|++..+++++|.+ +|++|+++|++...+.- +..+..++++ T Consensus 437 d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el~~i~~E~~~~~~~~---------d~~g~~ge~e 499 (499) T protein:vir:80 437 DEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEIPNN---------DMTGIFGEEE 499 (499) T ss_pred CHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHHHHHHHHhhcCCCCC---------CccccCCCCC Confidence 999999999886 69999999999999998844 56778887765432110 0011111111 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=4.1e-47 Score=274.88 Aligned_cols=456 Identities=13% Similarity=0.066 Sum_probs=289.7 Q ss_pred ccchhhceeccCC-cccC-CHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQED-ADKL-TPTRIAAFIRH----HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~-~~~~-~~~~i~~~i~~----~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.|-+.......- ...+ ..+-|.++..+ .-.....+++.+++||.|+|+.+..... ....+..+++++|+++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~--~~~~~~~~~~slnl~~ 78 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS--YGDTQKHELQSVNVTK 78 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc--CCCccccceeecchHH Confidence 1111111100000 0000 00001111000 0122345678889999999987654332 2334455678889999 Q ss_pred HHHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) .|++..+++++++|++++.+++ +.|+++++.|+|.....+++..++..|.++..+|+|. +++++.+ ++|...+|+ T Consensus 79 ~i~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~~~i~~-v~ad~~~P~ 156 (505) T protein:vir:79 79 LASAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GKIKLAW-ATADQVYPL 156 (505) T ss_pred HHHHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC-CceEEEE-EcCCeeEEE Confidence 9999999999999999987664 4688899999999999999999999999999999984 5666654 789888998 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCC---cEEE---EEeecc---CCc--------ccccc-cccccc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEN---DYTR---YKPIVV---AGS--------VPTLE-VAEHSA 214 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~---~~~~---~~~~~~---~~~--------~~~~~-~~~~~~ 214 (513) +.++.+...++++..|...... .....+.+|.|+.+ ..+. |+.... +.. +.... .....+ T Consensus 157 ~~d~~~~~~~a~~~~~~~~~~~--~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g 234 (505) T protein:vir:79 157 QADTNQVNELAIASRTTEVENH--RTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITG 234 (505) T ss_pred EEcCCCeEEEEEEEEEEEecCC--cceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecC Confidence 6555555455555544433222 11223356666521 2212 121111 100 11111 111123 Q ss_pred CcccceEEecC----CC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchh Q lcl|NC_019916. 215 QFGFPMIEYRN----NE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSD 285 (513) Q Consensus 215 ~g~vPvv~~~n----~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~ 285 (513) +.+.+|++|++ +. .|.|+|++++++||++|.++|++++.++....++.+=..+-.... .+... . T Consensus 235 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~---~~~~~----~ 307 (505) T protein:vir:79 235 LKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGS---SYGGQ----A 307 (505) T ss_pred CCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccC---CCCcc----c Confidence 44555677754 32 489999999999999999999999999976666555111100000 00000 0 Q ss_pred hhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccc-cc Q lcl|NC_019916. 286 ADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDN-FS 364 (513) Q Consensus 286 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~ 364 (513) ......+ .+.+..++..- .....++.++.++.++..+++...++.+.+.|...++.+...++. .. T Consensus 308 ~~~~~~~---------fd~~~~~y~~~-----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~ 373 (505) T protein:vir:79 308 SETHPPM---------FDPDETVYQAM-----YGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPS 373 (505) T ss_pred ccccccC---------CCccceeeeec-----cCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCcc Confidence 0000000 01111111110 111234567788888888999999999999999999887654432 23 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--------ccccccceeeEEeCCCCCcCHHHHHH Q lcl|NC_019916. 365 GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNG--------KWDIDPDEIGFIFRDNLPTDDVAIIT 436 (513) Q Consensus 365 ~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~--------~~~~~~~~i~i~f~~~~p~d~~e~a~ 436 (513) +..||+++++.++.|..+++.+++.|+.+|+++++.|+.+...... ....+..+++|.|++.+|.|..+.++ T Consensus 374 ~~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~ 453 (505) T protein:vir:79 374 GIQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRA 453 (505) T ss_pred ccchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHH Confidence 5679999999999999999999999999999999999987554322 12333457899999999999999998 Q ss_pred HHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCC Q lcl|NC_019916. 437 ALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGL 484 (513) Q Consensus 437 ~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~ 484 (513) ..+++ +|++|.|+++..+|+++| +++|++||++|+...++.....++. T Consensus 454 ~~~~~v~~Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 454 ADLQAVQAQVMPKKQFLMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred HHHHHHHcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 88876 689999999999999987 7789999998876533332222222 No 65 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=6.6e-45 Score=262.79 Aligned_cols=451 Identities=12% Similarity=0.035 Sum_probs=285.2 Q ss_pred ccc---hhhce---eccCCcccCCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeec Q lcl|NC_019916. 2 IDM---QQANM---NYQEDADKLTPTRIAAFIRH----HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVH 71 (513) Q Consensus 2 ~~~---~~~~~---~~~~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~ 71 (513) |.| -|..+ .+.+.... + |.+++.+ .-.....|++.+++||+|+|+.+..... ....+...+++. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~-~---~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~--~~~~~~~~~~sl 74 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTG-S---LSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS--DGIKKKRLKNTI 74 (508) T ss_pred CChHHHHHHHHHHHHHHhcccc-c---hHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC--CCCccccceeec Confidence 111 00000 00000000 0 1111100 0123456899999999999986543321 222233456789 Q ss_pred chhHHHHHHHHHHhhcCCeeecCCcH----HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 72 SFARYIADFQTSYSVGNAIAMSGPSS----DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 72 n~~~~ivd~~~~~l~g~p~~~~~~~~----~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) |+++.|++..++++++.|++++.+++ +.|+++++.|+|.....+++..++..|.++..+|+|.+ .+++.+ ++|. T Consensus 75 n~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-~~~i~~-v~ad 152 (508) T protein:vir:15 75 NMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGN-HIKIAW-VRAD 152 (508) T ss_pred chHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCC-eeEEEE-EcCC Confidence 99999999999999999999876432 35788999999999999999999999999999999854 555554 7888 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc--CC--cEEE---EEeecc---CCcc--------cccc- Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT--EN--DYTR---YKPIVV---AGSV--------PTLE- 208 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt--~~--~~~~---~~~~~~---~~~~--------~~~~- 208 (513) ..+|+..+..+..-+++++.+...+ ....+..+++|.|+ .+ ..+. |+.... +... .... T Consensus 153 ~~~P~~~d~~~~~~~af~~~~~~~~--~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~ 230 (508) T protein:vir:15 153 QFYPLQSNTNDISEAAIASRTQRTE--SNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAP 230 (508) T ss_pred eeEEEEEcCCCeEEEEEEEEEEeec--CCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCc Confidence 8889744444333334433333222 11223344556654 21 2222 222111 1111 0011 Q ss_pred ccccccCcccceEEecCC---------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccc Q lcl|NC_019916. 209 VAEHSAQFGFPMIEYRNN---------EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQ 279 (513) Q Consensus 209 ~~~~~~~g~vPvv~~~n~---------~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~ 279 (513) ...-.++.+.|+++|+++ ..|.|+|++++++||++|.++|++++.++....++.+-.++... +..+ T Consensus 231 ~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~---d~~~-- 305 (508) T protein:vir:15 231 QVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRF---DDEH-- 305 (508) T ss_pred ceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcC---CCCC-- Confidence 111134556678888653 24899999999999999999999999997655555553322110 0000 Q ss_pred cccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_019916. 280 MVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT 359 (513) Q Consensus 280 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 359 (513) .. .+ +.+.+.+..- ......+..++.++.++..+.+...++.+.+.|...++++... T Consensus 306 ---------~~--------~~--~~~~~~~~~~----~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~ 362 (508) T protein:vir:15 306 ---------KP--------TF--DTEQNVYVGV----LSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGT 362 (508) T ss_pred ---------cc--------cc--CCCCeeEEec----cCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchh Confidence 00 00 1111111100 0112234568888889999999999999999999999887655 Q ss_pred ccc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---c-------cccccceeeEEeCCCCC Q lcl|NC_019916. 360 DDN-FSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNG---K-------WDIDPDEIGFIFRDNLP 428 (513) Q Consensus 360 ~~~-~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~---~-------~~~~~~~i~i~f~~~~p 428 (513) ++. ..+..||.++++.++.+..++..+++.|+.+|++++++|+.++..... + ......+++|.|++.++ T Consensus 363 f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~ 442 (508) T protein:vir:15 363 FSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVF 442 (508) T ss_pred cccccCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCC Confidence 432 235579999999999999999999999999999999999887654321 1 11223468899999999 Q ss_pred cCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCC Q lcl|NC_019916. 429 TDDVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPE 495 (513) Q Consensus 429 ~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (513) .|..+.++.++++ +|++|+|+++..+|+++| +++|++|+++|+.+..+ .+......++.+|+ T Consensus 443 ~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~-----~~~~~~~~~g~~ge 508 (508) T protein:vir:15 443 VNKDKQLEEDAKVLAIGALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTF-----EGGRSAILNGGDGE 508 (508) T ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCc-----cccccccCCCCCCC Confidence 9999999988875 699999999999999976 67789999988653211 11111112222222 No 66 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=3.1e-46 Score=270.05 Aligned_cols=469 Identities=12% Similarity=0.088 Sum_probs=295.5 Q ss_pred chhhceeccCCcccC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHH Q lcl|NC_019916. 4 MQQANMNYQEDADKL--TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQ 81 (513) Q Consensus 4 ~~~~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 81 (513) |-.-...|-..+ .+ ...-+-..+..|-..|+.+|+.|.+||.|.+.-+....+. ..-.--+++.++..++|+... T Consensus 1 ~~~~~~~~~~~~-~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg--~~~~~~r~~~~ps~~~~~~~~ 77 (527) T protein:vir:10 1 MGQDKRQYGSTQ-QLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRG--GDEGDQRPIYVPNGEKLIEAK 77 (527) T ss_pred CCccccccCCCc-CcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCC--ccccccceeeehhhHHhhCCc Confidence 111112222221 01 0111222356666778899999999999987544332211 111123567788887777775 Q ss_pred HHHhhcCCeeecCC-----cHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC----ceeEEEEEcccceEEE Q lcl|NC_019916. 82 TSYSVGNAIAMSGP-----SSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ----KGEVSVKLDPMECFII 152 (513) Q Consensus 82 ~~~l~g~p~~~~~~-----~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~----~~~~~~~~~p~~~~~~ 152 (513) ..+++.+..+..+ .++.++.|.+.+++..++.+..+++++.|++..++.+|++. .+++. +++|...||+ T Consensus 78 -~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~-~~DP~~~f~~ 155 (527) T protein:vir:10 78 -MRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLH-EVDPSTYFPY 155 (527) T ss_pred -ceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEe-ecCcceeeee Confidence 4455555554322 23568889999999999999999999999998888778543 34443 5899999999 Q ss_pred ecCCCCcceEEE--EEEEeecccccccce----------------------eEEEEEEEcCCcEEEEEeeccC-C----- Q lcl|NC_019916. 153 YDRSVNPKPIMA--VRYHAVQTVVDNITQ----------------------TKYEVETWTENDYTRYKPIVVA-G----- 202 (513) Q Consensus 153 ~d~~~~~~~~~~--ir~~~~~~~~~~~~~----------------------~~~~ve~yt~~~~~~~~~~~~~-~----- 202 (513) .|+...+.+..+ +.-|...+......+ +.+..+.|+.+.+..-...+.. . T Consensus 156 ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~ 235 (527) T protein:vir:10 156 EDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKL 235 (527) T ss_pred ecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhh Confidence 988765555543 222333221111001 0011112221111100000000 0 Q ss_pred ccccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 203 SVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 203 ~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) ....++...+++++.||||+|+|- ..|+|+++++++++|++|+++|+.+.++.+.+.|+.+++|.......+... T Consensus 236 ~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~ 315 (527) T protein:vir:10 236 STLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMV 315 (527) T ss_pred cCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcC Confidence 112234456899999999999663 359999999999999999999999999999999999999975321110000 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) . +. -..+.++. .++++++..+.-...++.++.|++.|.+.||..|++|. T Consensus 316 -------------~--------~~-VgPG~iwe---------L~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~Pa 364 (527) T protein:vir:10 316 -------------P--------WT-ISPLGMVE---------HGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPD 364 (527) T ss_pred -------------c--------cc-cCCceeEe---------cCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCe Confidence 0 00 00111121 23456666666556788999999999999999999999 Q ss_pred cccccc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HhcccccccccceeeEEeCCCCCcCH Q lcl|NC_019916. 358 LTDDNF--SGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYT-VVAHIE---ERVNGKWDIDPDEIGFIFRDNLPTDD 431 (513) Q Consensus 358 ~~~~~~--~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~-li~~~l---~~~~~~~~~~~~~i~i~f~~~~p~d~ 431 (513) +.++.+ ++++||.||+..+++|.+|+.+++..++-..++..+ .+...| .............+.|+|.+++|.|. T Consensus 365 vA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~ 444 (527) T protein:vir:10 365 IAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNS 444 (527) T ss_pred eeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCH Confidence 998844 467899999999999999999999999888877544 222222 22222222334577999999999999 Q ss_pred HHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHH-hhhhcCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 432 VAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKT-YDTKGGLIINGTSGNDPEDEGVRGQQGE 505 (513) Q Consensus 432 ~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (513) ++.++.++++ +|++|.+||+++| +|++|+++|+++|.++++..... ++..+..+. ......|-+++.....+. T Consensus 445 ~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a-~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 445 EKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGA-QMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhh-hhccccCCCCCCcccccC Confidence 9999999987 7999999998887 78999999999999887754332 222222111 111111111111111111 Q ss_pred CCCc Q lcl|NC_019916. 506 PEDE 509 (513) Q Consensus 506 ~~~~ 509 (513) +.-- T Consensus 524 ~~~~ 527 (527) T protein:vir:10 524 GQPL 527 (527) T ss_pred CCCC Confidence 1111 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=3.6e-46 Score=269.72 Aligned_cols=469 Identities=12% Similarity=0.089 Sum_probs=295.6 Q ss_pred chhhceeccCCcccC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHH Q lcl|NC_019916. 4 MQQANMNYQEDADKL--TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQ 81 (513) Q Consensus 4 ~~~~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 81 (513) |-.-...|-..+ .+ ...-+-..+..|-..|+.+|+.|.+||.|.+.-+....+. ..-.--+++.++..++|+... T Consensus 1 ~~~~~~~~~~~~-~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg--~~~~~~r~~~~ps~~~~~~~~ 77 (527) T protein:vir:10 1 MGQDKRQYGSTQ-QLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRG--GDEGDQRPIYVPNGEKLIEAK 77 (527) T ss_pred CCccccccCCCc-CcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCC--ccccccceeeehhhHHhhCCc Confidence 111112222221 01 0111222356666778899999999999987544332211 111123567788887777775 Q ss_pred HHHhhcCCeeecCC-----cHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC----ceeEEEEEcccceEEE Q lcl|NC_019916. 82 TSYSVGNAIAMSGP-----SSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ----KGEVSVKLDPMECFII 152 (513) Q Consensus 82 ~~~l~g~p~~~~~~-----~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~----~~~~~~~~~p~~~~~~ 152 (513) ..+++.+..+..+ .++.+..|.+.+++..++.+..+++++.|++..++.+|++. .+++. +++|...||+ T Consensus 78 -~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~-~~DP~~~f~~ 155 (527) T protein:vir:10 78 -MRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLH-EVDPSTYFPY 155 (527) T ss_pred -ceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEe-ecCcceeeee Confidence 4445555554322 23568888999999999999999999999998888778543 34443 5899999999 Q ss_pred ecCCCCcceEEE--EEEEeecccccccce----------------------eEEEEEEEcCCcEEEEEeeccC-C----- Q lcl|NC_019916. 153 YDRSVNPKPIMA--VRYHAVQTVVDNITQ----------------------TKYEVETWTENDYTRYKPIVVA-G----- 202 (513) Q Consensus 153 ~d~~~~~~~~~~--ir~~~~~~~~~~~~~----------------------~~~~ve~yt~~~~~~~~~~~~~-~----- 202 (513) .|+...+.+..+ +.-|...+......+ +.+..+.|+.+.+..-...+.. . T Consensus 156 ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~ 235 (527) T protein:vir:10 156 EDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKL 235 (527) T ss_pred ecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhh Confidence 988765555543 222333221111001 0011112221111100000000 0 Q ss_pred ccccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 203 SVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 203 ~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) ....++...+++++.||||+|+|- ..|+|+++++++++|++|+++|+.+.++.+.+.|+.+++|.......+... T Consensus 236 ~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~ 315 (527) T protein:vir:10 236 STLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMV 315 (527) T ss_pred cCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcC Confidence 112234456899999999999663 359999999999999999999999999999999999999975321110000 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) . +. -..+.++. .++++++..+.-...++.++.|++.|.+.||..|++|. T Consensus 316 -------------~--------~~-VgPG~iwe---------L~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~Pa 364 (527) T protein:vir:10 316 -------------P--------WT-ISPLGMVE---------HGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPD 364 (527) T ss_pred -------------c--------cc-cCCceeEe---------cCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCe Confidence 0 00 00111121 23456666666556788999999999999999999999 Q ss_pred cccccc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HhcccccccccceeeEEeCCCCCcCH Q lcl|NC_019916. 358 LTDDNF--SGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYT-VVAHIE---ERVNGKWDIDPDEIGFIFRDNLPTDD 431 (513) Q Consensus 358 ~~~~~~--~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~-li~~~l---~~~~~~~~~~~~~i~i~f~~~~p~d~ 431 (513) +.++.+ ++++||.||+..+++|.+|+.+++..++-..++..+ .+...| .............+.|+|.+++|.|. T Consensus 365 vA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~ 444 (527) T protein:vir:10 365 IAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNN 444 (527) T ss_pred eeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCH Confidence 998844 467899999999999999999999999888877544 222222 22222222334577999999999999 Q ss_pred HHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHH-hhhhcCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 432 VAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKT-YDTKGGLIINGTSGNDPEDEGVRGQQGE 505 (513) Q Consensus 432 ~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (513) ++.++.++++ +|++|.+||+++| +|++|+++|+++|.++++..... ++..+..+. ......|-+++.....+. T Consensus 445 ~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a-~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 445 EKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGA-QMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhh-hhccccCCCCCCcccccC Confidence 9999999987 7999999998887 78999999999999887754332 222222111 111111111111111111 Q ss_pred CCCc Q lcl|NC_019916. 506 PEDE 509 (513) Q Consensus 506 ~~~~ 509 (513) +.-- T Consensus 524 ~~~~ 527 (527) T protein:vir:10 524 GQPL 527 (527) T ss_pred CCCC Confidence 1111 No 68 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=1.4e-43 Score=255.61 Aligned_cols=459 Identities=12% Similarity=0.045 Sum_probs=278.3 Q ss_pred ccchhhceecc-CCcccCCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 2 IDMQQANMNYQ-EDADKLTPTRIAAFIRH----HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 2 ~~~~~~~~~~~-~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |.|=+...... .-...+..+-+.+...+ .-.....+++.+++||+|+++.+.... ..+..+..++++.|+++. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~slnl~~~ 78 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN--TDGETKKRDLNHLPIART 78 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc--CCCCcccCceeecchHHH Confidence 11111110000 00000001111111110 012345679999999999987553322 122334566788999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) |++..+++++++|++++.+++ +.++++++.|+|.....+++..++..|.+|..+|+|. +++++.+ ++|...+|+. T Consensus 79 i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~-v~ad~~~P~~ 156 (500) T protein:vir:30 79 AAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAF-VQAPVFLPLQ 156 (500) T ss_pred HHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEE-EcCCeeEEEE Confidence 999999999999999887764 4588889999999999999999999999999999985 4566654 7999999986 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEc--CCc--EEE---EEeecc---CCcc------ccc-cccccccCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT--END--YTR---YKPIVV---AGSV------PTL-EVAEHSAQF 216 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt--~~~--~~~---~~~~~~---~~~~------~~~-~~~~~~~~g 216 (513) .++.+....+++.. .....++ .....+.+|.|+ .+. .+. |+.... +... ... ......++. T Consensus 157 ~d~~~~~~~a~~~~-~~~~~~~-~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 234 (500) T protein:vir:30 157 SNTQDVSSAAVVIK-SVKTING-KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVT 234 (500) T ss_pred EcCCCeEEEEEEEE-EeeeecC-CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCC Confidence 66554443343332 2222222 222333456554 332 122 222111 1111 000 111112344 Q ss_pred ccceEEecC----C-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhh Q lcl|NC_019916. 217 GFPMIEYRN----N-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDAD 287 (513) Q Consensus 217 ~vPvv~~~n----~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~ 287 (513) +.||+.|++ + ..|.|+|++++++||++|.++|++++.++....++.+-..+-........ + T Consensus 235 ~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~---------g- 304 (500) T protein:vir:30 235 RPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTD---------G- 304 (500) T ss_pred CccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCC---------c- Confidence 445666644 3 24889999999999999999999999998766655542222111000000 0 Q ss_pred hhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccc-ccccc Q lcl|NC_019916. 288 AMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDD-NFSGN 366 (513) Q Consensus 288 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n 366 (513) ........+.+.+.+..-+ .....+..++.++.++..+.+...++.+.+.|...++.+...++ ...|. T Consensus 305 -------~~~~~~~~d~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~ 373 (500) T protein:vir:30 305 -------DVVPRPRFESDQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM 373 (500) T ss_pred -------cccCCcccCCCcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc Confidence 0000001111111111100 11123345777888888888998899888888887777654433 23466 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--h Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--G 442 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~ 442 (513) .||.+++++++.+..+++.+++.|+.+|++++++|+.+.... .++......++.|.|++.++.|..+.++.++++ + T Consensus 374 ~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a 453 (500) T protein:vir:30 374 KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNA 453 (500) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHc Confidence 799999999999999999999999999999999998765432 122222334689999999999999999988886 7 Q ss_pred cCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCC Q lcl|NC_019916. 443 AQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDE 497 (513) Q Consensus 443 g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) |++|.++++.++++++| +++|++++++|+.......++..+... + T Consensus 454 Gi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g----------~ 500 (500) T protein:vir:30 454 GFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYG----------E 500 (500) T ss_pred CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccC----------C Confidence 89999999988876665 445566666553221111111111100 0 No 69 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=1.4e-43 Score=255.61 Aligned_cols=459 Identities=12% Similarity=0.045 Sum_probs=278.3 Q ss_pred ccchhhceecc-CCcccCCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 2 IDMQQANMNYQ-EDADKLTPTRIAAFIRH----HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 2 ~~~~~~~~~~~-~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |.|=+...... .-...+..+-+.+...+ .-.....+++.+++||+|+++.+.... ..+..+..++++.|+++. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~slnl~~~ 78 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN--TDGETKKRDLNHLPIART 78 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc--CCCCcccCceeecchHHH Confidence 11111110000 00000001111111110 012345679999999999987553322 122334566788999999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) |++..+++++++|++++.+++ +.++++++.|+|.....+++..++..|.+|..+|+|. +++++.+ ++|...+|+. T Consensus 79 i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~-v~ad~~~P~~ 156 (500) T protein:vir:98 79 AAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAF-VQAPVFLPLQ 156 (500) T ss_pred HHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEE-EcCCeeEEEE Confidence 999999999999999887764 4588889999999999999999999999999999985 4566654 7999999986 Q ss_pred cCCCCcceEEEEEEEeecccccccceeEEEEEEEc--CCc--EEE---EEeecc---CCcc------ccc-cccccccCc Q lcl|NC_019916. 154 DRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT--END--YTR---YKPIVV---AGSV------PTL-EVAEHSAQF 216 (513) Q Consensus 154 d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt--~~~--~~~---~~~~~~---~~~~------~~~-~~~~~~~~g 216 (513) .++.+....+++.. .....++ .....+.+|.|+ .+. .+. |+.... +... ... ......++. T Consensus 157 ~d~~~~~~~a~~~~-~~~~~~~-~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 234 (500) T protein:vir:98 157 SNTQDVSSAAVVIK-SVKTING-KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVT 234 (500) T ss_pred EcCCCeEEEEEEEE-EeeeecC-CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCC Confidence 66554443343332 2222222 222333456554 332 122 222111 1111 000 111112344 Q ss_pred ccceEEecC----C-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhh Q lcl|NC_019916. 217 GFPMIEYRN----N-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDAD 287 (513) Q Consensus 217 ~vPvv~~~n----~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~ 287 (513) +.||+.|++ + ..|.|+|++++++||++|.++|++++.++....++.+-..+-........ + T Consensus 235 ~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~---------g- 304 (500) T protein:vir:98 235 RPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTD---------G- 304 (500) T ss_pred CccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCC---------c- Confidence 445666644 3 24889999999999999999999999998766655542222111000000 0 Q ss_pred hhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccc-ccccc Q lcl|NC_019916. 288 AMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDD-NFSGN 366 (513) Q Consensus 288 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n 366 (513) ........+.+.+.+..-+ .....+..++.++.++..+.+...++.+.+.|...++.+...++ ...|. T Consensus 305 -------~~~~~~~~d~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~ 373 (500) T protein:vir:98 305 -------DVVPRPRFESDQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM 373 (500) T ss_pred -------cccCCcccCCCcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc Confidence 0000001111111111100 11123345777888888888998899888888887777654433 23466 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--h Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--G 442 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~ 442 (513) .||.+++++++.+..+++.+++.|+.+|++++++|+.+.... .++......++.|.|++.++.|..+.++.++++ + T Consensus 374 ~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a 453 (500) T protein:vir:98 374 KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNA 453 (500) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHc Confidence 799999999999999999999999999999999998765432 122222334689999999999999999988886 7 Q ss_pred cCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCC Q lcl|NC_019916. 443 AQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDE 497 (513) Q Consensus 443 g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) |++|.++++.++++++| +++|++++++|+.......++..+... + T Consensus 454 Gi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g----------~ 500 (500) T protein:vir:98 454 GFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYG----------E 500 (500) T ss_pred CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccC----------C Confidence 89999999988876665 445566666553221111111111100 0 No 70 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=1.3e-41 Score=244.68 Aligned_cols=456 Identities=11% Similarity=0.017 Sum_probs=278.8 Q ss_pred CCcccCCHHHHHHHHHH-----------H-----HHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRH-----------H-----YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~-----------~-----~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |..-.--.++|++++.+ | ......++...++||+|+++.+... .........++++.|+++. T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~--~~~~~~~~~~~~slnl~~~ 78 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYK--NTDGDIKSRPMNHLPIART 78 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccccc--ccCcchhcccceecchHHH Confidence 11000000111111100 0 2344567899999999987654322 1123334456788899999 Q ss_pred HHHHHHHHhhcCCeeecCCcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEe Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIY 153 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~ 153 (513) |++..+++++++|++++.+++ +.++++++.|+|.....+++..++..|.++..+|+|. +++++.+ ++|...+|+. T Consensus 79 i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~~~i~~-v~ad~~~P~~ 156 (522) T protein:vir:47 79 ASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG-DKVRVAF-IQAPVFFPLE 156 (522) T ss_pred HHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC-CceEEEE-EcCCceEEEE Confidence 999999999999999987664 4578889999999999999999999999999999974 5676654 7899999975 Q ss_pred cCCCC-cceEEEEEEEeecccccccceeEEEEEEEc---C------------CcEEE---EEeecc---CCc-------- Q lcl|NC_019916. 154 DRSVN-PKPIMAVRYHAVQTVVDNITQTKYEVETWT---E------------NDYTR---YKPIVV---AGS-------- 203 (513) Q Consensus 154 d~~~~-~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt---~------------~~~~~---~~~~~~---~~~-------- 203 (513) .++.. .+...+.+.+.... .. ....+.+|.++ . ...+. |+..+. |.. T Consensus 157 ~~~~~~~e~a~~~~~~~~~~--~~-~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e 233 (522) T protein:vir:47 157 SNTQDVSSAAILTKTIKSEG--RK-NVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDK 233 (522) T ss_pred EcCCceEEEEEEEEEEeecc--cc-eeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccccccc Confidence 44432 22223333322211 11 11111233321 0 11122 221111 111 Q ss_pred cccccc-cccccCcccceEEecCC---------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccc Q lcl|NC_019916. 204 VPTLEV-AEHSAQFGFPMIEYRNN---------EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFD 273 (513) Q Consensus 204 ~~~~~~-~~~~~~g~vPvv~~~n~---------~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~ 273 (513) +..... ..-.++.+.+|++|+++ ..|.|+|.++++++|++|.++|.+++.++....++.+-..+...... T Consensus 234 ~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~ 313 (522) T protein:vir:47 234 YKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQ 313 (522) T ss_pred ccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCC Confidence 111111 11123344456777653 24899999999999999999999999999877776653222111000 Q ss_pred cccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 274 DSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFS 353 (513) Q Consensus 274 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 353 (513) ...+... ... ..+.+.+++..- . .....+.+++.++.++-.+.+.+.++.+.+.|...+ T Consensus 314 ~~~g~~~---------------~~~--~fd~~~~~f~~~--~--~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~ 372 (522) T protein:vir:47 314 RPDGTID---------------FRP--RFDVEQNVYMQI--G--GSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQI 372 (522) T ss_pred CCCcccc---------------ccc--ccCcccceEeec--C--CCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHh Confidence 0000000 000 001111111110 0 111234467888888888999988999888887777 Q ss_pred Cccccccc-cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccccccceeeEEeCCCCCcC Q lcl|NC_019916. 354 HTPDLTDD-NFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVN--GKWDIDPDEIGFIFRDNLPTD 430 (513) Q Consensus 354 ~~p~~~~~-~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~--~~~~~~~~~i~i~f~~~~p~d 430 (513) +.+...++ ...+..+|.++++..+.+..+++.+++.|..+|+++++.|+.+..... .+.......++|.|++.++.| T Consensus 373 gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D 452 (522) T protein:vir:47 373 GVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTD 452 (522) T ss_pred CCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCC Confidence 76543333 223557899999999999999999999999999999999997764321 222334457899999999999 Q ss_pred HHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCC Q lcl|NC_019916. 431 DVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEG 498 (513) Q Consensus 431 ~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (513) ..+.++..+++ +|++|.++++.++++++| +++|++|+++|+.+..+......+ .+.+.+..+.+++ T Consensus 453 ~~~~~~~~~~~v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~--~~~~~~~~~d~~~ 522 (522) T protein:vir:47 453 RHAELDYWAKMVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYG--MHDQNEEKADDKG 522 (522) T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCC--CCCcccccCCCCC Confidence 99999888885 699999999999888876 677899998876543221111111 0111111111111 No 71 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=1.9e-41 Score=243.77 Aligned_cols=475 Identities=14% Similarity=0.118 Sum_probs=281.9 Q ss_pred hhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHH Q lcl|NC_019916. 5 QQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSY 84 (513) Q Consensus 5 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~ 84 (513) +-+|+.--++.....+.-....+..+-..|..+|+.|.+||.|+|--+.-. .+... -.-+..++.+++|++. .+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~i--l~G~d---r~~~~~ps~r~~V~~~-~~ 74 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLV--LRGDD---SVPILMPSGRKIVEAV-HR 74 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhh--cCCCc---eeeeccchHHHHHHHH-HH Confidence 222222222222222222223345555678899999999999998543211 11111 1123356788999995 56 Q ss_pred hhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCC----CceeEEEEEcccce Q lcl|NC_019916. 85 SVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPS----QKGEVSVKLDPMEC 149 (513) Q Consensus 85 l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~----~~~~~~~~~~p~~~ 149 (513) ++|.|++|..+. +..|.+|.+.+++..++.+..+++++.|+|..++-+|++ +..++. +++|... T Consensus 75 ~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~-~vDP~~~ 153 (563) T protein:vir:74 75 FLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVD-EVDPRQI 153 (563) T ss_pred hcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEe-ecCCcee Confidence 669999994321 235788899999999999999999999999877777753 244443 6899999 Q ss_pred EEEecCCCCcc------------------eEEEEEEEeeccccccc--ceeEEEEEEEcCCcE-------EEEEeeccC- Q lcl|NC_019916. 150 FIIYDRSVNPK------------------PIMAVRYHAVQTVVDNI--TQTKYEVETWTENDY-------TRYKPIVVA- 201 (513) Q Consensus 150 ~~~~d~~~~~~------------------~~~~ir~~~~~~~~~~~--~~~~~~ve~yt~~~~-------~~~~~~~~~- 201 (513) ||+-|++.... -++.+|.|.....+... ....+-++.|+-+.. ..+...+.+ T Consensus 154 fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~ 233 (563) T protein:vir:74 154 FLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQV 233 (563) T ss_pred eeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchh Confidence 99555543211 11222211111111000 001111222221100 000000000 Q ss_pred --CccccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 202 --GSVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 202 --~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) .....++..-|++++.||||.|+|- ..|+|++++++.+++++|.++|+.+.++..+.+|+.++.|...... T Consensus 234 ~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~-- 311 (563) T protein:vir:74 234 RSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDP-- 311 (563) T ss_pred hhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccccc-- Confidence 0011123334789999999998663 3599999999999999999999999999999999999987542211 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHH-HHHHHh Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAA-DIHKFS 353 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~i~~~s 353 (513) ......++.- ..+.++-+. ...+.+-+..+.--.+...+..|++.|.. .|+.+| T Consensus 312 -~~g~~~~w~v------------------gpG~i~El~------~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s 366 (563) T protein:vir:74 312 -NTGELTDWNI------------------GPMQIVEIA------GNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGS 366 (563) T ss_pred -cccccccccc------------------CCceeEecc------CCccccceeeecchhhhHHHHHHHHHHHHHHHHhhc Confidence 1111111100 011111111 11122334444444466888999998887 889999 Q ss_pred Ccccccccc--ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhc---------cccccccc-c Q lcl|NC_019916. 354 HTPDLTDDN--FSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ----RYTVVAHIEERV---------NGKWDIDP-D 417 (513) Q Consensus 354 ~~p~~~~~~--~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~----~~~li~~~l~~~---------~~~~~~~~-~ 417 (513) ++|...++. .+..+||.||+..+.+|.+++++|+..+..++++ .+++++.++... .+..++.. . T Consensus 367 ~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~ 446 (563) T protein:vir:74 367 GTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNEC 446 (563) T ss_pred cCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCce Confidence 999999884 4567899999999999999999999988888777 555555444442 11122233 3 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC---CCC-CCHHHHHHHHHHHHHHHHHHhhhhcCCCCC-CCC Q lcl|NC_019916. 418 EIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL---PNV-TDADEIVKMMDKQRKAMLKTYDTKGGLIIN-GTS 490 (513) Q Consensus 418 ~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l---~~v-~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~ 490 (513) .|.|+|.+.+|.|.++.++.++.+ +|++|+|||+.+| +|. +|++.|+++|+.++-..+..+....+.... +.. T Consensus 447 ~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~ 526 (563) T protein:vir:74 447 SVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAM 526 (563) T ss_pred EEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceec Confidence 478999999999999999988776 7999999998887 654 378888888887666553333222221111 111 Q ss_pred CCCCCCCCCCCCCCCCCCc------cCCC Q lcl|NC_019916. 491 GNDPEDEGVRGQQGEPEDE------RTSD 513 (513) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~------~~~~ 513 (513) ++.|-+++....+|.|=+. --.| T Consensus 527 ~~~g~~~~~~dd~g~p~~~~~~~~~~~~~ 555 (563) T protein:vir:74 527 DNGGAGEQQFDDQGNPIDQFGNPVEIPPD 555 (563) T ss_pred ccCCCCcccccccCCchhHcCCcccCCcc Confidence 1111111111111222111 0111 No 72 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=3.3e-39 Score=231.53 Aligned_cols=453 Identities=12% Similarity=0.038 Sum_probs=270.1 Q ss_pred CccchhhceeccCCc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDA-DKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIAD 79 (513) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd 79 (513) .++++++...-..-. +....+.|..+.+...+.. ..+ -+..|.+. .+.....++...++++.|+++.|++ T Consensus 4 ~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~------~w~~~~~~~~~~~~~~~~l~~~i~~ 74 (518) T protein:vir:78 4 WSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQ-KEW--SKDSYLTS------LWAQGYVPTVHDKLMNSGTGNEIVV 74 (518) T ss_pred hhhHHHHHHHhhcCCCCccchhccHHHhhhcccch-hhh--hhhhhhhh------hcccCCCCccccccccCChHHHHHH Confidence 333333333222110 0111123333222211111 000 01112111 1122223445567889999999999 Q ss_pred HHHHHhhcCCeeecC------CcH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceE Q lcl|NC_019916. 80 FQTSYSVGNAIAMSG------PSS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECF 150 (513) Q Consensus 80 ~~~~~l~g~p~~~~~------~~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~ 150 (513) ..++++++++++++. +++ +.++++++.|+|.....+++..++..|.++..+|++. |++++.+ ++|...+ T Consensus 75 ~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~-v~ad~~~ 152 (518) T protein:vir:78 75 VAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILN-GRPSISV-HSSSQFW 152 (518) T ss_pred HHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEEC-CeeEEEE-EcCCeeE Confidence 999999999998753 222 3578889999999999999999999999999999874 6676665 7999999 Q ss_pred EEecCCCCcceEEEEEEEeecccccccceeEEEEEEE------------cCCcEEE--EEeeccCCccc----------- Q lcl|NC_019916. 151 IIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETW------------TENDYTR--YKPIVVAGSVP----------- 205 (513) Q Consensus 151 ~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~y------------t~~~~~~--~~~~~~~~~~~----------- 205 (513) |+|++.. +..++ ++......+. ....++++.+ +...+.+ |+.. .+.... T Consensus 153 P~~~~g~---~~~~~-f~~~~~~~~k-~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~-~~~~v~~~~~~~~~~l~ 226 (518) T protein:vir:78 153 IDFKNNE---PFRFN-FFEEIPTSNK-ADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKID-GDKTTPISAERLPEQIT 226 (518) T ss_pred EEeecCc---EEEEE-EEEEeecCCc-ceeEEEEEeeccccccceeecccceeEEEEEeeec-Ccccccccccccccccc Confidence 9998643 33333 2222211111 1111123322 2222221 1111 110000 Q ss_pred ----cccccc---cccCcccceEEecCC-----C-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCc Q lcl|NC_019916. 206 ----TLEVAE---HSAQFGFPMIEYRNN-----E-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDI 268 (513) Q Consensus 206 ----~~~~~~---~~~~g~vPvv~~~n~-----~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~ 268 (513) +....+ -......|++.|.+| . .|.|+|+.++++||++|.++|++++.++....++.+-..+- T Consensus 227 ~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l 306 (518) T protein:vir:78 227 SYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMF 306 (518) T ss_pred cccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHh Confidence 000000 012345677776432 2 28999999999999999999999999987666665543332 Q ss_pred ccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHH Q lcl|NC_019916. 269 DTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAAD 348 (513) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 348 (513) ......... ......+.+.+.+..-.+....+......++-++.++..+.+...++.+.+. T Consensus 307 ~~~~~~~~~-------------------~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~ 367 (518) T protein:vir:78 307 RKKVNKSTD-------------------KEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQK 367 (518) T ss_pred ccCCCCCCC-------------------ccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHH Confidence 111000000 0000111111111110000000111112366777888899999999999999 Q ss_pred HHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----cccccceeeEEeC Q lcl|NC_019916. 349 IHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGK----WDIDPDEIGFIFR 424 (513) Q Consensus 349 i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~----~~~~~~~i~i~f~ 424 (513) |...++++...++.-++..||.++++..+.+.+++..++..+..+|+++++.++.++....+. ...+...+.|.|+ T Consensus 368 ~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~ 447 (518) T protein:vir:78 368 AVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFP 447 (518) T ss_pred HHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeC Confidence 998888876655543467899999999999999999999999999999999999887654332 1223356899999 Q ss_pred CCCCcCHHHHHHHHHHH--hcCCCHHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHH-HhhhhcCCCCCCCCCCCC Q lcl|NC_019916. 425 DNLPTDDVAIITALVQA--GAQIPQEYLYQY-LPNVTD--ADEIVKMMDKQRKAMLK-TYDTKGGLIINGTSGNDP 494 (513) Q Consensus 425 ~~~p~d~~e~a~~~~kl--~g~iS~et~~~~-l~~v~D--~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~ 494 (513) +.++.|..+.+++++++ +|++|.|+++++ +|..+| +++|++||++|+..... .-+++++..+.+ | T Consensus 448 D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~-----g 518 (518) T protein:vir:78 448 DPMSVNLNELSSTLNNMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKG-----G 518 (518) T ss_pred CCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCC-----C Confidence 99999999999998875 699999999987 466665 67889999988664321 111212111111 1 No 73 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=2e-35 Score=210.86 Aligned_cols=450 Identities=11% Similarity=-0.036 Sum_probs=273.6 Q ss_pred CCcccCCHHHHHHHHHHH----------------HHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHH----------------YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~----------------~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |+.-+--..+|++++.+. -.....|+.++++||+|+++-++.. ......+..++++.|+++. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~--~~~~~~~~~~~~sl~~~~~ 78 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYI--NSQGKIQERDYMTLNLRKL 78 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccc--cccccccccceeecCcHHH Confidence 221111112222211110 1123457888999999998754322 1122334456788999999 Q ss_pred HHHHHHHHhhcCCeeecCCc--------------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEE Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPS--------------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSV 142 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~--------------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~ 142 (513) |+...++++++++++++.++ .+.|+++++.|+|.....+.+..++..|.++..+|+|. +.+++.+ T Consensus 79 i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~~~I~~ 157 (517) T protein:vir:98 79 SADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN-GEIEFSW 157 (517) T ss_pred HHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC-CeeEEEE Confidence 99999999999998876442 24588899999999999999999999999999999985 4455554 Q ss_pred EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCc--------EEE---EEeeccC---Ccc---- Q lcl|NC_019916. 143 KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEND--------YTR---YKPIVVA---GSV---- 204 (513) Q Consensus 143 ~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~--------~~~---~~~~~~~---~~~---- 204 (513) ++|...+|+-.+. .+...+++-+...+...+ ........|.|+.+. ++. |+..... ... T Consensus 158 -v~ad~~~Pl~~~~-~~v~~~ai~~~~~~~~~~-~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~ 234 (517) T protein:vir:98 158 -ALANAFYPLRSNS-NGISEGVMKSVTTKVIGN-KTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEE 234 (517) T ss_pred -EcCCeeEEEEecC-CCeEEEEEEEEEEEeecC-CceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccc Confidence 7888888844333 234444443333322222 122233456555332 111 1211111 100 Q ss_pred --cccc-ccccccCcccceEEecC----C-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccc Q lcl|NC_019916. 205 --PTLE-VAEHSAQFGFPMIEYRN----N-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLF 272 (513) Q Consensus 205 --~~~~-~~~~~~~g~vPvv~~~n----~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~ 272 (513) .... ...-.++.+-+|++|++ + ..|.|+|.++++++|++|.++|.+++.++....++.+=..+-.... T Consensus 235 ~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~ 314 (517) T protein:vir:98 235 LYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVP 314 (517) T ss_pred cccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhcccc Confidence 0000 00111222222445544 2 3599999999999999999999999999876666554322211110 Q ss_pred ccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 273 DDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 352 (513) +....... .. .+.+.+++..-. +...+..++..+.++-.+.+.+.++.+.+.|... T Consensus 315 ~~~g~~~~-----------------~~--~d~~~~~y~~~~-----~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~ 370 (517) T protein:vir:98 315 DESGMPPP-----------------QV--FDPDVNVYKSIR-----MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEME 370 (517) T ss_pred CCCCcccC-----------------CC--CCcccceeeecc-----CCCCCCceeeeccccchHHHHHHHHHHHHHHHHH Confidence 00000000 00 001111111111 1112344566666777789999999999999999 Q ss_pred hCccccccccc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCc Q lcl|NC_019916. 353 SHTPDLTDDNF-SGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPT 429 (513) Q Consensus 353 s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~ 429 (513) ++.+...++.- .+..+|.++++..+.+..+++.+++.|+.+|++++++|+.+.... .++.......+.|.|.+.++. T Consensus 371 ~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~ 450 (517) T protein:vir:98 371 LKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQ 450 (517) T ss_pred hCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCC Confidence 99886555432 344689999999999999999999999999999999998765432 222223345789999999999 Q ss_pred CHHHHHHHHHHH--hcCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 430 DDVAIITALVQA--GAQIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGE 505 (513) Q Consensus 430 d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (513) |..+.++...++ +|++|.++++.++.++++ +++|+.|+++|..+. ++.+.... ...+. T Consensus 451 D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~----~~~~~~~~-~~~~~------------- 512 (517) T protein:vir:98 451 DRSALLRFYGQAKTFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIEL----DPVTISQR-AQKRM------------- 512 (517) T ss_pred CHHHHHHHHHHHHhcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhcccc----CCCCcccc-ccCCC------------- Confidence 999999998885 789999999988866665 566777777775422 11111100 00000 Q ss_pred CCCcc Q lcl|NC_019916. 506 PEDER 510 (513) Q Consensus 506 ~~~~~ 510 (513) +++++ T Consensus 513 ~gd~e 517 (517) T protein:vir:98 513 FGDEE 517 (517) T ss_pred CCCCC Confidence 11111 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.93 E-value=1.2e-24 Score=151.67 Aligned_cols=448 Identities=12% Similarity=0.073 Sum_probs=262.6 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-------cccccccCCCC--CCcceeecchhHHHHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGI-------LSPASRRNEKG--KADHRAVHSFARYIADFQTS 83 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~-------~~~~~~~~~~~--~~~~ri~~n~~~~ivd~~~~ 83 (513) |... +++.+..- ........++++.+++-|.|...+. ++...-....+ +..+=+-.|+++.+++..++ T Consensus 1 m~~~--~~~~v~~~-h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G 77 (513) T protein:vir:97 1 MADK--DPKSPATT-SGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSG 77 (513) T ss_pred CCCC--CCCCCCcC-CHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhh Confidence 2211 11221111 1112345677888888888863321 11111000000 00111346999999999999 Q ss_pred HhhcCCeeecCCcHHHHHH-HHH-----hcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc-----------------eeE Q lcl|NC_019916. 84 YSVGNAIAMSGPSSDRLDD-FNR-----RNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK-----------------GEV 140 (513) Q Consensus 84 ~l~g~p~~~~~~~~~~l~~-~~~-----~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~-----------------~~~ 140 (513) ++|-+||+++.+....+.+ +++ -++++.....+.+.++.+|+++++|-....+. -+. T Consensus 78 ~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy 157 (513) T protein:vir:97 78 KPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPY 157 (513) T ss_pred hhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCce Confidence 9999999987766666665 443 36899999999999999999999995432211 123 Q ss_pred EEEEcccceEEEecCC---CCcceEEEEEEEe-ecccccccceeEEEEEEEcCCcEEEEEeeccCCc---cccccccccc Q lcl|NC_019916. 141 SVKLDPMECFIIYDRS---VNPKPIMAVRYHA-VQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS---VPTLEVAEHS 213 (513) Q Consensus 141 ~~~~~p~~~~~~~d~~---~~~~~~~~ir~~~-~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~---~~~~~~~~~~ 213 (513) .+.+.|.+++- |+.. ....+.. +++-. ....|+...+.+..+.+++++.+..|+....+.. .+.....-.| T Consensus 158 ~~~~~~e~Iin-W~~~~v~G~~~L~~-v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~ 235 (513) T protein:vir:97 158 WVMIKPECLLF-ARSEVINGVEVLQH-VRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWAT 235 (513) T ss_pred EEEecHhhhcC-cceeccCcceeeee-EEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCC Confidence 34466766543 3211 1123333 33221 2234555667777778889887766655433322 1223333457 Q ss_pred cCcccceEEecCCC----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhh Q lcl|NC_019916. 214 AQFGFPMIEYRNNE----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAM 289 (513) Q Consensus 214 ~~g~vPvv~~~n~~----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~ 289 (513) +++.||||.|.... .+.+.|.++-.|.-+.=+..|++...+...++|+++++|......+. T Consensus 236 ~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~--------------- 300 (513) T protein:vir:97 236 GLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDP--------------- 300 (513) T ss_pred cCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCc--------------- Confidence 89999999987543 25567888888888888999999999999999999999964321110 Q ss_pred hccccccchhhhcchhcceeeccccccccccccCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCcccccccccccccc Q lcl|NC_019916. 290 KKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYD-SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSS 368 (513) Q Consensus 290 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 368 (513) +.+.+.........+++++|+..+.+ .+.....++.+.+.|..++..+- ...+++.| T Consensus 301 -------------------i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll---~~~~~~~T 358 (513) T protein:vir:97 301 -------------------VVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFL---KRKTGGQT 358 (513) T ss_pred -------------------eEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHhh---ccCCcccc Confidence 11112222222345788999999854 46688999999999988775441 22346789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCC-CCcC-HHHHHHHHHHH--hcC Q lcl|NC_019916. 369 GVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN-LPTD-DVAIITALVQA--GAQ 444 (513) Q Consensus 369 g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~-~p~d-~~e~a~~~~kl--~g~ 444 (513) |+|.+.......+.....-..+..++++.++++..+++.-. + .++|+.++. .+.. .++.++++.++ +|. T Consensus 359 a~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~~~-----~--~~~v~in~dF~~~~~~~~~~~al~~a~~~G~ 431 (513) T protein:vir:97 359 ATARALDSAEATSDLSAMTGLFEDALAQALDITADWLRLGP-----N--GGTVELVKDYDLEEMDAPGLQALQVAREKRD 431 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-----C--ccEEEeccccCcccCCHHHHHHHHHHHhCCC Confidence 99999988888888888889999999999999988875321 1 122332221 1222 24556666664 789 Q ss_pred CCHHHHHHhCC--CC-C---CHHHHHHHHHHHHHHHHHHh----hhhcCCCCCCCCCCCCCC-----CCCCC--CCCCCC Q lcl|NC_019916. 445 IPQEYLYQYLP--NV-T---DADEIVKMMDKQRKAMLKTY----DTKGGLIINGTSGNDPED-----EGVRG--QQGEPE 507 (513) Q Consensus 445 iS~et~~~~l~--~v-~---D~~~E~~ri~~E~~~~~~~~----~~~~~~~~~~~~~~~~~~-----~~~~~--~~~~~~ 507 (513) +|++|.++.|- .| . |.+++.+++.++-+++.-.. ++... .++...+.++++ ++.+. +-+.|+ T Consensus 432 is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) T protein:vir:97 432 ISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQK-NPPEGGEGEGEGEGEGGEGGEGGEGGGNPG 510 (513) T ss_pred CCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCC-CCCCCCCCCCCCCCCCCCCCCccccCCCCC Confidence 99999987652 23 1 34555555554432221110 01010 000000101111 11111 111244 Q ss_pred Ccc Q lcl|NC_019916. 508 DER 510 (513) Q Consensus 508 ~~~ 510 (513) .+. T Consensus 511 ~~~ 513 (513) T protein:vir:97 511 GES 513 (513) T ss_pred CCC Confidence 333 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.93 E-value=2e-25 Score=155.99 Aligned_cols=421 Identities=10% Similarity=0.004 Sum_probs=245.0 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccC---CCCCCc--ce----eecchhHHHHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRN---EKGKAD--HR----AVHSFARYIADFQTS 83 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~---~~~~~~--~r----i~~n~~~~ivd~~~~ 83 (513) |+...-.+ ......++++..++-|.|...+......+.+ .+.... .| +-.|+++.+++..++ T Consensus 1 m~V~~~hp---------~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G 71 (452) T protein:vir:94 1 MPIETKHP---------EYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSG 71 (452) T ss_pred CCCCCcCH---------HHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhc Confidence 33221111 1234466778888888875432111111111 111111 12 236999999999999 Q ss_pred HhhcCCeeecCCcHHHHHHHH---HhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcc Q lcl|NC_019916. 84 YSVGNAIAMSGPSSDRLDDFN---RRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPK 160 (513) Q Consensus 84 ~l~g~p~~~~~~~~~~l~~~~---~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~ 160 (513) ++|-+||+++.++ .+..+. +.++++.....+.+.++.+|+++++|-....|.-+.++.++|.+++ =|+-..... T Consensus 72 ~vf~k~p~~~~p~--~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii-~W~~~~~g~ 148 (452) T protein:vir:94 72 MVLDQPPVITHPD--AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENIL-NWEEDEDGR 148 (452) T ss_pred hhhcCCceecccH--HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhc-CccccccCC Confidence 9999999986543 333332 3478999999999999999999999977766655555668888866 354333344 Q ss_pred eEEE-EEEEee--cccccccceeEEEEEEEc--CCcEEEEEeeccCCccc-----cccccccccCcccceEEecCCC--- Q lcl|NC_019916. 161 PIMA-VRYHAV--QTVVDNITQTKYEVETWT--ENDYTRYKPIVVAGSVP-----TLEVAEHSAQFGFPMIEYRNNE--- 227 (513) Q Consensus 161 ~~~~-ir~~~~--~~~~~~~~~~~~~ve~yt--~~~~~~~~~~~~~~~~~-----~~~~~~~~~~g~vPvv~~~n~~--- 227 (513) +... +|.... +..+....+....+.+++ ++.+...+....++..+ .......++++.||||.|.... T Consensus 149 l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~ 228 (452) T protein:vir:94 149 LLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSM 228 (452) T ss_pred eeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCCCC Confidence 4333 333221 122223334444444444 44333222222222211 1222335789999999886543 Q ss_pred -CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhc Q lcl|NC_019916. 228 -YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQA 306 (513) Q Consensus 228 -~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 306 (513) .+.+.|.++-.|.-+.-+..|+..+.+...++|++++.|..... T Consensus 229 ~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~----------------------------------- 273 (452) T protein:vir:94 229 TPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS----------------------------------- 273 (452) T ss_pred CCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC----------------------------------- Confidence 35667889999988999999999999999999999999964211 Q ss_pred ceeeccccccccccccCCceeEEeecCCH-HHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDS-AGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELAST 385 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~ 385 (513) -+.+.++........+++++|+..+.+. +..+..++.|.+.+...+.- +-.....++.|++|.......-.+.... T Consensus 274 -~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~--ll~~~~~~~~s~ea~~~~~~~~~s~L~~ 350 (452) T protein:vir:94 274 -TMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSAR--LIDNSTRGSEATETVKLRYMSETASLKS 350 (452) T ss_pred -ceEecccccccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHHH--hhccCCCcchHHHHHHHHHHHhhHHHHH Confidence 0112222222333457789999988544 77889999999999887652 1222233567887755433333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC--CCCCCHH Q lcl|NC_019916. 386 KRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL--PNVTDAD 461 (513) Q Consensus 386 ~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l--~~v~D~~ 461 (513) .-.....++.+++++++.+++. +.+ ..+++.-....+.-..+.++++.++ +|.+|++|++..| ..|-|++ T Consensus 351 ~a~~~e~al~~~l~~~a~w~g~-----~~~-~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~ 424 (452) T protein:vir:94 351 VTRAVEALLNKAYSCIMDMESM-----GGT-LNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPP 424 (452) T ss_pred HHHHHHHHHHHHHHHHHHHcCC-----CCc-eEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCc Confidence 3344556667777777776542 111 1223222222233345667766664 7899999998887 4567888 Q ss_pred HHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 462 EIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 462 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) .|.+++..|.++.. ..+.+ .+.+++. +. T Consensus 425 ~e~~~i~~E~~~~~--------~~~~~-~~~~~~~------------~~ 452 (452) T protein:vir:94 425 GESMGVIPDPPAPE--------PSPSN-TPPNPSS------------KA 452 (452) T ss_pred cCHHHHHHHhhccC--------cccCC-CCCCCcc------------CC Confidence 88888887744310 00000 0001111 11 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.86 E-value=1.5e-20 Score=129.27 Aligned_cols=432 Identities=11% Similarity=0.045 Sum_probs=232.2 Q ss_pred CC-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-----ccc--cc-ccCCCCC------CcceeecchhHHH Q lcl|NC_019916. 13 ED-ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGI-----LSP--AS-RRNEKGK------ADHRAVHSFARYI 77 (513) Q Consensus 13 ~~-~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~-----~~~--~~-~~~~~~~------~~~ri~~n~~~~i 77 (513) |+ ..--.+ ......++++..++-+.|...+. +.+ .. ....+++ ..+-+-.|+++.+ T Consensus 1 m~~V~~~hp---------~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t 71 (501) T protein:vir:95 1 MPNVSFIRP---------ELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRT 71 (501) T ss_pred CCCCCCCCH---------HHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHH Confidence 22 010011 23445667788888888865321 111 10 0011101 1112346999999 Q ss_pred HHHHHHHhhcCCeeecCCcHHHHHHHHHh-----cCHHHHHHHHHHHHhhCCeEEEEeeecCCC-ce------------- Q lcl|NC_019916. 78 ADFQTSYSVGNAIAMSGPSSDRLDDFNRR-----NDIDTLNYELYLDMTVTGRAYEYVYRDPSQ-KG------------- 138 (513) Q Consensus 78 vd~~~~~l~g~p~~~~~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~-~~------------- 138 (513) ++..++++|-++|+++ ....++.++++ ++++.....+.+.++.+|+++++|-....+ .+ T Consensus 72 ~~~l~G~vf~k~p~~~--~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~r 149 (501) T protein:vir:95 72 LFGLVGQVFMRDPVVK--VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIR 149 (501) T ss_pred HHHHhhhhhcCCccee--CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCC Confidence 9999999999999985 34557777644 689999999999999999999999543221 11 Q ss_pred eEEEEEcccceEEEecCC-C--CcceEEEE-EEEeecccccccceeEEEEEEEc--CCcEEE---EEeeccC-------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRS-V--NPKPIMAV-RYHAVQTVVDNITQTKYEVETWT--ENDYTR---YKPIVVA-------- 201 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~-~--~~~~~~~i-r~~~~~~~~~~~~~~~~~ve~yt--~~~~~~---~~~~~~~-------- 201 (513) +.++.+.|.+++- |+.. + ...+..++ |-...+..+....+.+..+.+.+ .+..+. |+....+ T Consensus 150 Py~~~~~~~~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~ 228 (501) T protein:vir:95 150 PTLYVYSPTEIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIP 228 (501) T ss_pred cEEEEecHhhhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceec Confidence 2334466666432 3321 1 12333332 22222222223333333333333 222222 2211111 Q ss_pred -Cc-----cccccccccccCcccceEEecCCCC----CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccc Q lcl|NC_019916. 202 -GS-----VPTLEVAEHSAQFGFPMIEYRNNEY----RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTL 271 (513) Q Consensus 202 -~~-----~~~~~~~~~~~~g~vPvv~~~n~~~----~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~ 271 (513) +. .+.....-.|.++.||||.|..... +.+.|.++-.|.-+.=+..|+....+...++|+++++|..... T Consensus 229 ~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~ 308 (501) T protein:vir:95 229 KGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEW 308 (501) T ss_pred CCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccc Confidence 00 0111122247899999998744322 3445555555554554556788889999999999999975432 Q ss_pred cccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 272 FDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHK 351 (513) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 351 (513) .... ..+.+.+.+. ......++++++|+..+.+.- .+..++.+.+.|.. T Consensus 309 ~~~~-----------------------------~~~~i~~G~~-~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~ 357 (501) T protein:vir:95 309 VTNV-----------------------------LKGSVNFGSR-GGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVA 357 (501) T ss_pred cccC-----------------------------CCCceeeccc-ccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHH Confidence 1111 0111112111 112234678999998765443 36778899998888 Q ss_pred HhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCc-C Q lcl|NC_019916. 352 FSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPT-D 430 (513) Q Consensus 352 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~-d 430 (513) ++..+ . ....++-||+|.+.......+.....-..+..++.+++++++.+++.... .++|..++..+. . T Consensus 358 ~Ga~l--l-~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~-------~~~v~i~~df~~~~ 427 (501) T protein:vir:95 358 LGAKL--V-EQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADS-------GVKFELNTDFDIAR 427 (501) T ss_pred HHHhh--c-cCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-------ceEEEEeccccccc Confidence 75432 1 22235678888877766666666667777888888888888887653211 223333333222 2 Q ss_pred -HHHHHHHHHHH--hcCCCHHHHHHhC---CCCC-CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 431 -DVAIITALVQA--GAQIPQEYLYQYL---PNVT-DADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQ 503 (513) Q Consensus 431 -~~e~a~~~~kl--~g~iS~et~~~~l---~~v~-D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (513) ..+.++++.++ +|.+|.+|+++.| +.++ |.+.|.++|..|..+... .+ ...+.....+|+++ ..++. T Consensus 428 ~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~-~~----~~~~~~~~~~gg~~-~~~~~ 501 (501) T protein:vir:95 428 MTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMA-LA----TPANVPGDGSGGDN-VGNSE 501 (501) T ss_pred CCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCccc-cc----ccCCCCCCCccccc-ccCCC Confidence 35556776665 7889999996665 4333 345555666554332111 01 11111111222222 11111 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.85 E-value=3.8e-19 Score=121.56 Aligned_cols=464 Identities=9% Similarity=-0.002 Sum_probs=234.4 Q ss_pred Cccch-------hhceeccCC---cccC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccC-------- Q lcl|NC_019916. 1 MIDMQ-------QANMNYQED---ADKL--TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRN-------- 60 (513) Q Consensus 1 ~~~~~-------~~~~~~~~~---~~~~--~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~-------- 60 (513) |..-+ +.....+.. ...| +-.+|.. ....+....++++..++-+.|...+......+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~-~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~ 79 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGY-QRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRD 79 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCc-CCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCC Confidence 11100 011111110 1111 0011211 1112345567788888888886432211111111 Q ss_pred CCCC------CcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHHHh-----cCHHHHHHHHHHHHhhCCeEEEE Q lcl|NC_019916. 61 EKGK------ADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFNRR-----NDIDTLNYELYLDMTVTGRAYEY 129 (513) Q Consensus 61 ~~~~------~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~ 129 (513) .+.+ ..+=+-.|+++.+++..++++|-+++.++. ...++.++++ ++++.....+.+.++.+|+++++ T Consensus 80 ~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~--p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iL 157 (535) T protein:vir:80 80 EEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQL--PPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIF 157 (535) T ss_pred cCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceec--cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEE Confidence 0110 111234799999999999999999998753 3566777654 68999999999999999999999 Q ss_pred eeecCCCce------------eEEEEEcccceEEEecCC-C--CcceEEEE-EEEeecccccccceeEEEEEEEcC--Cc Q lcl|NC_019916. 130 VYRDPSQKG------------EVSVKLDPMECFIIYDRS-V--NPKPIMAV-RYHAVQTVVDNITQTKYEVETWTE--ND 191 (513) Q Consensus 130 v~~d~~~~~------------~~~~~~~p~~~~~~~d~~-~--~~~~~~~i-r~~~~~~~~~~~~~~~~~ve~yt~--~~ 191 (513) |-.-..+.. +.++.+.|.+++- |+.. + ...+..++ |-......++...+.+.++.+++. +. T Consensus 158 VD~P~~~~~~t~ade~~~~~rPy~~~y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G 236 (535) T protein:vir:80 158 TDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEG 236 (535) T ss_pred EeecCCCCcccHHHHHhcCCCcEEEEechhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCc Confidence 954333321 3444567776543 3322 1 22344332 222222223444444444444443 22 Q ss_pred EEE---EEeeccCCccc-----cccccccccCcccceEEecCCC----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 192 YTR---YKPIVVAGSVP-----TLEVAEHSAQFGFPMIEYRNNE----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNE 259 (513) Q Consensus 192 ~~~---~~~~~~~~~~~-----~~~~~~~~~~g~vPvv~~~n~~----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~ 259 (513) .+. |+....+..+. .......|.++.||||.|.... .+.+.|.++-.|.-+.=+..|+..+.+...++ T Consensus 237 ~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~ 316 (535) T protein:vir:80 237 NYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQ 316 (535) T ss_pred eEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcC Confidence 222 22222221111 1112345789999999885332 24556777777777777788889999999999 Q ss_pred hhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHH Q lcl|NC_019916. 260 AMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTE 339 (513) Q Consensus 260 ~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 339 (513) |+++++|......++.. ....+.+.+..+ ...+++++++|+..+.+.-+. T Consensus 317 P~l~i~G~~~~~~~~~~----------------------------~~~~i~iG~~~~-~~lP~~~~~~~~e~~~~~~a~- 366 (535) T protein:vir:80 317 PTAFFTGLTKDWVEDVF----------------------------KDFKVHLGSRAI-IPLPQGATAGILQITPNSVPF- 366 (535) T ss_pred ceeeeecCchhhhhcCC----------------------------CCcceEecCccc-ccCCCCCCcceeeeccchhHH- Confidence 99999997532211100 001111111111 123457888999887665554 Q ss_pred HHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccee Q lcl|NC_019916. 340 LYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEI 419 (513) Q Consensus 340 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i 419 (513) ..++.+.+.|..+....- ....++.++.+-+...+...+.....-.....++.+++++++.+++.. .+...+ T Consensus 367 ~~l~~~e~qM~~lGa~ll---~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~-----~~~~~~ 438 (535) T protein:vir:80 367 EAMTHKESQMIAMGANLL---VKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGI-----VNDETV 438 (535) T ss_pred HHHHHHHHHHHHHHHHhh---ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCc-----cCCCce Confidence 468888888877754432 122344444433333333333344444556667777777777765422 122233 Q ss_pred eEEeCC-CCCcC-HHHHHHHHHHH--hcCCCHHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCC Q lcl|NC_019916. 420 GFIFRD-NLPTD-DVAIITALVQA--GAQIPQEYLYQYL---PNVT---DADEIVKMMDKQRKAMLKTYDTKGGLIINGT 489 (513) Q Consensus 420 ~i~f~~-~~p~d-~~e~a~~~~kl--~g~iS~et~~~~l---~~v~---D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 489 (513) .|.-++ ..... ..+.++++.++ +|.||++|++..| +.++ +.++|..|++.|..+.-.......+ ...+. T Consensus 439 ~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d-~~~~g 517 (535) T protein:vir:80 439 EYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGD-AASGG 517 (535) T ss_pred EEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCC-CCCCC Confidence 333222 12222 34566666665 7899999998776 3331 2355666776664332111110000 00011 Q ss_pred CCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 490 SGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~ 509 (513) .+..+.+++..++ ....+ T Consensus 518 ~~~~~~~~~~~~~--~~~~~ 535 (535) T protein:vir:80 518 TNKAKLNNGNGGG--NQAGN 535 (535) T ss_pred CCcCcccCCcccc--ccCCC Confidence 1111111110000 11111 No 78 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.83 E-value=1.1e-19 Score=124.62 Aligned_cols=437 Identities=11% Similarity=0.003 Sum_probs=222.5 Q ss_pred Ccc-chhhceeccCCcc-------cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccc------cccccccCCCCC Q lcl|NC_019916. 1 MID-MQQANMNYQEDAD-------KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRG--QNDGI------LSPASRRNEKGK 64 (513) Q Consensus 1 ~~~-~~~~~~~~~~~~~-------~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G--~~~i~------~~~~~~~~~~~~ 64 (513) |.. |==.-..+.|... .+.+.|. ++..-.... .-.....|..- .++.. +..........+ T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~--~~~d~g~~~--~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y 76 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWL--RNLDCVMDN--IKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDW 76 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhh--HhhhhhhHH--HHHhhhhcCCCCCCccccccCcchhhhhhccchhhh Confidence 100 0000011111211 2223331 111111111 11122233211 00000 000000000001 Q ss_pred Ccc---e-eecchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHHHh-----cCHHHHHHHHHHHHhhCCeEEEEeeecCC Q lcl|NC_019916. 65 ADH---R-AVHSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFNRR-----NDIDTLNYELYLDMTVTGRAYEYVYRDPS 135 (513) Q Consensus 65 ~~~---r-i~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~v~~d~~ 135 (513) -++ | +-.|+++..++..++++|-++|+++.++...++.++++ ++++.....+.+.++.+|+++++|-..++ T Consensus 77 ~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~ 156 (488) T protein:vir:96 77 EDLTWRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPE 156 (488) T ss_pred HhhhhhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCC Confidence 111 2 23699999999999999999999988877778888764 68999999999999999999999966543 Q ss_pred Cc----------eeEEEEEcccceEEEecCC--CCcceEEEE-EE-EeecccccccceeEEEEEEEcCCcEEEEEeeccC Q lcl|NC_019916. 136 QK----------GEVSVKLDPMECFIIYDRS--VNPKPIMAV-RY-HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVA 201 (513) Q Consensus 136 ~~----------~~~~~~~~p~~~~~~~d~~--~~~~~~~~i-r~-~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~ 201 (513) +. -+..+.++|.+++----+. ....+..++ |- +...+..+......+.+..+++..+..++...++ T Consensus 157 ~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~ 236 (488) T protein:vir:96 157 SATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDE 236 (488) T ss_pred cCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCC Confidence 32 1344456777754321111 122344332 21 2222222223334444445666544433333332 Q ss_pred Ccccc-ccccccccCcccceEEecCCC----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccc Q lcl|NC_019916. 202 GSVPT-LEVAEHSAQFGFPMIEYRNNE----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDST 276 (513) Q Consensus 202 ~~~~~-~~~~~~~~~g~vPvv~~~n~~----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~ 276 (513) ....+ ....-.++++.||||.|.... .+.+-|.++-.|.-+.=+..|+.-..+.....|++++.+.+....... T Consensus 237 ~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~- 315 (488) T protein:vir:96 237 YSDEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMAS- 315 (488) T ss_pred cccceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCccccc- Confidence 22221 112235689999999985432 244556677777666667777787777777788777533211100000 Q ss_pred ccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC-c Q lcl|NC_019916. 277 LLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSH-T 355 (513) Q Consensus 277 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~-~ 355 (513) ....+.+..+.......+.|+++|+..+.+.- .+..++.|.+.+..++. + T Consensus 316 ----------------------------~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l 366 (488) T protein:vir:96 316 ----------------------------EMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASL 366 (488) T ss_pred ----------------------------ccccceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhh Confidence 00001111111112223467788887664433 36678888888877653 3 Q ss_pred cccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCC-CCcC-HHH Q lcl|NC_019916. 356 PDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN-LPTD-DVA 433 (513) Q Consensus 356 p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~-~p~d-~~e 433 (513) +. . +++-||++.+.....-.+.....-.....++++++++++..++...+..... .++|.-++. .+.. ..+ T Consensus 367 ~~----~-~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~--~~~~~in~dF~~~~ld~~ 439 (488) T protein:vir:96 367 FT----Q-QSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPD--ELVFKLNRDYFDVEVNPQ 439 (488) T ss_pred cc----C-CCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCcc--ceEEEeccCCCCccCCHH Confidence 32 1 2456788877766666666666667788888888888888776543322222 233333321 2222 355 Q ss_pred HHHHHHHH--hcCCCHHHHHHhC--CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCC Q lcl|NC_019916. 434 IITALVQA--GAQIPQEYLYQYL--PNVTDADEIVKMMDKQRKAMLKTYDTKGGLII 486 (513) Q Consensus 434 ~a~~~~kl--~g~iS~et~~~~l--~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 486 (513) .++++.++ +|.||.+|.++.| ..|-+++..++.++.+-++ . +.+. T Consensus 440 ~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~-----~---g~~~ 488 (488) T protein:vir:96 440 MLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAE-----L---GFGM 488 (488) T ss_pred HHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhh-----c---CCCC Confidence 67777775 7899999998765 3343222222222222111 0 1111 No 79 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.82 E-value=3.7e-18 Score=116.17 Aligned_cols=438 Identities=10% Similarity=0.021 Sum_probs=234.3 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCC----CC---CC---cceee Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNE----KG---KA---DHRAV 70 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~----~~---~~---~~ri~ 70 (513) |.+ .+... .+|..- ........++++..++-|.|... ...+....+. .. +. .+-+- T Consensus 1 ~~~---------~~~~~---~~V~~~-hp~y~a~~~~W~~ird~~~G~~~-~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~ 66 (489) T protein:vir:78 1 MLT---------ENGQG---SGVKTK-HREWLHYAPKWQKVRHALAGELV-SYLRNVGLNEPDKAYGEARQAEYEAGGIV 66 (489) T ss_pred Ccc---------CCCcc---CCCCcc-CHHHHHHHHHHHHHHHHhcCccc-ccccCCCCCCCCCCCChHHHHHHHhcccc Confidence 111 11111 111111 11124456778889998999532 1111111110 00 11 11134 Q ss_pred cchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHHHh-----cCHHHHHHHHHHHHhhCCeEEEEeeecCCCc-------- Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFNRR-----NDIDTLNYELYLDMTVTGRAYEYVYRDPSQK-------- 137 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~-------- 137 (513) .|+++.+++..++++|-++|.++. ...++.++++ ++++.....+.+.++.+|+++++|-....+. T Consensus 67 ~n~~~~tl~~l~G~vfrk~p~~~~--p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~ 144 (489) T protein:vir:78 67 YNFTRRTLSGMVGSVMRKEPEINI--PKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNA 144 (489) T ss_pred CChHHHHHHHHhchhhcCCcceec--cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHH Confidence 699999999999999999998854 3456666653 6899999999999999999999996644431 Q ss_pred ---eeEEEEEcccceEEEecC-C--CCcceEEEEEEEee----cccccccceeEEEEEEEcCC--cE---EEEEeeccCC Q lcl|NC_019916. 138 ---GEVSVKLDPMECFIIYDR-S--VNPKPIMAVRYHAV----QTVVDNITQTKYEVETWTEN--DY---TRYKPIVVAG 202 (513) Q Consensus 138 ---~~~~~~~~p~~~~~~~d~-~--~~~~~~~~ir~~~~----~~~~~~~~~~~~~ve~yt~~--~~---~~~~~~~~~~ 202 (513) -+..+.+.|.+++- |+. . ....+..+ ++-.. +..++...+.+..+.+++.+ .. ..|+....+. T Consensus 145 ~~~rPy~~~~~~~~Iin-W~~~~v~G~~~Lt~v-~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~ 222 (489) T protein:vir:78 145 GLLNPTIAFYTTENIVN-WRLTRVGSVNRVTMV-VLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGG 222 (489) T ss_pred hcCCcEEEEechhhhcC-ceeeeeCCccceeEE-EEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCc Confidence 13344567777543 321 1 12234433 22221 22234445556666666653 22 2233333332 Q ss_pred ccccc----cccccccCcccceEEecCCC----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 203 SVPTL----EVAEHSAQFGFPMIEYRNNE----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 203 ~~~~~----~~~~~~~~g~vPvv~~~n~~----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) ..... ...-.++++.||||.|.... .+.+-|.++-.|.-+.=+..|+.-..+...+.|+++++|........ T Consensus 223 ~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~ 302 (489) T protein:vir:78 223 AQEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQA 302 (489) T ss_pred ccceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCccc Confidence 22211 12234789999999986432 24455777777766666778889999999999999999964321110 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHh- Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFS- 353 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s- 353 (513) . ... ..+.+.+.+... .....+++++|+..+.+.. .+..++.+.+.+..+. T Consensus 303 ~-------------------------~~~-~~~~i~~g~~~~-~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa 354 (489) T protein:vir:78 303 F-------------------------KEA-NPNGIKFGSRRG-HNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGA 354 (489) T ss_pred c-------------------------ccc-CccceeeCCccc-ccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhh Confidence 0 000 001111111111 1223578889998876544 3667888888877763 Q ss_pred CccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHH Q lcl|NC_019916. 354 HTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVA 433 (513) Q Consensus 354 ~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e 433 (513) .+.. . +++-||++.+.....-.+.....-.....++.+++++++.+++...+. ... -.+...|.. ..-..+ T Consensus 355 ~l~~----~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~-~~~-i~~n~dF~~--~~~d~~ 425 (489) T protein:vir:78 355 QLIT----P-TQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKPEDT-EVE-FRLNMDFFL--EPMTAQ 425 (489) T ss_pred hhcc----C-CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC-ceE-EEeecccCc--ccCCHH Confidence 3332 2 246788877776666666666666778888888888888876532110 000 012223322 111355 Q ss_pred HHHHHHHH--hcCCCHHHHHHhCC--CCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 434 IITALVQA--GAQIPQEYLYQYLP--NVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP 506 (513) Q Consensus 434 ~a~~~~kl--~g~iS~et~~~~l~--~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (513) .++++.++ +|.||.+|.+..|- .|-|+. .+.++.|-+. ++.+ .+ ..++++=+.+.++.+. T Consensus 426 ~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~--~e~~~~ei~~-----~~~~-~~-----~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 426 DRAAWMADINAGLLPATAYYAALRKAGVTDWT--DADIKDAVAD-----QPLP-VA-----TEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc--HHHHHHHHhh-----cCCC-cc-----cCCcccCCCCcccccC Confidence 56666665 78999999987652 344322 2222222111 1111 10 0111111111111111 No 80 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.79 E-value=6.1e-18 Score=114.95 Aligned_cols=437 Identities=11% Similarity=0.033 Sum_probs=230.2 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccC----CCCCC------cceee Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRN----EKGKA------DHRAV 70 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~----~~~~~------~~ri~ 70 (513) |. -|| ... .+|..- ........++++..++-|.|... ........+ ..... .+=+- T Consensus 1 ~~---~~~------~~~---~~V~~~-hp~y~a~~~~W~~ird~~~G~~~-~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~ 66 (491) T protein:vir:95 1 ML---TAN------GQG---SGVKTK-HREWLHYAPKWQKVRHALAGDLV-GYLRNVGLNEPDKAYGEARQAEYEAGGIV 66 (491) T ss_pred Cc---ccC------Ccc---CCCCcc-CHHHHHHHHHHHHHHHHhcCcch-hhcccCCCcCCCCCCCHHHHHHHHhcccC Confidence 11 111 111 111110 11124456778888888988531 111111111 01111 11134 Q ss_pred cchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHHHh-----cCHHHHHHHHHHHHhhCCeEEEEeeecCCCc-------- Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFNRR-----NDIDTLNYELYLDMTVTGRAYEYVYRDPSQK-------- 137 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~-------- 137 (513) .|+++.+++..++++|-++|+++.+ ..++.++++ ++++.....+.+.++.+|+++++|-....+. T Consensus 67 ~n~~~~tl~~l~G~vfrk~p~~~~p--~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~ 144 (491) T protein:vir:95 67 YNFTRRTLSGMVGSVMRKEPEINIP--KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNA 144 (491) T ss_pred CChHHHHHHHHhchhhcCCceeecc--HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHH Confidence 6999999999999999999998533 446666653 7899999999999999999999996543321 Q ss_pred ---eeEEEEEcccceEEEecC---CCCcceEEEEEEEee----cccccccceeEEEEEEEcC---Cc--EEEEEeeccCC Q lcl|NC_019916. 138 ---GEVSVKLDPMECFIIYDR---SVNPKPIMAVRYHAV----QTVVDNITQTKYEVETWTE---ND--YTRYKPIVVAG 202 (513) Q Consensus 138 ---~~~~~~~~p~~~~~~~d~---~~~~~~~~~ir~~~~----~~~~~~~~~~~~~ve~yt~---~~--~~~~~~~~~~~ 202 (513) -+..+.+.|.+++- |+. .....+.. +++-.. +..++...+.+..+.+++. .. +..|+....+. T Consensus 145 ~~~rPy~~~~~~~~Iin-W~~~~v~g~~~L~~-v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~ 222 (491) T protein:vir:95 145 GLLNPTIAFYTTENIVN-WRLTRVGSVNRVTM-VVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGG 222 (491) T ss_pred hcCCcEEEEechhhhcC-ceeeeeCCceeeeE-EEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCc Confidence 13344567777543 221 11223333 333222 1123334444444444432 21 22232222222 Q ss_pred cccccc----ccccccCcccceEEecCCC--C--CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 203 SVPTLE----VAEHSAQFGFPMIEYRNNE--Y--RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 203 ~~~~~~----~~~~~~~g~vPvv~~~n~~--~--~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) .....+ ..-.++++.||||.+.... . +.+-|.++-.|.-+.=+..|+.-..+...+.|+++++|........ T Consensus 223 ~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~ 302 (491) T protein:vir:95 223 AQEEVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQS 302 (491) T ss_pred ceeeeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcch Confidence 221111 1224679999999985432 2 3455667766666666777888899999999999999964321111 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH-h Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF-S 353 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~-s 353 (513) ... .....+.+.+.... .-..+++++|+..+.+.- .+..++.+...+... + T Consensus 303 ~~~--------------------------~~~~~i~~g~~~~~-~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga 354 (491) T protein:vir:95 303 FKE--------------------------ANPNGIKFGSRCGH-NLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGA 354 (491) T ss_pred hhc--------------------------cCcceeEecCcCCc-CCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHH Confidence 000 00011111111111 123578889998876554 366677777777665 3 Q ss_pred CccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-ceeeEEeCCCCCcCHH Q lcl|NC_019916. 354 HTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP-DEIGFIFRDNLPTDDV 432 (513) Q Consensus 354 ~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~-~~i~i~f~~~~p~d~~ 432 (513) .+.. . +++-||++.+.....-.+.....-.....++.+++++++.+++...+ .+. -.+...|.. ..-.+ T Consensus 355 ~l~~----~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~---~~v~i~~n~dF~~--~~~~~ 424 (491) T protein:vir:95 355 QLIT----P-SQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGKPED---SEVEFQLNMDFFL--QPMTA 424 (491) T ss_pred Hhcc----C-CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC---CceEEEeeccccc--ccCCH Confidence 3322 1 34678888777666666666666677888888888888887643211 110 012222322 22235 Q ss_pred HHHHHHHHH--hcCCCHHHHHHhCC--CCCCH--HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 433 AIITALVQA--GAQIPQEYLYQYLP--NVTDA--DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP 506 (513) Q Consensus 433 e~a~~~~kl--~g~iS~et~~~~l~--~v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (513) +.++++.++ +|.+|++|.+..|- .|.|. +++.++|+.|. .+.+.. .+..++- +...++... T Consensus 425 ~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~-------~~~~~~-----~~~~~~~-~~~~~~~~~ 491 (491) T protein:vir:95 425 QDRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNAIEDAP-------LPSGAV-----TQVAGEI-PQAAQQQQE 491 (491) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcC-------CCCCcc-----ccccccc-hhhhhhccC Confidence 567777665 78999999987652 34432 33333332221 111111 1111111 111111111 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.79 E-value=2.7e-19 Score=122.43 Aligned_cols=471 Identities=11% Similarity=0.057 Sum_probs=220.2 Q ss_pred Cccc----hhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceee Q lcl|NC_019916. 1 MIDM----QQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAV 70 (513) Q Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~ 70 (513) |++. ++.....+.|..+ .-+.+.+++..+... -+....+-.+||.|+|=- ..........++ -.+. T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~--p~~~ 97 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQ-AVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWS-QDEIDELKERGQ--APTV 97 (776) T ss_pred CCCCCcccchhcccCCCCCHH-HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCC-HHHHHHHHhcCC--ceEE Confidence 3222 2233333333222 223444554432211 122345667899998621 111111122222 2378 Q ss_pred cchhHHHHHHHHHHhhcCCeee--cCCc--H--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCC--C Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAM--SGPS--S--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPS--Q 136 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~--~~~~--~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~--~ 136 (513) +|.++.+|+..+++...+.+.+ ...+ + ..++.+++.|+++...+.+..+++++|.||+-|+++.+ + T Consensus 98 ~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~ 177 (776) T protein:vir:93 98 YNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDG 177 (776) T ss_pred ecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCC Confidence 9999999999999988775543 2221 1 23666788899999999999999999999999888754 3 Q ss_pred ceeEEEEEcccceEEEecCCCC------cceEEEEEEEee----------------------------cc---------- Q lcl|NC_019916. 137 KGEVSVKLDPMECFIIYDRSVN------PKPIMAVRYHAV----------------------------QT---------- 172 (513) Q Consensus 137 ~~~~~~~~~p~~~~~~~d~~~~------~~~~~~ir~~~~----------------------------~~---------- 172 (513) .+...-.++|.++++ |+... .+.++ .+.|.. .+ T Consensus 178 ~~~~~~~~~p~~i~~--Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (776) T protein:vir:93 178 EPIYAGAESWRNILW--DSTYRRLDMDDCRYIF-RVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSP 254 (776) T ss_pred CceEeeccChhheee--ccccccCCHHHHhhhh-hhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccc Confidence 332222357776543 32110 11110 000000 00 Q ss_pred ------------cccccceeEEEEEEEcCCcEEEEEee-----------------------------------------c Q lcl|NC_019916. 173 ------------VVDNITQTKYEVETWTENDYTRYKPI-----------------------------------------V 199 (513) Q Consensus 173 ------------~~~~~~~~~~~ve~yt~~~~~~~~~~-----------------------------------------~ 199 (513) ......+.+..+|+|....+...... - T Consensus 255 ~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~ 334 (776) T protein:vir:93 255 EYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIM 334 (776) T ss_pred ccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEE Confidence 00001123334555543221110000 0 Q ss_pred cCCccccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) .++. .......+.+++.||+|+|+... .+.|.+..++++++.+|..+|.+.+.+. +.++.+-.|..... T Consensus 335 ~g~~-~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~~~--- 408 (776) T protein:vir:93 335 TTRD-LMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVDDI--- 408 (776) T ss_pred ecch-hhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeeccccccch--- Confidence 0000 00111223456889999886532 4779999999999999999999988763 34444434422110 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSH 354 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 354 (513) +... . ...+.+.++.+.++.. +.+.+.....-..++...+..+...|..+|+ T Consensus 409 ------------d~~~---~------~~~rp~~vi~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tG 460 (776) T protein:vir:93 409 ------------DEFR---R------EAARPDAVMTVKNGKL-------GAVKMDVDRDLAPAHLELASRSIQMIQQVGG 460 (776) T ss_pred ------------HHHH---H------hcccCCceeeeCCccc-------cccccccCcCccHHHHHHHHHHHHHHHHhhC Confidence 0000 0 0112233444433221 1222222222235677889999999999999 Q ss_pred ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------ccccc------- Q lcl|NC_019916. 355 TPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKW----------DIDPD------- 417 (513) Q Consensus 355 ~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~----------~~~~~------- 417 (513) +.+...+..+++.||+|+..+...........-..|..+++++.++++.++....... ...+. T Consensus 461 i~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~ 540 (776) T protein:vir:93 461 VTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLP 540 (776) T ss_pred cChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccch Confidence 9988888777779999999887777777777777777788888777777665542210 00010 Q ss_pred ---------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHH-------HHHhC--CCCCCHHHHHHHHHH---------- Q lcl|NC_019916. 418 ---------EIGFIFRDNLPTDDVAIITALVQAGAQIPQEY-------LYQYL--PNVTDADEIVKMMDK---------- 469 (513) Q Consensus 418 ---------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et-------~~~~l--~~v~D~~~E~~ri~~---------- 469 (513) +|.|.=.+..+.-..+..+.++.+-+.+..+. +++.. |...+..++++.... T Consensus 541 ~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~ 620 (776) T protein:vir:93 541 ENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPT 620 (776) T ss_pred hhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcc Confidence 11121122222212333334444333222211 12222 222222211111100 Q ss_pred -------HH-HHHHHHhhhhcCCCCCCCCC---CCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 470 -------QR-KAMLKTYDTKGGLIINGTSG---NDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 470 -------E~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +. ...+................ ....+......+. .......+ T Consensus 621 ~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa-~~~~~~a~ 674 (776) T protein:vir:93 621 PEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKA-KHISRMAI 674 (776) T ss_pred hhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhh-hhhhhcch Confidence 00 00000000000000000000 0000000000000 00000000 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.75 E-value=5.5e-17 Score=109.70 Aligned_cols=475 Identities=10% Similarity=0.058 Sum_probs=228.7 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHY------NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) -+.-+| ...+..+.+ -+.+.+.++...+. ...+....+-.+||.|.|=- ..........+. -.+.+|.+ T Consensus 12 ~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~--p~~~~N~i 86 (711) T protein:vir:10 12 QLYAKK-AKVYAKNND-DDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWP-SQVRTERELEQR--PCLVNNVL 86 (711) T ss_pred chhHHH-HHhcccCcc-hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCC-HHHHHHHHhcCC--CcEEEcch Confidence 222222 233333322 23445555554422 12233355668899997611 010111111222 24779999 Q ss_pred HHHHHHHHHHhhcCCeeec--C------------------------CcH--------HHHHHHHHhcCHHHHHHHHHHHH Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMS--G------------------------PSS--------DRLDDFNRRNDIDTLNYELYLDM 120 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~--~------------------------~~~--------~~l~~~~~~n~~~~~~~~~~~~a 120 (513) +.+|+..+++-.-+.+.+. . .++ ..++.+.+.|+.+...+.+..++ T Consensus 87 ~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~ 166 (711) T protein:vir:10 87 PTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGA 166 (711) T ss_pred HHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHh Confidence 9999999999987766552 1 111 12455677899999999999999 Q ss_pred hhCCeEEEEeeecC------CCceeEEEEE-cccceEEEecCCC------CcceEEEEEEEeecc--------------- Q lcl|NC_019916. 121 TVTGRAYEYVYRDP------SQKGEVSVKL-DPMECFIIYDRSV------NPKPIMAVRYHAVQT--------------- 172 (513) Q Consensus 121 ~~~G~~~~~v~~d~------~~~~~~~~~~-~p~~~~~~~d~~~------~~~~~~~ir~~~~~~--------------- 172 (513) +++|.||+-|+.|. +|++.+. .+ +|.++ +||+.. +.+-++ ++.|...+ T Consensus 167 ~~~G~G~~ev~~d~~~~d~~~~e~~i~-~v~~p~~v--~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~yp~~a~~~~~ 242 (711) T protein:vir:10 167 VESGMGYLRVRSDYLADDSFEQDLIIE-AIQNQFSV--TIDPDAKKRDRSDMNWCL-IDDTMSKEKFKALYPDATAEPVY 242 (711) T ss_pred hhcCcceEEEEecccCCCCCCCCeEEe-eecChhhe--eeCccccccChhhhccee-eeecCCHHHHHHhCCchhhhhhh Confidence 99999998776542 3444443 34 68774 455421 112122 22221100 Q ss_pred ----ccc---ccceeEEEEEEEcCCcEEEEEeeccCC---------------------------------------cccc Q lcl|NC_019916. 173 ----VVD---NITQTKYEVETWTENDYTRYKPIVVAG---------------------------------------SVPT 206 (513) Q Consensus 173 ----~~~---~~~~~~~~ve~yt~~~~~~~~~~~~~~---------------------------------------~~~~ 206 (513) .+. .....+..+++|......+......++ .... T Consensus 243 ~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~ 322 (711) T protein:vir:10 243 EDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANV 322 (711) T ss_pred cccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEeccee Confidence 000 001223333444322211100000000 0001 Q ss_pred ccccccccCcccceEEecCC-------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhe-ecCccccccccccc Q lcl|NC_019916. 207 LEVAEHSAQFGFPMIEYRNN-------EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVI-KGDIDTLFDDSTLL 278 (513) Q Consensus 207 ~~~~~~~~~g~vPvv~~~n~-------~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~-~G~~~~~~~~~~~~ 278 (513) .....+.+.+.||+|+|.-. ..+.|.+..+++.|+.+|...|.+...+.-.+.+.+++ .|.... .+ T Consensus 323 L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~-~~----- 396 (711) T protein:vir:10 323 LEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG-RE----- 396 (711) T ss_pred ecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCC-hH----- Confidence 12223455678999887432 23567889999999999999999999987777655443 332110 00 Q ss_pred ccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 279 QMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 279 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) +... . ..-+.+.++.++++. ...+.++++....-..++...++.....|-..|++.+. T Consensus 397 --------~~~~---e------~~~~~~~vi~~~~~~-----~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~ 454 (711) T protein:vir:10 397 --------DEWE---Q------ANTKNFSLLTYIPQY-----QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDA 454 (711) T ss_pred --------HHHH---h------ccccCCCeeEecccc-----cCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChH Confidence 0000 0 001223344444322 12234555554555677888999999999999999988 Q ss_pred ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------ccccc----------- Q lcl|NC_019916. 359 TDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKW----------DIDPD----------- 417 (513) Q Consensus 359 ~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~----------~~~~~----------- 417 (513) ..+..+++.||+||......-.......-..|..+.+++.+++++++....... ..++. T Consensus 455 ~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~ 534 (711) T protein:vir:10 455 SLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) T ss_pred HcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEeccccccccc Confidence 888777889999999887776666666667777777777777777655432110 01100 Q ss_pred --------------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH------HHHHHhCCCCCCHHHHHHHHHHHHHHHH-- Q lcl|NC_019916. 418 --------------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ------EYLYQYLPNVTDADEIVKMMDKQRKAML-- 475 (513) Q Consensus 418 --------------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~------et~~~~l~~v~D~~~E~~ri~~E~~~~~-- 475 (513) +|.|.=.+..+.-..+.+..++.+.+.+|. ..+++.+++ .+.++-.+++++...... T Consensus 535 G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~-p~~~el~e~lr~~~~~~~~~ 613 (711) T protein:vir:10 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLS 613 (711) T ss_pred ccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCC-CCHHHHHHHHHhhcCcccCc Confidence 112222333333334444555555444433 123444433 232222233321100000 Q ss_pred --------HHhhhhcCCCCC-CCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 476 --------KTYDTKGGLIIN-GTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 476 --------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ......-..... +........+....+...-..+.+.- T Consensus 614 ~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~ 660 (711) T protein:vir:10 614 KDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADML 660 (711) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000 00000000000000000000000000 No 83 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.74 E-value=4.7e-16 Score=104.62 Aligned_cols=465 Identities=13% Similarity=0.102 Sum_probs=223.9 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.--.+.+.-.-+ ++++.+...+++..+... -+....+-.+||.|.|=- ..........+.+ .+.+|.++ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~ 76 (714) T protein:vir:81 1 MKNETNTMATKND-NGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP-PEVLQVLKDRGQP--MTIHNLIA 76 (714) T ss_pred CCcccccccCCCC-cchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHH Confidence 2212222222222 345555555555543222 123455777899997621 1111222222222 37899999 Q ss_pred HHHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---ce Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---KG 138 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---~~ 138 (513) .+|+..+++---+.+.+. . ++. ..++.+++.++++...+.+..+++++|.||+-+|.+.+. .+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:81 77 PTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 999999999988776652 2 111 124566778999999999999999999999988887532 23 Q ss_pred eEEEEEcccceEEEecCCC------CcceEEEEEEEeeccc--------------------------------------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSV------NPKPIMAVRYHAVQTV--------------------------------------- 173 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~~--------------------------------------- 173 (513) .+ -.++|.+++ ||+.. +.+- .+++.|...+. T Consensus 157 ~i-~~v~p~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (714) T protein:vir:81 157 KV-STVSRNEVF--WDWLSREADLSDCRW-LMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred EE-Eecchhhee--eccccccCChhhccc-eeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccch Confidence 33 346888854 34321 1111 11222211000 Q ss_pred -------------ccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------ Q lcl|NC_019916. 174 -------------VDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------ 204 (513) Q Consensus 174 -------------~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------ 204 (513) .......+..+|+|.............++.. T Consensus 233 ~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:81 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEec Confidence 0000111223444532221111111111000 Q ss_pred --ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 205 --PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 205 --~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) .......|.+.+.||+|+|.-.. ...|-+..+++.|+.+|+..|.+...+. ++..++..|.... T Consensus 313 ~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~------- 383 (714) T protein:vir:81 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL------- 383 (714) T ss_pred CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccc------- Confidence 00011123445678888765432 1347788899999999999999877652 3333333332110 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) .+...... .-+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+ T Consensus 384 ------~d~~~~e~----------~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:81 384 ------SDNDLMEQ----------IERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ------cHHHHHHh----------ccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCCh Confidence 00000000 01112233332221 11112222233333233456677788888999999999988 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c------c-------- Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D------P-------- 416 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~------~-------- 416 (513) ...+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ + + . T Consensus 447 ~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~ 526 (714) T protein:vir:81 447 AFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGD 526 (714) T ss_pred HHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccC Confidence 88777777899999988766555555555566666777777666665543321100 0 0 0 Q ss_pred -----c-------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 -----D-------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 417 -----~-------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) + +|.|.=.+..|....+.++.++.+.+.++. ..+++.+++ .+.++-+++|++-. T Consensus 527 ~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~------ 599 (714) T protein:vir:81 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------ 599 (714) T ss_pred cceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc------ Confidence 0 122222333444445566666665444433 345566554 55455455554310 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCC---------------CccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPE---------------DERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~ 513 (513) +.. ....+...+..........-+ ...+.+ T Consensus 600 ----~~~--~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae 644 (714) T protein:vir:81 600 ----GTP--KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEAD 644 (714) T ss_pred ----CCC--CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000000000000000 000000 No 84 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.74 E-value=4.7e-16 Score=104.62 Aligned_cols=465 Identities=13% Similarity=0.102 Sum_probs=223.9 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.--.+.+.-.-+ ++++.+...+++..+... -+....+-.+||.|.|=- ..........+.+ .+.+|.++ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~ 76 (714) T protein:vir:99 1 MKNETNTMATKND-NGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP-PEVLQVLKDRGQP--MTIHNLIA 76 (714) T ss_pred CCcccccccCCCC-cchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHH Confidence 2212222222222 345555555555543222 123455777899997621 1111222222222 37899999 Q ss_pred HHHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---ce Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---KG 138 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---~~ 138 (513) .+|+..+++---+.+.+. . ++. ..++.+++.++++...+.+..+++++|.||+-+|.+.+. .+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:99 77 PTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 999999999988776652 2 111 124566778999999999999999999999988887532 23 Q ss_pred eEEEEEcccceEEEecCCC------CcceEEEEEEEeeccc--------------------------------------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSV------NPKPIMAVRYHAVQTV--------------------------------------- 173 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~~--------------------------------------- 173 (513) .+ -.++|.+++ ||+.. +.+- .+++.|...+. T Consensus 157 ~i-~~v~p~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (714) T protein:vir:99 157 KV-STVSRNEVF--WDWLSREADLSDCRW-LMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred EE-Eecchhhee--eccccccCChhhccc-eeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccch Confidence 33 346888854 34321 1111 11222211000 Q ss_pred -------------ccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------ Q lcl|NC_019916. 174 -------------VDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------ 204 (513) Q Consensus 174 -------------~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------ 204 (513) .......+..+|+|.............++.. T Consensus 233 ~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:99 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEec Confidence 0000111223444532221111111111000 Q ss_pred --ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 205 --PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 205 --~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) .......|.+.+.||+|+|.-.. ...|-+..+++.|+.+|+..|.+...+. ++..++..|.... T Consensus 313 ~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~------- 383 (714) T protein:vir:99 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL------- 383 (714) T ss_pred CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccc------- Confidence 00011123445678888765432 1347788899999999999999877652 3333333332110 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) .+...... .-+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+ T Consensus 384 ------~d~~~~e~----------~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:99 384 ------SDNDLMEQ----------IERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ------cHHHHHHh----------ccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCCh Confidence 00000000 01112233332221 11112222233333233456677788888999999999988 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c------c-------- Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D------P-------- 416 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~------~-------- 416 (513) ...+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ + + . T Consensus 447 ~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~ 526 (714) T protein:vir:99 447 AFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGD 526 (714) T ss_pred HHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccC Confidence 88777777899999988766555555555566666777777666665543321100 0 0 0 Q ss_pred -----c-------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 -----D-------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 417 -----~-------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) + +|.|.=.+..|....+.++.++.+.+.++. ..+++.+++ .+.++-+++|++-. T Consensus 527 ~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~------ 599 (714) T protein:vir:99 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------ 599 (714) T ss_pred cceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc------ Confidence 0 122222333444445566666665444433 345566554 55455455554310 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCC---------------CccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPE---------------DERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~ 513 (513) +.. ....+...+..........-+ ...+.+ T Consensus 600 ----~~~--~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae 644 (714) T protein:vir:99 600 ----GTP--KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEAD 644 (714) T ss_pred ----CCC--CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000000000000000 000000 No 85 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.74 E-value=4.7e-16 Score=104.62 Aligned_cols=465 Identities=13% Similarity=0.102 Sum_probs=223.9 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.--.+.+.-.-+ ++++.+...+++..+... -+....+-.+||.|.|=- ..........+.+ .+.+|.++ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~ 76 (714) T protein:vir:10 1 MKNETNTMATKND-NGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP-PEVLQVLKDRGQP--MTIHNLIA 76 (714) T ss_pred CCcccccccCCCC-cchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHH Confidence 2212222222222 345555555555543222 123455777899997621 1111222222222 37899999 Q ss_pred HHHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---ce Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---KG 138 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---~~ 138 (513) .+|+..+++---+.+.+. . ++. ..++.+++.++++...+.+..+++++|.||+-+|.+.+. .+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:10 77 PTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 999999999988776652 2 111 124566778999999999999999999999988887532 23 Q ss_pred eEEEEEcccceEEEecCCC------CcceEEEEEEEeeccc--------------------------------------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSV------NPKPIMAVRYHAVQTV--------------------------------------- 173 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~~--------------------------------------- 173 (513) .+ -.++|.+++ ||+.. +.+- .+++.|...+. T Consensus 157 ~i-~~v~p~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (714) T protein:vir:10 157 KV-STVSRNEVF--WDWLSREADLSDCRW-LMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred EE-Eecchhhee--eccccccCChhhccc-eeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccch Confidence 33 346888854 34321 1111 11222211000 Q ss_pred -------------ccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------ Q lcl|NC_019916. 174 -------------VDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------ 204 (513) Q Consensus 174 -------------~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------ 204 (513) .......+..+|+|.............++.. T Consensus 233 ~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:10 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEec Confidence 0000111223444532221111111111000 Q ss_pred --ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 205 --PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 205 --~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) .......|.+.+.||+|+|.-.. ...|-+..+++.|+.+|+..|.+...+. ++..++..|.... T Consensus 313 ~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~------- 383 (714) T protein:vir:10 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL------- 383 (714) T ss_pred CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccc------- Confidence 00011123445678888765432 1347788899999999999999877652 3333333332110 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) .+...... .-+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+ T Consensus 384 ------~d~~~~e~----------~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:10 384 ------SDNDLMEQ----------IERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ------cHHHHHHh----------ccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCCh Confidence 00000000 01112233332221 11112222233333233456677788888999999999988 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c------c-------- Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D------P-------- 416 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~------~-------- 416 (513) ...+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ + + . T Consensus 447 ~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~ 526 (714) T protein:vir:10 447 AFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGD 526 (714) T ss_pred HHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccC Confidence 88777777899999988766555555555566666777777666665543321100 0 0 0 Q ss_pred -----c-------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 -----D-------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 417 -----~-------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) + +|.|.=.+..|....+.++.++.+.+.++. ..+++.+++ .+.++-+++|++-. T Consensus 527 ~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~------ 599 (714) T protein:vir:10 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------ 599 (714) T ss_pred cceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc------ Confidence 0 122222333444445566666665444433 345566554 55455455554310 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCC---------------CccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPE---------------DERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~ 513 (513) +.. ....+...+..........-+ ...+.+ T Consensus 600 ----~~~--~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae 644 (714) T protein:vir:10 600 ----GTP--KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEAD 644 (714) T ss_pred ----CCC--CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000000000000000 000000 No 86 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.74 E-value=4.7e-16 Score=104.62 Aligned_cols=465 Identities=13% Similarity=0.102 Sum_probs=223.9 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.--.+.+.-.-+ ++++.+...+++..+... -+....+-.+||.|.|=- ..........+.+ .+.+|.++ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~ 76 (714) T protein:vir:32 1 MKNETNTMATKND-NGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP-PEVLQVLKDRGQP--MTIHNLIA 76 (714) T ss_pred CCcccccccCCCC-cchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHH Confidence 2212222222222 345555555555543222 123455777899997621 1111222222222 37899999 Q ss_pred HHHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---ce Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---KG 138 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---~~ 138 (513) .+|+..+++---+.+.+. . ++. ..++.+++.++++...+.+..+++++|.||+-+|.+.+. .+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:32 77 PTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 999999999988776652 2 111 124566778999999999999999999999988887532 23 Q ss_pred eEEEEEcccceEEEecCCC------CcceEEEEEEEeeccc--------------------------------------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSV------NPKPIMAVRYHAVQTV--------------------------------------- 173 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~~--------------------------------------- 173 (513) .+ -.++|.+++ ||+.. +.+- .+++.|...+. T Consensus 157 ~i-~~v~p~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (714) T protein:vir:32 157 KV-STVSRNEVF--WDWLSREADLSDCRW-LMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred EE-Eecchhhee--eccccccCChhhccc-eeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccch Confidence 33 346888854 34321 1111 11222211000 Q ss_pred -------------ccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------ Q lcl|NC_019916. 174 -------------VDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------ 204 (513) Q Consensus 174 -------------~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------ 204 (513) .......+..+|+|.............++.. T Consensus 233 ~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:32 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEec Confidence 0000111223444532221111111111000 Q ss_pred --ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 205 --PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 205 --~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) .......|.+.+.||+|+|.-.. ...|-+..+++.|+.+|+..|.+...+. ++..++..|.... T Consensus 313 ~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~------- 383 (714) T protein:vir:32 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL------- 383 (714) T ss_pred CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccc------- Confidence 00011123445678888765432 1347788899999999999999877652 3333333332110 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) .+...... .-+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+ T Consensus 384 ------~d~~~~e~----------~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:32 384 ------SDNDLMEQ----------IERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ------cHHHHHHh----------ccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCCh Confidence 00000000 01112233332221 11112222233333233456677788888999999999988 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c------c-------- Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D------P-------- 416 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~------~-------- 416 (513) ...+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ + + . T Consensus 447 ~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~ 526 (714) T protein:vir:32 447 AFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGD 526 (714) T ss_pred HHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccC Confidence 88777777899999988766555555555566666777777666665543321100 0 0 0 Q ss_pred -----c-------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 -----D-------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 417 -----~-------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) + +|.|.=.+..|....+.++.++.+.+.++. ..+++.+++ .+.++-+++|++-. T Consensus 527 ~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~------ 599 (714) T protein:vir:32 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------ 599 (714) T ss_pred cceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc------ Confidence 0 122222333444445566666665444433 345566554 55455455554310 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCC---------------CccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPE---------------DERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~ 513 (513) +.. ....+...+..........-+ ...+.+ T Consensus 600 ----~~~--~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae 644 (714) T protein:vir:32 600 ----GTP--KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEAD 644 (714) T ss_pred ----CCC--CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000000000000000 000000 No 87 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.74 E-value=4.7e-16 Score=104.62 Aligned_cols=465 Identities=13% Similarity=0.102 Sum_probs=223.9 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |.--.+.+.-.-+ ++++.+...+++..+... -+....+-.+||.|.|=- ..........+.+ .+.+|.++ T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~ 76 (714) T protein:vir:27 1 MKNETNTMATKND-NGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP-PEVLQVLKDRGQP--MTIHNLIA 76 (714) T ss_pred CCcccccccCCCC-cchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHH Confidence 2212222222222 345555555555543222 123455777899997621 1111222222222 37899999 Q ss_pred HHHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---ce Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---KG 138 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---~~ 138 (513) .+|+..+++---+.+.+. . ++. ..++.+++.++++...+.+..+++++|.||+-+|.+.+. .+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:27 77 PTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 999999999988776652 2 111 124566778999999999999999999999988887532 23 Q ss_pred eEEEEEcccceEEEecCCC------CcceEEEEEEEeeccc--------------------------------------- Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSV------NPKPIMAVRYHAVQTV--------------------------------------- 173 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~~--------------------------------------- 173 (513) .+ -.++|.+++ ||+.. +.+- .+++.|...+. T Consensus 157 ~i-~~v~p~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (714) T protein:vir:27 157 KV-STVSRNEVF--WDWLSREADLSDCRW-LMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred EE-Eecchhhee--eccccccCChhhccc-eeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccch Confidence 33 346888854 34321 1111 11222211000 Q ss_pred -------------ccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------ Q lcl|NC_019916. 174 -------------VDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------ 204 (513) Q Consensus 174 -------------~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------ 204 (513) .......+..+|+|.............++.. T Consensus 233 ~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g 312 (714) T protein:vir:27 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVG 312 (714) T ss_pred hhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEec Confidence 0000111223444532221111111111000 Q ss_pred --ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 205 --PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 205 --~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) .......|.+.+.||+|+|.-.. ...|-+..+++.|+.+|+..|.+...+. ++..++..|.... T Consensus 313 ~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~------- 383 (714) T protein:vir:27 313 PHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL------- 383 (714) T ss_pred CcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccc------- Confidence 00011123445678888765432 1347788899999999999999877652 3333333332110 Q ss_pred cccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 278 LQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 278 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) .+...... .-+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+ T Consensus 384 ------~d~~~~e~----------~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:27 384 ------SDNDLMEQ----------IERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ------cHHHHHHh----------ccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCCh Confidence 00000000 01112233332221 11112222233333233456677788888999999999988 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c------c-------- Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D------P-------- 416 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~------~-------- 416 (513) ...+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ + + . T Consensus 447 ~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~ 526 (714) T protein:vir:27 447 AFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGD 526 (714) T ss_pred HHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccC Confidence 88777777899999988766555555555566666777777666665543321100 0 0 0 Q ss_pred -----c-------eeeEEeCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 -----D-------EIGFIFRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 417 -----~-------~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) + +|.|.=.+..|....+.++.++.+.+.++. ..+++.+++ .+.++-+++|++-. T Consensus 527 ~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~------ 599 (714) T protein:vir:27 527 NGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------ 599 (714) T ss_pred cceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc------ Confidence 0 122222333444445566666665444433 345566554 55455455554310 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCC---------------CccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPE---------------DERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~ 513 (513) +.. ....+...+..........-+ ...+.+ T Consensus 600 ----~~~--~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae 644 (714) T protein:vir:27 600 ----GTP--KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEAD 644 (714) T ss_pred ----CCC--CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000000000000000 000000 No 88 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.71 E-value=1.4e-15 Score=101.94 Aligned_cols=466 Identities=13% Similarity=0.090 Sum_probs=222.3 Q ss_pred CccchhhceeccCCcc--cCCHHHHHHHHHHHH--HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDAD--KLTPTRIAAFIRHHY--NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~i~~~i~~~~--~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |-+-...-.+-+.+.. +++.+.+..+..... ..-+....+-.+||.|.|=- ..........+.+ .+.+|.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~~~~~~l~~~g~p--~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLA-PEVIQVLKDRGQP--MTIHNLIAP 77 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEeccHHH Confidence 3332222222222211 244444444433321 11233456778899998611 1111222222222 378999999 Q ss_pred HHHHHHHHhhcCCeeec--C---CcH---------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCC---Ccee Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMS--G---PSS---------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPS---QKGE 139 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~--~---~~~---------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~---~~~~ 139 (513) +|+..+++.--+.+.+. . ++. ..+..+++.++++...+.+..+++++|.||+-++.+.+ +.+. T Consensus 78 ~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i~ 157 (714) T protein:vir:10 78 TVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEFK 157 (714) T ss_pred HHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCeE Confidence 99999999988876652 2 111 12556777899999999999999999999998888754 3333 Q ss_pred EEEEEcccceEEEecCCC------CcceEEEEEEEee------------------------------------------- Q lcl|NC_019916. 140 VSVKLDPMECFIIYDRSV------NPKPIMAVRYHAV------------------------------------------- 170 (513) Q Consensus 140 ~~~~~~p~~~~~~~d~~~------~~~~~~~ir~~~~------------------------------------------- 170 (513) +. .|+|.++++ |+.. +.+-++ ++.|.. T Consensus 158 i~-~v~p~~v~~--Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 158 VS-TVSRNEVFW--DWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred EE-ecChhheee--ccccccCChhhhhhhh-hhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccch Confidence 33 468887644 3311 011111 111000 Q ss_pred -------c--ccccccceeEEEEEEEcCCcEEEEEeeccCCcc------------------------------------- Q lcl|NC_019916. 171 -------Q--TVVDNITQTKYEVETWTENDYTRYKPIVVAGSV------------------------------------- 204 (513) Q Consensus 171 -------~--~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~------------------------------------- 204 (513) . .........+..+|+|.............++.. T Consensus 234 ~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecc Confidence 0 000011122334555543322221111111100 Q ss_pred -ccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccc Q lcl|NC_019916. 205 -PTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLL 278 (513) Q Consensus 205 -~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~ 278 (513) .......|.+.+.+|+|+|+-.. ...|.+..+++.|+.+|...|.+...+. ++..++..|... T Consensus 314 ~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~--------- 382 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQ--------- 382 (714) T ss_pred hhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHHh--CCceeecccccc--------- Confidence 00112224456678888775432 2347788899999999999999877662 222222222110 Q ss_pred ccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 279 QMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 279 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) ..+...... ..+.+.++.+.++. ..+....+.++......-..++...++.....|-..|++-+. T Consensus 383 ----~~d~~~~e~----------~~rp~~vi~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:10 383 ----LSDNDLMEQ----------LERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred ----ccHHHHHHh----------ccCCCCeEEecccc-cccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHH Confidence 000000000 01112333332211 111122222333322233456778899999999999999988 Q ss_pred ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-c------c----cceeeEE----- Q lcl|NC_019916. 359 TDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD-I------D----PDEIGFI----- 422 (513) Q Consensus 359 ~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~-~------~----~~~i~i~----- 422 (513) ..+..+++.||+||..+...-.......-..|..+.+++.+++++++...-.... + + ...+.+. T Consensus 448 ~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~ 527 (714) T protein:vir:10 448 FLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN 527 (714) T ss_pred HcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCC Confidence 8877777899999988776655556666666777777777777776644321110 0 0 0011111 Q ss_pred -----------------eCCCCCcCHHHHHHHHHHHhcCCCH-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 423 -----------------FRDNLPTDDVAIITALVQAGAQIPQ-------EYLYQYLPNVTDADEIVKMMDKQRKAMLKTY 478 (513) Q Consensus 423 -----------------f~~~~p~d~~e~a~~~~kl~g~iS~-------et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~ 478 (513) =.+..|.-..+.++.+.++.+.++. ..+++.+.+ .+.++-+++|.+-. T Consensus 528 ~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~-p~~~ei~~~ir~~~------- 599 (714) T protein:vir:10 528 GELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL------- 599 (714) T ss_pred ccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-cCHHHHHHHHHHHc------- Confidence 1122222234444555554333322 334555543 44444455554220 Q ss_pred hhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 479 DTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.. ........+.......... ......+ T Consensus 600 ---~~~--~~~~~~~~e~q~~q~~~~~-~~~~q~~ 628 (714) T protein:vir:10 600 ---GTP--KSPDEMTPEEQEVAAQQQA-LQQQQAE 628 (714) T ss_pred ---CCC--CCccccCcchhHHHHHHHH-HHHHHHH Confidence 000 0000000000000000000 0000000 No 89 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.68 E-value=1.2e-15 Score=102.40 Aligned_cols=418 Identities=12% Similarity=0.068 Sum_probs=202.8 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHH--HHHH------------HHHHHHHHHhcCCCccccccccccCCCCCCc Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHY--NNQR------------PRLEMLYDYYRGQNDGILSPASRRNEKGKAD 66 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~------------~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~ 66 (513) |-...+|.-.-...+ ..-...++.-.- ..+. --+..+...|. T Consensus 1 ~~~~~~a~~~~~~~~----a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~-------------------- 56 (461) T protein:vir:80 1 MYSIDKAKQAKIDSK----IVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYA-------------------- 56 (461) T ss_pred Cccchhhhhhhhhhh----hhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHH-------------------- Confidence 444444332221110 111111211100 0000 01122222221 Q ss_pred ceeecchhHHHHHHHHHHhhcCCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc--eeEE Q lcl|NC_019916. 67 HRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK--GEVS 141 (513) Q Consensus 67 ~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~--~~~~ 141 (513) .+.+++.+|+..++.++-+++.+++++++. ++.+|+..++.....++.+.+..+|.|++++-..+.+. .... T Consensus 57 ---~~~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~ 133 (461) T protein:vir:80 57 ---SNSIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLS 133 (461) T ss_pred ---hCCccchhhccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCcc Confidence 357888999999999999999999887654 67778777899999999999999999998887643321 1111 Q ss_pred EEEcccc--e---EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCc Q lcl|NC_019916. 142 VKLDPME--C---FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQF 216 (513) Q Consensus 142 ~~~~p~~--~---~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g 216 (513) ..+.|.. . +.+|+. .++... .+ ..+.....-......++........+.... ........++ T Consensus 134 ~pl~~~~~~~~~~l~~~~~----~~i~~~-~~-~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~-------~~~~~~~~iH 200 (461) T protein:vir:80 134 TAIDPKTIKSIPYINTFNT----QKVTQL-YL-NQDMFSEHFGEVEFFEVNRVSQLGEEILSG-------TTASTSEQIH 200 (461) T ss_pred CCcccccccceeEEEeccc----cccchh-hh-cccCcCcccccceEEEEecccccccccccc-------ccCccceEEc Confidence 1122222 1 111111 011110 00 001000000000000111100000000000 0000111233 Q ss_pred ccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhc Q lcl|NC_019916. 217 GFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKK 291 (513) Q Consensus 217 ~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (513) .-+++.|.+. -.|.|.++.+.+.+.++++++-..+..+..+..+.+...|..... .... T Consensus 201 ~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~-----------~~~~----- 264 (461) T protein:vir:80 201 RSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALN-----------KDDK----- 264 (461) T ss_pred cccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhh-----------chHH----- Confidence 3456666553 348999999999999999998887776666666555444321100 0000 Q ss_pred cccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc--cccccccccH Q lcl|NC_019916. 292 LADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT--DDNFSGNSSG 369 (513) Q Consensus 292 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~~~n~Sg 369 (513) ......++....+..+.+ ... +-+|-..+.+.++....++.+.+.|+..+++|-.- ....+++.|| T Consensus 265 --~~~~~~~~~~~~~~g~~~--------~d~--~e~~e~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asg 332 (461) T protein:vir:80 265 --ANLTAMLDFMFRTEALAI--------IKG--DEQLTKESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGA 332 (461) T ss_pred --HHHHHHHHHhcCCceEEE--------EcC--CcceEEEecCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccc Confidence 001111222222221211 112 23455566778889999999999999999999743 3334567777 Q ss_pred HH-HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh----- Q lcl|NC_019916. 370 VA-MKYKVLGTVELASTKR-KQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG----- 442 (513) Q Consensus 370 ~A-i~~~~~~l~~k~~~~~-~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----- 442 (513) .. ++. ...+++.+| ..++..+++++++++.-+.......+++..++++.|++-.+.+..|.|++..+.+ T Consensus 333 e~D~~~----yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~ 408 (461) T protein:vir:80 333 QYDVMN----YYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQI 408 (461) T ss_pred hHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 75 433 223334444 5678899999998875444334444566678899999999999999998876643 Q ss_pred ----cCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 443 ----AQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 443 ----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) |++|.+++.+.+ . .. .+-.......+.+.+.++..+...+...+++-| T Consensus 409 ~~~~g~is~~e~r~~l-------------~----~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 460 (461) T protein:vir:80 409 YIVNGVLDPDEVKETR-------------F----GR------FGLENSSKFSGDSAEIDKLAKLVYDAYAKKNAD 460 (461) T ss_pred HHhcCCCCHHHHHHHH-------------H----Hh------cCCCCCccCCCCCchhhhhhhhccccccccCCC Confidence 444444332211 0 00 000000000001111011000011111111111 No 90 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.62 E-value=1.5e-14 Score=96.36 Aligned_cols=473 Identities=12% Similarity=0.054 Sum_probs=218.3 Q ss_pred Cccc-hhhceeccCC-cccCCHHHHHHHHHHHH--HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDM-QQANMNYQED-ADKLTPTRIAAFIRHHY--NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~-~~~~~~~~~~-~~~~~~~~i~~~i~~~~--~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) +|.. |++....+-+ ...++.+.+..+..... ..-+....+-.+||.|.|=- ..........+.+ .+.+|.++. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~-~~~~~~l~~~g~p--~~~~N~i~~ 79 (772) T protein:vir:10 3 ITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLD-TELLRRQQALGIP--PAVEDLIGP 79 (772) T ss_pred cchhhHHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcCCC--cEEEcchHH Confidence 2221 2232222222 23567777766654422 22233456777899998621 1111122222222 378999999 Q ss_pred HHHHHHHHhhcCCeeec--CC---cH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCce--eEE Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMS--GP---SS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKG--EVS 141 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~--~~---~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~--~~~ 141 (513) +|+..+++...+.+.+. .. .+ ..++.+++.++++...+.+..+++++|.||+-++.+++... ..+ T Consensus 80 ~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i 159 (772) T protein:vir:10 80 ALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRC 159 (772) T ss_pred HHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEE Confidence 99999999988876652 21 11 12566778899999999999999999999999888765321 122 Q ss_pred EEEcccceEEEecCCCCcceE---EEE-EEEee----------------------------------cc----------- Q lcl|NC_019916. 142 VKLDPMECFIIYDRSVNPKPI---MAV-RYHAV----------------------------------QT----------- 172 (513) Q Consensus 142 ~~~~p~~~~~~~d~~~~~~~~---~~i-r~~~~----------------------------------~~----------- 172 (513) -.++|.++ +||+....... +.+ +.|.. .+ T Consensus 160 ~~v~p~~v--~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (772) T protein:vir:10 160 RPIRRDEI--HWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWN 237 (772) T ss_pred EeeCcccc--eecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccc Confidence 34688874 44543211111 111 00000 00 Q ss_pred -----------cccccceeEEEEEEEcCCcEEEEEeeccCC--------------------------------------c Q lcl|NC_019916. 173 -----------VVDNITQTKYEVETWTENDYTRYKPIVVAG--------------------------------------S 203 (513) Q Consensus 173 -----------~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~--------------------------------------~ 203 (513) +.+...+.+.-+|+|-............++ . T Consensus 238 ~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~ 317 (772) T protein:vir:10 238 EARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGP 317 (772) T ss_pred hhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecc Confidence 000011223334444322111110000000 0 Q ss_pred cccccccccccCcccceEEecCCC-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccc Q lcl|NC_019916. 204 VPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLL 278 (513) Q Consensus 204 ~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~ 278 (513) ........|.+.+.||+|+|.-.. ...|.+..+++.|+.+|+..|.+...+...+ +..=.|... T Consensus 318 ~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~gav~--------- 386 (772) T protein:vir:10 318 HCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGAVA--------- 386 (772) T ss_pred eeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCCcc--------- Confidence 001112233456678888875321 2347788999999999999999887774332 211111110 Q ss_pred ccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 279 QMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 279 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) ..+..+.... -+.+.++.++++.- ...++.++......-..++...++.....|-.+|++-+. T Consensus 387 ----~~d~~~~e~~----------arp~~vi~~~~~~~---~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~ 449 (772) T protein:vir:10 387 ----MTDAQFRRQI----------ARPDADIVLDENHM---AKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAG 449 (772) T ss_pred ----chhHHHHHhc----------cCCCCeEEeCCccc---cCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHH Confidence 0000000000 01123333332210 011222333322223467778889999999999999888 Q ss_pred ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccc---------------- Q lcl|NC_019916. 359 TDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWD------IDP---------------- 416 (513) Q Consensus 359 ~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~------~~~---------------- 416 (513) ..+..+++.||+||..+-..-.......-..+..+.+++.+++++++........ .+. T Consensus 450 ~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~ 529 (772) T protein:vir:10 450 FQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDP 529 (772) T ss_pred HcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecc Confidence 7777667789999987765555555556666677777777776666544321100 000 Q ss_pred --------ceeeE---Ee-CCCCCcC---HHHHHHHHHHHhcCCCHHHHH-------HhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019916. 417 --------DEIGF---IF-RDNLPTD---DVAIITALVQAGAQIPQEYLY-------QYLPNVTDADEIVKMMDKQRKAM 474 (513) Q Consensus 417 --------~~i~i---~f-~~~~p~d---~~e~a~~~~kl~g~iS~et~~-------~~l~~v~D~~~E~~ri~~E~~~~ 474 (513) ++|.+ .+ -...|.. ..+.++.+.++.+.++.+... +.+.+ ...++-.+++++-.... T Consensus 530 ~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~-p~~~ei~~~ir~~~~~~ 608 (772) T protein:vir:10 530 QTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDV-PFKRDVVEAIRAVDQQQ 608 (772) T ss_pred cccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCC-CChHHHHHHHHHHhccC Confidence 01100 00 0122222 233444455554444444322 22221 22222333333211000 Q ss_pred H-----HH-hhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 475 L-----KT-YDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 475 ~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) . .. ............. +-.--.... .....+.+ T Consensus 609 ~peq~~~~~~q~~qq~~~~~~~--el~~~q~~a----~~~~~~A~ 647 (772) T protein:vir:10 609 TPEQIQQQIDQAVQDALAKAGN--DIKLRELEI----KERKADSE 647 (772) T ss_pred ChHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH----HHHHHHHH Confidence 0 00 0000000000000 000000000 00000011 No 91 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.55 E-value=9.1e-14 Score=92.09 Aligned_cols=447 Identities=12% Similarity=0.117 Sum_probs=197.0 Q ss_pred chhhceeccCCcccCCHHHHHHHHHHHHHH-------HH-HHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhH Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRHHYNN-------QR-PRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFAR 75 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-------~~-~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~ 75 (513) |=| ..-...++.+.|..++...... .+ ....+..+||.|+..-. . ..+ -.+++.+... T Consensus 1 ~~k-----~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~-----~--~~~--~s~~~~~~v~ 66 (705) T protein:vir:88 1 MAK-----RRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN-----E--RPG--KSGIVSRDVQ 66 (705) T ss_pred CCc-----ccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc-----c--cCC--CCccccHHHH Confidence 111 1112346777777777655322 21 23456668999985321 1 111 2346677777 Q ss_pred HHHHHHHHHhh----cC--CeeecC---CcHH-------HHHH-HHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCC--- Q lcl|NC_019916. 76 YIADFQTSYSV----GN--AIAMSG---PSSD-------RLDD-FNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPS--- 135 (513) Q Consensus 76 ~ivd~~~~~l~----g~--p~~~~~---~~~~-------~l~~-~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~--- 135 (513) ..|+....+|+ +. .+++.. .|.+ .+.. +.+.|+.........++++++|.|++.||++.. T Consensus 67 ~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~ 146 (705) T protein:vir:88 67 ETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKP 146 (705) T ss_pred HHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccch Confidence 77777777664 32 234432 1211 2333 345567677788999999999999999988532 Q ss_pred ---------------------------------------------CceeEEEEEcccceEEEecCCCCcc-eEEEEEEEe Q lcl|NC_019916. 136 ---------------------------------------------QKGEVSVKLDPMECFIIYDRSVNPK-PIMAVRYHA 169 (513) Q Consensus 136 ---------------------------------------------~~~~~~~~~~p~~~~~~~d~~~~~~-~~~~ir~~~ 169 (513) |.+++ ..|+|.+.++--+...... ...+.|++. T Consensus 147 ~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i-~~V~p~d~~~dp~a~~~~d~~~~~~~~~~ 225 (705) T protein:vir:88 147 TFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKV-LCVKPENFLVDRLATCIDDARFLCHREKY 225 (705) T ss_pred hhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceee-eeccHHHceecCCCCCcccCcEEEEEEec Confidence 22222 2357766543221111111 111122221 Q ss_pred ec-cc-----c---------------------------ccc-------------ceeEEEEEEEcC-----CcEE-EEEe Q lcl|NC_019916. 170 VQ-TV-----V---------------------------DNI-------------TQTKYEVETWTE-----NDYT-RYKP 197 (513) Q Consensus 170 ~~-~~-----~---------------------------~~~-------------~~~~~~ve~yt~-----~~~~-~~~~ 197 (513) +. +. + +.. ...+..+|+|.. +.+. .+.. T Consensus 226 t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~ 305 (705) T protein:vir:88 226 TVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRI 305 (705) T ss_pred cHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEE Confidence 10 00 0 000 000111122210 1110 0010 Q ss_pred eccCCccccccccccccCcccceEEe-----cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccc Q lcl|NC_019916. 198 IVVAGSVPTLEVAEHSAQFGFPMIEY-----RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLF 272 (513) Q Consensus 198 ~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~ 272 (513) .-.++. +... -+++++||+.+ +..-+|.|.++.+.++++.+|.+++.+.+.+...++|...+-. + .. T Consensus 306 ~~~g~~---il~~--~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~-g-~v- 377 (705) T protein:vir:88 306 LYVGDY---IISN--EPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLD-G-QV- 377 (705) T ss_pred EEeCcc---cccc--ccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccc-c-cc- Confidence 001110 1111 24566676654 4455689999999999999999999999999888887654411 0 00 Q ss_pred ccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 273 DDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 352 (513) . ..+ +...+.++++... ..+.+.++..+.-.......++.+...|... T Consensus 378 ~--------------------~~d---~~~~~pg~vv~~~---------~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~ 425 (705) T protein:vir:88 378 N--------------------LED---LLTNEAAGIVRVK---------SMNSITPLETPQLSGEVYGMLDRLEADRGKR 425 (705) T ss_pred C--------------------ccc---ccccCCCeeEEec---------CCCccccccCCcCcHHHHHHHHHHHHHHHHh Confidence 0 000 0011122233322 1234555555555566778899999999999 Q ss_pred hCccccccc----cccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhccccccc----------c-- Q lcl|NC_019916. 353 SHTPDLTDD----NFSGNSSGVAMKYKVLGTVELASTKRKQFE-RGLNQRYTVVAHIEERVNGKWDI----------D-- 415 (513) Q Consensus 353 s~~p~~~~~----~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~-~~l~~~~~li~~~l~~~~~~~~~----------~-- 415 (513) |+++++..+ .+.++.++.|+..+......+.....+.|. .++++++++++.++......... + T Consensus 426 tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~ 505 (705) T protein:vir:88 426 TGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPA 505 (705) T ss_pred hCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchH Confidence 999998765 223456777888777777777777777775 45677777777665544322110 0 Q ss_pred ----cceeeEEeCCCCCcCHHHHHHHHHHH----hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCC Q lcl|NC_019916. 416 ----PDEIGFIFRDNLPTDDVAIITALVQA----GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIIN 487 (513) Q Consensus 416 ----~~~i~i~f~~~~p~d~~e~a~~~~kl----~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 487 (513) ..++.+.-.. ...+..+....+..+ ..+.+. -.+.+.++ +. .+..+.++-.+....... .....+ T Consensus 506 ~~~~~~~v~v~v~~-~~~~~eq~~a~l~~ll~~~q~l~~~---~~~~~~~~-~~-~~~~~~~el~e~~~~k~~-~~~~~~ 578 (705) T protein:vir:88 506 NWRERSDLTVTVGI-GNMNKDQQMLHLMRIWEMAQAVVGG---GGLGVLVS-EQ-NLYNILKEVTENAGYKDP-DRFWTN 578 (705) T ss_pred hhccCCceEEeecc-ccchHHHHHHHHHHHHHHHHHhhcc---cchhhhcC-hH-HHHHHHHHHHHhhhhhhH-HHHhhh Confidence 0011111111 111111211111111 111110 00011111 00 010111100000000000 000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCC-----ccCC-------C Q lcl|NC_019916. 488 GTSGNDPEDEGVRGQQGEPED-----ERTS-------D 513 (513) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~-----~~~~-------~ 513 (513) .............. ..+.+- .... + T Consensus 579 ~~~~e~~~~~~~~~-q~e~~~~~~~~~~q~e~~k~q~e 615 (705) T protein:vir:88 579 PNSPEALQAKAIRE-QKEAQPKPEDIKAQADAQRAQSD 615 (705) T ss_pred hhhHHHHHHHHhhh-hhhhhHHHHHHHHHHHHHHHHHH Confidence 00000000000000 000000 0000 0 No 92 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.53 E-value=2.3e-12 Score=84.39 Aligned_cols=439 Identities=10% Similarity=0.048 Sum_probs=222.5 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCc--------------cee--ecchhHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKAD--------------HRA--VHSFARY 76 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~--------------~ri--~~n~~~~ 76 (513) |+ --+.+.-+++-....++.+.....+-|+|-..-....+ ......++ +.+ ..+|++- T Consensus 1 mn----~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~--~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~ 74 (502) T protein:vir:79 1 MA----ILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKA--RRENRTADQLSQYGAVSLREQARYLDNNHDLVIG 74 (502) T ss_pred Cc----hHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCC--CCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 22 11222222222222222222333344666432111111 11111111 111 3588999 Q ss_pred HHHHHHHHhhcC-CeeecCC----c-------HHHHHHHHHh----------cCHHHHHHHHHHHHhhCCeEEEEeeecC Q lcl|NC_019916. 77 IADFQTSYSVGN-AIAMSGP----S-------SDRLDDFNRR----------NDIDTLNYELYLDMTVTGRAYEYVYRDP 134 (513) Q Consensus 77 ivd~~~~~l~g~-p~~~~~~----~-------~~~l~~~~~~----------n~~~~~~~~~~~~a~~~G~~~~~v~~d~ 134 (513) +|+..+.+++|. ++++... + .+.++..|+. .+|...+..+.+..+..|.+|+.+..++ T Consensus 75 av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~ 154 (502) T protein:vir:79 75 VFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGR 154 (502) T ss_pred HHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecc Confidence 999999999996 6654321 1 1234444442 3678888889999999999999887765 Q ss_pred CCc------eeEEEE-EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccc Q lcl|NC_019916. 135 SQK------GEVSVK-LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTL 207 (513) Q Consensus 135 ~~~------~~~~~~-~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~ 207 (513) .+. ..+.+. ++|..+--.+++ ......+|.+ +..+.-..| +++.... ++. T Consensus 155 ~~~~~~g~~~~l~lq~iepd~l~~~~~~--~~~i~~GVe~------d~~Gr~~aY--~i~~~hP--------gd~----- 211 (502) T protein:vir:79 155 INSLTPSAGVHFWLEALEPDFIPMTSDE--SNRLNQGVFV------DDWGRPEKY--LVYKSRP--------VSG----- 211 (502) T ss_pred cCccCCCcccceEEEEecchhcCCCCCC--CCeeEeeeEE------CCCCceEEE--EEeecCC--------CCC----- Confidence 432 111222 566665333332 2345555543 111222222 2332111 110 Q ss_pred cccccccCcccc---eEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccc Q lcl|NC_019916. 208 EVAEHSAQFGFP---MIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQ 279 (513) Q Consensus 208 ~~~~~~~~g~vP---vv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~ 279 (513) ....+.+|| |+++... .+|.|.|..++..+..++....-........+.-..+++......... T Consensus 212 ---~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~----- 283 (502) T protein:vir:79 212 ---RQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEP----- 283 (502) T ss_pred ---cccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccc----- Confidence 011234555 6665443 469999999888887776655443333333333333334321111000 Q ss_pred cccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_019916. 280 MVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT 359 (513) Q Consensus 280 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 359 (513) ...+ .......+.+.+|.......++-++++.+..-+..++..+++.+.+.|..-.++|-.. T Consensus 284 ---~~~~---------------~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~ 345 (502) T protein:vir:79 284 ---DGNG---------------SKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSS 345 (502) T ss_pred ---ccCC---------------CCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHH Confidence 0000 0001111223333322223456788898888888899999999999999999999322 Q ss_pred -cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccccccc----ccceeeEEeCCCCC--cCH Q lcl|NC_019916. 360 -DDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ-RYTVVAHIEERVNGKWDI----DPDEIGFIFRDNLP--TDD 431 (513) Q Consensus 360 -~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~~li~~~l~~~~~~~~~----~~~~i~i~f~~~~p--~d~ 431 (513) ...++ . |-.++|..+......+...+..|...+.+ +++..+...-..+...-+ ....+.+.|..+-. .|. T Consensus 346 lt~D~s-~-nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP 423 (502) T protein:vir:79 346 TARNYN-G-TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDP 423 (502) T ss_pred Hhcccc-c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccCh Confidence 22232 2 66677888888888888888877765554 444434333222221111 11234677844433 466 Q ss_pred HHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 432 VAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 432 ~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) ...+++.... +|+.|.+..+...+ .|+++.++++.+|++...+.--.........+ ...+.......+...+++ T Consensus 424 ~Ke~~a~~~~i~~Gl~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~--~~~~~~~~~~e~~~~~~~ 499 (502) T protein:vir:79 424 VKEAEAWKIQIRGGAATESDWVRAGG--RNPDDVKRRRKAEIDENRKLDLVFDTDPASDK--GGSSAATKRQEPQHTDDQ 499 (502) T ss_pred HHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCC Confidence 6667666554 79999999999997 48999999999887765443222111110000 011111111111111122 Q ss_pred cCC Q lcl|NC_019916. 510 RTS 512 (513) Q Consensus 510 ~~~ 512 (513) .++ T Consensus 500 ~e~ 502 (502) T protein:vir:79 500 SEE 502 (502) T ss_pred CCC Confidence 222 No 93 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.53 E-value=8.5e-14 Score=92.23 Aligned_cols=466 Identities=10% Similarity=-0.011 Sum_probs=204.1 Q ss_pred CCcccCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHY------NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |..+ ...+.+++..+. ..-+....+-.+||.|.|=- ..........+ +..+|.++.+|+..+++-- T Consensus 1 m~d~---~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~-~~~~~~l~~q~----rp~~N~i~~~v~~v~g~e~ 72 (725) T protein:vir:10 1 MADN---ENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD-DWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) T ss_pred CCch---HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcC----CCcccchHHHHHHHHhhHH Confidence 3322 333444443322 22233466777899998621 11111222222 3357999999999999987 Q ss_pred cCCeeec--CC--cH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec---CCC-ceeEEEEE-----c Q lcl|NC_019916. 87 GNAIAMS--GP--SS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD---PSQ-KGEVSVKL-----D 145 (513) Q Consensus 87 g~p~~~~--~~--~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---~~~-~~~~~~~~-----~ 145 (513) -+.+.+. .. ++ ..++.+.+.++++...+.+..+++++|.||+-|..| +++ ...+.+.+ + T Consensus 73 ~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~ 152 (725) T protein:vir:10 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccC Confidence 6665542 21 11 124556677999999999999999999999877433 332 11121111 2 Q ss_pred ccceEEEecCCCC------cceEEEEEEEeecc--------cc----------cc--------cceeEEEEEEEcCCcE- Q lcl|NC_019916. 146 PMECFIIYDRSVN------PKPIMAVRYHAVQT--------VV----------DN--------ITQTKYEVETWTENDY- 192 (513) Q Consensus 146 p~~~~~~~d~~~~------~~~~~~ir~~~~~~--------~~----------~~--------~~~~~~~ve~yt~~~~- 192 (513) |.+++ ||+... .+- .+++.|.... +. +. ....+..+++|....+ T Consensus 153 ~~~v~--~Dp~a~~~D~sDar~-~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~ 229 (725) T protein:vir:10 153 CSHVI--WDSNSKLMDKSDARH-CTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred HhHcc--cCchhhccChhhhhh-hhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEe Confidence 33333 444210 111 1111221100 00 00 0112222333332111 Q ss_pred -EEEEee-ccCCc-----------------------------------------cccccccccccCcccceEEecCC--- Q lcl|NC_019916. 193 -TRYKPI-VVAGS-----------------------------------------VPTLEVAEHSAQFGFPMIEYRNN--- 226 (513) Q Consensus 193 -~~~~~~-~~~~~-----------------------------------------~~~~~~~~~~~~g~vPvv~~~n~--- 226 (513) ..+... ...+. ..+..+..+.+.+.||+|+|.-. T Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~ 309 (725) T protein:vir:10 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeec Confidence 000000 00000 00111222344456888876422 Q ss_pred CC----CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc Q lcl|NC_019916. 227 EY----RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA 302 (513) Q Consensus 227 ~~----~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 302 (513) .. +.|.+.++++.|+.+|..+|.+...+-..+.-... |..+.. ...... +.. T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~--~~~~~i-----------~~~e~~-----------~~~ 365 (725) T protein:vir:10 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPF--FWPEQI-----------AGFEHM-----------YDG 365 (725) T ss_pred cCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCcccc--ccHhhh-----------hHHHHH-----------Hhc Confidence 12 33888899999999999999988776433222111 110000 000000 000 Q ss_pred chhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHH Q lcl|NC_019916. 303 MRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVEL 382 (513) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k 382 (513) ......+........++....+...+.....-+.++...++.....|-..|++-+...+..+++.||+||..+-...... T Consensus 366 ~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~ 445 (725) T protein:vir:10 366 NDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLE 445 (725) T ss_pred cCCceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHH Confidence 00011111111111111112222333333333456777999999999999999888877777789999999887666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cc----c------------------------ceeeEEeCCCCC Q lcl|NC_019916. 383 ASTKRKQFERGLNQRYTVVAHIEERVNGKWD------ID----P------------------------DEIGFIFRDNLP 428 (513) Q Consensus 383 ~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~------~~----~------------------------~~i~i~f~~~~p 428 (513) ....-..+..+.+++.+++++++...-.... .+ + .++.|.=.+..+ T Consensus 446 l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~ 525 (725) T protein:vir:10 446 TYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQ 525 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcH Confidence 6666677777777777777766544321100 00 0 112222223333 Q ss_pred cCHHHHHHHHHHHhcCCC----H--HHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHh-------------hhhcCCCCC Q lcl|NC_019916. 429 TDDVAIITALVQAGAQIP----Q--EYLYQYLPN--VTDADEIVKMMDKQRKAMLKTY-------------DTKGGLIIN 487 (513) Q Consensus 429 ~d~~e~a~~~~kl~g~iS----~--et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~-------------~~~~~~~~~ 487 (513) .-..+.+..++.+...++ . .+++..++. .+..++-.+++.++........ ......... T Consensus 526 s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~ 605 (725) T protein:vir:10 526 SMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQD 605 (725) T ss_pred HHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhH Confidence 223344444444433222 1 223332322 2223344455543321110000 000000000 Q ss_pred C-CCCCCC--CCCCCCCCCCCCCCcc-CCC Q lcl|NC_019916. 488 G-TSGNDP--EDEGVRGQQGEPEDER-TSD 513 (513) Q Consensus 488 ~-~~~~~~--~~~~~~~~~~~~~~~~-~~~ 513 (513) . ...... .......+...-+..+ ..| T Consensus 606 ~e~~q~~~~~~~~qae~~ka~aE~~k~~~~ 635 (725) T protein:vir:10 606 PAMVQAQGVLLQGQAELAKAQNQTLSLQID 635 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000000 0000000000000000 000 No 94 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.49 E-value=5.4e-12 Score=82.36 Aligned_cols=445 Identities=8% Similarity=-0.032 Sum_probs=233.2 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cccccccCCCCCCcc--------------ee--ecchhH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGI-LSPASRRNEKGKADH--------------RA--VHSFAR 75 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~-~~~~~~~~~~~~~~~--------------ri--~~n~~~ 75 (513) |-..-..+.++.+++..-.-.+..........|+|-.... ...+...+....++. .+ .++|++ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 4444456666666665433233333344455677533211 011111111111111 11 358899 Q ss_pred HHHHHHHHHhhc-CCeeecCC--------cH---HHHHHHHHh------------cCHHHHHHHHHHHHhhCCeEEEEee Q lcl|NC_019916. 76 YIADFQTSYSVG-NAIAMSGP--------SS---DRLDDFNRR------------NDIDTLNYELYLDMTVTGRAYEYVY 131 (513) Q Consensus 76 ~ivd~~~~~l~g-~p~~~~~~--------~~---~~l~~~~~~------------n~~~~~~~~~~~~a~~~G~~~~~v~ 131 (513) -+|+..+..++| .++++... ++ +.++..|+. .+|...+..+.+..+..|.+|+.+. T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 160 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREH 160 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEe Confidence 999999999999 68887543 11 234444432 1366778889999999999998887 Q ss_pred ecCCCceeEEEE-EcccceEEEecC--CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccc Q lcl|NC_019916. 132 RDPSQKGEVSVK-LDPMECFIIYDR--SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLE 208 (513) Q Consensus 132 ~d~~~~~~~~~~-~~p~~~~~~~d~--~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~ 208 (513) ..+++...+.+. ++|..+-..++. ........+|.+ +..+.-..| +++.... ++..... T Consensus 161 ~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~------d~~Gr~~aY--~i~~~hP--------gd~~~~~-- 222 (505) T protein:vir:96 161 RGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIEL------DAWERPVAY--HLLVNHP--------GDNSYCY-- 222 (505) T ss_pred ecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEE------CCCCceEEE--EEeecCC--------Ccccccc-- Confidence 666554333333 566665433321 112334556643 111222222 3333211 1000000 Q ss_pred ccccccCcccc---eEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccc Q lcl|NC_019916. 209 VAEHSAQFGFP---MIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQM 280 (513) Q Consensus 209 ~~~~~~~g~vP---vv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~ 280 (513) ......+.+|| |+++.. -.+|.|.|..++..+..++....-........+.-..+++......... T Consensus 223 ~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~------ 296 (505) T protein:vir:96 223 HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQP------ 296 (505) T ss_pred ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCc------ Confidence 00112244555 455433 3479999999888877766655544444433333333444422111000 Q ss_pred ccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc Q lcl|NC_019916. 281 VDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTD 360 (513) Q Consensus 281 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 360 (513) .. +.....+..+..+.+.. ..++.++++++.+-+..++..+.+.+.+.|..-.++|-... T Consensus 297 --~~---------~~~~~~~~~l~pG~i~~---------L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~l 356 (505) T protein:vir:96 297 --PE---------DDQGEIVEEVEAGTYQL---------LPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRL 356 (505) T ss_pred --cc---------cccCccccccCCceeee---------cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 00 00001111222222222 34567889998888889999999999999999999884332 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccccccccc---ceeeEEeCCCCC--cCHHHH Q lcl|NC_019916. 361 DNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ-RYTVVAHIEERVNGKWDIDP---DEIGFIFRDNLP--TDDVAI 434 (513) Q Consensus 361 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~~li~~~l~~~~~~~~~~~---~~i~i~f~~~~p--~d~~e~ 434 (513) ..--+++|-.+.|..+......+...+..|...+.+ +++..+...-..+...-... ..+.+.|..+-- .|.... T Consensus 357 t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke 436 (505) T protein:vir:96 357 AHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKD 436 (505) T ss_pred hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHH Confidence 211134566678888888888888888888775444 55554544333222111111 124567754333 466666 Q ss_pred HHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 435 ITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 435 a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) +++.... +|+.|.+..+...+ .|+++.++++.+|++...+.-- .... ..........++++..++|+ T Consensus 437 ~~a~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl----~~~~--~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 437 SKAHSESIKNRTRSRSSIIRAAG--DDPEDVFDEIAWEEQLMRDKGV----NPTP--PEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCC----CCCC--CCCCCCCCCCCCCCCCCCCC Confidence 7766654 79999999999987 4899999999988776544221 1111 11111111111111122222 No 95 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.48 E-value=4.6e-12 Score=82.76 Aligned_cols=459 Identities=10% Similarity=-0.034 Sum_probs=218.4 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCc--------------cee--ecchhHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKAD--------------HRA--VHSFARY 76 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~--------------~ri--~~n~~~~ 76 (513) |. ++. + ..-..++.......||.|--.--.+-....+....++ +.+ .++|.+- T Consensus 1 ~~----~~~----~---~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 69 (530) T protein:vir:38 1 MK----IPS----L---VGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAAN 69 (530) T ss_pred Cc----cce----e---ecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 00 000 0 0001122345667777653110000000001111110 111 3589999 Q ss_pred HHHHHHHHhhcCCeeecCCc-----------H----HHHHHHHHh--------------cCHHHHHHHHHHHHhhCCeEE Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPS-----------S----DRLDDFNRR--------------NDIDTLNYELYLDMTVTGRAY 127 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~-----------~----~~l~~~~~~--------------n~~~~~~~~~~~~a~~~G~~~ 127 (513) +|+..+.+++|.+++..... + +.++..|+. .+|...+..+.+..++.|.+| T Consensus 70 av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~ 149 (530) T protein:vir:38 70 AVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELC 149 (530) T ss_pred HHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceE Confidence 99999999999999875421 1 234444432 246677888999999999999 Q ss_pred EEeeecCCCcee--EEEE-EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcc Q lcl|NC_019916. 128 EYVYRDPSQKGE--VSVK-LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSV 204 (513) Q Consensus 128 ~~v~~d~~~~~~--~~~~-~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~ 204 (513) +.+..++++... +.+. ++|..+--.++.........+|.+ +..+.-..|+ ++.... .+... T Consensus 150 ~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~------d~~Gr~~aY~--i~~~~~--------~~~~~ 213 (530) T protein:vir:38 150 VQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI------NDSGAALGYY--VSDDGY--------PGWMA 213 (530) T ss_pred EEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEE------CCCCceEEEE--EeeccC--------CCccc Confidence 888766543221 2222 466654433333334455666644 1122222333 332110 00000 Q ss_pred -ccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccc Q lcl|NC_019916. 205 -PTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLL 278 (513) Q Consensus 205 -~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~ 278 (513) .+........++.--|+++... .+|.|.|..++..+..++...--........+.-..+++......... T Consensus 214 ~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~---- 289 (530) T protein:vir:38 214 QNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAM---- 289 (530) T ss_pred cccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccc---- Confidence 0000000111222225665443 468999999887776665544333322222222222333221111000 Q ss_pred ccccc-hhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 279 QMVDP-SDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 279 ~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) ..... ...+....+.......... .....+.+.+|.. .....+.++++++.+-+..++..+.+.+.+.|..-.++|- T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~pG~i-~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~y 367 (530) T protein:vir:38 290 DFILGADNKEQQSKLTGWLGEMAAY-YSAAPVRLGGARV-PHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSY 367 (530) T ss_pred cccccCCcccccccccccchhhhhc-ccccceeccCcee-eecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 00000 0000001111111100000 0111223333322 2234567889999888888999999999999999999885 Q ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccc------cc-----ceeeEEeCC Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL-NQRYTVVAHIEERVNGKWDI------DP-----DEIGFIFRD 425 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l-~~~~~li~~~l~~~~~~~~~------~~-----~~i~i~f~~ 425 (513) .....--+++|-.+.|..+......+...+..|...+ +.+++..+... ...+.... ++ ..+.+.|.. T Consensus 368 e~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a-v~~G~i~~p~~~~~~~~~~~~a~~~~~w~~ 446 (530) T protein:vir:38 368 EQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEA-IVRRVVTLPSKARFSFQEARTAWGNANWIG 446 (530) T ss_pred HHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH-HHcCCccCCCCCCCCchhhHHhhhceeeec Confidence 3322111344566788877777777877777776643 33444333321 12221111 11 123466743 Q ss_pred CC--CcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 426 NL--PTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG 501 (513) Q Consensus 426 ~~--p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) +- -.|....+++.... +|+.|.+.++...+ .|+++.++++.+|++...+.--....... .........++.++ T Consensus 447 p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~-~~~~~~~~~~~~~~ 523 (530) T protein:vir:38 447 SGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRESMERRAAGLNPPAWAA-AAFEAGVKKSNEEE 523 (530) T ss_pred CCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCcc-cccCCCCCCCCCCC Confidence 33 24666666666554 78999999999987 48999999999888765543211111100 01111111111111 Q ss_pred CCCCCCC Q lcl|NC_019916. 502 QQGEPED 508 (513) Q Consensus 502 ~~~~~~~ 508 (513) +++..+. T Consensus 524 ~d~~~~a 530 (530) T protein:vir:38 524 QDGARAA 530 (530) T ss_pred CCCCCCC Confidence 1111111 No 96 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.45 E-value=2.5e-12 Score=84.20 Aligned_cols=473 Identities=11% Similarity=0.027 Sum_probs=205.3 Q ss_pred cccCCHHHHHHHHHHHHHH------HHHHHHHHHHHh--cCCCcccc---ccccccCCCCCCcceeecchhHHHHHHHHH Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYY--RGQNDGIL---SPASRRNEKGKADHRAVHSFARYIADFQTS 83 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY--~G~~~i~~---~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~ 83 (513) ..+.+.+.+.+++..+... .+.....=.+|| .|+|=-.. .-.......++| .+.+|.++.+|+..++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP--~~~~N~i~~~v~~v~g 78 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIA 78 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCC--ceEEcchHHHHHHHHH Confidence 3344556666666543221 111222223355 57651000 000010111111 3778999999999999 Q ss_pred HhhcCCeeec--C---CcHH--------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec---CC------CceeEE Q lcl|NC_019916. 84 YSVGNAIAMS--G---PSSD--------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD---PS------QKGEVS 141 (513) Q Consensus 84 ~l~g~p~~~~--~---~~~~--------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---~~------~~~~~~ 141 (513) +-..+.+.+. . +.+. .++.+++.++++...+.+..+++++|.||+-+..| +. ...++. T Consensus 79 ~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~ 158 (708) T protein:vir:10 79 EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIE 158 (708) T ss_pred HHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceE Confidence 9988776652 1 1121 25567788999999999999999999999877554 11 122222 Q ss_pred EEEcccceEEEecCCCC-cce----EEEEEEEeecc---------------c--------ccccceeEEEEEEEcCCc-- Q lcl|NC_019916. 142 VKLDPMECFIIYDRSVN-PKP----IMAVRYHAVQT---------------V--------VDNITQTKYEVETWTEND-- 191 (513) Q Consensus 142 ~~~~p~~~~~~~d~~~~-~~~----~~~ir~~~~~~---------------~--------~~~~~~~~~~ve~yt~~~-- 191 (513) ...+|... +.||+... ..+ ..+++.|...+ . +......+..+++|.... T Consensus 159 ~~~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~ 237 (708) T protein:vir:10 159 PIYDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) T ss_pred Eeecchhh-cccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEE Confidence 22344321 12332210 000 01111110000 0 000001122222222111 Q ss_pred -------------EEEEEeec------------------------------cCCccccccccccccCcccceEEecCC-- Q lcl|NC_019916. 192 -------------YTRYKPIV------------------------------VAGSVPTLEVAEHSAQFGFPMIEYRNN-- 226 (513) Q Consensus 192 -------------~~~~~~~~------------------------------~~~~~~~~~~~~~~~~g~vPvv~~~n~-- 226 (513) +..|.... ..+ ....+...+.+++.+|+|+|.-. T Consensus 238 ~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g-~~~le~~~~~p~~~fP~vP~~g~r~ 316 (708) T protein:vir:10 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDG-DGFLEKPRRIPGEHIPLIPVYGKRW 316 (708) T ss_pred EEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecc-hhhhccCCCCCCCceeeEEEeeeee Confidence 11110000 000 00122334566788899987532 Q ss_pred -----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhh Q lcl|NC_019916. 227 -----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLE 301 (513) Q Consensus 227 -----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 301 (513) ....|.+.++++.|+.+|+.+|.+...+........++........ ... ...........+. T Consensus 317 ~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~-----------~~~--~~~~~~~~~~~~~ 383 (708) T protein:vir:10 317 FIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGL-----------EKH--WEARNKKRPAFLP 383 (708) T ss_pred ccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhH-----------HHH--Hhhccccchhhhc Confidence 1235778899999999999999998777544333222211100000 000 0000000000000 Q ss_pred cchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHH Q lcl|NC_019916. 302 AMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVE 381 (513) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~ 381 (513) ... +.-+.|....+... ...+....-..++...++.....|-.+|++-+...+. .+|.||+||..+-..-.. T Consensus 384 ~~~----~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn~SG~aI~~rq~qg~~ 455 (708) T protein:vir:10 384 LRE----VRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADM 455 (708) T ss_pred ccc----ccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccchHHHHHHHHHHHHHH Confidence 000 00000000000001 1111122334567788889999999999998777664 567999999988776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cc--------------cc--------c-------eeeEEeCCC Q lcl|NC_019916. 382 LASTKRKQFERGLNQRYTVVAHIEERVNGKW------DI--------------DP--------D-------EIGFIFRDN 426 (513) Q Consensus 382 k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~------~~--------------~~--------~-------~i~i~f~~~ 426 (513) .....-..+..+.+++.+++++++....+.. .. +. + +|.|.=.+. T Consensus 456 ~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~ 535 (708) T protein:vir:10 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccC Confidence 7777777788888888887777765532110 00 00 1 122222334 Q ss_pred CCcCHHHHHHHHHHHhcCCCH---HH------HHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHhhhhcCC Q lcl|NC_019916. 427 LPTDDVAIITALVQAGAQIPQ---EY------LYQYLPNVTDADEIVKMMDKQRKA-------------MLKTYDTKGGL 484 (513) Q Consensus 427 ~p~d~~e~a~~~~kl~g~iS~---et------~~~~l~~v~D~~~E~~ri~~E~~~-------------~~~~~~~~~~~ 484 (513) .+.-..+.++.++++.+.++. .+ +++.+. ....++-++++++.... .....+..... T Consensus 536 ~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D-~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~ 614 (708) T protein:vir:10 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQS 614 (708) T ss_pred chhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcC-CcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHH Confidence 444445566666665443322 11 223332 23333334444332100 00000000000 Q ss_pred CC---CCCCCCCCCCCCCCCCCCC-C-------CCccCCC Q lcl|NC_019916. 485 II---NGTSGNDPEDEGVRGQQGE-P-------EDERTSD 513 (513) Q Consensus 485 ~~---~~~~~~~~~~~~~~~~~~~-~-------~~~~~~~ 513 (513) .. .......-....+..+... . .-....| T Consensus 615 q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~ 654 (708) T protein:vir:10 615 QPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQD 654 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000000000 0 0000000 No 97 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.43 E-value=9.7e-12 Score=80.95 Aligned_cols=462 Identities=13% Similarity=0.138 Sum_probs=199.5 Q ss_pred chhhceeccCCcc-cCCHHHHHHHHHHH----HHHHHH---HH----------HHHHHHhcCCCccccccccccCCCCCC Q lcl|NC_019916. 4 MQQANMNYQEDAD-KLTPTRIAAFIRHH----YNNQRP---RL----------EMLYDYYRGQNDGILSPASRRNEKGKA 65 (513) Q Consensus 4 ~~~~~~~~~~~~~-~~~~~~i~~~i~~~----~~~~~~---~~----------~~~~~YY~G~~~i~~~~~~~~~~~~~~ 65 (513) ++.|.-......+ -++.+.|..+|.++ .+.+.+ +. .++.+||.|...- .....+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~---~~~~~~~~~r- 76 (651) T protein:vir:80 1 MKLATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLR---SVGDVNADWR- 76 (651) T ss_pred CcccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccc---ccCCCCCCCC- Confidence 3333333332222 24555554444443 333221 11 1456677775321 1111111111 Q ss_pred cceeecchhHHHHHHHHHHhhcC-----C-eeecC--CcH------HHHHHHHH----hcCHHHHHHHHHHHHhhCCeEE Q lcl|NC_019916. 66 DHRAVHSFARYIADFQTSYSVGN-----A-IAMSG--PSS------DRLDDFNR----RNDIDTLNYELYLDMTVTGRAY 127 (513) Q Consensus 66 ~~ri~~n~~~~ivd~~~~~l~g~-----p-~~~~~--~~~------~~l~~~~~----~n~~~~~~~~~~~~a~~~G~~~ 127 (513) ++++.+..+..|+..+..|+.. . +.+.. +.+ .++..++. ..+|......+..+++++|.|+ T Consensus 77 -s~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i 155 (651) T protein:vir:80 77 -HKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSV 155 (651) T ss_pred -ccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceE Confidence 3688999999999888887653 1 22211 112 12444443 5678888889999999999999 Q ss_pred EEeeecCC-------------------------------CceeEEEEEcccceEEEecCCCCc--ceEEEEEEEeeccc- Q lcl|NC_019916. 128 EYVYRDPS-------------------------------QKGEVSVKLDPMECFIIYDRSVNP--KPIMAVRYHAVQTV- 173 (513) Q Consensus 128 ~~v~~d~~-------------------------------~~~~~~~~~~p~~~~~~~d~~~~~--~~~~~ir~~~~~~~- 173 (513) +.||++.. |.+.+ ..|+|.++++ |++... .-.+.+|.+..... T Consensus 156 ~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i-~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~~l 232 (651) T protein:vir:80 156 LALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDF-EVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKADI 232 (651) T ss_pred EEEeecceeeeeehheeccccccccccceeeeccceeeeceeEE-EEecHHHeee--cCCCcCccccceeeeeeeeHHHH Confidence 99987631 11222 2367777654 443211 11222333211000 Q ss_pred --------c---------c-------------------------ccceeEEEEEEEcC-----CcEEEEEeeccCCcccc Q lcl|NC_019916. 174 --------V---------D-------------------------NITQTKYEVETWTE-----NDYTRYKPIVVAGSVPT 206 (513) Q Consensus 174 --------~---------~-------------------------~~~~~~~~ve~yt~-----~~~~~~~~~~~~~~~~~ 206 (513) . . .....+..+|+|.. .....+.....+... T Consensus 233 ~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~i-- 310 (651) T protein:vir:80 233 LNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEV-- 310 (651) T ss_pred HHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEE-- Confidence 0 0 00001112233321 111111111111111 Q ss_pred cccccccc-CcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccc Q lcl|NC_019916. 207 LEVAEHSA-QFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQM 280 (513) Q Consensus 207 ~~~~~~~~-~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~ 280 (513) .....++ +..+|++.++. ..+|+|..+.+.+.+..+|.+...+.+.+.-.++|.+.+-..+..... T Consensus 311 -l~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~------- 382 (651) T protein:vir:80 311 -LRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPE------- 382 (651) T ss_pred -ecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHH------- Confidence 1111222 34568766643 357999999999999999999999999999999998765421111000 Q ss_pred ccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_019916. 281 VDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLT 359 (513) Q Consensus 281 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~ 359 (513) .+. ...++++. .+..+++.++... .+.......++.+...+...++++++. T Consensus 383 ------------------~l~-~~pg~vi~---------~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~ 434 (651) T protein:vir:80 383 ------------------DVY-TEPGKVFL---------VSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYV 434 (651) T ss_pred ------------------Hhh-cCCCceEE---------ecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHH Confidence 000 01122222 2234556666544 244556778999999999999998866 Q ss_pred ccc---ccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcccccc----------------ccccee Q lcl|NC_019916. 360 DDN---FSGNSSGVAMKYKVLGTVELASTKRKQFER-GLNQRYTVVAHIEERVNGKWD----------------IDPDEI 419 (513) Q Consensus 360 ~~~---~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~-~l~~~~~li~~~l~~~~~~~~----------------~~~~~i 419 (513) .+. ..++.++.+++.+...+.......-+.|.. +++.+++.++.++........ ....++ T Consensus 435 ~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl 514 (651) T protein:vir:80 435 GANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDL 514 (651) T ss_pred hCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccce Confidence 442 224456666666666666666655556654 556566555555543321100 011123 Q ss_pred eEEe--CCCCCcCHHHHHHHHHHHh------cCCC---H-----HH---HHHhCCCCCCHHHHHHH------HHHHHHHH Q lcl|NC_019916. 420 GFIF--RDNLPTDDVAIITALVQAG------AQIP---Q-----EY---LYQYLPNVTDADEIVKM------MDKQRKAM 474 (513) Q Consensus 420 ~i~f--~~~~p~d~~e~a~~~~kl~------g~iS---~-----et---~~~~l~~v~D~~~E~~r------i~~E~~~~ 474 (513) ++.+ ...-+....+..+.+.++. +..+ . +. +++.++ +.++..=+.. ...+++. T Consensus 515 ~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g-~~~~~~~l~~~~q~~~~~~~~~~- 592 (651) T protein:vir:80 515 QKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWG-FEEPEAYLKQQDQQAPANPQEAL- 592 (651) T ss_pred eeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcC-CCCcHHhcCCCccchhhhhhHHH- Confidence 2222 1112222222222222221 1111 1 11 122222 2222110000 0000000 Q ss_pred HHHhhhhcC----CCC-CCCCCCCCCCCCC--CCCCCCCCCccCCC Q lcl|NC_019916. 475 LKTYDTKGG----LII-NGTSGNDPEDEGV--RGQQGEPEDERTSD 513 (513) Q Consensus 475 ~~~~~~~~~----~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 513 (513) +..+..... ... .+.....+..... .......+-+++-. T Consensus 593 ~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 638 (651) T protein:vir:80 593 LSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTPNADQMQQELM 638 (651) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 000 0000000000000 00000000000000 No 98 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.42 E-value=3.2e-13 Score=89.06 Aligned_cols=462 Identities=11% Similarity=-0.006 Sum_probs=200.2 Q ss_pred CCcccCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHY------NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |..+ ...+.+++..+. ..-+.....-.+||.|.|=- ..........+ +..+|.++.+|+..+++-- T Consensus 1 m~d~---~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~~~~~~l~~q~----rp~~N~i~~~i~~v~g~~~ 72 (725) T protein:vir:77 1 MADN---ENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD-DWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) T ss_pred CCch---HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCC-HHHHHHHHhcC----CCccccHHHHHHHHHhhHH Confidence 3322 223333333321 22233456667899998621 11111122222 3357999999999999887 Q ss_pred cCCeeec--CC--cH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec---CCC-ceeEEEEE-----c Q lcl|NC_019916. 87 GNAIAMS--GP--SS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD---PSQ-KGEVSVKL-----D 145 (513) Q Consensus 87 g~p~~~~--~~--~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---~~~-~~~~~~~~-----~ 145 (513) -+.+.+. .. ++ ..++.+.+.++++...+.+..+++++|.||+-|+.| +++ ...+.+.+ + T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~ 152 (725) T protein:vir:77 73 QNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccC Confidence 7665542 21 11 125566677999999999999999999999877544 221 11121111 2 Q ss_pred ccceEEEecCCCC------cceEEEEEEEeecc---------------------cc-----cccceeEEEEEEEcCCcEE Q lcl|NC_019916. 146 PMECFIIYDRSVN------PKPIMAVRYHAVQT---------------------VV-----DNITQTKYEVETWTENDYT 193 (513) Q Consensus 146 p~~~~~~~d~~~~------~~~~~~ir~~~~~~---------------------~~-----~~~~~~~~~ve~yt~~~~~ 193 (513) |.++ +||+... .+-++ ++.|...+ .. -.....+..+++|....+. T Consensus 153 ~~~v--~~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~ 229 (725) T protein:vir:77 153 CSHV--IWDSNSKLMDKSDARHCT-VIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred hhhc--eeCchhhccChhhHHHHH-HHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEe Confidence 3333 3333211 00011 11111100 00 0001223334444422211 Q ss_pred --EEEeec-cCC-----------------------------------------ccccccccccccCcccceEEecCC--- Q lcl|NC_019916. 194 --RYKPIV-VAG-----------------------------------------SVPTLEVAEHSAQFGFPMIEYRNN--- 226 (513) Q Consensus 194 --~~~~~~-~~~-----------------------------------------~~~~~~~~~~~~~g~vPvv~~~n~--- 226 (513) .+.... ..+ ...+..+..+.+.+.||+|+|.-. T Consensus 230 ~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~ 309 (725) T protein:vir:77 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) T ss_pred eEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeec Confidence 110000 000 000111223344566888876432 Q ss_pred CC----CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhh-hheecCcccccccccccccccchhhhhhhccccccchhhh Q lcl|NC_019916. 227 EY----RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAM-LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLE 301 (513) Q Consensus 227 ~~----~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~-l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 301 (513) .. +.|-+.++++.++.+|..+|.+...+.....-. .+-.|... ............ T Consensus 310 ~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~--------------~~~~~~~~~~~~------ 369 (725) T protein:vir:77 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIA--------------GFEHMYDGNDDY------ 369 (725) T ss_pred cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhh--------------HHHHHHHhccCC------ Confidence 22 337788999999999999999886664333211 11111000 000000000000 Q ss_pred cchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHH Q lcl|NC_019916. 302 AMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVE 381 (513) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~ 381 (513) ..+.........|....+.+.......=+.++...++.....|-..|++-+...+..+++.||+||...-..... T Consensus 370 -----~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~ 444 (725) T protein:vir:77 370 -----PYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADL 444 (725) T ss_pred -----ceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHH Confidence 000000000111111122222222222234566689999999999999888777777777999999988776776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------ccccc------------------------ceeeEEeCCCC Q lcl|NC_019916. 382 LASTKRKQFERGLNQRYTVVAHIEERVNGK----------WDIDP------------------------DEIGFIFRDNL 427 (513) Q Consensus 382 k~~~~~~~f~~~l~~~~~li~~~l~~~~~~----------~~~~~------------------------~~i~i~f~~~~ 427 (513) .+...-..+..+.+++.+++++++...... ...++ .+|.|.=.+.. T Consensus 445 ~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~ 524 (725) T protein:vir:77 445 ETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSF 524 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccch Confidence 767777777788888877777765443210 00000 11222222232 Q ss_pred CcCHHHHHHHHHHHhcCCCH------HHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhh---------------hcCC Q lcl|NC_019916. 428 PTDDVAIITALVQAGAQIPQ------EYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDT---------------KGGL 484 (513) Q Consensus 428 p~d~~e~a~~~~kl~g~iS~------et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~---------------~~~~ 484 (513) +.=..+.++.++.+...++. -++...++. ....++.++++.++.......... .... T Consensus 525 ~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~ 604 (725) T protein:vir:77 525 QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQ 604 (725) T ss_pred HHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhH Confidence 22223334444444322221 122222221 111233344444322111000000 0000 Q ss_pred C---CCCCC-CCCCCCCCCCCCCCCCCCc-cCCC Q lcl|NC_019916. 485 I---INGTS-GNDPEDEGVRGQQGEPEDE-RTSD 513 (513) Q Consensus 485 ~---~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~ 513 (513) . ...+. ......+.. ...-+.. ...+ T Consensus 605 ~~e~~q~q~~~~~~qa~~~---kaq~e~~k~q~~ 635 (725) T protein:vir:77 605 DPAMVQAQGVLLQGQAELA---KAQNQTLSLQID 635 (725) T ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH Confidence 0 00000 000000000 0000000 0000 No 99 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.42 E-value=9e-13 Score=86.62 Aligned_cols=467 Identities=11% Similarity=-0.005 Sum_probs=196.5 Q ss_pred CCcccCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHY------NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |..+ .+.+.+++..+. ..-+....+-.+||.|.|=- ..........+ +..+|.++.+|+..+++-- T Consensus 1 m~d~---~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~~~~~~l~~q~----rp~~N~i~~~i~~v~g~e~ 72 (725) T protein:vir:92 1 MADN---ENRLESILSRFDADWTASDEARREAKNDLFFSRISQWD-DWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) T ss_pred CCch---HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC-HHHHHHHHhcC----CCcccchHHHHHHHHhhHH Confidence 3322 233444443322 22233466777899998621 11111122222 3357999999999999877 Q ss_pred cCCeeec--CC--cH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec---CCC-ceeEEEEE----cc Q lcl|NC_019916. 87 GNAIAMS--GP--SS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD---PSQ-KGEVSVKL----DP 146 (513) Q Consensus 87 g~p~~~~--~~--~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---~~~-~~~~~~~~----~p 146 (513) -+.+.+. .. ++ ..++.+.+.++++...+.+..+++++|.||+-|..| +++ ...+.+.+ +| T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~ 152 (725) T protein:vir:92 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCC Confidence 6655442 21 11 125566677999999999999999999999877543 221 11122221 22 Q ss_pred cceEEEecCCCC------cceEEEEEEEeecc---------------------ccc-----ccceeEEEEEEEcCCcE-- Q lcl|NC_019916. 147 MECFIIYDRSVN------PKPIMAVRYHAVQT---------------------VVD-----NITQTKYEVETWTENDY-- 192 (513) Q Consensus 147 ~~~~~~~d~~~~------~~~~~~ir~~~~~~---------------------~~~-----~~~~~~~~ve~yt~~~~-- 192 (513) ... +.||+... .+-++ ++.|...+ ... .....+..+++|....+ T Consensus 153 ~~~-V~~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~ 230 (725) T protein:vir:92 153 CSH-VIWDSNSKLMDKSDSRHCT-VIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) T ss_pred hhh-cccCchhhccChhhHHHHH-HHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEee Confidence 221 12333211 00000 11111000 000 00112223333332111 Q ss_pred EEEEee-ccCC-----------------------------------------ccccccccccccCcccceEEecCC---C Q lcl|NC_019916. 193 TRYKPI-VVAG-----------------------------------------SVPTLEVAEHSAQFGFPMIEYRNN---E 227 (513) Q Consensus 193 ~~~~~~-~~~~-----------------------------------------~~~~~~~~~~~~~g~vPvv~~~n~---~ 227 (513) ..+... ...+ ...+..+..+.+.+.||+|+|.-. . T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~ 310 (725) T protein:vir:92 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) T ss_pred eEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeecc Confidence 000000 0000 000111222344456888876432 1 Q ss_pred C----CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc Q lcl|NC_019916. 228 Y----RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM 303 (513) Q Consensus 228 ~----~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 303 (513) . +.|-+.++++.++.+|+.+|.+...+-..+.-..+ +..+... ..... +... T Consensus 311 ~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~--~~~~~i~-----------~~~~~-----------~~~~ 366 (725) T protein:vir:92 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPF--FWPEQIA-----------GFEHM-----------YDGN 366 (725) T ss_pred CCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccc--cchhhhh-----------HHHHH-----------Hhcc Confidence 2 33888899999999999999988666433221111 1000000 00000 0000 Q ss_pred hhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHH Q lcl|NC_019916. 304 RQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELA 383 (513) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~ 383 (513) .....+.........|.......++.....-+.++...++.....|-..|++-+-..+..+++.||+||..+-..-.... T Consensus 367 ~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l 446 (725) T protein:vir:92 367 DDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLET 446 (725) T ss_pred CccceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHH Confidence 00011111111111111122223333333344567779999999999999998777777777899999988766555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccc----------------------------ceeeEEeCCCCCc Q lcl|NC_019916. 384 STKRKQFERGLNQRYTVVAHIEERVNGKWD------IDP----------------------------DEIGFIFRDNLPT 429 (513) Q Consensus 384 ~~~~~~f~~~l~~~~~li~~~l~~~~~~~~------~~~----------------------------~~i~i~f~~~~p~ 429 (513) ...-..|..+.+++.++++.++........ .+. .++.|.=.+..+. T Consensus 447 ~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s 526 (725) T protein:vir:92 447 YVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQS 526 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHH Confidence 556666666777776666665444321100 000 1111212222222 Q ss_pred CHHHHHHHHHHHhcCCCH------HHHHHhCC--CCCCHHHHHHHHHHHHHHHHH-------------HhhhhcCCCCCC Q lcl|NC_019916. 430 DDVAIITALVQAGAQIPQ------EYLYQYLP--NVTDADEIVKMMDKQRKAMLK-------------TYDTKGGLIING 488 (513) Q Consensus 430 d~~e~a~~~~kl~g~iS~------et~~~~l~--~v~D~~~E~~ri~~E~~~~~~-------------~~~~~~~~~~~~ 488 (513) -..+.+..++.+...++. -++...++ ......+..+++.++...... ............ T Consensus 527 ~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~ 606 (725) T protein:vir:92 527 MKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDP 606 (725) T ss_pred HHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHH Confidence 122333444444322221 11222221 111223334444332111000 000000000000 Q ss_pred -C--CCCCCCCCCCCCCCCCCCCccCC-C Q lcl|NC_019916. 489 -T--SGNDPEDEGVRGQQGEPEDERTS-D 513 (513) Q Consensus 489 -~--~~~~~~~~~~~~~~~~~~~~~~~-~ 513 (513) . ....-.......+...-+.++.. | T Consensus 607 e~~~~qa~~~~~qae~~kaqaE~~k~q~~ 635 (725) T protein:vir:92 607 AMVQAQGVLLQGQAELAKAQNQTLSLQID 635 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00000000000000000000000 0 No 100 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.41 E-value=2.4e-11 Score=78.76 Aligned_cols=457 Identities=11% Similarity=-0.004 Sum_probs=217.0 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccccccccCCCCCCc--------------cee--ecchh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQN--DGILSPASRRNEKGKAD--------------HRA--VHSFA 74 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~--~i~~~~~~~~~~~~~~~--------------~ri--~~n~~ 74 (513) |. +|-. ..+.- ... .........||.|-- .-....+ .+....++ +.+ .++|. T Consensus 1 ~~----~p~~-~~~~~--~~~-~~~~~~~~~y~~~a~~~~~~~~~w--~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a 70 (533) T protein:vir:34 1 MK----TPTI-PTLLG--PDG-MTSLREYAGYHGGGSGFGGQLRSW--NPPSESVDAALLPNFTRGNARADDLVRNNGYA 70 (533) T ss_pred CC----Cchh-hhhhc--ccc-cchHHHHHhhhhccCCCCCccccc--ccCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Confidence 11 1111 11100 111 122345667776631 1101111 01111111 011 36899 Q ss_pred HHHHHHHHHHhhcCCeeecCCc-----------H----HHHHHHHHh--------------cCHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGPS-----------S----DRLDDFNRR--------------NDIDTLNYELYLDMTVTGR 125 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~~-----------~----~~l~~~~~~--------------n~~~~~~~~~~~~a~~~G~ 125 (513) +-+|+..+.+++|.+++..... . +.++..|+. .+|...+..+++..++.|. T Consensus 71 ~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE 150 (533) T protein:vir:34 71 ANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGE 150 (533) T ss_pred HHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCc Confidence 9999999999999999876531 1 223333432 1466778889999999999 Q ss_pred EEEEeeecCCCceeE--EEE-EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCC Q lcl|NC_019916. 126 AYEYVYRDPSQKGEV--SVK-LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAG 202 (513) Q Consensus 126 ~~~~v~~d~~~~~~~--~~~-~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~ 202 (513) +|+...+.+.+...+ .+. ++|..+--.++.........+|.+ +..+.-..|+ ++.... .+. T Consensus 151 ~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~------d~~Gr~~aY~--i~~~~~--------~~~ 214 (533) T protein:vir:34 151 LFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI------NDSGAALGYY--VSEDGY--------PGW 214 (533) T ss_pred eEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEE------CCCCCeEEEE--EeecCC--------CCc Confidence 999887766543221 222 566665444443333445566654 1112222332 332111 000 Q ss_pred c-cccccccccccCcccc---eEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccc Q lcl|NC_019916. 203 S-VPTLEVAEHSAQFGFP---MIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFD 273 (513) Q Consensus 203 ~-~~~~~~~~~~~~g~vP---vv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~ 273 (513) . ..+.... ....+| |+++.. -.+|.|.|..++..+..++....-........+.-..+++........ T Consensus 215 ~~~~~~~~~---~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~ 291 (533) T protein:vir:34 215 MPQKWTWIP---RELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA 291 (533) T ss_pred cccccceee---eeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccc Confidence 0 0000000 011223 455433 347999999988777666554433332222222222223321111000 Q ss_pred cccccccccch-hhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 274 DSTLLQMVDPS-DADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 274 ~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 352 (513) . ...... ..+....+......... ......+.+.+|.. ....++.+++|++..-+..++..+...+.+.|..- T Consensus 292 ~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAag 365 (533) T protein:vir:34 292 M----DFILGANSQEQRERLTGWIGEIAA-YYAAAPVRLGGAKV-PHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAG 365 (533) T ss_pred c----ccccCCCcccccccccccchhhhh-ccCcceeeccCcee-eecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhh Confidence 0 000000 00000001000000000 00112223333322 22345677899988888889999999999999999 Q ss_pred hCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccccccc------cc-----ceee Q lcl|NC_019916. 353 SHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ-RYTVVAHIEERVNGKWDI------DP-----DEIG 420 (513) Q Consensus 353 s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~~li~~~l~~~~~~~~~------~~-----~~i~ 420 (513) .++|-.....--++.|-.++|..+......+...+..|...+.+ +++..+...- ..+..+. ++ ..+. T Consensus 366 lGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ai-l~G~i~~p~~~~~~~~~~~~~~~~ 444 (533) T protein:vir:34 366 LGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAI-VRRVVTLPSKARFSFQEARSAWGN 444 (533) T ss_pred cCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HcCcccCCCccCCCchhhHHhhhc Confidence 98884332211134566678887777777777777777665433 3333332211 2222111 11 1235 Q ss_pred EEeCCCC--CcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCC Q lcl|NC_019916. 421 FIFRDNL--PTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPED 496 (513) Q Consensus 421 i~f~~~~--p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (513) +.|..+- -.|....+++.... +|+.|.+..+...+ .|+++.++++.+|++...+.--.. +.........+ T Consensus 445 ~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~ev~~q~a~e~~~~~~~gl~~----~~~~~~~~~s~ 518 (533) T protein:vir:34 445 CDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRETMERRAAGLKP----PAWAAAAFESG 518 (533) T ss_pred eeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHhcCCCC----CCCCCcCccCC Confidence 6774333 34666667766654 78999999999997 489999999998887654432111 11000000000 Q ss_pred CCCCCCCCCCCCccCC Q lcl|NC_019916. 497 EGVRGQQGEPEDERTS 512 (513) Q Consensus 497 ~~~~~~~~~~~~~~~~ 512 (513) ...+.+.+.++++-. T Consensus 519 -~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 519 -LRQSTEEEKSDSRAA 533 (533) T ss_pred -CCCCCCCCcccCCCC Confidence 000001111111111 No 101 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.39 E-value=5.9e-12 Score=82.17 Aligned_cols=467 Identities=9% Similarity=0.028 Sum_probs=195.9 Q ss_pred cccCCHHHHHHHHHHHHHH-------HHHHHHHH-HHHhcCCCccccccccccC----CCCCCcceeecchhHHHHHHHH Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNN-------QRPRLEML-YDYYRGQNDGILSPASRRN----EKGKADHRAVHSFARYIADFQT 82 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~-------~~~~~~~~-~~YY~G~~~i~~~~~~~~~----~~~~~~~ri~~n~~~~ivd~~~ 82 (513) ..+.+.+.+.+++..+... +....+.+ .+||.|.|=- ........ ..++| .+.+|.++.+|+..+ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~-~~~~~~l~~~~q~~~rP--~~~~N~i~~~i~~v~ 77 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWE-GATAAGTKLDEQFEKYP--KFEINKVATELNRII 77 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCC-HHHHHHHHhhhhhcCCC--ceEEcchHHHHHHHH Confidence 2333445555555443211 11111122 3689997611 00000110 11111 367899999999999 Q ss_pred HHhhcCCeee--cCC---cHH--------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec---CCC------ceeE Q lcl|NC_019916. 83 SYSVGNAIAM--SGP---SSD--------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD---PSQ------KGEV 140 (513) Q Consensus 83 ~~l~g~p~~~--~~~---~~~--------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---~~~------~~~~ 140 (513) ++---+.+.+ ... .+. .++.+.+.++++...+.+..+++++|.||+-+..| +++ ...+ T Consensus 78 g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i 157 (708) T protein:vir:17 78 AEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAI 157 (708) T ss_pred hhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccce Confidence 9987666554 222 111 25566778999999999999999999999766432 221 2222 Q ss_pred EEEEcc-cceEEEecCCCCc-ceE----EEEEEEeec-----------------------ccccccceeEEEEEEEcC-- Q lcl|NC_019916. 141 SVKLDP-MECFIIYDRSVNP-KPI----MAVRYHAVQ-----------------------TVVDNITQTKYEVETWTE-- 189 (513) Q Consensus 141 ~~~~~p-~~~~~~~d~~~~~-~~~----~~ir~~~~~-----------------------~~~~~~~~~~~~ve~yt~-- 189 (513) ....+| .+++ ||+.... .+. .+++.|... ..+-.....+..+++|.. T Consensus 158 ~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~ 235 (708) T protein:vir:17 158 EPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRK 235 (708) T ss_pred Eeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEee Confidence 211233 3443 4443210 000 011111000 000000112222333321 Q ss_pred -------------CcEEEEEeec------------------------------cCCccccccccccccCcccceEEecCC Q lcl|NC_019916. 190 -------------NDYTRYKPIV------------------------------VAGSVPTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 190 -------------~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~g~vPvv~~~n~ 226 (513) +.++.|.... ..+. ...+...+.+++.+|+|+|.-. T Consensus 236 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~-~~l~~~~~~p~~~fP~vP~~g~ 314 (708) T protein:vir:17 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGD-GFLEKPRRIPGEHIPLIPVYGK 314 (708) T ss_pred eeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeeccc-ccccCCCCCCCCccceEEEecc Confidence 0111110000 0000 1122334556778898887532 Q ss_pred ---CCC----CcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 227 ---EYR----QGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 227 ---~~~----~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) ..| .|-+.++++.|+.+|..+|.+...+-.......++.-..... ....-.+.+ ........ T Consensus 315 r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g---------~~~~~~~~~--~~~~~~~~ 383 (708) T protein:vir:17 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRG---------LEKHWEARN--KKRPAFLP 383 (708) T ss_pred cccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhh---------hHHhhhhcc--cchhhhhh Confidence 122 366779999999999999998876644433222111100000 000000000 00000000 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGT 379 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 379 (513) +.... +.+-.+..+ ....+.++ ..+++ .++...++.....|-..|++-+...+. .+|.||+||...-..- T Consensus 384 ~~~~~-~~~g~v~~~-----a~~~~~~~--~~~~~-~~~~~llq~~~~~i~~~tGi~d~~~G~-~sn~SG~Ai~~rq~qg 453 (708) T protein:vir:17 384 LREVR-DKYGNIIAG-----ATPAGYTQ--PAVMN-QALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRA 453 (708) T ss_pred hhccC-Ccccccccc-----cCCcccCC--Ccccc-HHHHHHHHHHHHHHHHhcCCChHHccC-ccchHHHHHHHHHHHH Confidence 00000 000000010 11111111 12233 566778999999999999988877775 5679999999877666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccc----------------------ce-------eeEEeC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGKWD------IDP----------------------DE-------IGFIFR 424 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~------~~~----------------------~~-------i~i~f~ 424 (513) .......-..+..+.++..+++++++...-+... .+. ++ |.|.=. T Consensus 454 ~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~ 533 (708) T protein:vir:17 454 DMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVG 533 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecc Confidence 6666666667777777777777666554321100 000 01 111112 Q ss_pred CCCCcCHHHHHHHHHHHhcCCCHH---H------HHHhCCCCCCHHHHHHHHHHHHHH---------------------H Q lcl|NC_019916. 425 DNLPTDDVAIITALVQAGAQIPQE---Y------LYQYLPNVTDADEIVKMMDKQRKA---------------------M 474 (513) Q Consensus 425 ~~~p~d~~e~a~~~~kl~g~iS~e---t------~~~~l~~v~D~~~E~~ri~~E~~~---------------------~ 474 (513) +..+.-..+..+.++++.+.++.. + +++.+++ ...++-.++|.+.... . T Consensus 534 p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~-p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~ 612 (708) T protein:vir:17 534 PSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAA 612 (708) T ss_pred cCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCC-CChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHH Confidence 222222334444555543322211 1 2333322 2223333343322110 0 Q ss_pred HHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 475 LKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +...+...............+-..... ......-| T Consensus 613 q~q~~~~~~eaqa~~~~~qAe~~ka~a----ea~~~q~~ 647 (708) T protein:vir:17 613 QSQPNPEMVLAQAQMVAAQAEAQKATN----ETAQTQIK 647 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHH Confidence 000000000000000000000000000 00000000 No 102 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.37 E-value=1.6e-11 Score=79.82 Aligned_cols=460 Identities=10% Similarity=0.057 Sum_probs=204.6 Q ss_pred CCcccCCHHHHHHHHHHHHH------HHHHHHHHHHHHh--cCCCcccccc---ccccCCCCCCcceeecchhHHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYN------NQRPRLEMLYDYY--RGQNDGILSP---ASRRNEKGKADHRAVHSFARYIADFQ 81 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~------~~~~~~~~~~~YY--~G~~~i~~~~---~~~~~~~~~~~~ri~~n~~~~ivd~~ 81 (513) |. +-+.+++.+++..+.. ..+.+...-.+|| .|.|=...-. .......+++ .+.+|.++.+|+.. T Consensus 1 m~--e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP--~~~~N~i~~~v~~v 76 (706) T protein:vir:10 1 MA--ESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYP--KFEINKVATELNRI 76 (706) T ss_pred CC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCC--ceEecchHHHHHHH Confidence 33 2234455555554432 2223334444566 4654110000 0010111222 57899999999999 Q ss_pred HHHhhcCCeeec--C---CcH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC-------CCceeEE Q lcl|NC_019916. 82 TSYSVGNAIAMS--G---PSS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP-------SQKGEVS 141 (513) Q Consensus 82 ~~~l~g~p~~~~--~---~~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~-------~~~~~~~ 141 (513) +++.--+.+.+. . .++ ..++.+.+.++++...+.+..+++++|.||+-+..|- .++..+. T Consensus 77 ~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~ 156 (706) T protein:vir:10 77 ISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIA 156 (706) T ss_pred hhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccce Confidence 999887766542 1 111 1255667789999999999999999999998886541 1222222 Q ss_pred EE--EcccceEEEecCCC------CcceEEEEEEEeecc------------c----------ccccceeEEEEEEEcCCc Q lcl|NC_019916. 142 VK--LDPMECFIIYDRSV------NPKPIMAVRYHAVQT------------V----------VDNITQTKYEVETWTEND 191 (513) Q Consensus 142 ~~--~~p~~~~~~~d~~~------~~~~~~~ir~~~~~~------------~----------~~~~~~~~~~ve~yt~~~ 191 (513) +. .+|.+. +.||+.. +.+-++ ++.|...+ . +......+...+.|+... T Consensus 157 i~~v~~p~~~-v~~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~ 234 (706) T protein:vir:10 157 VEPIYDPARS-VWFDPDAKKYDKSDALWAF-CMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRK 234 (706) T ss_pred eeeeccchhc-eecCchhcccChhhcceEe-eeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccc Confidence 21 255542 2344421 111111 11111100 0 000011112222333211 Q ss_pred E----EEEEeeccC-------------------Ccc---------------------ccccccccccCcccceEEecCCC Q lcl|NC_019916. 192 Y----TRYKPIVVA-------------------GSV---------------------PTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 192 ~----~~~~~~~~~-------------------~~~---------------------~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) . .+|+....+ +.. .......+.+.+.||+|+|.-.+ T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r 314 (706) T protein:vir:10 235 ESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKR 314 (706) T ss_pred eeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeecc Confidence 1 111110000 000 01112233445889999875432 Q ss_pred -------CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhh-ccccccch- Q lcl|NC_019916. 228 -------YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMK-KLADEKMA- 298 (513) Q Consensus 228 -------~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~- 298 (513) ...|.+.++++.|+.+|..+|.+.+.+..... ..-.|...... .....+ ........ T Consensus 315 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~--~~~~~~~~~i~------------~~~~~~~~~~~~~~~~ 380 (706) T protein:vir:10 315 WFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPG--QTPIVDMEQIR------------GLEQHWEGRNRKRPAF 380 (706) T ss_pred ccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCC--cccccchhHHH------------HHHHHhhhcccccccc Confidence 24577889999999999999999887633222 11112110000 000000 00000000 Q ss_pred -hhhcchh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHH Q lcl|NC_019916. 299 -QLEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKV 376 (513) Q Consensus 299 -~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~ 376 (513) .+..+.. .+.+. .......++..+.-..++...++.....|..+|++-+.+.+. .+|.||+||...- T Consensus 381 l~~~~~~~~~g~i~----------~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~-~sn~SG~Ai~~rq 449 (706) T protein:vir:10 381 LPLRTVTDKTGNVV----------APANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQM-PSNVARETVNSLL 449 (706) T ss_pred hhcccccCCCCccc----------ccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCC-ccchHHHHHHHHH Confidence 0000000 00000 001122222232334556777888889999999998877664 4679999999887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------cccc------------------c-------eeeE Q lcl|NC_019916. 377 LGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKW----------DIDP------------------D-------EIGF 421 (513) Q Consensus 377 ~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~----------~~~~------------------~-------~i~i 421 (513) ..........-..|..+.+++.+++++++...-... ..++ + +|.| T Consensus 450 ~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i 529 (706) T protein:vir:10 450 NRSDMASFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSV 529 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEE Confidence 777777777788888888888887777765432110 0000 0 1112 Q ss_pred EeCCCCCcCHHHHHHHHHHHhcC-CCH--HH------HHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCC Q lcl|NC_019916. 422 IFRDNLPTDDVAIITALVQAGAQ-IPQ--EY------LYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGN 492 (513) Q Consensus 422 ~f~~~~p~d~~e~a~~~~kl~g~-iS~--et------~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 492 (513) .=.+..+.-..+..+.++.+.+. .+. .+ +++.+.+ +..++-.+++++..... ....+..+. T Consensus 530 ~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~-p~~~e~~e~irk~~~~q---------~~~~~~~~~ 599 (706) T protein:vir:10 530 DVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEG-EGLDDFKAFNRRQLLTQ---------GIVKPRNQQ 599 (706) T ss_pred ecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCc-cchHHHHHHHHHhhccc---------CCccccchh Confidence 11333444345555566655332 221 12 2333322 22223344443321100 000000000 Q ss_pred CCCC----------------CCCCCCCCCCCCc--cCCC Q lcl|NC_019916. 493 DPED----------------EGVRGQQGEPEDE--RTSD 513 (513) Q Consensus 493 ~~~~----------------~~~~~~~~~~~~~--~~~~ 513 (513) .... .....+-...+.+ +... T Consensus 600 eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a 638 (706) T protein:vir:10 600 EQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQN 638 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000000000000 0000 No 103 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.37 E-value=5.2e-11 Score=76.97 Aligned_cols=475 Identities=10% Similarity=-0.032 Sum_probs=217.7 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccccccccC-C---CCCCc-------- Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQ--NDGILSPASRRN-E---KGKAD-------- 66 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~--~~i~~~~~~~~~-~---~~~~~-------- 66 (513) |++.......+... ..-.+ +...-...|+|- +.-....+.... . ..... T Consensus 1 m~~~~~r~~~~~a~---------------~~~~~--~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~Ra 63 (553) T protein:vir:63 1 MTKVTVRKLSEVTS---------------GRPEQ--SASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARG 63 (553) T ss_pred Ccchhhhhhccccc---------------ccchh--hhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHH Confidence 32222222111111 10011 111111235542 211111110000 0 00000 Q ss_pred cee--ecchhHHHHHHHHHHhhcCCeeecCCc------------H----HHHHHHHHh--------------cCHHHHHH Q lcl|NC_019916. 67 HRA--VHSFARYIADFQTSYSVGNAIAMSGPS------------S----DRLDDFNRR--------------NDIDTLNY 114 (513) Q Consensus 67 ~ri--~~n~~~~ivd~~~~~l~g~p~~~~~~~------------~----~~l~~~~~~--------------n~~~~~~~ 114 (513) +.+ .++|++-+|+..+.+++|.+++..... . +.++..|+. .+|...+. T Consensus 64 RdL~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~ 143 (553) T protein:vir:63 64 RDMADNDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIR 143 (553) T ss_pred HHHHhcChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHH Confidence 011 358999999999999999999875421 1 123333332 14667788 Q ss_pred HHHHHHhhCCeEEEEeeecCCCcee--EEE-EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCc Q lcl|NC_019916. 115 ELYLDMTVTGRAYEYVYRDPSQKGE--VSV-KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEND 191 (513) Q Consensus 115 ~~~~~a~~~G~~~~~v~~d~~~~~~--~~~-~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~ 191 (513) .+++..+..|.+|+...+.++.... +.+ .++|..+-..++.........+|.+- ..+.-..| +++.... T Consensus 144 l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d------~~Gr~vaY--~i~~~hP 215 (553) T protein:vir:63 144 LGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYD------KRGRPQGY--WIQVAHP 215 (553) T ss_pred HHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEEC------CCCceEEE--EeeccCC Confidence 8999999999999877665543221 122 25776665555444445566676541 12222233 3443222 Q ss_pred EEEEEeeccCCccccccccccccCcccc---eEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_019916. 192 YTRYKPIVVAGSVPTLEVAEHSAQFGFP---MIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLV 263 (513) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~ 263 (513) --.+........+.... .+..|| |+++- .-.+|.|.|..++..+-.++....--.......+.-..+ T Consensus 216 gd~~~~~~~~~~~~r~~-----~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~f 290 (553) T protein:vir:63 216 GDLYQMAPDMYKWKFVQ-----QSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAA 290 (553) T ss_pred Cccccccccccceeeec-----cccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheee Confidence 11110110000000000 011222 34432 235799999998877766665444433333222222223 Q ss_pred eecCcccccccccccccccchhhhhhhccccccchhh-hcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHH Q lcl|NC_019916. 264 IKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL-EAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYK 342 (513) Q Consensus 264 ~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 342 (513) ++...................... .......... ........+.+.+|.. .....+.++++++..-+..++..+. T Consensus 291 i~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~p~~p~~~~~~F~ 366 (553) T protein:vir:63 291 IESELPPEFIHSQMSGGSPNADMV---GIFGKYMDALKAYVGGANNIQIDGAKI-PHLFPGTKLNLKPMGTPGGVGSEFE 366 (553) T ss_pred eecCCChhhhhhhccccccccccc---ccccccccccccccccccceeecCcee-eecCCCCeeeecCCCCCCCCHHHHH Confidence 332111100000000000000000 0000000000 0001111222333222 2234567889988888888999999 Q ss_pred HHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccccccc------- Q lcl|NC_019916. 343 KRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ-RYTVVAHIEERVNGKWDI------- 414 (513) Q Consensus 343 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~~li~~~l~~~~~~~~~------- 414 (513) +.+.+.|.+-.++|-...-.--+++|-.+.|..+......+...+..|...+.+ +++..+...-. .+..+. T Consensus 367 ~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l-~G~i~~p~~~~~~ 445 (553) T protein:vir:63 367 ASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIA-AGEVPMPPGQTRD 445 (553) T ss_pred HHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-cCCccCCCcccch Confidence 999999999998884322111134555677877777777777777777665544 44443332222 221110 Q ss_pred -------ccceeeEEeCCCCC--cCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcC Q lcl|NC_019916. 415 -------DPDEIGFIFRDNLP--TDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGG 483 (513) Q Consensus 415 -------~~~~i~i~f~~~~p--~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~ 483 (513) ....+.+.|..+-. .|....+++.... +|+.|.+..+...+ .|+++.++++.+|.+...+.--.... T Consensus 446 ~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~ 523 (553) T protein:vir:63 446 LFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG--GDFRKSFAQRAREDALLKKYGLTFNL 523 (553) T ss_pred hhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 01124567754444 3666667666554 78999999999997 48999999999887765443211111 Q ss_pred CCC-CCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 484 LII-NGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 484 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ... ....+.+.+..+.+.....++++.. + T Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-e 553 (553) T protein:vir:63 524 SAKRSLGDGRDAATGIAEDPAAAQTSQQG-E 553 (553) T ss_pred CCccccCCCcccCCCCCCCCCCCCccccc-C Confidence 100 0011111111111111111111111 1 No 104 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.36 E-value=6.8e-12 Score=81.82 Aligned_cols=396 Identities=11% Similarity=0.059 Sum_probs=182.6 Q ss_pred HHHHHHHHHHhcCC-C--ccccccccccCCCCCCcce-----eecchhHHHHHHHHHHhhcCCeeecCCc--H---HHHH Q lcl|NC_019916. 35 RPRLEMLYDYYRGQ-N--DGILSPASRRNEKGKADHR-----AVHSFARYIADFQTSYSVGNAIAMSGPS--S---DRLD 101 (513) Q Consensus 35 ~~~~~~~~~YY~G~-~--~i~~~~~~~~~~~~~~~~r-----i~~n~~~~ivd~~~~~l~g~p~~~~~~~--~---~~l~ 101 (513) ....+-|...--|- + +........ ........ -.+.+++.+|+..+.-++-+++.+++++ . +.++ T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~--~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~~ 78 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSL--SLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLFT 78 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCc--cccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHHH Confidence 11111111111110 0 000000000 00000000 1368889999999999999999997753 2 2477 Q ss_pred HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC---------ceeEEEEEcccceEEEe-cCCCCcceEEE-EEEEee Q lcl|NC_019916. 102 DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ---------KGEVSVKLDPMECFIIY-DRSVNPKPIMA-VRYHAV 170 (513) Q Consensus 102 ~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~---------~~~~~~~~~p~~~~~~~-d~~~~~~~~~~-ir~~~~ 170 (513) ..|+.-++.....++.+.+-.+|.|++++-.+..+ .......+++..+.|.. .+.....+-++ ..+|.+ T Consensus 79 ~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v 158 (437) T protein:vir:52 79 KFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSI 158 (437) T ss_pred HHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEEE Confidence 77877788999999999999999999988775321 11111112222221110 00000000000 001111 Q ss_pred cccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_019916. 171 QTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDT 250 (513) Q Consensus 171 ~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~ 250 (513) ... . ....+.+.++++|.. ..+| ...++-.|.|.++.+.+-+..++++.-.. T Consensus 159 ~~~----~----~~~~iH~SRii~~~~------------------~~~~--~~~~~~~G~s~le~~~~~i~~~~~~~~~~ 210 (437) T protein:vir:52 159 LGG----S----QSITVHHSRLIILNA------------------NDAP--LSDNDIWGVSDLEKIIDVLKRFDSASVNV 210 (437) T ss_pred ecC----C----cceeEccceeEEecC------------------ccCC--CccccccCCchHHHHHHHHHHHHHHHHHH Confidence 000 0 000011111211110 0012 11234458999999999998888888777 Q ss_pred HHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCceeEE Q lcl|NC_019916. 251 ANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADANYI 329 (513) Q Consensus 251 ~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l 329 (513) +..+..+..+.+.+.|......... ... .......+..++. .+++.++ .+.+|- T Consensus 211 ~~l~~~~~~~v~k~~~l~~~l~~~~----------~~~----~~~~~~~~~~~~~~~~~~~~d-----------~~~~~e 265 (437) T protein:vir:52 211 GDLIFESKIDIFKIAGLSDKIAAGM----------ENE----VASVISAVQEIKSATNSLLLD-----------AENEYD 265 (437) T ss_pred HHHHHHcCCCceecchHHHHhcCCc----------HHH----HHHHHHHHHHhcCCCceEEEc-----------CCcceE Confidence 7766666666555555321111100 000 0001111222222 2233321 223455 Q ss_pred eecCCHHHHHHHHHHHHHHHHHHhCcccccc-cc-ccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 330 HKEYDSAGTELYKKRLAADIHKFSHTPDLTD-DN-FSGNSSGVAMKYKVLGTVELASTKR-KQFERGLNQRYTVVAHIEE 406 (513) Q Consensus 330 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~-~~~n~Sg~Ai~~~~~~l~~k~~~~~-~~f~~~l~~~~~li~~~l~ 406 (513) +.+.+.++....++...+.|+..+++|-.-+ +. .+|=.||..=..-|. ..++..| ..+...+++++++++.-. T Consensus 266 ~~~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yy---d~i~~~Qe~~l~p~le~l~~~i~~~~- 341 (437) T protein:vir:52 266 RKELTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYH---EAIRRLQETRLRPIFEIIDPLICNEL- 341 (437) T ss_pred EEecCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 5567778888999999999999999996443 22 222245554333333 2333333 567888888888765321 Q ss_pred hcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh---------cCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 407 RVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG---------AQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 407 ~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~---------g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 477 (513) . +..+ .+++++|++-...+..+.|++..+.+ |++|.+.+.+. +.+. -. T Consensus 342 -~-g~~~---~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~-------------L~~~-----g~ 398 (437) T protein:vir:52 342 -F-GGLP---ADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANE-------------LRES-----GL 398 (437) T ss_pred -c-CCCC---CcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHH-------------HHhc-----CC Confidence 1 2111 25789999999899999988765532 34444333222 2110 00 Q ss_pred hhhhcCC----CCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 478 YDTKGGL----IINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 478 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) +...... ..+.....+..+++.......+++.+.+ T Consensus 399 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 399 FANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 0000000 0000000111111111111122222222 No 105 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.34 E-value=8.6e-12 Score=81.26 Aligned_cols=410 Identities=11% Similarity=0.049 Sum_probs=184.6 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc-CCCCC-Cccee--ecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRR-NEKGK-ADHRA--VHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~-~~~ri--~~n~~~~ 76 (513) |..-++... +.+-+...+.+.-.. .++.... ..... ....+ .+.+++. T Consensus 5 m~~~~~~~~---------------------------~~D~~~~~~~~~~g~-~~~~~~~~~~~~~~~l~~~Y~~~~l~~~ 56 (435) T protein:vir:79 5 MSDKVKAIT---------------------------KEDGYNEIFGSKDGT-FRPNAFYMQRAAFKALSQFYEEDGMARR 56 (435) T ss_pred cccccccch---------------------------hhcchhhhhcccccc-cccCcccCCcCCHHHHHHHHhcCchhhh Confidence 221111100 111111112221110 0000000 00000 00111 3678899 Q ss_pred HHHHHHHHhhcCCeeecCCc-HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccc---eEEE Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPS-SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPME---CFII 152 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~---~~~~ 152 (513) +|+..+.-++.+++.+++++ .+.++..|+.-++.....++.+.+..+|.|++++-..++.. ..-++.+.. .+.+ T Consensus 57 ~Vd~~aed~~r~g~~i~g~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~--~~~Pl~~~g~i~~i~v 134 (435) T protein:vir:79 57 IVDVIPEEMVTPGFKVDGVKNEKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKM--LKSPVKPGAQLEDIRV 134 (435) T ss_pred hhccchHHhhcCCceecCCChHHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCC--cccccccCCceeeEEe Confidence 99999999999999998764 46678888877888999999999999999998887643321 111222221 1222 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCccc-----ceEE---ec Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGF-----PMIE---YR 224 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v-----Pvv~---~~ 224 (513) +|..- +.... +...........-..|.| ...++. ....-|+--.| |+-. .. T Consensus 135 ~d~~~---i~~~~-~~~dp~sp~fg~P~~y~v-------------~~~~~~----~~~~iH~SRli~~~g~~~p~~~~~~ 193 (435) T protein:vir:79 135 YDRYQ---ITIHE-RETNARSVRYGEPKLYKI-------------SPGGDI----PEFFVHYSRICIIDGERVSNEKRRQ 193 (435) T ss_pred echhh---ccchh-hccCCcccccCcceEEEE-------------ecCCCC----CceEEcceeEEEecCCcchhhhccc Confidence 22210 00000 000000000000001111 000000 00011211111 1111 12 Q ss_pred CCCCCCcch-hHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc Q lcl|NC_019916. 225 NNEYRQGDF-ENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM 303 (513) Q Consensus 225 n~~~~~sd~-e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 303 (513) ++-.|.|.+ +.+.+-+..++++....+..+..+..+.+.+.|.......+... .... .+...+... T Consensus 194 ~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~--------~~~~-----~r~~~~~~~ 260 (435) T protein:vir:79 194 NDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGR--------YAAR-----LRLAQVDDE 260 (435) T ss_pred cCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccch--------HHHH-----HHHHHHHHh Confidence 334467765 67778777888877777776666665555554432211111000 0000 001111111 Q ss_pred hh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc-cc-cccc-cccHHHHHHHHHHH Q lcl|NC_019916. 304 RQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT-DD-NFSG-NSSGVAMKYKVLGT 379 (513) Q Consensus 304 ~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~-~~~~-n~Sg~Ai~~~~~~l 379 (513) +. .+.+.+ .+.+-+|-..+.+.++....++...+.|+..+++|-.- ++ ..+| |.||..-..-|... T Consensus 261 ~~~~~~~~i----------~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~ 330 (435) T protein:vir:79 261 SGVGKAIGI----------DATDEEYEVLNSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKL 330 (435) T ss_pred cCCCCceeE----------ecCCcceEEEecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHH Confidence 11 122222 12223444556777889999999999999999999643 22 2233 46676544444433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTD 459 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D 459 (513) +.. .++..+...+++++++++. . .++.++|++-...+..|.|++..+.+...+. +++ .+. -+ T Consensus 331 i~~--~Qe~~l~p~l~~l~~li~~-----s-------~d~~~~f~pL~~~sekEkAei~~~~a~a~~~--~~~-~g~-i~ 392 (435) T protein:vir:79 331 IDR--KRVEDYKPILEFLLPFMIS-----E-------TEWSIEFEPLSVPSDKDKAEIMAKNVESVVK--LKA-EQA-IN 392 (435) T ss_pred HHH--HHHHHHHHHHHHHHHHhhc-----C-------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHh-cCC-CC Confidence 322 3356678888887777541 1 2568999999999999999887665332211 111 222 23 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 460 ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 460 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) +++..+.+.. .....+.......+-+..++...+...+.++++ T Consensus 393 ~~e~r~~L~~--------~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 393 LKETRDTLRS--------ICPDLKIMDNDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHHHHHHH--------hccccCCCCcccccCCccccCCCCCCCCCCCCC Confidence 3333332211 010111111110110111111111122222222 No 106 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.29 E-value=2.1e-11 Score=79.15 Aligned_cols=406 Identities=13% Similarity=0.103 Sum_probs=181.4 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcce--eecchhHHHHHHHHHHhhcCCeeecCCc-HHHHHHH Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHR--AVHSFARYIADFQTSYSVGNAIAMSGPS-SDRLDDF 103 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~r--i~~n~~~~ivd~~~~~l~g~p~~~~~~~-~~~l~~~ 103 (513) +..+. .+-|.+..-|.++-..... .....+..... -.+.+++.+|+..+.-++.+++.+++++ ++.++.. T Consensus 1 ~~~~~------~d~~~~~~~~~~~~~~~~~-~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~~~~~ 73 (427) T protein:vir:10 1 MKIVK------HDGYNDIFNGGADGSPKPF-FMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDEKEFKSL 73 (427) T ss_pred CCccc------cchHHHHhhcCCCCcccCc-cccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccHHHHHHHH Confidence 11111 0111111222211111110 00000000001 1367788999999999999999998864 4567778 Q ss_pred HHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccc---eEEEecCCCCcceEEEEEEEeeccccccccee Q lcl|NC_019916. 104 NRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPME---CFIIYDRSVNPKPIMAVRYHAVQTVVDNITQT 180 (513) Q Consensus 104 ~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~---~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~ 180 (513) |+.-++.....++.+.+..+|.|++++-++.+.. ...++.+.. .+.++|... +... .+.......+..... T Consensus 74 ~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~--l~~p~~~~g~l~~l~v~d~~~---~~~~-~~~~dp~s~~fg~P~ 147 (427) T protein:vir:10 74 WDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRM--LTSQAKPGAKLEGVRVYDRFA---ITVE-KRVTNARSPRYGEPE 147 (427) T ss_pred HHHhhHHHHHHHHHHhccccceeEEEEEecCCCc--cccccCCCcceeEEEEechhc---cccc-ccccCccccccCcce Confidence 8888899999999999999999999887653321 111111111 112222110 0000 000000000000111 Q ss_pred EEEEEEEcCCcEEEEEeeccCCccccccccccccCc-----ccceEE---ecCCCCCCcchhH-HHHHHHHHHHHHHHHH Q lcl|NC_019916. 181 KYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQF-----GFPMIE---YRNNEYRQGDFEN-VLSLIDLYDVAQSDTA 251 (513) Q Consensus 181 ~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g-----~vPvv~---~~n~~~~~sd~e~-v~~liD~~~~~~S~~~ 251 (513) .|.| ...++. ....-|+-. +-|+.. ..++-.|.|.+.. +.+-+..++++.-..+ T Consensus 148 ~y~v-------------~~~~~~----~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~ 210 (427) T protein:vir:10 148 IYKV-------------SPGDNM----QPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLAT 210 (427) T ss_pred EEEE-------------ecCCCC----cceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHH Confidence 1111 000000 000112111 111111 1233457777754 6676777777777766 Q ss_pred HHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCceeEEe Q lcl|NC_019916. 252 NYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSADANYIH 330 (513) Q Consensus 252 ~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~ 330 (513) ..+..+....+.+.|........... .... . +...+...+ ..+.+.+ .+.+-+|-+ T Consensus 211 ~l~~k~~~~v~k~~~l~~~~~~~~~~--------~~~~--~---r~~~~~~~~~~~~~~~l----------~~~~e~~e~ 267 (427) T protein:vir:10 211 QILRRKQQAVWKVKGLAEMCDDDDAQ--------YAAR--L---RLAQVDDNSGVGRAIGI----------DAETEEYDV 267 (427) T ss_pred HHHHHhccccccchhHHHHhcCccch--------HHHH--H---HHHHHHHhcCcccceee----------ecCCCceeE Confidence 66666666665555543211111000 0000 0 000011111 1111221 112234555 Q ss_pred ecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-cccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 331 KEYDSAGTELYKKRLAADIHKFSHTPDLTD-D-NFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEER 407 (513) Q Consensus 331 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~ 407 (513) .+.+.++....++...+.|+..+++|-.-+ + ..+| |.||..=..-|...+. ..++..+.+.+++++++++. T Consensus 268 ~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~--~~Qe~~l~p~l~~l~~~i~~---- 341 (427) T protein:vir:10 268 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVD---- 341 (427) T ss_pred EecccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhc---- Confidence 567788899999999999999999996432 2 2222 5666753333333332 23345688888888777541 Q ss_pred cccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCC Q lcl|NC_019916. 408 VNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIIN 487 (513) Q Consensus 408 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 487 (513) . .+++++|++-...+..|.|+...+.+...+. +++ . ++-++++..+.+...- ....+. + T Consensus 342 -s-------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~--~~~-~-gvi~~~e~r~~L~~~~-----~~~~~~----~ 400 (427) T protein:vir:10 342 -E-------EEWSIEFEPLSVPSKKEESEITKNNVESVTK--AIT-E-QIIDLEEARDTLRSIA-----PEFKLK----D 400 (427) T ss_pred -C-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHh-c-CCCCHHHHHHHHHhhh-----ccccCC----C Confidence 1 2578999999999999999876664322111 111 1 1223333333332210 001110 0 Q ss_pred CCCCCCCCCCCCCCCCCCC--CCccCCC Q lcl|NC_019916. 488 GTSGNDPEDEGVRGQQGEP--EDERTSD 513 (513) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~--~~~~~~~ 513 (513) ..+. +.++.+..++...+ +++..+| T Consensus 401 ~~~~-~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 401 GNNI-NIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred Cccc-cccccchhcCCCCCCCCCCCCCC Confidence 0011 11111111111111 2222222 No 107 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.28 E-value=6.9e-11 Score=76.31 Aligned_cols=399 Identities=13% Similarity=0.103 Sum_probs=184.2 Q ss_pred HHHHHHHHHHhcCCCccccccccccCCCCCCc------ceeecchhHHHHHHHHHHhhcCCeeecCCcH-HHHHHHHHhc Q lcl|NC_019916. 35 RPRLEMLYDYYRGQNDGILSPASRRNEKGKAD------HRAVHSFARYIADFQTSYSVGNAIAMSGPSS-DRLDDFNRRN 107 (513) Q Consensus 35 ~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~------~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~-~~l~~~~~~n 107 (513) ..+.+-+...+-|-++- ..........+ .=-.+.+++.+|+..+.-++-+++.++++++ .+++.-|+.- T Consensus 1 ~~~~D~~~n~~~gg~~~----~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~~~~~~~l 76 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDG----SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDL 76 (422) T ss_pred CccchhhHHHHcCCCCC----ccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHHHHHHHHHHHh Confidence 11112222223332211 00000000000 0013678899999999999999999988765 4666677777 Q ss_pred CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc---ceEEEecCCCCcceEEEEEEEeecccccccceeEEEE Q lcl|NC_019916. 108 DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM---ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEV 184 (513) Q Consensus 108 ~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~---~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~v 184 (513) ++.....++.+.+..+|.|++++-.+++.. ..-++.+. ..+.++|... +....++......+......+.| T Consensus 77 ~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~--~~~Pl~~~g~~~~l~v~d~~~----i~~~~~~~dp~s~~fg~P~~y~v 150 (422) T protein:vir:10 77 EMTQNINDAWSWARLFGGAAIVAIVKDNRA--LTSPVREGAELETVRVYDRTQ----VKVQTREENPRNARFGEPLTYRI 150 (422) T ss_pred hHHHHHHHHHHhhccccceEEEEEecCCCC--ccccccccCceeeEEeecccc----ccchhcccCccccccCcceEEEE Confidence 888999999999999999998887643221 11122211 1122222210 00000111000000011111111 Q ss_pred EEEcCCcEEEEEeeccCCccccccccccccCcc-------cc-eEEecCCCCCCcchhH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 185 ETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFG-------FP-MIEYRNNEYRQGDFEN-VLSLIDLYDVAQSDTANYMT 255 (513) Q Consensus 185 e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-------vP-vv~~~n~~~~~sd~e~-v~~liD~~~~~~S~~~~~~~ 255 (513) .-.+... ...-|+-.. +| +..+.++-.|.|.++. +.+-+..++++.-..+..+. T Consensus 151 ~~~~~~~-----------------~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~ 213 (422) T protein:vir:10 151 TTNESDM-----------------FYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLK 213 (422) T ss_pred ecCCCCc-----------------ceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1000000 000111111 11 1122344457888876 66777778887777776666 Q ss_pred HhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCceeEEeecCC Q lcl|NC_019916. 256 DLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYD 334 (513) Q Consensus 256 ~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 334 (513) .+....+.+.|.......... ...+. .+...+...+. .+.+.+ .+.+-+|-..+.+ T Consensus 214 ~~~~~v~~~~~l~~~~~~~~~------------~~~~~-~r~~~~~~~~~~~~~~~l----------~~~~e~~e~~~~~ 270 (422) T protein:vir:10 214 RKQQAVWKAKGLAELCDDSEG------------FGAAR-LRLAQVDNNSGVGQAIGI----------DAESEEYSVLNSD 270 (422) T ss_pred HhccccccchhHHHhcCCccc------------hHHHH-HHHHHHHHhcCCccceeE----------ecCCcceEEEecc Confidence 666665555543211111000 00000 00000000000 111111 1122345555677 Q ss_pred HHHHHHHHHHHHHHHHHHhCcccccc-c-cccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019916. 335 SAGTELYKKRLAADIHKFSHTPDLTD-D-NFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGK 411 (513) Q Consensus 335 ~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~ 411 (513) .++....++...+.|+..+++|-.-+ + ..+| |.||..-..-|...+. ..++..++..+++++++++. . T Consensus 271 lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~--~~Qe~~l~p~l~~l~~~i~~-----s-- 341 (422) T protein:vir:10 271 IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVD--RKRNAELLPILEFLIPFIVN-----A-- 341 (422) T ss_pred cCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhcc-----c-- Confidence 88899999999999999999996432 2 2222 3466654433333332 23346678888888887642 1 Q ss_pred cccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCC Q lcl|NC_019916. 412 WDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSG 491 (513) Q Consensus 412 ~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 491 (513) .+++++|++-...+..|.|+...+.+...+ ++++ .+ +-++++..+.+... ....+..... T Consensus 342 -----~~~~~~f~pL~~~sekekaei~~~~a~a~~--~~~~-~g-~i~~~e~r~~L~~~--------~~~~~~~~~~--- 401 (422) T protein:vir:10 342 -----EEWSVEFNPLAQESSKDKAEILEKNVNSIA--ALIA-AG-AMDIDEARDTLRTI--------APEVKINDGS--- 401 (422) T ss_pred -----CCcEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHh-cC-CCCHHHHHHHhhhh--------cccccCCCCC--- Confidence 257899999999999999998766543221 1222 22 22333333333211 0001111100 Q ss_pred CCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 492 NDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .+++.+..+-...|.+..++| T Consensus 402 -~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 402 -VETEVTISETSNDPLEVPTDD 422 (422) T ss_pred -CccccchhhcCCCCCCCCCCC Confidence 001111111011122222222 No 108 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.27 E-value=2.3e-10 Score=73.45 Aligned_cols=443 Identities=11% Similarity=0.021 Sum_probs=220.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcc--------------ee--ecchhHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADH--------------RA--VHSFARY 76 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~--------------ri--~~n~~~~ 76 (513) |+ -.+.+...+.-....++.+-.....-|+|-..--..... .....++. .+ .++|++- T Consensus 1 Mn----~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~--~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~ 74 (548) T protein:vir:95 1 MN----LIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAK--RQPLGADTSLQKSAVSMREQCRKLDEDHDLVTG 74 (548) T ss_pred Cc----hHHhHhhhcchHHHHHHHHhHHHhccccccCcccccccc--CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 22 111111112211122222222333446653221100000 00111110 11 3578888 Q ss_pred HHHHHHHHhhcC-CeeecC----CcH-------HHHHHHHHh----------cCHHHHHHHHHHHHhhCCeEEEEeeecC Q lcl|NC_019916. 77 IADFQTSYSVGN-AIAMSG----PSS-------DRLDDFNRR----------NDIDTLNYELYLDMTVTGRAYEYVYRDP 134 (513) Q Consensus 77 ivd~~~~~l~g~-p~~~~~----~~~-------~~l~~~~~~----------n~~~~~~~~~~~~a~~~G~~~~~v~~d~ 134 (513) +|+..+.+++|. ++.+.. .+. +.++..|+. .+|...+..+.+..+..|.+|+...++. T Consensus 75 av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~ 154 (548) T protein:vir:95 75 LLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGR 154 (548) T ss_pred HHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 999999999983 444322 111 223334432 2477888889999999999998887765 Q ss_pred CCc--------eeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcC--CcEEEEEeeccCCcc Q lcl|NC_019916. 135 SQK--------GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTE--NDYTRYKPIVVAGSV 204 (513) Q Consensus 135 ~~~--------~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~--~~~~~~~~~~~~~~~ 204 (513) ... .++. .++|..+-..++.. ...+..+|.+ +..+.-..|+ ++.. ....... . T Consensus 155 ~~~~~~g~~~~~~lq-liepd~l~~~~~~~-~~~i~~GIE~------D~~Grp~aY~--i~~~hPgd~~~~~-----~-- 217 (548) T protein:vir:95 155 VPNYTFATSVPFALE-LLEPDYLPFSYNNL-SKGIVQGIER------DTWRRKRAYH--LLKDHPGNLQTLG-----G-- 217 (548) T ss_pred cccccCCcccceEEE-EechhhcCCCCCCC-CCceeeeeEE------CCCCceEEEE--EeecCCCcccccc-----c-- Confidence 432 1222 25666653333332 2345556543 1122223333 3332 2111100 0 Q ss_pred ccccccccccCcccc---eEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccc Q lcl|NC_019916. 205 PTLEVAEHSAQFGFP---MIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDST 276 (513) Q Consensus 205 ~~~~~~~~~~~g~vP---vv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~ 276 (513) ...+-+|| |+++-. -.+|.|.|..++..+..++....--.....-.+.-..+++......... T Consensus 218 -------~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~-- 288 (548) T protein:vir:95 218 -------SLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTV-- 288 (548) T ss_pred -------ccceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccC-- Confidence 01122333 344332 3468999999887776666554443333332232223333221110000 Q ss_pred ccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_019916. 277 LLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTP 356 (513) Q Consensus 277 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 356 (513) ... .......+.+.+|........+-+++|++.+-+..++..+...+.+.|..-.++| T Consensus 289 -----~~~-----------------~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGip 346 (548) T protein:vir:95 289 -----EPG-----------------KDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRST 346 (548) T ss_pred -----CCC-----------------cccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCC Confidence 000 0011122223333322223445678998888788899999999999999999998 Q ss_pred ccc-cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccccccc----ccceeeEEeCCCCC-- Q lcl|NC_019916. 357 DLT-DDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ-RYTVVAHIEERVNGKWDI----DPDEIGFIFRDNLP-- 428 (513) Q Consensus 357 ~~~-~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~~li~~~l~~~~~~~~~----~~~~i~i~f~~~~p-- 428 (513) -.. ...+ + .|-.+.|..+......+...+..|...+.+ +++..+...-..+...-+ ....+.+.|..+-. T Consensus 347 Ye~ltgD~-s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~ 424 (548) T protein:vir:95 347 YSSVSRAY-D-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPW 424 (548) T ss_pred HHHHhccc-c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccc Confidence 432 2223 2 366678888877777777777777766655 555544433222211111 11235778854332 Q ss_pred cCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCC--CCCCCCCCCCCCCCCC Q lcl|NC_019916. 429 TDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIING--TSGNDPEDEGVRGQQG 504 (513) Q Consensus 429 ~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 504 (513) .|....+++...+ +|+.|.+..+...+ .|+++.++++.+|.+...+.--.+....... ..+.+..+....+..+ T Consensus 425 iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G--~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (548) T protein:vir:95 425 INPMHEANAWELLVKAGFADEAEVARARG--RDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLG 502 (548) T ss_pred cChHHHHHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccc Confidence 4777777776654 68999999999987 4899999999988876655332221111110 1111222211111111 Q ss_pred ----CCCCccCCC Q lcl|NC_019916. 505 ----EPEDERTSD 513 (513) Q Consensus 505 ----~~~~~~~~~ 513 (513) .++++.++. T Consensus 503 ~~~~~~~~~~~~~ 515 (548) T protein:vir:95 503 VGKMLTADEAREL 515 (548) T ss_pred cccccccchhHHh Confidence 244554444 No 109 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.20 E-value=5.9e-10 Score=71.18 Aligned_cols=438 Identities=13% Similarity=0.054 Sum_probs=216.1 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYI 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~i 77 (513) |++-=+-...|... +. ....|.+.+....+.|.+ +..++++||.+...- .....+.. + -+++-+|.+.-+ T Consensus 1 ~~~~~~~~~~~~~~-~~-~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~---~~~~~~~~-~-r~~~~~~k~~~~ 73 (584) T protein:vir:95 1 MSVKVAELNSLLVR-DS-SAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTT---TTSNQGLP-W-KNSTTLPKLCQI 73 (584) T ss_pred CCcchhhhhhhccc-cc-hHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhh---hhhhcccc-c-ccccchhHHHHH Confidence 33222222222211 11 223444444444444433 346888998885421 11111111 1 246778999999 Q ss_pred HHHHHHHhhcCC------ee---ecCCcH-----HHHHHHH----HhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC--- Q lcl|NC_019916. 78 ADFQTSYSVGNA------IA---MSGPSS-----DRLDDFN----RRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ--- 136 (513) Q Consensus 78 vd~~~~~l~g~p------~~---~~~~~~-----~~l~~~~----~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~--- 136 (513) ++..+.+|+.-= +. +..++. ++++... ...++.....++.++++++|-|+..+++...- T Consensus 74 ~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~ 153 (584) T protein:vir:95 74 RDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEM 153 (584) T ss_pred HHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceee Confidence 999988885421 11 222222 2344443 44588999999999999999999998876431 Q ss_pred ---------ceeEEEEEcccceEEEecCCCC--cceEEEEEEEe------------------------------------ Q lcl|NC_019916. 137 ---------KGEVSVKLDPMECFIIYDRSVN--PKPIMAVRYHA------------------------------------ 169 (513) Q Consensus 137 ---------~~~~~~~~~p~~~~~~~d~~~~--~~~~~~ir~~~------------------------------------ 169 (513) ....+..++|.++| ||++-. ...-+.+|.+. T Consensus 154 ~e~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~ 231 (584) T protein:vir:95 154 TDGTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYS 231 (584) T ss_pred eccccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCc Confidence 01233468998877 565431 11111122211 Q ss_pred ecccccccceeE----EEEEEEcCCcEEEEE-------------------eeccCCccccccccccccCcccceEEecCC Q lcl|NC_019916. 170 VQTVVDNITQTK----YEVETWTENDYTRYK-------------------PIVVAGSVPTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 170 ~~~~~~~~~~~~----~~ve~yt~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 226 (513) ..+.+....... .-.+.|....+..+. ..-.++. -.....-+.+.+.+|++.+..- T Consensus 232 ~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~-iIR~~~np~~~~~~PF~~~~~~ 310 (584) T protein:vir:95 232 VEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRST-EVRNESIPTWFGSAPIYHVGWR 310 (584) T ss_pred ccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccE-EEEeeecCCCCCCCCEEEEcce Confidence 111110000000 001112221111111 0000000 0011122345688898776543 Q ss_pred -----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhh Q lcl|NC_019916. 227 -----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLE 301 (513) Q Consensus 227 -----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 301 (513) -+|+|....+.++|+.+|.+.-.+.+.+.-+.+|.+...+..... T Consensus 311 p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~~------------------------------ 360 (584) T protein:vir:95 311 FRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVEEF------------------------------ 360 (584) T ss_pred eeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccchh------------------------------ Confidence 369999999999999999999999999999999965544421100 Q ss_pred cchhcceeeccccccccccccCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccccccc-cccccHHHHHHHHHHH Q lcl|NC_019916. 302 AMRQANMILLKTGMAPNGQQTSADANYIHKEY-DSAGTELYKKRLAADIHKFSHTPDLTDDNF-SGNSSGVAMKYKVLGT 379 (513) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l 379 (513) . ...+.+.-++..++++++.++. +..+.-+.+..+...+-..|++|..+.+.- .++.++..+..+..++ T Consensus 361 --~-------~~pg~~~~~~~~~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa 431 (584) T protein:vir:95 361 --V-------WGPGAEIHLDQGGDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAA 431 (584) T ss_pred --c-------ccCCceeecCCCCCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHH Confidence 0 0112233345667788888774 445555678888899999999998776532 2344555567777778 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccc-c---------------ccceeeEEe--CCCCCcCHHHHHHHHHH Q lcl|NC_019916. 380 VELASTKRKQFERGL-NQRYTVVAHIEERVNGKWD-I---------------DPDEIGFIF--RDNLPTDDVAIITALVQ 440 (513) Q Consensus 380 ~~k~~~~~~~f~~~l-~~~~~li~~~l~~~~~~~~-~---------------~~~~i~i~f--~~~~p~d~~e~a~~~~k 440 (513) -.....+.+.|..++ ++++.++..+....-...+ . ...+++-.| ....-.-+++.++..+. T Consensus 432 ~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~ 511 (584) T protein:vir:95 432 GRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQN 511 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHH Confidence 777888888888875 8888887766433111100 0 000111111 11111112333333333 Q ss_pred Hh------------cCCCHHHHHH------hCCC---CC-C----HHHHHHHHHHHHHHHHHHhhhhcCCCCC Q lcl|NC_019916. 441 AG------------AQIPQEYLYQ------YLPN---VT-D----ADEIVKMMDKQRKAMLKTYDTKGGLIIN 487 (513) Q Consensus 441 l~------------g~iS~et~~~------~l~~---v~-D----~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 487 (513) +. +.++...... .+|. .+ + .+.|.+..-.+.++.......+..-++- T Consensus 512 l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 512 LVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 21 1123322211 1331 11 1 2222222221111111111111111111 No 110 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.20 E-value=2.2e-10 Score=73.55 Aligned_cols=442 Identities=10% Similarity=0.060 Sum_probs=188.8 Q ss_pred Cccchhhce-eccCCcccCCHHHHHHHHHH--HH-HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANM-NYQEDADKLTPTRIAAFIRH--HY-NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~i~~~i~~--~~-~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |..-.+-+- .|..- . .+-...++.- -+ ..+-.+. ...||....-+-+ ..- ... -.+.+++. T Consensus 35 ~~~~~~~~~~~~~~~-~---~~~~~~~~a~~~g~~~~~~~~~--~~~~~~~~~~~~~-~l~-------a~Y-~~~~l~r~ 99 (532) T protein:vir:94 35 LATAHEIDPTAYSPY-E---RNAAQNAMAMDYGLQTGRNGRN--ALSFVEATSWPGF-PTL-------ALL-AQLPEYRT 99 (532) T ss_pred hhhhhhhcccccccc-c---ccccccccccccccCccccccc--ccccccccccchH-HHH-------HHH-HcCchhhh Confidence 111100000 00000 0 0000000000 00 0000000 0012221110000 000 000 02566788 Q ss_pred HHHHHHHHhhcCCeeecCCcH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEE---EEEc Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGPSS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVS---VKLD 145 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~---~~~~ 145 (513) +|+..+.=++-+++.+.++.+ ..++..|+.-++.....++.+.+..||.|++++-.+.++..... ..++ T Consensus 100 ~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~ 179 (532) T protein:vir:94 100 MHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLS 179 (532) T ss_pred hhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCcccccccccccc Confidence 999999988889999866322 23555566667888899999999999999988877644421100 0011 Q ss_pred ccc-------eEEEecCCCCcceEEEEEEEeecccc--cccceeEEEE---EEEcCCcEEEEEeeccCCccccccccccc Q lcl|NC_019916. 146 PME-------CFIIYDRSVNPKPIMAVRYHAVQTVV--DNITQTKYEV---ETWTENDYTRYKPIVVAGSVPTLEVAEHS 213 (513) Q Consensus 146 p~~-------~~~~~d~~~~~~~~~~ir~~~~~~~~--~~~~~~~~~v---e~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (513) |.. .+.++|+.. +.. ..|...+.. +...-..+.+ .-+.+.++++|... T Consensus 180 ~~~I~~g~~~~l~vld~~~----v~p-~~~~~~dp~sp~fg~P~~y~v~~g~~iH~SRli~f~g~--------------- 239 (532) T protein:vir:94 180 PSFVQRGCLIGFATIEPMW----LSP-NAYNATDPTLPSFYKPDSWIATSGKKIHSSRIHTVVGR--------------- 239 (532) T ss_pred ccccccceeeEEEeechhe----ecc-cccccccccccccCCceeEEEccCeeeccceEEEecCC--------------- Confidence 111 122222110 000 000000000 0000000000 01112222222110 Q ss_pred cCcccceEEe-cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcc Q lcl|NC_019916. 214 AQFGFPMIEY-RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKL 292 (513) Q Consensus 214 ~~g~vPvv~~-~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l 292 (513) .+|-... .++-.|.|.++.+..-+..++++.-..+..+..+....+.. +....... .. ...+ T Consensus 240 ---~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~a~~ls~---------~~----~~~~ 302 (532) T protein:vir:94 240 ---PVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DMAQLLAP---------GG----AQSL 302 (532) T ss_pred ---CchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee-chHHhhcc---------hh----HHHH Confidence 0111110 12235889999888888888887777766555555443322 22111100 00 0000 Q ss_pred ccccchhhhcchhc-ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc-cc-cccc-ccc Q lcl|NC_019916. 293 ADEKMAQLEAMRQA-NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT-DD-NFSG-NSS 368 (513) Q Consensus 293 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~-~~~~-n~S 368 (513) ......+...+.+ +++.+. .+.-+|-+...+.+.....++...+.|+..+++|-.- ++ ..+| |.+ T Consensus 303 -~~r~~~~~~~~~n~g~~~id----------~~~e~~e~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~Glnst 371 (532) T protein:vir:94 303 -DARLQLFNLYRDNRNIGALD----------KGTEEIQQTNTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNAS 371 (532) T ss_pred -HHHHHHHHhhcCCccceEEc----------CCCceeEEEecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCccccccc Confidence 0111111221111 122221 1122444555778888999999999999999999653 22 2222 456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH------- Q lcl|NC_019916. 369 GVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA------- 441 (513) Q Consensus 369 g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl------- 441 (513) |+.=..-|...+. ...+..+...+++++++++.-. . +..+ .+++++|++-...+..|.|+...+. T Consensus 372 Ge~D~~~yyd~I~--s~Qe~~l~p~le~l~~~l~~s~--~-g~~~---~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~ 443 (532) T protein:vir:94 372 SDGEIRVWYDFIA--GYQATNLTPLMEWIIDLIQLSE--Y-GQID---PGLAWEWSPLMELDDKELAEVRQLNASTDSTL 443 (532) T ss_pred chHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh--c-CCCC---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 6643333333331 2334557788888888775321 1 2222 2578999998888999888765432 Q ss_pred --hcCCCHHHHHHhCCCCC------C--HHHHHHHHHHHHHHHHH-HhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 442 --GAQIPQEYLYQYLPNVT------D--ADEIVKMMDKQRKAMLK-TYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 442 --~g~iS~et~~~~l~~v~------D--~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) .|++|.+.+.+.+..-. + ...+++....+..+... ..++.......+..+.+.++|+.+.++..+.+.- T Consensus 444 ~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 523 (532) T protein:vir:94 444 MELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPA 523 (532) T ss_pred HhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCcccc Confidence 47899888877663211 1 01111111111111000 1111111111111222333333333333333333 Q ss_pred CCC Q lcl|NC_019916. 511 TSD 513 (513) Q Consensus 511 ~~~ 513 (513) +.+ T Consensus 524 ~~~ 526 (532) T protein:vir:94 524 QND 526 (532) T ss_pred ccC Confidence 333 No 111 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.18 E-value=8.3e-10 Score=70.36 Aligned_cols=448 Identities=9% Similarity=-0.012 Sum_probs=205.5 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cccccc-cCC-CCCCc--------cee- Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGI-LSPASR-RNE-KGKAD--------HRA- 69 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~-~~~~~~-~~~-~~~~~--------~ri- 69 (513) |++-..+..-+. ...+-.....-|+|-...- .+.+.. ..+ ..... +.+ T Consensus 1 m~~~~~~~~a~~--------------------~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~ 60 (495) T protein:vir:10 1 MNMTPSGYQSLA--------------------SGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNV 60 (495) T ss_pred CCcccccccccc--------------------hhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHH Confidence 222111111000 0011111223355532111 010000 000 00000 011 Q ss_pred -ecchhHHHHHHHHHHhhcCCeeecCC--cH---HHHHHHHHh----------cCHHHHHHHHHHHHhhCCeEEEEeeec Q lcl|NC_019916. 70 -VHSFARYIADFQTSYSVGNAIAMSGP--SS---DRLDDFNRR----------NDIDTLNYELYLDMTVTGRAYEYVYRD 133 (513) Q Consensus 70 -~~n~~~~ivd~~~~~l~g~p~~~~~~--~~---~~l~~~~~~----------n~~~~~~~~~~~~a~~~G~~~~~v~~d 133 (513) .++|++-+|+..+.+++|.+++.... ++ ..++..|+. .+|...+..+++..+..|.+|+.+.+. T Consensus 61 rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~ 140 (495) T protein:vir:10 61 RNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPR 140 (495) T ss_pred hcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeec Confidence 35899999999999999999987653 21 234433322 257778888999999999999876555 Q ss_pred CCCc---eeEEEE-EcccceEEEecC---CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccc Q lcl|NC_019916. 134 PSQK---GEVSVK-LDPMECFIIYDR---SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPT 206 (513) Q Consensus 134 ~~~~---~~~~~~-~~p~~~~~~~d~---~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~ 206 (513) +... ..+.+. ++|..+--.++. .....+..+|.+- ..+....|++.--.+...... .....+ T Consensus 141 ~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d------~~Gr~vaY~i~~~hpgd~~~~-----~~~~~~ 209 (495) T protein:vir:10 141 PLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFS------NGGKRKAYCFYRNHPAESSLI-----GDPVDT 209 (495) T ss_pred ccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEEC------CCCceEEEEEeecCCCccccc-----ccccce Confidence 4322 112222 577765433322 1223456666541 112233333211111111100 000000 Q ss_pred ccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhh Q lcl|NC_019916. 207 LEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDA 286 (513) Q Consensus 207 ~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~ 286 (513) .....++.++.+| ..+...+|.|.+..++.|-|.-+..-+.+.... ..+.-..+++......... ....... T Consensus 210 ~rvpA~~vlH~f~--~r~gQ~RGis~la~i~~l~~l~~y~dael~~a~-i~A~~~~fi~~~~~~~~~~-----~~~~~~~ 281 (495) T protein:vir:10 210 VWIKAEHVLHVTV--LTVRSDAGAPWFQLLLRLNELDQYEDAELVRKK-TAALFAAFIQEATADSTGG-----PTIGQPK 281 (495) T ss_pred eeechhheEeccc--cCCCcccCcchhHHHHHHHHhhHHHHHHHHHHH-HhhhheeeeecCCCccccc-----cccCccc Confidence 0000011122222 123445788888776665432222222222222 1221122233211111000 0000000 Q ss_pred hhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc Q lcl|NC_019916. 287 DAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN 366 (513) Q Consensus 287 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n 366 (513) .............+.+.. ...+.++++++..-+..++..+...+.+.|..-.++|-.....--++ T Consensus 282 ------~~~~~~~~~~l~pG~i~~---------L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~ 346 (495) T protein:vir:10 282 ------RSKGGKRITGLNPGTLQY---------LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRG 346 (495) T ss_pred ------cccCcccceecCCceeee---------cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc Confidence 000000111122222222 34567788888887888999999999999999988884322111133 Q ss_pred ccHHHHHHHHHHHHHHHHHHHH-HHHHHH-HHHHHHHHHHHHhccccccccc-----ceeeEEeCCCCC--cCHHHHHHH Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRK-QFERGL-NQRYTVVAHIEERVNGKWDIDP-----DEIGFIFRDNLP--TDDVAIITA 437 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~-~f~~~l-~~~~~li~~~l~~~~~~~~~~~-----~~i~i~f~~~~p--~d~~e~a~~ 437 (513) +|-.++|..+......+...+. .+...+ +.+++..+...-..+...-+++ ..+.+.|..+-. .|....+++ T Consensus 347 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A 426 (495) T protein:vir:10 347 VNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLA 426 (495) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHH Confidence 4555778777777777766554 455544 3355544443322222111111 124567754432 467777777 Q ss_pred HHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCC-CCCCCCCCCCCCCCc Q lcl|NC_019916. 438 LVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDP-EDEGVRGQQGEPEDE 509 (513) Q Consensus 438 ~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 509 (513) .... +|+.|.+..+...+ .|+++.++++.+|++...+.--. ....+....+ +....+.++..+++| T Consensus 427 ~~~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~----~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 427 DLGDVRAGFAPISDKQAERG--YDMEELFDMISDANQLIDEYDLR----LDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCC----CCCCCCcCCCccCCCCCCCCCCCCCC Confidence 6654 79999999999997 48999999888887765443211 1111111111 111111111112222 No 112 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.16 E-value=1.6e-10 Score=74.35 Aligned_cols=444 Identities=10% Similarity=0.038 Sum_probs=198.4 Q ss_pred CccchhhceeccCCcccCCH--------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecc Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTP--------TRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHS 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~--------~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n 72 (513) ++..++.+.....-.+.+.+ .....+-.. ... .+-..+..||-... .+..... ... -.+. T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~-~~~~~l~-------a~Y-~~~~ 114 (537) T protein:vir:10 47 MMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAY-ANP--NLSEGLVLWYAQQA-FIGHQMC-------ALI-ATHW 114 (537) T ss_pred cCCCCCccCcccccccccccchhccccccchhhhhhh-ccc--cccchhhhhccccC-CccHHHH-------HHH-HhCc Confidence 44444433333322221111 111111111 000 01111222222211 1110000 000 1368 Q ss_pred hhHHHHHHHHHHhhcCCeeecCCcH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEE Q lcl|NC_019916. 73 FARYIADFQTSYSVGNAIAMSGPSS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKL 144 (513) Q Consensus 73 ~~~~ivd~~~~~l~g~p~~~~~~~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~ 144 (513) +++.+|+..+.-++-+++.+++++. ..++..|+..++.....++.+.+..||.+++++..+..+.....-++ T Consensus 115 l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl 194 (537) T protein:vir:10 115 LVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPF 194 (537) T ss_pred hhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCccccccc Confidence 8999999999999999999877542 24566667778889999999999999999988876532211111111 Q ss_pred cccc-------eEEEecCCCCcceEEEEEEEeec-ccccccceeEEEE--EEEcCCcEEEEEeeccCCcccccccccccc Q lcl|NC_019916. 145 DPME-------CFIIYDRSVNPKPIMAVRYHAVQ-TVVDNITQTKYEV--ETWTENDYTRYKPIVVAGSVPTLEVAEHSA 214 (513) Q Consensus 145 ~p~~-------~~~~~d~~~~~~~~~~ir~~~~~-~~~~~~~~~~~~v--e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (513) .+.. .+.++|+.-. .+.. +.++... ...+...-..+.+ ..|.+.++++|.... T Consensus 195 ~~~~i~kg~~k~l~vidp~~~-~~~~-~~~~~~dp~sp~fg~P~~y~v~g~~iH~SRli~f~g~~--------------- 257 (537) T protein:vir:10 195 NIDGVMPGAYKGIVQIDPYWC-APLL-DAQASSNPVSMHFYEPTYWLINGKKYHRSHLAIYINDE--------------- 257 (537) T ss_pred ccccccccceeEEEEechhhc-cccc-chhhhccCCccccCCceeeeecCeEecceeEEEecCCC--------------- Confidence 1111 1222221100 0000 0000000 0000000000111 011222222221100 Q ss_pred CcccceEEe-cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccc Q lcl|NC_019916. 215 QFGFPMIEY-RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLA 293 (513) Q Consensus 215 ~g~vPvv~~-~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (513) +|-+.- .++-.|.|.++.+.+-+..++++.-..+..+..+..+.+.+.|..... . ... + T Consensus 258 ---~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~----------~--~~~----~- 317 (537) T protein:vir:10 258 ---VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLA----------N--KQQ----F- 317 (537) T ss_pred ---CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhc----------C--HHH----H- Confidence 111000 122358899999888888888888877777766666655554431100 0 000 0 Q ss_pred cccchhhhcchhc-ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc-ccc-cc-ccccH Q lcl|NC_019916. 294 DEKMAQLEAMRQA-NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT-DDN-FS-GNSSG 369 (513) Q Consensus 294 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~-~~-~n~Sg 369 (513) ......+...+.+ +++.+. ..+-+|-....+.+.....++...+.|+..+++|-.- ++. .+ -|.|| T Consensus 318 ~~r~~~~~~~r~n~g~~~id----------~e~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatG 387 (537) T protein:vir:10 318 DETMSWWTATRDNYQVRVVD----------KDNEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTG 387 (537) T ss_pred HHHHHHHHhhcCCcceeEec----------CCCceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccch Confidence 0111122222222 233322 1123555666778888999999999999999999653 232 22 24667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHH-------H- Q lcl|NC_019916. 370 VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQ-------A- 441 (513) Q Consensus 370 ~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~k-------l- 441 (513) ..=..-|...+ +.+|..++..+++++++++... .... .+++++|++-...|..|.|++..+ + T Consensus 388 e~D~~~yyd~I---~~~Qe~l~p~l~~l~~ll~~~~----~~~~---~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~ 457 (537) T protein:vir:10 388 DYEEASYHEEC---ESTQDDMRPLIDRHHQLVCRSH----LRKR---IRVKVEFPPMDAPKESERADTFLKKMQAAKLAF 457 (537) T ss_pred hHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhc----CCCC---cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 75444444443 3333457888888888876432 1122 257899999999999998876443 2 Q ss_pred -hcCCCHHHHHHhCCCCCCH-HHHH-HHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 442 -GAQIPQEYLYQYLPNVTDA-DEIV-KMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 442 -~g~iS~et~~~~l~~v~D~-~~E~-~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +|++|.+.+.+.|....+. ...+ ..+..|..+....-.........+..+..++..+.+.....+.+++++. T Consensus 458 ~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:10 458 EMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSG 532 (537) T ss_pred HcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCc Confidence 4788887777665321100 0000 0011111111000000000000000000000001111101111111111 No 113 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.14 E-value=2.3e-10 Score=73.39 Aligned_cols=441 Identities=10% Similarity=0.024 Sum_probs=192.8 Q ss_pred Cccchh---------------hceecc-------CCcc--cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019916. 1 MIDMQQ---------------ANMNYQ-------EDAD--KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPA 56 (513) Q Consensus 1 ~~~~~~---------------~~~~~~-------~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~ 56 (513) |..+.+ ..+.+. ||.- +...+-+..+.-- ...... ...+..||....-+-+ .. T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~f~gy-ql 113 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGG-QNPYVV-PTMLQDWYNSQGFIGY-QA 113 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhc-cCccch-hhHHHhhhcccCCccH-HH Confidence 111110 111111 2211 0011111111100 011111 1223344433211101 00 Q ss_pred cccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcH-------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEE Q lcl|NC_019916. 57 SRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSS-------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEY 129 (513) Q Consensus 57 ~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~ 129 (513) . .. =-.+.+++.+|+..+.-++.+++.++++++ ..++..|+.-++...+.++.+.+-.||.+|++ T Consensus 114 ~-------al-Y~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~ 185 (765) T protein:vir:96 114 C-------AI-ISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIAL 185 (765) T ss_pred H-------HH-HHhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEE Confidence 0 00 013678899999999999999999877542 34666777778899999999999999999988 Q ss_pred eeecCCCceeEEEEEcccc-------eEEEecCCCCcceEEEEEEEeec-ccccccceeEEEEE--EEcCCcEEEEEeec Q lcl|NC_019916. 130 VYRDPSQKGEVSVKLDPME-------CFIIYDRSVNPKPIMAVRYHAVQ-TVVDNITQTKYEVE--TWTENDYTRYKPIV 199 (513) Q Consensus 130 v~~d~~~~~~~~~~~~p~~-------~~~~~d~~~~~~~~~~ir~~~~~-~~~~~~~~~~~~ve--~yt~~~~~~~~~~~ 199 (513) +-.+.++.....-.+++.. .+.++|+... .+.. +.++..+ .......-..+.+. -+.+.++++ T Consensus 186 i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~-~~~~-v~e~~~Dp~sp~fg~P~~y~i~g~~IH~SRli~----- 258 (765) T protein:vir:96 186 FVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWA-MPQL-TAESTADPSAEHFYEPDFWIISGKKYHRSHLVV----- 258 (765) T ss_pred EEecccCcchhhccccccccccceeeEEEEechhhc-cccc-chhccccccccccCcceeeeecCceeccceEEE----- Confidence 7665332211111122211 1222221000 0000 0000000 00000000000000 000111111 Q ss_pred cCCccccccccccccCcccceEEe---cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFPMIEY---RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDST 276 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vPvv~~---~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~ 276 (513) +.+-|+-.+ .++-.|.|.++.+..-+..++++.-..+..+..+....+.+.+.... T Consensus 259 ---------------~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l------ 317 (765) T protein:vir:96 259 ---------------VRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAI------ 317 (765) T ss_pred ---------------ecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhh------ Confidence 111111111 12234889999999988888888877777666666555544332110 Q ss_pred ccccccchhhhhhhccccccchhhhcchhc-ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_019916. 277 LLQMVDPSDADAMKKLADEKMAQLEAMRQA-NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHT 355 (513) Q Consensus 277 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 355 (513) ..... + ......+...+.+ +++.+ +.+-+|-+.+.+.+.....++...+.|+..+++ T Consensus 318 ------~~~~~----l-~~r~~~~~~~r~n~g~~~i-----------d~ee~~e~~s~~lsgl~d~l~~~~~~iAaas~I 375 (765) T protein:vir:96 318 ------ANEDA----F-NARLAFWIANRDNHGVKVI-----------GIDETMEQFDTNLSDFDSVIMNQYQLVAAIAKT 375 (765) T ss_pred ------ccHHH----H-HHHHHHHHHhcCCceeEEe-----------cCCcceeEEecccCCHHHHHHHHHHHHHhhhCC Confidence 00000 0 0111222222222 22222 122345556678889999999999999999999 Q ss_pred cccc-cc-ccc-ccccHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCH Q lcl|NC_019916. 356 PDLT-DD-NFS-GNSSGVA-MKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDD 431 (513) Q Consensus 356 p~~~-~~-~~~-~n~Sg~A-i~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~ 431 (513) |-.- ++ ..+ -|.||.. ++.-+..+. ..++..+...+++++.+++.- +..+ .+++++|++-...+. T Consensus 376 P~t~LfGqsp~GlnATGe~D~~nYyD~I~---s~Qe~~l~p~le~L~~li~~s-----~~i~---~d~~i~FnpL~~~se 444 (765) T protein:vir:96 376 PATKLLGTSPKGFNATGEHETISYHEELE---SIQEHIFDPLLERHYLLLAKS-----ESID---VQLEIVWNPVDSTTS 444 (765) T ss_pred CeeeeccCCcccccCcchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-----cCCC---CcceEEeCCCCCCCH Confidence 9633 23 212 3677774 443333332 233466788899888887642 2222 258999999999999 Q ss_pred HHHHHHHHHH---------hcCCCHHHHHHhCC------CCCCHHHHHH---HHHHHHHHHHHHhh--hhcCCCCC---- Q lcl|NC_019916. 432 VAIITALVQA---------GAQIPQEYLYQYLP------NVTDADEIVK---MMDKQRKAMLKTYD--TKGGLIIN---- 487 (513) Q Consensus 432 ~e~a~~~~kl---------~g~iS~et~~~~l~------~v~D~~~E~~---ri~~E~~~~~~~~~--~~~~~~~~---- 487 (513) .+.|++..+. +|++|...+.+.|. +-...+.+.+ -+..|..+..+... ........ T Consensus 445 kEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~ 524 (765) T protein:vir:96 445 QQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAE 524 (765) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcccccccCcccccc Confidence 9988875442 47888877777652 1111111111 00000000000000 00000000 Q ss_pred -CCCCCCCCCCCC---C-----------CCCCCCCCccCCC Q lcl|NC_019916. 488 -GTSGNDPEDEGV---R-----------GQQGEPEDERTSD 513 (513) Q Consensus 488 -~~~~~~~~~~~~---~-----------~~~~~~~~~~~~~ 513 (513) +.....+..+++ + ++.+....+.+.+ T Consensus 525 a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~ 565 (765) T protein:vir:96 525 AQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRP 565 (765) T ss_pred CCCCccCCCCcccccCCcccCCccccccccCccccCccccc Confidence 000000000000 0 0000000000000 No 114 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.14 E-value=1.3e-09 Score=69.34 Aligned_cols=459 Identities=12% Similarity=0.053 Sum_probs=199.8 Q ss_pred cccCCHHHHHHHHHHHHHH------HHHHHHHHHHHhc--CCCc---ccccccc-ccCCCCCCcceeecchhHHHHHHHH Q lcl|NC_019916. 15 ADKLTPTRIAAFIRHHYNN------QRPRLEMLYDYYR--GQND---GILSPAS-RRNEKGKADHRAVHSFARYIADFQT 82 (513) Q Consensus 15 ~~~~~~~~i~~~i~~~~~~------~~~~~~~~~~YY~--G~~~---i~~~~~~-~~~~~~~~~~ri~~n~~~~ivd~~~ 82 (513) ..+.+.+++.++...+... -+.....-.+||. |+|= ++. ..+ .....++| .+.+|.++.+|+..+ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~-~~~~~l~~~~~P--~~~~N~i~~~v~~v~ 77 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAA-GSELGKHFEKYP--KFEINKISTELNRII 77 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHH-HHHHHHhhCCCC--eEEEccHHHHHHHHH Confidence 3334556666665543221 1122333445664 6541 000 000 01111222 377899999999999 Q ss_pred HHhhcCCeee--cCC---cHH--------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC----CC--ce-eEEE Q lcl|NC_019916. 83 SYSVGNAIAM--SGP---SSD--------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP----SQ--KG-EVSV 142 (513) Q Consensus 83 ~~l~g~p~~~--~~~---~~~--------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~----~~--~~-~~~~ 142 (513) ++---+.+.+ ... .+. .++.+.+.++.+...+.+..+++++|.||+-|+.|- ++ .. .+++ T Consensus 78 g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i 157 (720) T protein:vir:35 78 SEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICL 157 (720) T ss_pred hHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeE Confidence 9997776554 221 111 255667789999999999999999999999887642 11 11 1111 Q ss_pred E--Ecc-cceEEEecCCCC-cce----EEEEEEEeec----------------------ccccccceeEEEEEEEcCCcE Q lcl|NC_019916. 143 K--LDP-MECFIIYDRSVN-PKP----IMAVRYHAVQ----------------------TVVDNITQTKYEVETWTENDY 192 (513) Q Consensus 143 ~--~~p-~~~~~~~d~~~~-~~~----~~~ir~~~~~----------------------~~~~~~~~~~~~ve~yt~~~~ 192 (513) . .+| .++ .||+... ..+ ..+++.|... ..+......+..+|+|.-..+ T Consensus 158 ~~v~~~~~~v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~ 235 (720) T protein:vir:35 158 EPIYDPARSV--WFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKE 235 (720) T ss_pred ecccCchhhe--eecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEEE Confidence 1 122 222 2332210 000 0111111000 000001111222222221111 Q ss_pred ---------------EEEEee------------------------------ccCCccccccccccccCcccceEEecCCC Q lcl|NC_019916. 193 ---------------TRYKPI------------------------------VVAGSVPTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 193 ---------------~~~~~~------------------------------~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) +.|... ..++. .......+.+++.||+|+|.-.+ T Consensus 236 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~-~~l~~~~~~p~~~fP~vP~~g~r 314 (720) T protein:vir:35 236 SVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGE-GFLEKAQRIPGEHIPLIPVYGKR 314 (720) T ss_pred EEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccc-hhcccCCCCCCCccceEEEEeee Confidence 111000 00000 11122344567778998875321 Q ss_pred ---C----CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh Q lcl|NC_019916. 228 ---Y----RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL 300 (513) Q Consensus 228 ---~----~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 300 (513) . ..|.+.++++.+|.+|+.+|.++..+. ..+...-.|...... ................+ T Consensus 315 ~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~~~~-----------~~~~~~a~~~~~~~~~l 381 (720) T protein:vir:35 315 WFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKSQIK-----------TLEKYWANRNKNRPAFL 381 (720) T ss_pred eccCCCcccceeeecchhHHHHHHHHHHHHHHHHH--cCCccccccCcchHH-----------HHHHHhhcccccccccc Confidence 1 247788899999999999999998884 333333223211000 00000000001111111 Q ss_pred hcchhcceeecccccccccc--ccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHH Q lcl|NC_019916. 301 EAMRQANMILLKTGMAPNGQ--QTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLG 378 (513) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 378 (513) . +.......|. ...+.+.+.....-..+....+..-...|-..|++-+...+.. +|.||+||..+-.. T Consensus 382 ~---------~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~-sn~SG~Ai~~rq~q 451 (720) T protein:vir:35 382 P---------LNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMP-SNIAKETVNHLMHR 451 (720) T ss_pred c---------cccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcc-cchHHHHHHHHHHH Confidence 0 0000000110 0112233333333345566788888889999999888777654 56999999886655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------c----ccc------------------ce-------eeEEe Q lcl|NC_019916. 379 TVELASTKRKQFERGLNQRYTVVAHIEERVNGKW------D----IDP------------------DE-------IGFIF 423 (513) Q Consensus 379 l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~------~----~~~------------------~~-------i~i~f 423 (513) -.......-..+..+.+++.+++++++....+.. . .+. ++ |.+.= T Consensus 452 g~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~ 531 (720) T protein:vir:35 452 SDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDV 531 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEec Confidence 5555555666666677777776666654432100 0 000 11 12222 Q ss_pred CCCCCcCHHHHHHHHHHHhcCCCHHH---------HHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCC Q lcl|NC_019916. 424 RDNLPTDDVAIITALVQAGAQIPQEY---------LYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDP 494 (513) Q Consensus 424 ~~~~p~d~~e~a~~~~kl~g~iS~et---------~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 494 (513) .+..+.-..+..+.++.+.+.++.+. +++.+++ +..++-.+++.+....... ..+... .. T Consensus 532 ~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~-p~~~e~~erirk~~~~~~~---------~~~~~~-e~ 600 (720) T protein:vir:35 532 GPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEG-EGLDEFKEYNRKQLLTQGV---------VKPRNT-EE 600 (720) T ss_pred ccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCc-hhHHHHHHHHHhhcchhcc---------cCccCh-hH Confidence 23333334444555665544433221 2333322 2233334444332211000 000000 00 Q ss_pred CCCCCCC----CCCCCCCccC-CC Q lcl|NC_019916. 495 EDEGVRG----QQGEPEDERT-SD 513 (513) Q Consensus 495 ~~~~~~~----~~~~~~~~~~-~~ 513 (513) ....... +....+-.+. .+ T Consensus 601 qq~~a~~qq~~qq~~~e~~~aqa~ 624 (720) T protein:vir:35 601 EQMVAQMIQQAQQPNAELVAAQGV 624 (720) T ss_pred HHHHHHHHHHHHhHhHHHHHHHHH Confidence 0000000 0000000000 00 No 115 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.01 E-value=2.4e-09 Score=67.83 Aligned_cols=440 Identities=10% Similarity=0.028 Sum_probs=189.0 Q ss_pred Cccchhhcee---ccCC----cccCCHHHHHHHHHHHHHHHHH----HHHHHHHHhcCCCccccccccccCCCCCCccee Q lcl|NC_019916. 1 MIDMQQANMN---YQED----ADKLTPTRIAAFIRHHYNNQRP----RLEMLYDYYRGQNDGILSPASRRNEKGKADHRA 69 (513) Q Consensus 1 ~~~~~~~~~~---~~~~----~~~~~~~~i~~~i~~~~~~~~~----~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri 69 (513) =..+.++.+. ...+ ...+..+-|..++...-..... .-..+.++|...--+ .+ ...... - T Consensus 74 ~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~-----gy---ql~alY-~ 144 (862) T protein:vir:99 74 AKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFI-----GH---QACALI-A 144 (862) T ss_pred chhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcc-----cH---HHHHHH-H Confidence 0011111110 0000 0011112222222111000000 001111222110000 00 000000 1 Q ss_pred ecchhHHHHHHHHHHhhcCCeeecCCc---------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeE Q lcl|NC_019916. 70 VHSFARYIADFQTSYSVGNAIAMSGPS---------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEV 140 (513) Q Consensus 70 ~~n~~~~ivd~~~~~l~g~p~~~~~~~---------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~ 140 (513) .+.+++.+|+..+.-++-+++.+.+.. ...++..|+..++.....++.+.+-.||.+++++-.+.+....+ T Consensus 145 ~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~L 224 (862) T protein:vir:99 145 QHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYY 224 (862) T ss_pred hCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhh Confidence 367889999999999999999997632 13466677777888888899999999998887765543221111 Q ss_pred EEEEcccc-------eEEEecCCCCcceEEEEEEEeecccccccceeEEEEE--EEcCCcEEEEEeeccCCccccccccc Q lcl|NC_019916. 141 SVKLDPME-------CFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVE--TWTENDYTRYKPIVVAGSVPTLEVAE 211 (513) Q Consensus 141 ~~~~~p~~-------~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve--~yt~~~~~~~~~~~~~~~~~~~~~~~ 211 (513) .-+++|.. .|.++|+.-.. +.................-..+.+. -+.+.++++|. T Consensus 225 sqPLn~e~I~kG~lkgl~vlDp~w~~-p~~v~~~~~Dp~sp~yGkP~~y~I~g~~IH~SRliif~--------------- 288 (862) T protein:vir:99 225 EKPFNPDGITPGSYRGISQIDPYWMM-PMLTAESTADPSSQFFYEPEFWIISGQKYHRSHLIIAR--------------- 288 (862) T ss_pred hcCcCcccccccceeEEEEechhhhc-ccccccccccccccccCCceeeeecCeeeccceeEEec--------------- Confidence 11222221 12233321100 0000000000000000000111110 01111222111 Q ss_pred cccCcccceEEe---cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhh Q lcl|NC_019916. 212 HSAQFGFPMIEY---RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADA 288 (513) Q Consensus 212 ~~~~g~vPvv~~---~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~ 288 (513) .-|+..+ .++-.|.|.++.+.+.+..++++....+..+..+....+.+.+..... . .... T Consensus 289 -----g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~----------~--ed~l 351 (862) T protein:vir:99 289 -----GPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIA----------N--EDKF 351 (862) T ss_pred -----CCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhc----------c--HHHH Confidence 1111111 122358899999999888888888777777766665555444432100 0 0000 Q ss_pred hhccccccchhhhcchhc-ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccc-cccc-cc Q lcl|NC_019916. 289 MKKLADEKMAQLEAMRQA-NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLT-DDNF-SG 365 (513) Q Consensus 289 ~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~-~~ 365 (513) . .....+...+.+ +++.++ .+-+|-+.+.+.+.....++...+.|+..+++|-.- ++.. +| T Consensus 352 ~-----~r~~~~~~~rdN~Gi~liD-----------~eEe~e~ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaG 415 (862) T protein:vir:99 352 I-----QRLMFWVRYRDNHAVKVLG-----------TDETMEQFDTSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKG 415 (862) T ss_pred H-----HHHHHHHhccCcceeEEec-----------CCCceeEEecccCChHHHHHHHHHHHHhhhCCCceeecccCccc Confidence 0 111122222222 233321 223455566788889999999999999999999653 3322 23 Q ss_pred -cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--- Q lcl|NC_019916. 366 -NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--- 441 (513) Q Consensus 366 -n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--- 441 (513) |.||..=..-|...+. ...+..+...+++++.++..- . + .+ .+++++|++-...+..|.|++..+. T Consensus 416 lnATGE~D~~nYyD~I~--s~QE~~L~P~LerL~~li~~~---l-g-~~---~d~~ieFnpL~~~sekEkAEi~kk~Aea 485 (862) T protein:vir:99 416 FNSTGEFETISYHEELE--SIQEHVYMPFLQRHYLISRLS---L-G-IQ---HEIDVVMEPVASMTAQQQADLNKTKAEG 485 (862) T ss_pred ccCchHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh---c-C-CC---CcceEEeCCCCCCCHHHHHHHHHHHHHH Confidence 5677743333333332 223456778888776654321 1 1 11 3588999999999999998775442 Q ss_pred ------hcCCCHHHHHHhC--------CCCCCHHHHHHH-HHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCC----- Q lcl|NC_019916. 442 ------GAQIPQEYLYQYL--------PNVTDADEIVKM-MDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRG----- 501 (513) Q Consensus 442 ------~g~iS~et~~~~l--------~~v~D~~~E~~r-i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 501 (513) +|++|.+.+..+| +.++|.+.|-.. ...+...+.+ ..+..... .+.++...++.. T Consensus 486 ~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e---~~g~a~~~--ap~de~~aga~~~~~e~ 560 (862) T protein:vir:99 486 GKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQ---KAGAAQET--ASAKETQAGAAVTTAEG 560 (862) T ss_pred HHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccc---cCCccccc--ccccccccccCCccccC Confidence 4778877776653 222221111000 0011100000 00000000 000000000000 Q ss_pred --------CCCCC-CCccCCC Q lcl|NC_019916. 502 --------QQGEP-EDERTSD 513 (513) Q Consensus 502 --------~~~~~-~~~~~~~ 513 (513) .++.+ +...+.+ T Consensus 561 d~~~~p~~~~~~~g~~~~~t~ 581 (862) T protein:vir:99 561 DQPNVQMVPSMKPGQMVGPEV 581 (862) T ss_pred CcccccccCCCCCCCcccccc Confidence 00001 0111111 No 116 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.99 E-value=7.4e-09 Score=65.16 Aligned_cols=448 Identities=10% Similarity=0.058 Sum_probs=175.2 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHH-------HHHHHHHHHHHHHh--cCCCccccccccccCCCCCCcceeec Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHY-------NNQRPRLEMLYDYY--RGQNDGILSPASRRNEKGKADHRAVH 71 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-------~~~~~~~~~~~~YY--~G~~~i~~~~~~~~~~~~~~~~ri~~ 71 (513) |.+++.--.... ..+++-+.+...|..++ +..+.+...+.+|| +|+.+ .....+.. +++. T Consensus 8 ~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~grs----~vv~ 76 (763) T protein:vir:95 8 MVPLPDPSQATK--LTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK-----PPKVKGRS----QVQP 76 (763) T ss_pred cCCCccccchhc--CCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc-----ccccCCCc----cccC Confidence 333332211111 12343334433343332 23334445566664 45432 11112222 3455 Q ss_pred chhHHHHHHHHHHh----hcCC--eeecC---CcH-------HHHHH-HHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC Q lcl|NC_019916. 72 SFARYIADFQTSYS----VGNA--IAMSG---PSS-------DRLDD-FNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP 134 (513) Q Consensus 72 n~~~~ivd~~~~~l----~g~p--~~~~~---~~~-------~~l~~-~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~ 134 (513) +-.+..|+.....| ++.+ |.+.. .|. ..++. ++..|+-.......+++++++|.|++.||++. T Consensus 77 ~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~ 156 (763) T protein:vir:95 77 KLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNR 156 (763) T ss_pred HHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeee Confidence 55555555544433 4432 24432 111 12333 45567766777899999999999999998751 Q ss_pred C---------------------------------------------------------C--------------------- Q lcl|NC_019916. 135 S---------------------------------------------------------Q--------------------- 136 (513) Q Consensus 135 ~---------------------------------------------------------~--------------------- 136 (513) . | T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~ 236 (763) T protein:vir:95 157 EIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLAN 236 (763) T ss_pred eeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecC Confidence 0 0 Q ss_pred ceeEEEEEcccceEEEecCCCC---cceEE-EEEEEeecc-c-------c--------------------------c--- Q lcl|NC_019916. 137 KGEVSVKLDPMECFIIYDRSVN---PKPIM-AVRYHAVQT-V-------V--------------------------D--- 175 (513) Q Consensus 137 ~~~~~~~~~p~~~~~~~d~~~~---~~~~~-~ir~~~~~~-~-------~--------------------------~--- 175 (513) .+++. .|+|.+.++ |++-. ...-+ +.+++.... . + + T Consensus 237 ~p~ie-~V~p~d~~i--Dp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 313 (763) T protein:vir:95 237 HPTVE-MLNPENIII--DPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISD 313 (763) T ss_pred ceEEE-eecHHHhee--cCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCC Confidence 00111 145555543 33211 11112 122221100 0 0 0 Q ss_pred ccceeEEEEEEEcC-----CcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHH Q lcl|NC_019916. 176 NITQTKYEVETWTE-----NDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDV 245 (513) Q Consensus 176 ~~~~~~~~ve~yt~-----~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~ 245 (513) .....+...|+|.. +.+..+......+.........+.+.+.+||+.|+. ..+|.|.+..++++++.+|. T Consensus 314 ~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~ 393 (763) T protein:vir:95 314 PMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGA 393 (763) T ss_pred cccceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHH Confidence 00011112233321 222222211111111111122233456777765543 34688999999999999999 Q ss_pred HHHHHHHHHHHhhhhhhhe-ecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCC Q lcl|NC_019916. 246 AQSDTANYMTDLNEAMLVI-KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSA 324 (513) Q Consensus 246 ~~S~~~~~~~~~~~~~l~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (513) .++.+.+.+.-.++|...+ .|... .. + ....+.+.++.+.++.... . T Consensus 394 ~~~~~~d~l~~~~~~~~~v~~gav~--~~-----------d--------------~~~~~pg~v~~v~~g~~~~-----~ 441 (763) T protein:vir:95 394 VMRGMIDLLGRSANGQRGMPKGMLD--AL-----------N--------------SRRYREGEDYEYNPTQNPA-----Q 441 (763) T ss_pred HHHHHHHHHHhhcCCcEEeeccccc--ch-----------h--------------hhcccCCceEEeeCCCChh-----h Confidence 9999999998888875432 22110 00 0 0001112233332221111 1 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 325 DANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS----GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTV 400 (513) Q Consensus 325 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~----~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~l 400 (513) ...++..+....+....+..+...+-..|++++.+.+..+ +..||+ ..+......+....-+.|..+++.+++. T Consensus 442 ~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v--~~l~qa~~~~~~~~~r~~~~~~k~l~~~ 519 (763) T protein:vir:95 442 MIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGI--RGVLDAASKREMAILRRLAKGMSEIGNK 519 (763) T ss_pred hcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222223345666777777777788888877644221 122333 3333334444445556666777777777 Q ss_pred HHHHHHhccccccc------cc-----------ceeeEEeCCCCCcCH-HHHHHHHHHH----hcCCCHHHHHHhCCCCC Q lcl|NC_019916. 401 VAHIEERVNGKWDI------DP-----------DEIGFIFRDNLPTDD-VAIITALVQA----GAQIPQEYLYQYLPNVT 458 (513) Q Consensus 401 i~~~l~~~~~~~~~------~~-----------~~i~i~f~~~~p~d~-~e~a~~~~kl----~g~iS~et~~~~l~~v~ 458 (513) ++.++......... ++ .+|.+.-. |.+. .+.+..+..+ ...++.......+.- T Consensus 520 ~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~---~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~-- 594 (763) T protein:vir:95 520 IIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS---TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAE-- 594 (763) T ss_pred HHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc---cchHHHHHHHHHHHHHHHhccccChHHHHHHHHH-- Confidence 77766554221100 00 01222111 1121 1222222221 111111100000000 Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCC-------------------CCCccCCC Q lcl|NC_019916. 459 DADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGE-------------------PEDERTSD 513 (513) Q Consensus 459 D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~ 513 (513) ..++.++.+ ....+....+. +.+. ....+..+... -....+.+ T Consensus 595 --~~d~~~~~~-------~~~~lr~~q~~-~d~~--~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e 656 (763) T protein:vir:95 595 --IADLKRMPK-------LAHDLRTWQPQ-PDPV--QEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERD 656 (763) T ss_pred --HHhhhchhh-------hHHHHHhcCCC-ccch--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 00000000000 0000 00000000000 00000000 No 117 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=98.92 E-value=1.4e-08 Score=63.57 Aligned_cols=445 Identities=11% Similarity=0.067 Sum_probs=220.9 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCc--ceeecchhH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKAD--HRAVHSFAR 75 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~--~ri~~n~~~ 75 (513) .+++++-...|..+.. +. ..|.++...+.+.|.. ..+++++|-... ..+.....+.+ +++.+|-.- T Consensus 5 ~~~~~~~~~~~~~~~~-~~-~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-------~tr~t~~~~~~w~~s~t~~k~~ 75 (599) T protein:vir:31 5 IKTLQKMLEGRDDDRA-FI-DELVVLFTNMENARAQKDREDKELMDYIDAT-------DTRKTSNSKLPFKNSTTINKLA 75 (599) T ss_pred hHHHHHHhhccCchHH-HH-HHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-------cccccccCCCCcccccchHHHH Confidence 4455554443433211 11 1233333444444433 356677773321 11112222233 356667777 Q ss_pred HHHHHHHHHhhcCC------eee---cCCcHH-----HHHHH----HHhcCHHHHHHHHHHHHhhCCeEEEEeeec---- Q lcl|NC_019916. 76 YIADFQTSYSVGNA------IAM---SGPSSD-----RLDDF----NRRNDIDTLNYELYLDMTVTGRAYEYVYRD---- 133 (513) Q Consensus 76 ~ivd~~~~~l~g~p------~~~---~~~~~~-----~l~~~----~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d---- 133 (513) -+++..+.++++-- +.+ ..+++. .++.+ +...++......+..+.+.+|-|+..+-.. T Consensus 76 ~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~ 155 (599) T protein:vir:31 76 HLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMT 155 (599) T ss_pred HHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcce Confidence 78999998887632 122 222221 23333 344678888889999999999988776422 Q ss_pred --CCCc------eeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc------------------------------- Q lcl|NC_019916. 134 --PSQK------GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV------------------------------- 174 (513) Q Consensus 134 --~~~~------~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~------------------------------- 174 (513) ++|. ++....++|.++|+=-+.+....--+.+|.+.+.... T Consensus 156 ~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~ 235 (599) T protein:vir:31 156 VTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREAL 235 (599) T ss_pred eecccccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccc Confidence 2221 1233457888776532222222333345543211000 Q ss_pred --cccc---------eeEEEE-EEEcCCcEE--EE----EeeccCC------------ccccccccccccCcccceEEec Q lcl|NC_019916. 175 --DNIT---------QTKYEV-ETWTENDYT--RY----KPIVVAG------------SVPTLEVAEHSAQFGFPMIEYR 224 (513) Q Consensus 175 --~~~~---------~~~~~v-e~yt~~~~~--~~----~~~~~~~------------~~~~~~~~~~~~~g~vPvv~~~ 224 (513) +... ...+.+ +.|.+..+. .| .....++ ......+..|.+.|..|++... T Consensus 236 ~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~ 315 (599) T protein:vir:31 236 ADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAV 315 (599) T ss_pred cchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEE Confidence 0000 000000 000110000 00 0000000 0111112223456667876654 Q ss_pred C-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 225 N-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 225 n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) . .-+|.|.+..+.++++.+|.+.-.+.+.+.-+..|++...|..... +... T Consensus 316 ~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~e--D~~~---------------------- 371 (599) T protein:vir:31 316 YEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREK--GMRG---------------------- 371 (599) T ss_pred eeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccccccccccc--CccC---------------------- Confidence 3 3468899999999999999999999999999999988877752110 0000 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccc-ccccccHHHHHHHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDN-FSGNSSGVAMKYKVLG 378 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Ai~~~~~~ 378 (513) .++...-+...+++.++.++.+.......+.++...+-+.|++|..+.+. ..+...+..++.+..+ T Consensus 372 -------------~P~~v~~~~d~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~na 438 (599) T protein:vir:31 372 -------------GPNHVFEVEETGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQG 438 (599) T ss_pred -------------CCCcceeecCCCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhh Confidence 01222334677888999988888888888999999999999999877553 3355677778888888 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccc-----------cccce-----e--eEEeCCCCCcCHHHHHHHHH Q lcl|NC_019916. 379 TVELASTKRKQFERGL-NQRYTVVAHIEERVNGKWD-----------IDPDE-----I--GFIFRDNLPTDDVAIITALV 439 (513) Q Consensus 379 l~~k~~~~~~~f~~~l-~~~~~li~~~l~~~~~~~~-----------~~~~~-----i--~i~f~~~~p~d~~e~a~~~~ 439 (513) .-....++.+.|..++ +.+++-+++.....-...+ +.+.+ + .+.+.+.--.-.++.++.++ T Consensus 439 a~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q 518 (599) T protein:vir:31 439 QNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQ 518 (599) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHH Confidence 8888888888888864 4466655544332211100 00001 1 12222222233566677666 Q ss_pred HHhc---------C---CCHHHH---HHh---C--CCC-C-C-----HHHHHHHHHHHHHHHHH--HhhhhcCCCCCCCC Q lcl|NC_019916. 440 QAGA---------Q---IPQEYL---YQY---L--PNV-T-D-----ADEIVKMMDKQRKAMLK--TYDTKGGLIINGTS 490 (513) Q Consensus 440 kl~g---------~---iS~et~---~~~---l--~~v-~-D-----~~~E~~ri~~E~~~~~~--~~~~~~~~~~~~~~ 490 (513) ++.+ + ++.+.. ++. + +.+ . . -+.+..+++++-++..+ ..+.+-+....++. T Consensus 519 ~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~ 598 (599) T protein:vir:31 519 NLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTG 598 (599) T ss_pred HHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccC Confidence 6532 2 233222 211 1 011 1 1 12222222222222111 22222222111111 Q ss_pred C Q lcl|NC_019916. 491 G 491 (513) Q Consensus 491 ~ 491 (513) + T Consensus 599 ~ 599 (599) T protein:vir:31 599 Q 599 (599) T ss_pred C Confidence 1 No 118 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.72 E-value=8.6e-08 Score=59.31 Aligned_cols=442 Identities=10% Similarity=0.055 Sum_probs=164.4 Q ss_pred ceeccCCcc-cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHH Q lcl|NC_019916. 8 NMNYQEDAD-KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSY 84 (513) Q Consensus 8 ~~~~~~~~~-~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~ 84 (513) -|+|...-- -.+++-|.+ +....+.-......+||+. +. ......++ ..++...+|+..+.. T Consensus 1 ~~~~~~~i~s~~~~~~i~~---~~~~s~~~~~~~~~~~~~p--------p~----~~~~la~l~~~n~~v~scI~~ia~~ 65 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKR---EEVESQALGETRFEEYVEP--------KV----NPLVLLSLLQVNPYHASACSIKAND 65 (542) T ss_pred Cccccccccccccchhhhh---ccccccccccccCCccccC--------CC----CHHHHHHHHhhcHHHHHHHHHHHHH Confidence 334333311 112222211 0000000000111111110 00 00000111 235667899999999 Q ss_pred hhcCCeeecCCcHHHHHHHHHhcC--HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 85 SVGNAIAMSGPSSDRLDDFNRRND--IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 85 l~g~p~~~~~~~~~~l~~~~~~n~--~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) +.+-|+++..+....+..++-..+ .......+..+.+.+|.||+.+..+..|.+.-.+.++|..+.+..|... T Consensus 66 IA~l~~~~~~~~~~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~----- 140 (542) T protein:vir:41 66 IIRTGYILEGDDEGVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR----- 140 (542) T ss_pred HhhCceeeecccchhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe----- Confidence 999999988777777766654322 4455667888999999999999888888776566678887766554321 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC-----CCCcchhHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE-----YRQGDFENVL 237 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e~v~ 237 (513) +++++. + ....+...|.....+.... +.. ...+..=-|++|++.. .|.|.+..+. T Consensus 141 -~~~~~~-----~---~~~~~~~~y~~~~~~~~~~----g~~-------~~~~~~~eIiHir~~~~~~~~~Glspi~~~~ 200 (542) T protein:vir:41 141 -YRQTWD-----G---VNITHFKDYRYEGEINPET----GED-------QDSVGANELVFIHIPSPVCSYYGVPRYVSAA 200 (542) T ss_pred -eEeeec-----C---CcceeEEeecccccccccc----ccc-------ccccCcccEEEecCCCCCCCcccccHHHHHH Confidence 111111 0 0011122222221111100 000 0001111245555332 4666666555 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhh--hheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceeeccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAM--LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMILLKT 313 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~--l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~ 313 (513) .-++....+.....+.+...+.|- +.+.|....... ................+ ...+... -.++.+.+.. T Consensus 201 ~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~--~~~~~~~e~~~~lk~~~----~~~~~g~~~n~gk~~vL~~ 274 (542) T protein:vir:41 201 PAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELE--EDPDGNPTGRTVIQALI----EDNFKHLKEAPHTPLVFSI 274 (542) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccc--cccccCHHHHHHHHHHH----HHHHhhhhcccCceeEeec Confidence 444333222222222222222232 333332111000 00000000000000000 0000000 0012222221 Q ss_pred cccccccccCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCccccccccccccc-cHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 314 GMAPNGQQTSADANYIHKE--YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNS-SGVAMKYKVLGTVELASTKRKQF 390 (513) Q Consensus 314 ~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~f 390 (513) . .+.+++++|.... .....+....+...+.|+..-++|+...+...++. ++.-++... ...+ T Consensus 275 ~-----~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~----------~~f~ 339 (542) T protein:vir:41 275 P-----GGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTR----------RTYY 339 (542) T ss_pred c-----CCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHH----------HHHH Confidence 1 1123444554433 34455667778888999999999987654332221 111111111 1122 Q ss_pred HHHHHHHHHHHHHHHHhcccccccccceeeEEeC--CCCCcCHHHHHHHHHHHhcCCCHHHHHHhCCCCCCHHHHH---- Q lcl|NC_019916. 391 ERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFR--DNLPTDDVAIITALVQAGAQIPQEYLYQYLPNVTDADEIV---- 464 (513) Q Consensus 391 ~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~--~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~---- 464 (513) ...+..+++.+...++..-. .... ..+.+.|+ ..+..|..+.++.+. .+|+++...+.+.++.++--++.. T Consensus 340 ~~tL~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~~ll~~d~~~~~~~~v-~~GilT~NE~Re~L~g~~pgdd~~l~p~ 416 (542) T protein:vir:41 340 ESVVRPQQNIISSILTDFFQ-VKFN-PKTRFKFNDETLLESDSVRNCALLV-QSGVLTPAEARERLFGLDGGPDIFMVPS 416 (542) T ss_pred HHHHHHHHHHHHHHHHhhcc-cccC-CceEEEecchhhcchHHHHHHHHHH-hCCCCCHHHHHHhhCCCCCCCccccccc Confidence 33333333333333322111 1111 23455664 333444444443332 268888877777665443111110 Q ss_pred ----HHHHHHHH-------HHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 465 ----KMMDKQRK-------AMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 465 ----~ri~~E~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.++.... .+.+..+...+...+.......+.........+|+.+-..+ T Consensus 417 ~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (542) T protein:vir:41 417 KGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFRAE 476 (542) T ss_pred cccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhhhhHHh Confidence 00000000 00000000000000000000000000000000011111111 No 119 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.53 E-value=3.5e-07 Score=55.95 Aligned_cols=425 Identities=10% Similarity=-0.003 Sum_probs=164.2 Q ss_pred HHHHHHHHHHHHH-HHHH-HHHHhcCCCcccc--ccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC-C--- Q lcl|NC_019916. 24 AAFIRHHYNNQRP-RLEM-LYDYYRGQNDGIL--SPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG-P--- 95 (513) Q Consensus 24 ~~~i~~~~~~~~~-~~~~-~~~YY~G~~~i~~--~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~-~--- 95 (513) .-+++..+....+ .... ....|......+. ..........-+..-+.+.-...+|+..+.-+-+-|+++-. . T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 1111111100000 0000 0000000000000 00000000000000011122333566666666667877521 1 Q ss_pred ----cHHHHHHHHH-hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 96 ----SSDRLDDFNR-RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 96 ----~~~~l~~~~~-~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) +...+..++. -|+ .......+..+.+.+|.||+.+-. .+|.+.-.+.+.|..+.+.-+.... ......+. T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~-~~g~~~~l~~l~p~~v~v~~~~~~~-~~~~~~~~ 158 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRW-AGPNIAGLDVLDPTKIHVHMVMVDG-LRRKVFEA 158 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-CCCcEEEEEEEcCcceEEEEeccCC-ccceeEEE Confidence 1122333332 233 345666788889999999988844 4555544556788877664433221 11111222 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQ 247 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~ 247 (513) |.... . .. ...+..|+++.+++++.. ++.+. ..|.|.++.+...++....+. T Consensus 159 y~~~~-~--g~--~~~~~~~~~~eiih~r~~--------------~~~~~---------~~G~sp~~~~~~~i~~~~~~~ 210 (457) T protein:vir:62 159 YDIDA-D--GN--EVLLGWFTPRDVLHIPGM--------------MLPGD---------FVGCSPISYARESIGLALAAQ 210 (457) T ss_pred EEEcc-C--Cc--eeEEEeeCccceEEecCC--------------CCCCc---------eecccHHHHHHHHHHHHHHHH Confidence 22211 1 11 122234556666555321 11111 146677766666665544444 Q ss_pred HHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCce Q lcl|NC_019916. 248 SDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSADA 326 (513) Q Consensus 248 S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 326 (513) .-..+.+...+.|-.+++-.... .... ...++..-........ .++++.+ ..+.++ T Consensus 211 ~~~~~~f~ng~~p~gil~~~~~l----------s~e~----~~~~~~~~~~~~~G~~nag~~~vl---------~~g~~~ 267 (457) T protein:vir:62 211 KYGAHFFRNGAMPGAVVEVPGTM----------SEEG----LARAREAWRAANSGVDNAHRVALL---------TEGAKF 267 (457) T ss_pred HHHHHHHhccCCcceEEEcCCCC----------CHHH----HHHHHHHHHHHhcCccccCcceec---------CCCceE Confidence 44444444444455444432110 0000 1111110000000000 1122222 223444 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 327 NYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIE 405 (513) Q Consensus 327 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l 405 (513) ..++.......+....+..+..|+..-++|+...+...+ +.++..++-..... +..+|.-.++.+...+ T Consensus 268 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f----------~~~~l~P~~~~ie~~l 337 (457) T protein:vir:62 268 SKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAF----------TMFSLRPWLERIEAGF 337 (457) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHH----------HHHHHHHHHHHHHHHH Confidence 455444444556677778889999999999866543332 22232232221111 1122222222222222 Q ss_pred Hhc-ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC--CCCCH--HHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 406 ERV-NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP--NVTDA--DEIVKMMDKQRKAMLKTY 478 (513) Q Consensus 406 ~~~-~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~--~v~D~--~~E~~ri~~E~~~~~~~~ 478 (513) ... -.........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++ -+++. +.-+....--........ T Consensus 338 n~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~ 417 (457) T protein:vir:62 338 NRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEP 417 (457) T ss_pred HhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccc Confidence 211 011111223445555566667899999998886 67899877777754 33332 111111100000000000 Q ss_pred hhhcCCC---CCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 479 DTKGGLI---INGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 479 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .+.+... ....++.++ .+..+.++.|++++++. T Consensus 418 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~d~~~~~~ 453 (457) T protein:vir:62 418 EPAPAPPAIDPPAEEPADD--EEPDNAEGDPDEGETED 453 (457) T ss_pred cccCCCccCCCCccCCCCC--CCCCCCCCCCccccccc Confidence 0001000 101111111 11122233333333222 No 120 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.53 E-value=3.6e-07 Score=55.94 Aligned_cols=429 Identities=9% Similarity=0.052 Sum_probs=202.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc---ccccc-CCCCCCcceeecchhHHHHHHHHHHhhcC--C-- Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILS---PASRR-NEKGKADHRAVHSFARYIADFQTSYSVGN--A-- 89 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~---~~~~~-~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p-- 89 (513) ++.+.|.+-.+.....|.+....+++||+=-.+.... ..... ....+.+.++..+-+...+++.++.|++. | T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 7888888877777777755444444443321111100 00000 00113456777888999999999888753 2 Q ss_pred --e-eecCCc-----H-----------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcc-cce Q lcl|NC_019916. 90 --I-AMSGPS-----S-----------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDP-MEC 149 (513) Q Consensus 90 --~-~~~~~~-----~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p-~~~ 149 (513) + ++...+ . ..+...+...+|.....++.++..++|.|.+++-.+++....+.+..-| .+. T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~ 160 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDS 160 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceE Confidence 1 222211 1 1233445567899999999999999999977766555433333333334 444 Q ss_pred EEEecCCCCcceEEEEEEEeeccc--------c-----------cccceeEEEEEEEc----C--Cc------------- Q lcl|NC_019916. 150 FIIYDRSVNPKPIMAVRYHAVQTV--------V-----------DNITQTKYEVETWT----E--ND------------- 191 (513) Q Consensus 150 ~~~~d~~~~~~~~~~ir~~~~~~~--------~-----------~~~~~~~~~ve~yt----~--~~------------- 191 (513) ++.-|. .+++...+|.++.... + .........+++|+ . .. T Consensus 161 ~v~~d~--~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~ 238 (547) T protein:vir:10 161 YFEEDS--RGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTE 238 (547) T ss_pred EEeeCC--CcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccc Confidence 444443 3456666654433110 0 00000011122221 1 00 Q ss_pred ----EEEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019916. 192 ----YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAML 262 (513) Q Consensus 192 ----~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l 262 (513) .+++. ..+... ...+..|..+|++.++ ++.+|+|-.++..+-+..+|.+.-..+...+...+|.+ T Consensus 239 ~p~~s~~~e-~~~~~~-----~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 312 (547) T protein:vir:10 239 RPFGKKWIL-KEGAVQ-----LGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAI 312 (547) T ss_pred cceeEEEEE-ecCcee-----eeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 01111 111000 0122345668887765 34679999999999999999998889999999999887 Q ss_pred heecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHH Q lcl|NC_019916. 263 VIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYK 342 (513) Q Consensus 263 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 342 (513) .+--.+... + ++ ...++.+. .+...+++.+....+.......+ T Consensus 313 ~v~~~g~~~--~-------------------------~~-~~pgg~~~---------~~~~~~v~pl~~~~~~~~~~~~i 355 (547) T protein:vir:10 313 MVTERGLIS--D-------------------------ID-LGASGLTV---------VRDMESMKPFESRARFDVSSIQL 355 (547) T ss_pred ecccccccc--c-------------------------ce-ecCCeeee---------cCCcccceeeecccchHHHHHHH Confidence 543110000 0 00 00011111 12344566666666777777888 Q ss_pred HHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhccccccc Q lcl|NC_019916. 343 KRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERG--------LNQRYTVVAHIEERVNGKWDI 414 (513) Q Consensus 343 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~--------l~~~~~li~~~l~~~~~~~~~ 414 (513) +.++..|-..-....+... -+...++.-++.. +++++..++.. +.-+++-++.++...+.-... T Consensus 356 ~~~~~rI~~af~~d~~~~~-~~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~ 427 (547) T protein:vir:10 356 TDLRSAVRRIYYVDQLQMK-DSPAMTATEVQVR-------YELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGEL 427 (547) T ss_pred HHHHHHHHHHhhhhhhhcC-CCccccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC Confidence 8888877543221111111 1233455544433 23333334433 333333344445443321111 Q ss_pred -------ccceeeEEeCCCCCcCHH--------HHHHHHHHHhcC-------CCHHHHHHhC---CCCC----CHHHHHH Q lcl|NC_019916. 415 -------DPDEIGFIFRDNLPTDDV--------AIITALVQAGAQ-------IPQEYLYQYL---PNVT----DADEIVK 465 (513) Q Consensus 415 -------~~~~i~i~f~~~~p~d~~--------e~a~~~~kl~g~-------iS~et~~~~l---~~v~----D~~~E~~ 465 (513) ....++|++..++-+... ..++.+..++++ +....++..+ -+|+ -.++|++ T Consensus 428 p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~ 507 (547) T protein:vir:10 428 PSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVT 507 (547) T ss_pred chhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHH Confidence 234566777655554311 111222223332 2223333222 1232 1357777 Q ss_pred HHHHHHHHHHHHhh------hhcCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 466 MMDKQRKAMLKTYD------TKGGLIINGTSGNDPEDEGVRGQQG 504 (513) Q Consensus 466 ri~~E~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (513) .+.+++++.++... ..+..+.....++ .+=.++. T Consensus 508 ~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~-----a~~~~~~ 547 (547) T protein:vir:10 508 SIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQ-----AALKENQ 547 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-----cchhccC Confidence 66666554333221 1222222211111 1100000 No 121 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.51 E-value=4.1e-07 Score=55.58 Aligned_cols=455 Identities=11% Similarity=0.063 Sum_probs=165.1 Q ss_pred Cccchhh----ceeccCCcc--c--CCHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCc-cee Q lcl|NC_019916. 1 MIDMQQA----NMNYQEDAD--K--LTPTRI-AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KAD-HRA 69 (513) Q Consensus 1 ~~~~~~~----~~~~~~~~~--~--~~~~~i-~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~-~ri 69 (513) .-.++++ ...+.++.. . ++-..+ ...+++..+....-+..-.-+....+.....++..++... ... ... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~ 83 (547) T protein:vir:63 4 FESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKF 83 (547) T ss_pred hhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHHh Confidence 1112222 222222211 0 111111 1112222111111111100111111111100000000000 000 011 Q ss_pred -ecchhHHHHHHHHHHhhc-----------C--CeeecC-------CcH---HHHHHHHHh-c--------CHHHHHHHH Q lcl|NC_019916. 70 -VHSFARYIADFQTSYSVG-----------N--AIAMSG-------PSS---DRLDDFNRR-N--------DIDTLNYEL 116 (513) Q Consensus 70 -~~n~~~~ivd~~~~~l~g-----------~--p~~~~~-------~~~---~~l~~~~~~-n--------~~~~~~~~~ 116 (513) ..++.+.+|+..+..+.+ - .+++.. .+. ..+.+++.. | .+......+ T Consensus 84 ~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~l 163 (547) T protein:vir:63 84 GGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKI 163 (547) T ss_pred hcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHH Confidence 124455555555443321 1 122211 111 134455443 1 133556667 Q ss_pred HHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEE Q lcl|NC_019916. 117 YLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYK 196 (513) Q Consensus 117 ~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~ 196 (513) ..+.+.+|.+|+.+..+.+|.+.-.+.++|..+.++.++... .....++|+.... + . ....+..+.+++++ T Consensus 164 v~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~~--~---~---~~~~~~~~eiih~r 234 (547) T protein:vir:63 164 VRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQVID--Q---K---IVATFNAREMAFAV 234 (547) T ss_pred HHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccc-cccCceEEEEEcC--C---c---EEEEeccccEEEec Confidence 889999999999998898887765667899888777665431 1111223322211 0 0 01123444444433 Q ss_pred eeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhe--ecCccccccc Q lcl|NC_019916. 197 PIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVI--KGDIDTLFDD 274 (513) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~--~G~~~~~~~~ 274 (513) .... ..+ .....|.|.++.+...+.....+..-....+...+.|--+| .|... T Consensus 235 ~n~~---------------~~~-----~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~----- 289 (547) T protein:vir:63 235 RNPR---------------SDI-----YATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQ----- 289 (547) T ss_pred ccCC---------------CCc-----ccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCC----- Confidence 2110 000 00114677776666665544444333333344333343222 22110 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSH 354 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 354 (513) .. .+....++..=...+.....++.+ +.....+.++.-++.......+....+...+.|+..-+ T Consensus 290 -----ls----~e~~~~lk~~~~~~~~G~~nagk~-------~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afg 353 (547) T protein:vir:63 290 -----QS----QHALEIFKREWKNSLSGINGSWQI-------PVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYG 353 (547) T ss_pred -----CC----HHHHHHHHHHHHHHhcCccccccc-------ccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhC Confidence 00 000011111000000000111111 11112223333333334445566777888899999999 Q ss_pred ccccccccccccc----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcC Q lcl|NC_019916. 355 TPDLTDDNFSGNS----SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTD 430 (513) Q Consensus 355 ~p~~~~~~~~~n~----Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d 430 (513) +|++..+-...+. ++..+-. +.+. ......+...|.-+++.+...+...=- ..+. ..+.+.|......+ T Consensus 354 VPP~~lG~~~~~~~~~~~~~s~t~--sn~e---~~~~~~~~~tL~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~~~~~ 426 (547) T protein:vir:63 354 IDPAEINIPNNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIV-AEFG-DKYTFQFVGGDIKS 426 (547) T ss_pred CCHHHcCcccccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhcc-cccC-CceEEEeecccccc Confidence 9987654221110 1111110 0000 111223344444444444433332111 1111 34678888888888 Q ss_pred HHHHHHHHHH-HhcCCCHHHHHHhCCC---CCCHHHHH-----HHH----HHHH---HHHHHHhhhhcCCCCCCCCCCCC Q lcl|NC_019916. 431 DVAIITALVQ-AGAQIPQEYLYQYLPN---VTDADEIV-----KMM----DKQR---KAMLKTYDTKGGLIINGTSGNDP 494 (513) Q Consensus 431 ~~e~a~~~~k-l~g~iS~et~~~~l~~---v~D~~~E~-----~ri----~~E~---~~~~~~~~~~~~~~~~~~~~~~~ 494 (513) .++.+..... .+|+++.-.+.++++. ++.-+.-+ ..+ .+++ +......+....... ...+.+. T Consensus 427 ~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 505 (547) T protein:vir:63 427 ELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTG-NRVSTDV 505 (547) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccC-CCCCCCC Confidence 8777664433 2588888777766643 22111000 000 0000 000011111111000 0010111 Q ss_pred CCCC---CCCC-CCCCCCccCCC Q lcl|NC_019916. 495 EDEG---VRGQ-QGEPEDERTSD 513 (513) Q Consensus 495 ~~~~---~~~~-~~~~~~~~~~~ 513 (513) ++.+ ...+ .++++..++++ T Consensus 506 ~~~~~~~~~~~~~~~d~~~~~~~ 528 (547) T protein:vir:63 506 EDIPDGKDTTGDIGKDGQRKDKD 528 (547) T ss_pred CCCCCCcccCCCcCccccccCcc Confidence 1100 0111 11112222222 No 122 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.37 E-value=9.8e-07 Score=53.53 Aligned_cols=425 Identities=10% Similarity=0.011 Sum_probs=163.3 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHhcCCCcccccc--ccccCCCCC---CcceeecchhHHHHHHHHHHhhcCCeeecC--- Q lcl|NC_019916. 24 AAFIRHHYNNQR-PRLEMLYDYYRGQNDGILSP--ASRRNEKGK---ADHRAVHSFARYIADFQTSYSVGNAIAMSG--- 94 (513) Q Consensus 24 ~~~i~~~~~~~~-~~~~~~~~YY~G~~~i~~~~--~~~~~~~~~---~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~--- 94 (513) .-+++..+.... +....... .+..+.-... .......+. +..-+.+.-...+|+..+.-+-+-|+++-. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~ 78 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEA--RAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRG 78 (457) T ss_pred Cchhhhhhccccccccccccc--ccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 111111110000 00000000 0000000000 000000000 000111222334667777777777877521 Q ss_pred Cc-----HHHHHHHHH-hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEE Q lcl|NC_019916. 95 PS-----SDRLDDFNR-RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAV 165 (513) Q Consensus 95 ~~-----~~~l~~~~~-~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~i 165 (513) +. ...+..++. .++ .......+..+.+.+|.||+.+-.+ +|.+.-.+.++|..+.+..+....... ... T Consensus 79 ~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~-~~~ 156 (457) T protein:vir:13 79 GSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRR-KVF 156 (457) T ss_pred CcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccc-eeE Confidence 11 112333332 222 2345666778899999999888544 566554556788877665443221111 111 Q ss_pred EEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHH Q lcl|NC_019916. 166 RYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDV 245 (513) Q Consensus 166 r~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~ 245 (513) +.|.... . ........|.++.+++++.. ++.+ ...|.|.++.+...|+.... T Consensus 157 ~~y~~~~-~----~~~~~~~~~~~~diih~~~~--------------~~~~---------~~~G~s~i~~~~~~i~~~~~ 208 (457) T protein:vir:13 157 EAYDIDA-D----GNEVLLGWFTPRDVLHIPGM--------------MLPG---------DFVGCSPISYARESIGLALA 208 (457) T ss_pred EEEEEec-C----CceeeEEeeCccceEEecCC--------------CCCC---------ccccccHHHHHHHHHHHHHH Confidence 1222111 0 01122334556666554321 1111 12477777666666655444 Q ss_pred HHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCC Q lcl|NC_019916. 246 AQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSA 324 (513) Q Consensus 246 ~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 324 (513) +..-..+.+...+.|-.+++-.... .. +....++..-........ .++++.+ ..+. T Consensus 209 ~~~~~~~~f~ng~~p~gil~~~~~l----------s~----e~~~~~~~~~~~~~~g~~nag~~~vl---------~~g~ 265 (457) T protein:vir:13 209 AQKYGSKFFANGAMPGAVVEVPGTM----------SE----EGLARAREAWRAANSGVDNAHRVALL---------TEGA 265 (457) T ss_pred HHHHHHHHHhcCCCcceEEEcCCCC----------CH----HHHHHHHHHHHHHhcCccccCcceec---------CCCc Confidence 4444444444445555555432110 00 001111110000010000 1222332 2334 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 325 DANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAH 403 (513) Q Consensus 325 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 403 (513) +++.++.......+....+...+.|+..-++|++..+...++ .++..++-+.... +...|...++.+.. T Consensus 266 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f----------~~~tl~P~~~~ie~ 335 (457) T protein:vir:13 266 KFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAF----------TMFSLRPWLERIEA 335 (457) T ss_pred eEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHH----------HHHHHHHHHHHHHH Confidence 555554444444566677788889999999998665433322 2222222221111 12222222222222 Q ss_pred HHHhc-ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCH--HHHHHHHHHHHHHHHH Q lcl|NC_019916. 404 IEERV-NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDA--DEIVKMMDKQRKAMLK 476 (513) Q Consensus 404 ~l~~~-~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~--~~E~~ri~~E~~~~~~ 476 (513) -+... -.........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++. +++. +.-+....-....... T Consensus 336 ~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~ 415 (457) T protein:vir:13 336 GFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEP 415 (457) T ss_pred HHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccccc Confidence 22211 111111223455666676777899999988886 688887766666533 2332 1111000000000000 Q ss_pred HhhhhcCCCCCCCCC----CCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 477 TYDTKGGLIINGTSG----NDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 477 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .....+.....++.. .+.+.++.++.++..++.+++| T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~ 456 (457) T protein:vir:13 416 EPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDD 456 (457) T ss_pred cccccCCCCCCCCCccccCCCCCCCCCCccccCCCCccccc Confidence 000000000000000 0111111111111112222222 No 123 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.28 E-value=1.7e-06 Score=52.20 Aligned_cols=445 Identities=11% Similarity=0.057 Sum_probs=198.6 Q ss_pred CC-cccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC Q lcl|NC_019916. 13 ED-ADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN 88 (513) Q Consensus 13 ~~-~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~ 88 (513) |. .+.+..+.+.+..+...++|.+ +.+.+.+|..-. ....... .......++..+-....++..++.|++. T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~~--~~~~~~~~~~dst~~~a~~~Las~l~~~ 75 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPS---LFPKESD--NSSTEYTTPWQAVGARCLNNLAAKLMLA 75 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCCC--cccccccccccccHHHHHHHHHHHHHhh Confidence 33 3456777888877776666644 455555554331 1111111 1112233466777788888888877652 Q ss_pred --C----eeecCCc------------HHH-----------HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCcee Q lcl|NC_019916. 89 --A----IAMSGPS------------SDR-----------LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGE 139 (513) Q Consensus 89 --p----~~~~~~~------------~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~ 139 (513) | +++...+ ... +...+..++|.....++.++..++|.|.+++-.+..+.+. T Consensus 76 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~ 155 (522) T protein:vir:94 76 LFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYS 155 (522) T ss_pred cCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCcee Confidence 2 1222110 111 2233445789999999999999999988766555444432 Q ss_pred EEEEEcccceEEEecCCCCcceEEEEEEEeecccc----------cccceeEEEEEEEc-----CCcEEEEEeeccCCcc Q lcl|NC_019916. 140 VSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV----------DNITQTKYEVETWT-----ENDYTRYKPIVVAGSV 204 (513) Q Consensus 140 ~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~----------~~~~~~~~~ve~yt-----~~~~~~~~~~~~~~~~ 204 (513) . +..-|..-+++--+. ..++...+|.++..... ....+....+++|+ .++..++.... +.. T Consensus 156 ~-~~~~pl~~y~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~-g~~- 231 (522) T protein:vir:94 156 P-MRMYRLVSYVVQRDA-FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVE-GIE- 231 (522) T ss_pred e-EEEEEcceEEEeeCC-CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeecc-Cce- Confidence 2 233444445554443 34565656554331100 00001122344443 23333332211 111 Q ss_pred ccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccc Q lcl|NC_019916. 205 PTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQ 279 (513) Q Consensus 205 ~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~ 279 (513) .. ...-..++..+|++.++- +.+|+|-.++..+-+..+|.+.-......+...+|.+.+.-.+.... T Consensus 232 ~~-~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~------- 303 (522) T protein:vir:94 232 VT-GTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQP------- 303 (522) T ss_pred ec-ccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccc------- Confidence 11 111123567789877653 46799999999999999999999999999999998866531100000 Q ss_pred cccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_019916. 280 MVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPD 357 (513) Q Consensus 280 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~ 357 (513) . .......+... .+..++++.+. ...+.......++.++..|...-..-. T Consensus 304 ---------------------~-----~~~~~~~g~~v--~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~ 355 (522) T protein:vir:94 304 ---------------------R-----RLNKAATGEFV--AGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNS 355 (522) T ss_pred ---------------------h-----heeccCCceee--cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh Confidence 0 00000011111 12223333332 333555667777777777654322111 Q ss_pred cccccccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcC-HHHHH Q lcl|NC_019916. 358 LTDDNFSGNSSGVAMKYKVLG-TVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTD-DVAII 435 (513) Q Consensus 358 ~~~~~~~~n~Sg~Ai~~~~~~-l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d-~~e~a 435 (513) +... -+.+.++.-++.+-.. +...-....+.-.+.+.-+++.++.++...+.-.......+++++.-++..- ..+-+ T Consensus 356 ~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~ 434 (522) T protein:vir:94 356 AVQR-NAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDL 434 (522) T ss_pred hccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHH Confidence 1111 1233455544332211 1111122222223333344444455554444333344445677765544431 11111 Q ss_pred HHHHH----HhcC--------CCHHH----HHHhCCC-CCC---HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCC Q lcl|NC_019916. 436 TALVQ----AGAQ--------IPQEY----LYQYLPN-VTD---ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPE 495 (513) Q Consensus 436 ~~~~k----l~g~--------iS~et----~~~~l~~-v~D---~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (513) +.+.. ++++ +.... +...++. ... .++|++.+.+++.+........... ..+.....+. T Consensus 435 ~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~-~~~~~a~~~~ 513 (522) T protein:vir:94 435 EKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAA-GANMGAAVGQ 513 (522) T ss_pred HHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhhhhhc Confidence 11111 1111 22222 2233322 111 2556665555443322221111111 1111111111 Q ss_pred CCCCCCCCC Q lcl|NC_019916. 496 DEGVRGQQG 504 (513) Q Consensus 496 ~~~~~~~~~ 504 (513) .-...-+.+ T Consensus 514 ~~~~~~~~~ 522 (522) T protein:vir:94 514 GAGEDMAQA 522 (522) T ss_pred ccchhhhcC Confidence 111111111 No 124 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.26 E-value=2e-06 Score=51.89 Aligned_cols=451 Identities=12% Similarity=0.087 Sum_probs=174.3 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCc-cee-ecchhHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KAD-HRA-VHSFARYI 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~-~ri-~~n~~~~i 77 (513) ....++-- .+..+..+..++.|.+........ ..+-..--..+.+.+.. + +..++... ... ..+ ..+..+.+ T Consensus 22 ~~~~~~~~-~~~~~~~~~~~~~~~k~~~~~~~a-~~~~~~~~~~~~~~~~~--r-~~~~~~~~l~~~~~~~~~npiv~~~ 96 (551) T protein:vir:80 22 VKHIEVDD-NYSIAIQQREQEQISKAMNNKEVA-YSQPVIGSMSANPGFKT--K-PSIRNNQDLHGVLKKFGGNIILNAI 96 (551) T ss_pred cccccccc-ceeeecccccHHHHHHhhccCcce-eecccccceecCccccc--C-ccccChhHHHHHHHHhhcCHHHHHH Confidence 22222211 222555667788888876642111 01111000111111110 0 00000000 000 011 12444555 Q ss_pred HHHHHHHhh-----------cCCeeecCC---------cH---HHHHHHHHh-c--------CHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 78 ADFQTSYSV-----------GNAIAMSGP---------SS---DRLDDFNRR-N--------DIDTLNYELYLDMTVTGR 125 (513) Q Consensus 78 vd~~~~~l~-----------g~p~~~~~~---------~~---~~l~~~~~~-n--------~~~~~~~~~~~~a~~~G~ 125 (513) |+..+..+. |.++.+... +. ..+.+++.. | .+......+..+.+.+|. T Consensus 97 I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gn 176 (551) T protein:vir:80 97 INTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQ 176 (551) T ss_pred HHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCC Confidence 555544332 122222111 11 134555443 1 123455667888999999 Q ss_pred EEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccc Q lcl|NC_019916. 126 AYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVP 205 (513) Q Consensus 126 ~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~ 205 (513) ||+.+..+.+|.+.-.+.++|..+.++.++... .....++|+.... + . ....|..+.+++++..... T Consensus 177 ay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~-~~~~~~~y~~~~~--g---~---~~~~~~~~eiiH~~~n~~~---- 243 (551) T protein:vir:80 177 VNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQVID--Q---K---IVATFNAREMAFAVRNPRS---- 243 (551) T ss_pred EEEEEEECCCCcEEEEEEeCCceeEEEECCccc-cccCceEEEEEeC--C---c---EEEEEcccceEEecccCCC---- Confidence 999988898888766677899988887765431 1111223332211 0 0 0112344444443321000 Q ss_pred cccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhe--ecCcccccccccccccccc Q lcl|NC_019916. 206 TLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDP 283 (513) Q Consensus 206 ~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~ 283 (513) +.. ....|.|.++.+...++....+..-..+.+...+.|-.+| .|... ... T Consensus 244 -------~~~---------~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~----------lt~- 296 (551) T protein:vir:80 244 -------DIY---------ATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQ----------QSQ- 296 (551) T ss_pred -------Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCC----------CCH- Confidence 000 0114667666666666554444444444444444444333 22110 000 Q ss_pred hhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccc Q lcl|NC_019916. 284 SDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNF 363 (513) Q Consensus 284 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 363 (513) +....++..=...+.....++.+ +...+.+.+++-++.......+....+...+.|+..-++|++..+-. T Consensus 297 ---e~~~~lk~~~~~~~~G~~nag~~-------~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~ 366 (551) T protein:vir:80 297 ---HALEIFKREWKNSLSGINGSWQI-------PVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIP 366 (551) T ss_pred ---HHHHHHHHHHHHHhcCccccCcc-------ccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcc Confidence 00011111000000000111111 11112223334443334445566778888899999999998665421 Q ss_pred cccc----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHH Q lcl|NC_019916. 364 SGNS----SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALV 439 (513) Q Consensus 364 ~~n~----Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~ 439 (513) ..+. ++..+-. +... ......+...|.-+++.+...+...=. ..+ ...+.+.|......+.++.+.... T Consensus 367 ~~~~~~~~~~~s~t~--sn~e---~~~~~f~~~tL~P~~~~ie~~ln~~L~-~~~-~~~~~f~f~~~~~~~~~~~~~~~~ 439 (551) T protein:vir:80 367 NNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIV-AEF-GDKYTFQFVGGDIKSELESVKILA 439 (551) T ss_pred cccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhc-ccc-CCceEEEeeccChhhHHHHHHHHH Confidence 1110 0111100 0000 111123333444444444333322110 111 235678888777777777766443 Q ss_pred H-HhcCCCHHHHHHhCCC---CCCHHHHH---------HHHHH---HHHHHHHHhhhhcCCCCCC--CCCCC-CCCCCCC Q lcl|NC_019916. 440 Q-AGAQIPQEYLYQYLPN---VTDADEIV---------KMMDK---QRKAMLKTYDTKGGLIING--TSGND-PEDEGVR 500 (513) Q Consensus 440 k-l~g~iS~et~~~~l~~---v~D~~~E~---------~ri~~---E~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~ 500 (513) . .+|+++.-.+.++++. ++.-+.-+ ....+ +.+......+......... +.+.+ ..+.+.. T Consensus 440 ~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 519 (551) T protein:vir:80 440 EKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTT 519 (551) T ss_pred HHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccC Confidence 3 2588888777777643 22111000 00111 1111111111111110000 00000 0000011 Q ss_pred CCCCCCCCccCCC Q lcl|NC_019916. 501 GQQGEPEDERTSD 513 (513) Q Consensus 501 ~~~~~~~~~~~~~ 513 (513) ...++++..++++ T Consensus 520 ~~~~~~~~~~~~~ 532 (551) T protein:vir:80 520 GDIGKDGQRKDKD 532 (551) T ss_pred CCccccccccCcc Confidence 1111112222222 No 125 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.26 E-value=2e-06 Score=51.89 Aligned_cols=451 Identities=11% Similarity=0.082 Sum_probs=154.8 Q ss_pred Cccchhhcee---ccCCc--------cc------------CCHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCcccc Q lcl|NC_019916. 1 MIDMQQANMN---YQEDA--------DK------------LTPTRIAAFIRHHYNNQRPRLEML----YDYYRGQNDGIL 53 (513) Q Consensus 1 ~~~~~~~~~~---~~~~~--------~~------------~~~~~i~~~i~~~~~~~~~~~~~~----~~YY~G~~~i~~ 53 (513) |.++=+.... |-.++ ++ ...++|.+.++........-+..+ ..||.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~-- 77 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRS-Y-- 77 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccccc-C-- Confidence 4444333222 22111 11 111222222211100000001111 01111100 0 Q ss_pred ccccccCCCC-C-Ccceee-cchhHHHHHHHHHHhh-------------cCCeeecCC-----cH-----HHHHHHHHh- Q lcl|NC_019916. 54 SPASRRNEKG-K-ADHRAV-HSFARYIADFQTSYSV-------------GNAIAMSGP-----SS-----DRLDDFNRR- 106 (513) Q Consensus 54 ~~~~~~~~~~-~-~~~ri~-~n~~~~ivd~~~~~l~-------------g~p~~~~~~-----~~-----~~l~~~~~~- 106 (513) ..+... . ..+.+. ....+.+|++.+.... |-++++... .. ..+..++.. T Consensus 78 ----~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~ 153 (563) T protein:vir:95 78 ----MKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNT 153 (563) T ss_pred ----CCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhc Confidence 000000 0 001111 2344444544443322 123333111 11 123333321 Q ss_pred -----c---CHHHHHHHHHHHHhhCCeEEEEee--ecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccc Q lcl|NC_019916. 107 -----N---DIDTLNYELYLDMTVTGRAYEYVY--RDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDN 176 (513) Q Consensus 107 -----n---~~~~~~~~~~~~a~~~G~~~~~v~--~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~ 176 (513) . .+......+..+.+.+|.||+++. .+..|.+.-.+.++|..+.+..++... ......+|+.... + T Consensus 154 ~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~--g- 229 (563) T protein:vir:95 154 GKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD--K- 229 (563) T ss_pred CCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC--C- Confidence 1 234566678899999999998765 444555555566889988887765431 1111222222111 0 Q ss_pred cceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 177 ITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTD 256 (513) Q Consensus 177 ~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~ 256 (513) ..+..+....++++..... .+.. ....|.|.++.+...+.....+..-..+.+.. T Consensus 230 -----~~~~~~~~~evI~~~~~~~-----------~d~~---------~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~n 284 (563) T protein:vir:95 230 -----RVVASFTSRELAMGIRNPR-----------TELS---------SSGYGLSEVEIAMKEFIAYNNTESFNDRFFSH 284 (563) T ss_pred -----ceeEEecCcceEEEeccCC-----------CCcc---------cCcccchHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 0111233333332211000 0000 01246777766666555444444444444444 Q ss_pred hhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHH Q lcl|NC_019916. 257 LNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSA 336 (513) Q Consensus 257 ~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 336 (513) .+.|-.+++-..... .... ....++..=...+.....++.+ +.....+.++.-++.+.... T Consensus 285 g~~p~giL~~~~~~~--------ls~e----~~~~~~~~~~~~~~G~~nagk~-------~~vl~~G~~~~~l~~~~~d~ 345 (563) T protein:vir:95 285 GGTTRGILQIRSDQQ--------QSQH----ALENFKREWKSSLSGINGSWQI-------PVVMADDIKFVNMTPTANDM 345 (563) T ss_pred cCCCceEEEeCCCCC--------CCHH----HHHHHHHHHHHHhccccccccc-------eEEcCCCceEEeccCChhHH Confidence 444543333110000 0000 0001100000001000011100 11112333444444444555 Q ss_pred HHHHHHHHHHHHHHHHhCcccccccccc-c----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019916. 337 GTELYKKRLAADIHKFSHTPDLTDDNFS-G----NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGK 411 (513) Q Consensus 337 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~----n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~ 411 (513) .+....+...+.|+..-++|+...+-.. + ...|..+... .+ .......+...|..+++.+...+...=- T Consensus 346 qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~- 419 (563) T protein:vir:95 346 QFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA--DP---GKKQQQSQNKGLQPLLRFIEDLVNRHII- 419 (563) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc--cH---HHHHHHHHHHHHHHHHHHHHHHHHhhhc- Confidence 6778888899999999999986554221 1 1111111110 00 0111122333344443333333322100 Q ss_pred cccccceeeEEeCCCCCcCHHHHHHHHHH-HhcCCCHHHHHHhCCC--CCCHHHHH-----------HHHH-HHHHHHHH Q lcl|NC_019916. 412 WDIDPDEIGFIFRDNLPTDDVAIITALVQ-AGAQIPQEYLYQYLPN--VTDADEIV-----------KMMD-KQRKAMLK 476 (513) Q Consensus 412 ~~~~~~~i~i~f~~~~p~d~~e~a~~~~k-l~g~iS~et~~~~l~~--v~D~~~E~-----------~ri~-~E~~~~~~ 476 (513) ..+ ...+.+.|.+.-+.+..+..+.... .+|+++.-.+.+.++. +++-+.=+ ..-. .+.+.... T Consensus 420 ~~~-~~~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (563) T protein:vir:95 420 SEY-GDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKE 498 (563) T ss_pred hhc-ccccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccch Confidence 011 1245677877766655554443222 2578887666666533 22111000 0000 00000001 Q ss_pred HhhhhcCCCCCC-----CCCCCCCCCC--CCCCCCCCCCc------------------cCCC Q lcl|NC_019916. 477 TYDTKGGLIING-----TSGNDPEDEG--VRGQQGEPEDE------------------RTSD 513 (513) Q Consensus 477 ~~~~~~~~~~~~-----~~~~~~~~~~--~~~~~~~~~~~------------------~~~~ 513 (513) ..+........+ +.+.+.+.++ ..+.++.+.++ +++| T Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 560 (563) T protein:vir:95 499 RLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSD 560 (563) T ss_pred hhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCccc Confidence 111111000000 0000000000 00000011000 0000 No 126 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.26 E-value=2e-06 Score=51.89 Aligned_cols=451 Identities=11% Similarity=0.082 Sum_probs=154.8 Q ss_pred Cccchhhcee---ccCCc--------cc------------CCHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCcccc Q lcl|NC_019916. 1 MIDMQQANMN---YQEDA--------DK------------LTPTRIAAFIRHHYNNQRPRLEML----YDYYRGQNDGIL 53 (513) Q Consensus 1 ~~~~~~~~~~---~~~~~--------~~------------~~~~~i~~~i~~~~~~~~~~~~~~----~~YY~G~~~i~~ 53 (513) |.++=+.... |-.++ ++ ...++|.+.++........-+..+ ..||.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~-- 77 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRS-Y-- 77 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccccc-C-- Confidence 4444333222 22111 11 111222222211100000001111 01111100 0 Q ss_pred ccccccCCCC-C-Ccceee-cchhHHHHHHHHHHhh-------------cCCeeecCC-----cH-----HHHHHHHHh- Q lcl|NC_019916. 54 SPASRRNEKG-K-ADHRAV-HSFARYIADFQTSYSV-------------GNAIAMSGP-----SS-----DRLDDFNRR- 106 (513) Q Consensus 54 ~~~~~~~~~~-~-~~~ri~-~n~~~~ivd~~~~~l~-------------g~p~~~~~~-----~~-----~~l~~~~~~- 106 (513) ..+... . ..+.+. ....+.+|++.+.... |-++++... .. ..+..++.. T Consensus 78 ----~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~ 153 (563) T protein:vir:99 78 ----MKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNT 153 (563) T ss_pred ----CCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhc Confidence 000000 0 001111 2344444544443322 123333111 11 123333321 Q ss_pred -----c---CHHHHHHHHHHHHhhCCeEEEEee--ecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccc Q lcl|NC_019916. 107 -----N---DIDTLNYELYLDMTVTGRAYEYVY--RDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDN 176 (513) Q Consensus 107 -----n---~~~~~~~~~~~~a~~~G~~~~~v~--~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~ 176 (513) . .+......+..+.+.+|.||+++. .+..|.+.-.+.++|..+.+..++... ......+|+.... + T Consensus 154 ~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~--g- 229 (563) T protein:vir:99 154 GKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD--K- 229 (563) T ss_pred CCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC--C- Confidence 1 234566678899999999998765 444555555566889988887765431 1111222222111 0 Q ss_pred cceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 177 ITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTD 256 (513) Q Consensus 177 ~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~ 256 (513) ..+..+....++++..... .+.. ....|.|.++.+...+.....+..-..+.+.. T Consensus 230 -----~~~~~~~~~evI~~~~~~~-----------~d~~---------~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~n 284 (563) T protein:vir:99 230 -----RVVASFTSRELAMGIRNPR-----------TELS---------SSGYGLSEVEIAMKEFIAYNNTESFNDRFFSH 284 (563) T ss_pred -----ceeEEecCcceEEEeccCC-----------CCcc---------cCcccchHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 0111233333332211000 0000 01246777766666555444444444444444 Q ss_pred hhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHH Q lcl|NC_019916. 257 LNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSA 336 (513) Q Consensus 257 ~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 336 (513) .+.|-.+++-..... .... ....++..=...+.....++.+ +.....+.++.-++.+.... T Consensus 285 g~~p~giL~~~~~~~--------ls~e----~~~~~~~~~~~~~~G~~nagk~-------~~vl~~G~~~~~l~~~~~d~ 345 (563) T protein:vir:99 285 GGTTRGILQIRSDQQ--------QSQH----ALENFKREWKSSLSGINGSWQI-------PVVMADDIKFVNMTPTANDM 345 (563) T ss_pred cCCCceEEEeCCCCC--------CCHH----HHHHHHHHHHHHhccccccccc-------eEEcCCCceEEeccCChhHH Confidence 444543333110000 0000 0001100000001000011100 11112333444444444555 Q ss_pred HHHHHHHHHHHHHHHHhCcccccccccc-c----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019916. 337 GTELYKKRLAADIHKFSHTPDLTDDNFS-G----NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGK 411 (513) Q Consensus 337 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~----n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~ 411 (513) .+....+...+.|+..-++|+...+-.. + ...|..+... .+ .......+...|..+++.+...+...=- T Consensus 346 qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~- 419 (563) T protein:vir:99 346 QFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA--DP---GKKQQQSQNKGLQPLLRFIEDLVNRHII- 419 (563) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc--cH---HHHHHHHHHHHHHHHHHHHHHHHHhhhc- Confidence 6778888899999999999986554221 1 1111111110 00 0111122333344443333333322100 Q ss_pred cccccceeeEEeCCCCCcCHHHHHHHHHH-HhcCCCHHHHHHhCCC--CCCHHHHH-----------HHHH-HHHHHHHH Q lcl|NC_019916. 412 WDIDPDEIGFIFRDNLPTDDVAIITALVQ-AGAQIPQEYLYQYLPN--VTDADEIV-----------KMMD-KQRKAMLK 476 (513) Q Consensus 412 ~~~~~~~i~i~f~~~~p~d~~e~a~~~~k-l~g~iS~et~~~~l~~--v~D~~~E~-----------~ri~-~E~~~~~~ 476 (513) ..+ ...+.+.|.+.-+.+..+..+.... .+|+++.-.+.+.++. +++-+.=+ ..-. .+.+.... T Consensus 420 ~~~-~~~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (563) T protein:vir:99 420 SEY-GDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKE 498 (563) T ss_pred hhc-ccccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccch Confidence 011 1245677877766655554443222 2578887666666533 22111000 0000 00000001 Q ss_pred HhhhhcCCCCCC-----CCCCCCCCCC--CCCCCCCCCCc------------------cCCC Q lcl|NC_019916. 477 TYDTKGGLIING-----TSGNDPEDEG--VRGQQGEPEDE------------------RTSD 513 (513) Q Consensus 477 ~~~~~~~~~~~~-----~~~~~~~~~~--~~~~~~~~~~~------------------~~~~ 513 (513) ..+........+ +.+.+.+.++ ..+.++.+.++ +++| T Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 560 (563) T protein:vir:99 499 RLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSD 560 (563) T ss_pred hhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCccc Confidence 111111000000 0000000000 00000011000 0000 No 127 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.26 E-value=2e-06 Score=51.87 Aligned_cols=390 Identities=12% Similarity=0.047 Sum_probs=153.8 Q ss_pred cceee--cchhHHHHHHHHHHhhcCCeeecCC--------cH---HHHHHHHHh---c-----------CHHHHHHHHHH Q lcl|NC_019916. 66 DHRAV--HSFARYIADFQTSYSVGNAIAMSGP--------SS---DRLDDFNRR---N-----------DIDTLNYELYL 118 (513) Q Consensus 66 ~~ri~--~n~~~~ivd~~~~~l~g~p~~~~~~--------~~---~~l~~~~~~---n-----------~~~~~~~~~~~ 118 (513) .+.++ .++...+|+..++.+.|-|+.+... .. +.+..++.. | -+......+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 11222 5788889999999999999876311 11 122233321 2 12345567888 Q ss_pred HHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCc------- Q lcl|NC_019916. 119 DMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTEND------- 191 (513) Q Consensus 119 ~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~------- 191 (513) +.+.+|.||+.+..+..|.+.-.+.++|..+.+.-|... ++.... . ... ++.+|.... T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~---------~~~~~~--~---~~~-~~~~~~~~~~~~~~~~ 145 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG---------FVQLLE--E---KEK-YFGVAGDRYQTNGNGD 145 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce---------eEeecC--C---cee-eEEeccccceeecccc Confidence 999999999988888888766556678877766554321 110000 0 000 000111100 Q ss_pred -EEEEEeeccCCccccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhe- Q lcl|NC_019916. 192 -YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVI- 264 (513) Q Consensus 192 -~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~- 264 (513) ...+...... .......+..=-|++|+.. ..|.|.+.....-++....+..-....+...+.|-.++ T Consensus 146 ~~~~~~~~~~~------~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 219 (467) T protein:vir:31 146 LDPVFVDADDG------STGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAII 219 (467) T ss_pred eeeeeeeeccc------cccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 0000000000 0000011111124555433 24666665544444333322222222222222233222 Q ss_pred -ecCcccccccccccccccchhhhhhhcccccc----chhhhcchhcc-eeeccccccccccccCCceeEEee---cCCH Q lcl|NC_019916. 265 -KGDIDTLFDDSTLLQMVDPSDADAMKKLADEK----MAQLEAMRQAN-MILLKTGMAPNGQQTSADANYIHK---EYDS 335 (513) Q Consensus 265 -~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~---~~~~ 335 (513) +|.... +.. ............... .........++ ...+..+. .....+++|... .... T Consensus 220 ~~~~~l~---~e~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~----~~~~~~~~~~~ls~~~~~d 287 (467) T protein:vir:31 220 VKGAELT---EKG-----REEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGA----DRSDVEIRLEPLTVGIDEE 287 (467) T ss_pred ecCcCCC---HHH-----HHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCC----cccccceeEEeccccChhh Confidence 221100 000 000000000000000 00000000000 01110000 001112222221 1123 Q ss_pred HHHHHHHHHHHHHHHHHhCcccccccccc-ccc-c-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--cc Q lcl|NC_019916. 336 AGTELYKKRLAADIHKFSHTPDLTDDNFS-GNS-S-GVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV--NG 410 (513) Q Consensus 336 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-S-g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~--~~ 410 (513) ..+....+...+.|+..-++|+...+... ++. | ..+.... .+..+++-+++.+...++.. .. T Consensus 288 ~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~-------------f~~~~l~P~~~~ie~~ln~~l~~~ 354 (467) T protein:vir:31 288 ASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKE-------------FAEETIQPKQHDFGELLYELVHKQ 354 (467) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHH-------------HHHHHHHHHHHHHHHHHHHhhcch Confidence 45667778888899999999976543221 221 1 2221111 11222333333333222211 11 Q ss_pred ccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCC Q lcl|NC_019916. 411 KWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIING 488 (513) Q Consensus 411 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 488 (513) ........+++.+...+..|..+.++++.++ .|+++.-.+.++++.-.-++.++. . ..........+ T Consensus 355 ~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~----------~-~~~~~~~~~~~ 423 (467) T protein:vir:31 355 GLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVY----------G-GETLVAEVTGG 423 (467) T ss_pred hhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccc----------C-Ccccccccccc Confidence 1111223466777788888999999988876 689999888888754221111110 0 00000000001 Q ss_pred CCCCCCCCCCCCCC--------------------CCCCCCccCC Q lcl|NC_019916. 489 TSGNDPEDEGVRGQ--------------------QGEPEDERTS 512 (513) Q Consensus 489 ~~~~~~~~~~~~~~--------------------~~~~~~~~~~ 512 (513) ..+.++.+++...+ .-+.+++.++ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 424 SGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred cCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 11111111111000 0011111111 No 128 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.25 E-value=2e-06 Score=51.82 Aligned_cols=385 Identities=11% Similarity=0.065 Sum_probs=159.5 Q ss_pred HHHHHHHHHHHHHH---HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPR---LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~---~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l 100 (513) ..+.+......... -..+-.++.+..... .. ....-+.+.-...+|+..+.-+-+-|++...+ .+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~v----~~~~al~~~~V~~~v~~ia~~ia~~p~~~~~~---~~ 68 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEAQK-----YV----SADTALKNSDIFSLIMQLSGDLAMVRYTSESD---RS 68 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcCCc-----ee----chHHhhccHHHHHHHHHHHHHHhhCccccccc---HH Confidence 11111100000000 001111111110000 00 00001222333445666665565667664322 23 Q ss_pred HHHHHh-c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccc Q lcl|NC_019916. 101 DDFNRR-N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDN 176 (513) Q Consensus 101 ~~~~~~-n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~ 176 (513) ..++.. | ........+..+.+.+|.||+.+-.+.+|.+.-.+.++|..+-+..+... ..+.+.+ ... ... T Consensus 69 ~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~-~~~~y~~---~~~--~~~ 142 (397) T protein:vir:38 69 QSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDG-SGLIYNI---NFD--EPA 142 (397) T ss_pred HHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceEEEEE---Eec--ccc Confidence 344432 2 23456677888999999999988888888766566788888877665433 1221111 110 000 Q ss_pred cceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 177 ITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTD 256 (513) Q Consensus 177 ~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~ 256 (513) .. ....+....+++++... + .+...|.|.+..+...++....+..-..+.+.. T Consensus 143 ~~----~~~~~~~~eiih~~~~~--------------~---------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~n 195 (397) T protein:vir:38 143 IG----YMENVPAADVIHIRLLS--------------K---------NGGKTGISPLSALINEQQIKDASNELTLKALKQ 195 (397) T ss_pred cc----ceeEecCccEEEecCCC--------------C---------CCccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 00 01123444444432110 0 011257777777777766555555555555555 Q ss_pred hhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHH Q lcl|NC_019916. 257 LNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSA 336 (513) Q Consensus 257 ~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 336 (513) .+.|-.+++-...... ... ..++..-.........++.+. ...+.++.-++...... T Consensus 196 g~~~~~il~~~~~~~~----------e~~----~~~~~~~~~~~~~~n~~~~~v---------l~~g~~~~~l~~~~~d~ 252 (397) T protein:vir:38 196 SVTASAVLTIQKGGLL----------DAE----TRIARSKEISKQIHNSDGPVV---------IDALEDYKPLEVKGNIA 252 (397) T ss_pred cCCccEEEEeCCCCCH----------HHH----HHHHHHHHHHhcccccCCcee---------cCCCceEEecCCChhHH Confidence 5555555543221100 000 000000000000000111122 22334444444444556 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019916. 337 GTELYKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDID 415 (513) Q Consensus 337 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~ 415 (513) .+....+...+.|+..-++|+...+...+ +.+..+.+ ..+..+++..++.+..-+...-- .+ T Consensus 253 ~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~ie~~ln~~l~-~~-- 315 (397) T protein:vir:38 253 SLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQIS--------------GQYAKSLNRYVQAIVGELNDKLH-AN-- 315 (397) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhcc-Ch-- Confidence 66778889999999999998765543222 21211111 12233444444444333322110 01 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCC Q lcl|NC_019916. 416 PDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSG 491 (513) Q Consensus 416 ~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 491 (513) +++.+...+-.|..+.++++.++ .|+++.-.+.+.++. +..- ++ ...+ .............+ T Consensus 316 ---~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~--d~--~~~~-------~~~~~~~~~~~~~~ 381 (397) T protein:vir:38 316 ---ISANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAK--DL--PDPE-------KEPQQAIQLIQQEG 381 (397) T ss_pred ---hcccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--cc--cccc-------cccccccccccccc Confidence 11222223345677788887776 578888777776542 2110 00 0000 00000111111111 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_019916. 492 NDPEDEGVRGQQGEPE 507 (513) Q Consensus 492 ~~~~~~~~~~~~~~~~ 507 (513) .+..+.+.++..++|+ T Consensus 382 g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 382 GENDGNNSDERGSDPE 397 (397) T ss_pred CCCCCCCCCCCCCCCC Confidence 1111222222222222 No 129 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.23 E-value=2.3e-06 Score=51.54 Aligned_cols=406 Identities=13% Similarity=0.055 Sum_probs=165.2 Q ss_pred HHHHHHHHHHHHHH-HHHHhcCCCcccccc---------ccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee-c-- Q lcl|NC_019916. 27 IRHHYNNQRPRLEM-LYDYYRGQNDGILSP---------ASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM-S-- 93 (513) Q Consensus 27 i~~~~~~~~~~~~~-~~~YY~G~~~i~~~~---------~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~-~-- 93 (513) +.+-+.+...+... +..| .|..-..... .........+..=+.+.-...+|+..++-+-+-|+.+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~ 79 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKW-LGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTK 79 (437) T ss_pred CCcchhhhhhhhHHhhhhh-cCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEc Confidence 11111111111111 1122 2221000000 0000000001111223334557777777777778764 1 Q ss_pred CC------cHHHHHHHHH-h-c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 94 GP------SSDRLDDFNR-R-N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 94 ~~------~~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) .+ .+..+..++. . | ........+..+++.+|.||+++-.+ .|.+.-.+.++|..+.+..+.+. .+. T Consensus 80 ~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g--~~~ 156 (437) T protein:vir:10 80 PDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSG--ALQ 156 (437) T ss_pred CCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCC--eEE Confidence 11 1223444432 2 3 23455667888999999999998877 46655456688888777665432 111 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDL 242 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~ 242 (513) | ......+ ....+..+.+++++... + +...|.|.++.+...++. T Consensus 157 ----y-~~~~~~g-------~~~~~~~~dIih~r~~~---------------~---------d~~~G~spi~~~~~~i~~ 200 (437) T protein:vir:10 157 ----Y-TYRNVDG-------TVSTLAEDDVFHVRGFS---------------L---------DGLMGLTPIQYAREVLGN 200 (437) T ss_pred ----E-EEEecCc-------eEEEEccccEEEecCcC---------------C---------CCcccccHHHHHHHHHHH Confidence 1 1111111 01234455555442110 0 112466766666555554 Q ss_pred HHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccc Q lcl|NC_019916. 243 YDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQ 321 (513) Q Consensus 243 ~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 321 (513) ...+..-..+.+...+.|-.+++..... ...........+.. .+.... .++.+.+ . T Consensus 201 ~~~~~~~~~~~f~ng~~p~gil~~~~~l----------~~e~~~~~~~~~~~----~~~g~~nag~~~vl---------~ 257 (437) T protein:vir:10 201 STAANKTSASVFRNGLRPSGVLSTDQIL----------QKEKRAEIRTDLAE----QFGGAMQAGKTMVL---------E 257 (437) T ss_pred HHHHHHHHHHHHhccCCccEEEEcCCCC----------CHHHHHHHHHHHHH----HhcCccccCcceec---------c Confidence 4444444444444445555555432110 00111111111100 000000 0112222 2 Q ss_pred cCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 322 TSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTV 400 (513) Q Consensus 322 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~l 400 (513) .+.+++-++.......+....+...+.|+..-++|+...+...+ +..+..++.... ..+...|...+.. T Consensus 258 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~----------~f~~~tl~P~~~~ 327 (437) T protein:vir:10 258 AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTL----------GFLTFTLRPWLTR 327 (437) T ss_pred CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHH----------HHHHHHHHHHHHH Confidence 23334444433444556667778888999999999866543322 222222222111 1223333333333 Q ss_pred HHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHH Q lcl|NC_019916. 401 VAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAML 475 (513) Q Consensus 401 i~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~ 475 (513) |...+...- .........+++.+...+..|..+.++++.++ +|+++.-.+.+.++. ++.-. +.-.+.. ... T Consensus 328 ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~-~~~~~~~---~~~ 403 (437) T protein:vir:10 328 IEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNA-AVLTVQS---ALL 403 (437) T ss_pred HHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc-ceEeecC---ccc Confidence 333332211 11111112345555666777889999988876 578888777766643 22111 1100000 000 Q ss_pred HHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 476 KTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) ..+..+.........+...+.+.+++++..++|+ T Consensus 404 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 404 -PIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred -chhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 0011111111111111111122222222233333 No 130 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.22 E-value=2.4e-06 Score=51.36 Aligned_cols=443 Identities=11% Similarity=0.068 Sum_probs=191.5 Q ss_pred CCc--ccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhc Q lcl|NC_019916. 13 EDA--DKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVG 87 (513) Q Consensus 13 ~~~--~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g 87 (513) |.. ..+..+.+.+..+...++|.+ +.+.+.+|..-. ...... ........++..+-....++..++.|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDS--DNASTDYQTPWQAVGARGLNNLASKLML 75 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCC--CcccccccccccccHHHHHHHHHHHHHh Confidence 332 245677888887776666644 455555554331 111111 1111233456677788888888887765 Q ss_pred C--Ce----eecCCc-----------------------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCce Q lcl|NC_019916. 88 N--AI----AMSGPS-----------------------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKG 138 (513) Q Consensus 88 ~--p~----~~~~~~-----------------------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~ 138 (513) . |. ++...+ +..+...+..++|.....++.++..++|.|.+++-.+.++.. T Consensus 76 ~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:10 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred hhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 2 31 122111 012333445678999999999999999988755533322222 Q ss_pred eEEEEEcccceEEEecCCCCcceEEEEEEEeeccc------------ccccceeEEEEEEEc-----C--CcEEEEEeec Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTV------------VDNITQTKYEVETWT-----E--NDYTRYKPIV 199 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~------------~~~~~~~~~~ve~yt-----~--~~~~~~~~~~ 199 (513) ..+..-|..-+.+--+. ..++...+|.++.... .....+....+++|+ + ..+.+|... T Consensus 156 -~~~~~~pl~~~~v~~d~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~- 232 (536) T protein:vir:10 156 -NPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEV- 232 (536) T ss_pred -eeEEEEEcCeEEEeeCC-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEee- Confidence 22334455555554443 3456666655433210 000111112233332 1 122222221 Q ss_pred cCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) .+... ....-..+|..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........+...+.=.+.. T Consensus 233 ~g~~v--~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~---- 306 (536) T protein:vir:10 233 EGMEV--QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT---- 306 (536) T ss_pred cCccc--cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccccc---- Confidence 11111 1111123567789877753 457999999999999888887666666666655554333100000 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSH 354 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 354 (513) + .. ......++....+...+.++..+....+.......++.++..|...-. T Consensus 307 ----------------~--------~~-----~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~ 357 (536) T protein:vir:10 307 ----------------Q--------PR-----RLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM 357 (536) T ss_pred ----------------c--------hh-----hhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHh Confidence 0 00 000000111111111122222223334555566777777776633221 Q ss_pred ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhcccccccccceeeEEeCCC Q lcl|NC_019916. 355 TPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERG--------LNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN 426 (513) Q Consensus 355 ~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~--------l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~ 426 (513) .-.+..- -+...++.-++.+ +.++...+|.. +.-+++.++.++...+.-.......+++.+.-+ T Consensus 358 ~~~l~~~-~~~r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~ 429 (536) T protein:vir:10 358 LNSAVQR-TGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTG 429 (536) T ss_pred hhhcccC-CCCCccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEec Confidence 1111111 1233455544433 23333333333 333444455555444333333333455665433 Q ss_pred CCcCHHHHHHHHHH-------HhcC--------CCHHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHh---hh Q lcl|NC_019916. 427 LPTDDVAIITALVQ-------AGAQ--------IPQEYLY----QYLPNVT----DADEIVKMMDKQRKAMLKTY---DT 480 (513) Q Consensus 427 ~p~d~~e~a~~~~k-------l~g~--------iS~et~~----~~l~~v~----D~~~E~~ri~~E~~~~~~~~---~~ 480 (513) +. .++..+.+.+ ++++ +....++ ..++..+ -.++|++.+.+++++.+... .. T Consensus 430 l~--~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~ 507 (536) T protein:vir:10 430 LE--AIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAA 507 (536) T ss_pred HH--HHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 2332222222 2222 2223333 2333212 13667777766554433322 11 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCC-CCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQG-EPED 508 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 508 (513) .+.....+.......-.+...+.| .|+- T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 508 LAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHhcCchhHHhhhhccccCCCC Confidence 111111111100000000001111 1111 No 131 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.19 E-value=2.8e-06 Score=51.03 Aligned_cols=462 Identities=11% Similarity=0.064 Sum_probs=185.8 Q ss_pred CccchhhceeccCC----cccCCHHHHHHHHHHHHHH----HH---HHHHHHHHHhcCCCcc----ccccccccCC-CCC Q lcl|NC_019916. 1 MIDMQQANMNYQED----ADKLTPTRIAAFIRHHYNN----QR---PRLEMLYDYYRGQNDG----ILSPASRRNE-KGK 64 (513) Q Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~i~~~i~~~~~~----~~---~~~~~~~~YY~G~~~i----~~~~~~~~~~-~~~ 64 (513) ||-.. ---..+| +.+++.+.+...|.+.+.. |. ++.+.+.+||...... ..+....... ... T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (641) T protein:vir:94 1 MTIEM--PTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDAD 78 (641) T ss_pred CccCC--CcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhc Confidence 22111 1111122 2356666665555544332 21 3466777777553221 1111111111 111 Q ss_pred CcceeecchhHHHHHHHHHHhhcC--C----eeec---CCcHHH-------HHHHHHhcCHHHHHHHHHHHHhhCCeEEE Q lcl|NC_019916. 65 ADHRAVHSFARYIADFQTSYSVGN--A----IAMS---GPSSDR-------LDDFNRRNDIDTLNYELYLDMTVTGRAYE 128 (513) Q Consensus 65 ~~~ri~~n~~~~ivd~~~~~l~g~--p----~~~~---~~~~~~-------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~ 128 (513) .-+||..+.+...++..+..|++. | +++. .++.+. +...+..+++........++++.+|.+++ T Consensus 79 ~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv 158 (641) T protein:vir:94 79 WRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTY 158 (641) T ss_pred ccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEE Confidence 124688888888888888777653 1 2332 122222 22233456777777899999999999999 Q ss_pred EeeecC------------CCce--------------e-EEEEEcccceEEEecCCCCcceEEEEEEEeecc--------- Q lcl|NC_019916. 129 YVYRDP------------SQKG--------------E-VSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQT--------- 172 (513) Q Consensus 129 ~v~~d~------------~~~~--------------~-~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~--------- 172 (513) .++++. ++.. . .+..++|.++ ++|++.+..-..++++..... T Consensus 159 ~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di--~~dps~~~~~~~f~~~r~t~~t~~~l~~eg 236 (641) T protein:vir:94 159 RLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDV--WLDTSGGKNTGTFVRLRHTREELHELVTSG 236 (641) T ss_pred EeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhhe--eecCCCCcccccceehhhhHHHHHHHHhcC Confidence 887641 1110 0 0112344443 344443222112222211100 Q ss_pred c-----------cccc--------------ceeEEEEEEEc----CCc-EEEEEeeccCCccccccccccccCcccceEE Q lcl|NC_019916. 173 V-----------VDNI--------------TQTKYEVETWT----END-YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIE 222 (513) Q Consensus 173 ~-----------~~~~--------------~~~~~~ve~yt----~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 222 (513) + .+.. ......+++|. ++. ...+...-.++. .+.......|..+|++. T Consensus 237 ~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~--il~~~~~~~~d~~Pf~~ 314 (641) T protein:vir:94 237 YYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQ--LIRLSDSKYWCGSPFVT 314 (641) T ss_pred CCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCE--EeecccccccCcCCeEE Confidence 0 0000 00000112221 111 111111111111 11111122356778877 Q ss_pred ecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccc Q lcl|NC_019916. 223 YRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKM 297 (513) Q Consensus 223 ~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 297 (513) ++- .-+|+|....+.+.+..+|.+.-...+.+....+|.+.+......... T Consensus 315 ~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~------------------------ 370 (641) T protein:vir:94 315 TTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKRE------------------------ 370 (641) T ss_pred ecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccc------------------------ Confidence 654 357999999999999999999999999999998887654332110000 Q ss_pred hhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCcccccccc---ccccccHHHHH Q lcl|NC_019916. 298 AQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDN---FSGNSSGVAMK 373 (513) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~~~n~Sg~Ai~ 373 (513) -+.+.+|+. ...+..++++++... .+.......++.+...|-....+..+.... .+.+.++..+. T Consensus 371 ----------~l~~~PG~i-i~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~ 439 (641) T protein:vir:94 371 ----------DVKAKPGAV-FKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQ 439 (641) T ss_pred ----------eeeccCCcc-eeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHH Confidence 000111111 112233445555432 222333445666665554444433322111 11234555566 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcc----------------cccccccceeeEEeCCCCCcC---HHH Q lcl|NC_019916. 374 YKVLGTVELASTKRKQFE-RGLNQRYTVVAHIEERVN----------------GKWDIDPDEIGFIFRDNLPTD---DVA 433 (513) Q Consensus 374 ~~~~~l~~k~~~~~~~f~-~~l~~~~~li~~~l~~~~----------------~~~~~~~~~i~i~f~~~~p~d---~~e 433 (513) .+......+....-+.|. ++++.+++-+++++.... +.......++...|.- .|-. .++ T Consensus 440 ~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~ 518 (641) T protein:vir:94 440 GVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVE 518 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHH Confidence 556666666666666666 366666666666554321 1111222233333321 2333 223 Q ss_pred HHHHHHHHhcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCC------------- Q lcl|NC_019916. 434 IITALVQAGAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVR------------- 500 (513) Q Consensus 434 ~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 500 (513) .++.+..+.++++ .+...|-+-|. --+.++-++-.+.+ .++.... --...+...++.. T Consensus 519 ~~~~i~~l~~~~~---~~a~~P~v~d~-~d~~~~~~~~~~~~----g~~~p~~-~ir~~~~~~~~~~~~~~~~q~~~~~~ 589 (641) T protein:vir:94 519 RERMVTDLLQLLD---ISGRVPQIGQS-LDYALILEDLLRQM----RFTDPMR-YIKKAEAPPAAPPIAPAEPGALPPEM 589 (641) T ss_pred HHHHHHHHHHHHH---HhhcChhhhhc-CCHHHHHHHHHHHh----CCCCchh-hccCccCchhHHHHHHHHHHHHHHHH Confidence 3333433333221 11111111100 00111111100000 0000000 0000000000000 Q ss_pred CC------CCCCCCcc-CCC Q lcl|NC_019916. 501 GQ------QGEPEDER-TSD 513 (513) Q Consensus 501 ~~------~~~~~~~~-~~~ 513 (513) .+ .....++. ..| T Consensus 590 a~~~~~~~~~~a~~~~~~~~ 609 (641) T protein:vir:94 590 MNSVGGGLNDQAIAGMTPED 609 (641) T ss_pred HHHHHhhhHHHHHHHhhHHH Confidence 00 00000000 011 No 132 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.12 E-value=4.2e-06 Score=50.05 Aligned_cols=451 Identities=11% Similarity=0.071 Sum_probs=197.4 Q ss_pred eccCCcccCCHHHHHHHHHHHHHHHHHH---HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 10 NYQEDADKLTPTRIAAFIRHHYNNQRPR---LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 10 ~~~~~~~~~~~~~i~~~i~~~~~~~~~~---~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) -..+..+.+..+.+.+..+...+.|.+. .+.+.+|..-. +..... ........++..+-....++..++.|+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPS---LFPKDS--DNSSTDYTTPWQAVGARGLNNLSAKVM 75 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccc---cCCCCC--CcccccccccccchHHHHHHHHHHHHH Confidence 1112235667777777777766666554 45555554331 111110 011112234666777788888888776 Q ss_pred cC--Cee----ecCCc------------HHHHH-----------HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc Q lcl|NC_019916. 87 GN--AIA----MSGPS------------SDRLD-----------DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK 137 (513) Q Consensus 87 g~--p~~----~~~~~------------~~~l~-----------~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~ 137 (513) +. |.+ +...+ ...++ ..+..++|.....++.++..++|.|.+++-.+.+.. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~ 155 (543) T protein:vir:88 76 LALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASS 155 (543) T ss_pred HhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCcccc Confidence 52 322 11111 01222 334457899999999999999999975554333222 Q ss_pred eeEE-EEEcccceEEEecCCCCcceEEEEEEEeecccc-----------cccceeEEEEEEEcC-----Cc-EEEEEeec Q lcl|NC_019916. 138 GEVS-VKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV-----------DNITQTKYEVETWTE-----ND-YTRYKPIV 199 (513) Q Consensus 138 ~~~~-~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-----------~~~~~~~~~ve~yt~-----~~-~~~~~~~~ 199 (513) .++. +.+-|...+++--+. .+++...+|.++..... ......-..+++|+. +. .+.... . T Consensus 156 ~~~~~~~~~pl~~y~v~~d~-~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~-~ 233 (543) T protein:vir:88 156 NSYNPMKLYTLHNHVVQRDA-FGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQ-E 233 (543) T ss_pred ceecceEEeEcceEEEeeCC-CCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCcccccc-c Confidence 2211 223455555555443 34566666654332100 000111112344431 11 010000 1 Q ss_pred cCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) ..+. ......-..++..+|++.++ ++.+|+|-.++..+-+-.+|.+.-......+...+|.+.+.-.+..... T Consensus 234 ~~~~-~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~- 311 (543) T protein:vir:88 234 IEGV-EVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVR- 311 (543) T ss_pred ccCe-eeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchh- Confidence 1111 11111112345678877765 3467999999999999999999888999999998888664211100000 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~ 352 (513) .+ . +.+......+..+++..+. ...+.......++.++..|... T Consensus 312 ----------------~~-----------~-------~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~a 357 (543) T protein:vir:88 312 ----------------RL-----------V-------KAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYV 357 (543) T ss_pred ----------------hc-----------c-------cCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHH Confidence 00 0 0000001112234444333 3346666777788777777432 Q ss_pred hCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhcccccccccceeeEEeC Q lcl|NC_019916. 353 SHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLN--------QRYTVVAHIEERVNGKWDIDPDEIGFIFR 424 (513) Q Consensus 353 s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~--------~~~~li~~~l~~~~~~~~~~~~~i~i~f~ 424 (513) -..-.+.. --+...++.-++.+ +.++...+|..+. -+++.++.++...+.-.......+++.+. T Consensus 358 f~~~~~~~-~~~~r~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~v 429 (543) T protein:vir:88 358 FMLNSAVQ-RSGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVT 429 (543) T ss_pred Hhhhhhcc-CCCCcccHHHHHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEE Confidence 21111110 11233455544332 3444444444333 33444455554444333333345566664 Q ss_pred CCCC-cCHHHHHHHHHHH---hcC---------CCHHHHHHhC---CCC-C----CHHHHHHHHHHHHHHHHHHhhh--- Q lcl|NC_019916. 425 DNLP-TDDVAIITALVQA---GAQ---------IPQEYLYQYL---PNV-T----DADEIVKMMDKQRKAMLKTYDT--- 480 (513) Q Consensus 425 ~~~p-~d~~e~a~~~~kl---~g~---------iS~et~~~~l---~~v-~----D~~~E~~ri~~E~~~~~~~~~~--- 480 (513) -.+. -..++.++.+... .+. +....++..+ -+| . -.++|++++.++++.++..... T Consensus 430 s~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~ 509 (543) T protein:vir:88 430 TGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAG 509 (543) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 3222 1122222222221 111 2223333322 133 1 2356666666555433332211 Q ss_pred hcCCCCCCCCCCCCC-CCCCCC--CCCCCCCccC Q lcl|NC_019916. 481 KGGLIINGTSGNDPE-DEGVRG--QQGEPEDERT 511 (513) Q Consensus 481 ~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~ 511 (513) .+............. +...++ ....|.+..- T Consensus 510 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 510 IGSGVAAQATASPEAMESAMDTAGVQPGPIATQV 543 (543) T ss_pred HhhchhhhhccChHHHHHHhhhcCCCCCCCCCCC Confidence 111111111100000 000000 0111111111 No 133 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.08 E-value=5e-06 Score=49.62 Aligned_cols=442 Identities=12% Similarity=0.082 Sum_probs=203.8 Q ss_pred CC---cccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 ED---ADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~---~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |. .+.+-.+.+.+..+...+.|.+ +.+.+.+|..-. +..... ........++..+-....++..++.|+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS---LFPKES--DNESTDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCC--CcccccccccccccHHHHHHHHHHHHH Confidence 33 2456777777777776666544 455555553331 111111 111122234566777788888887776 Q ss_pred cC--Ce----eecCCc-----------------------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc Q lcl|NC_019916. 87 GN--AI----AMSGPS-----------------------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK 137 (513) Q Consensus 87 g~--p~----~~~~~~-----------------------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~ 137 (513) +. |. ++...+ +..+...+..++|.....++.++..++|.|.+++-.++++. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:15 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 42 32 122111 01133335567899999999999999999876665454444 Q ss_pred eeEEEEEcccceEEEecCCCCcceEEEEEEEeeccc------------ccccceeEEEEEEEc-----CCc--EEEEEee Q lcl|NC_019916. 138 GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTV------------VDNITQTKYEVETWT-----END--YTRYKPI 198 (513) Q Consensus 138 ~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~------------~~~~~~~~~~ve~yt-----~~~--~~~~~~~ 198 (513) .++.. + |...+.+-.+. ..++...+|.++.... +....+....+++|+ .+. +..+... T Consensus 156 ~~f~~-~-pl~~~~v~~d~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~ 232 (535) T protein:vir:15 156 NPMKL-Y-RLSSYVVQRDA-YGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEV 232 (535) T ss_pred eeeEE-E-EcCeeEEeeCC-CCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEe Confidence 44432 3 44444433332 3456666665443210 000111111233333 221 2222222 Q ss_pred ccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccc Q lcl|NC_019916. 199 VVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFD 273 (513) Q Consensus 199 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~ 273 (513) . +.... ......++..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+...+|.+.+--.+..... T Consensus 233 ~-g~~~~--~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~ 309 (535) T protein:vir:15 233 E-DVEID--GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPR 309 (535) T ss_pred e-Ccccc--ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccch Confidence 1 11111 11122346678887765 3467999999999999999999888999999888888654211000000 Q ss_pred cccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 274 DSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHK 351 (513) Q Consensus 274 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~ 351 (513) .+ .....+ ....+..++++.+. ...+.......++.++..|.. T Consensus 310 -----------------~l----------------~~~~~g--~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 354 (535) T protein:vir:15 310 -----------------RL----------------TKAQTG--DFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSY 354 (535) T ss_pred -----------------hc----------------ccCCce--eeecCCcccceeeecccccchhHHHHHHHHHHHHHHH Confidence 00 000000 01112233344443 234556677777777777744 Q ss_pred HhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcccccccccceeeEEe Q lcl|NC_019916. 352 FSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ--------RYTVVAHIEERVNGKWDIDPDEIGFIF 423 (513) Q Consensus 352 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~--------~~~li~~~l~~~~~~~~~~~~~i~i~f 423 (513) .- ..+.....-++..++.-++.+ +.++...+|..+.+ +++.++.++...+.-.......++++| T Consensus 355 af-~~~~~~~~~~~r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y 426 (535) T protein:vir:15 355 AF-MLNSAVQRTGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI 426 (535) T ss_pred HH-hhhhcccCCCccccHHHHHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE Confidence 21 111111011233455444332 34444444444433 455555666555444444455678887 Q ss_pred CCCCCcCH-HHHHHHHH----HHhcC--------CCHHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHH---Hhh Q lcl|NC_019916. 424 RDNLPTDD-VAIITALV----QAGAQ--------IPQEYLYQYL---PNVT-----DADEIVKMMDKQRKAMLK---TYD 479 (513) Q Consensus 424 ~~~~p~d~-~e~a~~~~----kl~g~--------iS~et~~~~l---~~v~-----D~~~E~~ri~~E~~~~~~---~~~ 479 (513) .-++..-. .+.++.+. .++++ +....++..+ -+|+ -.++|++++.+++.+.+. .+. T Consensus 427 is~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~ 506 (535) T protein:vir:15 427 STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAA 506 (535) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 66655321 11112221 12222 2222222221 1222 135555555544333222 222 Q ss_pred hhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 480 TKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) ..+......+......-....+..|-+.+ T Consensus 507 ~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 507 TGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred HHHhhccchhccChHHHHHHHhccCCCCC Confidence 32222222211111111122233333333 No 134 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.07 E-value=5.3e-06 Score=49.53 Aligned_cols=442 Identities=12% Similarity=0.076 Sum_probs=202.3 Q ss_pred CCc---ccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDA---DKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~---~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.. +.+..+.+.+..+...+.|.+ +.+.+.+|..-. +..... ........++..+-....+++.++.|+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS---LFPKES--DNESTDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCC--CcccccccccccccHHHHHHHHHHHHH Confidence 442 456778887777776666654 445555553331 111110 111112234556677778888887776 Q ss_pred cC--Cee----ecCCc------------H-----------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc Q lcl|NC_019916. 87 GN--AIA----MSGPS------------S-----------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK 137 (513) Q Consensus 87 g~--p~~----~~~~~------------~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~ 137 (513) +- |.+ +...+ . ..+...+..++|.....++.++..++|.|.+++-.++++. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:33 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 52 221 21111 0 1123335567899999999999999999877765554444 Q ss_pred eeEEEEEcccceEEEecCCCCcceEEEEEEEeeccc------------ccccceeEEEEEEEc-----CCc--EEEEEee Q lcl|NC_019916. 138 GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTV------------VDNITQTKYEVETWT-----END--YTRYKPI 198 (513) Q Consensus 138 ~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~------------~~~~~~~~~~ve~yt-----~~~--~~~~~~~ 198 (513) .++. .-|...+.+-.+. .+++...+|.++.... +....+....+++|+ .+. +..+... T Consensus 156 ~~f~--~~pl~~~~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~ 232 (535) T protein:vir:33 156 NPMK--LYRLSSYVVQRDA-YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEV 232 (535) T ss_pred eeeE--EEEcCeeEEeeCC-CCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEE Confidence 4433 3355555555443 3455556655443210 000000111122222 211 2222211 Q ss_pred ccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccc Q lcl|NC_019916. 199 VVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFD 273 (513) Q Consensus 199 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~ 273 (513) . +..... .....+++.+|++.++ ++.+|+|-.++..+-+..+|.+.-......+...+|.+.+--.+.... T Consensus 233 ~-~~~~~~--~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~- 308 (535) T protein:vir:33 233 E-DVEIDG--SDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQP- 308 (535) T ss_pred e-Cccccc--cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccch- Confidence 1 111111 1112346678887765 346799999999999999999988899999988888865421100000 Q ss_pred cccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 274 DSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHK 351 (513) Q Consensus 274 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~ 351 (513) ... .....+ ....+..++++.+. ...+.......++.++..|.. T Consensus 309 ---------------------------~~~-----~~~~~g--~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 354 (535) T protein:vir:33 309 ---------------------------RRL-----TKAQTG--DFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSY 354 (535) T ss_pred ---------------------------hhc-----ccCCce--eeecCCcccceeeecccccchhHHHHHHHHHHHHHHH Confidence 000 000000 01112233344443 234556677777777777744 Q ss_pred HhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcccccccccceeeEEe Q lcl|NC_019916. 352 FSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ--------RYTVVAHIEERVNGKWDIDPDEIGFIF 423 (513) Q Consensus 352 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~--------~~~li~~~l~~~~~~~~~~~~~i~i~f 423 (513) .- ..+.....-++..++.-++.+ +.++...++..+.+ +++.++.++...+.-.......++++| T Consensus 355 af-~~~~~~~~~~~r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y 426 (535) T protein:vir:33 355 AF-MLNSAVQRTGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI 426 (535) T ss_pred HH-hhhhcccCCCccccHHHHHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE Confidence 21 111110011233455444332 34444444444433 455555666555444445555678888 Q ss_pred CCCCCcCH-HHHHHHHH----HHhcC--------CCHHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHhhhh- Q lcl|NC_019916. 424 RDNLPTDD-VAIITALV----QAGAQ--------IPQEYLYQYL---PNVT-----DADEIVKMMDKQRKAMLKTYDTK- 481 (513) Q Consensus 424 ~~~~p~d~-~e~a~~~~----kl~g~--------iS~et~~~~l---~~v~-----D~~~E~~ri~~E~~~~~~~~~~~- 481 (513) .-++..-. .+.++.+. .++++ +....++..+ -+|+ -.++|++.+.+++.+.+...+.+ T Consensus 427 is~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~ 506 (535) T protein:vir:33 427 STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAA 506 (535) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 66655321 11112211 12221 2222222221 1222 13556666655544333322222 Q ss_pred --cCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 482 --GGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 482 --~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) +.............-.+.-..-|-+.+ T Consensus 507 ~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 507 AGGAGVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred hhhhhhcchhhcCChhHHHHHHhccCCCC Confidence 211111111110000111111111222 No 135 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.07 E-value=5.3e-06 Score=49.49 Aligned_cols=398 Identities=10% Similarity=0.004 Sum_probs=161.1 Q ss_pred HHHHHHHHHHHH-H---------HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec Q lcl|NC_019916. 24 AAFIRHHYNNQR-P---------RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS 93 (513) Q Consensus 24 ~~~i~~~~~~~~-~---------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~ 93 (513) ..+++.+....+ + ....+.++.-...... ... ...=+.+.-...+|+..+.-+-+-|+.+- T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~-------~v~--~~~al~~~~v~~~i~~ia~~ia~l~~~~~ 71 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTI-------SVK--GKNALKVATVFACIKILSESVSKLPLKIY 71 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcc-------eec--hhhhhccHHHHHHHHHHHHhhccCceEEE Confidence 222222221100 0 0011111111100000 000 00001233444567777776667787741 Q ss_pred --C-C-----cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcc Q lcl|NC_019916. 94 --G-P-----SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPK 160 (513) Q Consensus 94 --~-~-----~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~ 160 (513) . + .+..+..++.. | ........+..+.+.+|.||+++-.+..|.+.-.+.++|..+-+..++..... T Consensus 72 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~ 151 (429) T protein:vir:10 72 QEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLN 151 (429) T ss_pred EecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccc Confidence 1 1 11234444432 3 23456677888999999999999999888765556678888777666432111 Q ss_pred eEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHH Q lcl|NC_019916. 161 PIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLI 240 (513) Q Consensus 161 ~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~li 240 (513) .. ...+|.... . . . .-.+.++.+++++.. ++ .+...|.|.++.+...+ T Consensus 152 ~~-~~~~~~~~~-~--g-~----~~~~~~~evih~~~~--------------~~---------~~~~~G~s~i~~~~~~i 199 (429) T protein:vir:10 152 SK-TKMWYVVNT-G--G-Q----QRVLKPEEILHFKNG--------------IT---------LDGLVGVPTMEYLKSTL 199 (429) T ss_pred cc-ceEEEEEcc-C--C-e----EEEEccccEEEecCC--------------CC---------CCCcccccHHHHHHHHH Confidence 10 111111110 0 0 0 012333333333210 00 01124667676666666 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccc Q lcl|NC_019916. 241 DLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNG 319 (513) Q Consensus 241 D~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 319 (513) +....+..-....+...+.|-.+++..... ...........+. ....... .++++.+ T Consensus 200 ~~~~~~~~~~~~~~~ng~~~~~il~~~~~l----------~~e~~~~~~~~~~----~~~~g~~n~~~~~vl-------- 257 (429) T protein:vir:10 200 ENSASADKFINNFYKQGLQVKGLVQYVGDL----------NEDAKKVFRENFE----SMSSGLQNSHRIALM-------- 257 (429) T ss_pred HHHHHHHHHHHHHHhccCCccEEEEcCCCC----------CHHHHHHHHHHHH----HHhccccccCceeec-------- Confidence 555444444444444444455454432110 0000000000000 0000000 1122222 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRY 398 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~ 398 (513) ..+.+++.+........+....+...+.|+..-++|+...+... ++-|+ ++. .....+...|+..+ T Consensus 258 -~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~----------~~~~f~~~~l~P~~ 324 (429) T protein:vir:10 258 -PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQ----------QQQQFYTDTLQATL 324 (429) T ss_pred -CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH----------HHHHHHHHHHHHHH Confidence 22334444443333445556677888899999999986654222 22222 111 11112233344444 Q ss_pred HHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHH Q lcl|NC_019916. 399 TVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRK 472 (513) Q Consensus 399 ~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~ 472 (513) +.|..-+... ..........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++. +++.+.-+... T Consensus 325 ~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~----- 399 (429) T protein:vir:10 325 TMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNG----- 399 (429) T ss_pred HHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc----- Confidence 4333333221 000001112344444566667899999998887 578888777777643 22111100000 Q ss_pred HHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 473 AMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) .+...+..+.. ... .+++++....++..++ T Consensus 400 -n~~~~d~~~~~---~~k--~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 400 -NMLPIDMAGQA---YLK--GGDTNGEVSKEGNEGN 429 (429) T ss_pred -cccchhhcccc---ccC--CCCCCCCCCCCCCCCC Confidence 00000000000 000 0111111111111111 No 136 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.04 E-value=6.2e-06 Score=49.13 Aligned_cols=439 Identities=10% Similarity=0.047 Sum_probs=154.4 Q ss_pred ceeccCCcccCCHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHH Q lcl|NC_019916. 8 NMNYQEDADKLTPT-RIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSY 84 (513) Q Consensus 8 ~~~~~~~~~~~~~~-~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~ 84 (513) -|.|.++-..+..- .|.+- .....-......+||+ ++. ......++ ...+...+|+..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--------pp~----~~~~La~~~~~n~~v~scI~~ia~~ 64 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGD----TDSQALKEDRFEEYVE--------PKV----HPLVLLSLLQVNPYHASACSIKAND 64 (540) T ss_pred CCCcccChhhccchhhhhcc----ccccccccCCCCcccc--------CCC----CHHHHHHHHHhcHHHHHHHHHHHHH Confidence 34444442221110 11110 0000000011111110 000 00000112 245667889999999 Q ss_pred hhcCCeeecCCcHHHHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 85 SVGNAIAMSGPSSDRLDDFNRR--NDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 85 l~g~p~~~~~~~~~~l~~~~~~--n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) +.+.|+.+..++..... +.-. -........+..+.+.+|.||+.+..+..|.+.-.+.++|..+-+..+... T Consensus 65 ia~~~~~i~~~~~~~~~-~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~----- 138 (540) T protein:vir:41 65 ILRTGYLIDGDDGGVEE-LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR----- 138 (540) T ss_pred HhcCCceEecCccchhh-hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce----- Confidence 99999988666544332 2211 123455667888999999999999888888766556678887766544321 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC-----CCCCcchhHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVL 237 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~ 237 (513) ++...+. ....+...|.....+... .+. ....+..=-|+++++. ..|.|.+.... T Consensus 139 ----~~~~~d~-----~~~~~~~~~~~~~~~~~~----~g~-------~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~ 198 (540) T protein:vir:41 139 ----YMQTWDG-----IHVTYFKDYRYEGEVNPD----NGE-------DQDGVGANEIIFIHLPSPICSYYGVPRYLSAA 198 (540) T ss_pred ----eEeeecC-----ceeeeeecccccceeecc----ccc-------cceeecccceEEecCCCCCCCcccccHHHHHH Confidence 1111110 111122222211111100 000 0001111124555432 25677666544 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc--chhcceeeccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA--MRQANMILLKTGM 315 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~ 315 (513) .-+.....+..-..+.+...+.|-.+++-.... .+..... ..........++..-...+.. .-.++.+.+... T Consensus 199 ~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l-~~e~~~~---~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~- 273 (540) T protein:vir:41 199 PSILAMQKIDEYNYAFFDNYTIPSYVITVTGEF-EDEMELG---SDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIP- 273 (540) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCccc-Cchhccc---hHHHHHHHHHHHHHHHHHhccccccccceEEEecC- Confidence 444433333222233333333343333211100 0000000 000000000000000000000 000112222110 Q ss_pred cccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCcccccccccc---cc-ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 316 APNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS---GN-SSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 316 ~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~---~n-~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) . ....+++|.. .......+....+...+.|+..-++|+...+... .+ .+.......+ T Consensus 274 --~--~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f------------- 336 (540) T protein:vir:41 274 --G--GDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTY------------- 336 (540) T ss_pred --C--CcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHH------------- Confidence 0 1123444443 3344455677788889999999999986554221 11 1122221111 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHH-H- Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIV-K- 465 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~-~- 465 (513) +...+..+++.|...++..-. .... ..+.+.|+...... .+.+..+.++ +|+++.-.+.+.++.++--..+. . T Consensus 337 ~~~tL~P~~~~ie~~ln~~L~-~~~~-~~~~i~f~~~~ll~-~D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p 413 (540) T protein:vir:41 337 YESVVRPQQEIVSSVLTDFIQ-LKLD-PGARFVFNEEILME-SEFVHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVP 413 (540) T ss_pred HHHHHHHHHHHHHHHHHHhhh-hccC-CceEEEecchhhcc-hHHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccc Confidence 111122222222222211100 0111 13456665433322 1233333333 68888766766554333111111 0 Q ss_pred ------HHHHHHH-HHHHHhhhhcC--CCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 466 ------MMDKQRK-AMLKTYDTKGG--LIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 466 ------ri~~E~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .+..... .......+... ....+..+.+.+++...++.+..-++...| T Consensus 414 ~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (540) T protein:vir:41 414 SSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSD 470 (540) T ss_pred cccccccccccccccCCCCccccccccchhcccccCccccccccccccccccccccc Confidence 0000000 00000000000 000000000000000000000000111111 No 137 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.03 E-value=6.5e-06 Score=49.03 Aligned_cols=443 Identities=11% Similarity=0.086 Sum_probs=190.4 Q ss_pred eeccCCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 9 MNYQEDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 9 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) |...+-.+++..+-..+..+...++|.+ +.+.+.+|..-. +..... ........++..+-....++..++.| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l 75 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPS---LFPKDS--DNASTDYTTPWQAVGARGLNNLASKL 75 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---cCCCCC--CccccccCCcccccHHHHHHHHHHHH Confidence 3333334556666676666666665544 455555554321 111111 11112234566777788888888777 Q ss_pred hcC--Ce----eecCCc------------HHHHHH-----------HHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC Q lcl|NC_019916. 86 VGN--AI----AMSGPS------------SDRLDD-----------FNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ 136 (513) Q Consensus 86 ~g~--p~----~~~~~~------------~~~l~~-----------~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~ 136 (513) ++. |. ++...+ ...++. .+..++|.....++.++..++|.|.+++-.+++. T Consensus 76 ~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (535) T protein:vir:94 76 MLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGT 155 (535) T ss_pred HhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc Confidence 652 22 122111 012333 3445789999999999999999987665444333 Q ss_pred ceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccc-----------ccccceeEEEEEEEcC-----Cc--EEEEEee Q lcl|NC_019916. 137 KGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTV-----------VDNITQTKYEVETWTE-----ND--YTRYKPI 198 (513) Q Consensus 137 ~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~-----------~~~~~~~~~~ve~yt~-----~~--~~~~~~~ 198 (513) ..++. .-|...+++-.+. ..++...+|.++.... .....+....+++|+. +. +..+... T Consensus 156 ~~~f~--~~pl~~y~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~ 232 (535) T protein:vir:94 156 YNPMK--LYRLSSYVVQRDA-FGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEEI 232 (535) T ss_pred ccceE--EEEcCeEEEeeCC-CCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEEe Confidence 23332 3355555555443 3456555554433110 0001111223444431 11 1111111 Q ss_pred ccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccc Q lcl|NC_019916. 199 VVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFD 273 (513) Q Consensus 199 ~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~ 273 (513) .+... .....+.++..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........|.+.+.-.+.. T Consensus 233 -~g~~~--~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~--- 306 (535) T protein:vir:94 233 -DGVEV--EGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGIT--- 306 (535) T ss_pred -cCeee--ccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccccc--- Confidence 11111 1111234677889887753 467999888888888888877666666666555555433210000 Q ss_pred cccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 274 DSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHK 351 (513) Q Consensus 274 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~ 351 (513) + ... .....++.... +..+++..+. ...+.......++.++..|.. T Consensus 307 -----------------~--------~~~-----~~~~~~g~~v~--g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~ 354 (535) T protein:vir:94 307 -----------------Q--------VRR-----LTKAQTGDFVS--GRPEDISFLQLEKAADFSVARAVSEQIEGRLSY 354 (535) T ss_pred -----------------c--------hhh-----cccCCCceeec--CCcccceeeecccccchhHHHHHHHHHHHHHHH Confidence 0 000 00000011111 1223333332 224556666777777776643 Q ss_pred HhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhcccccccccceeeEEe Q lcl|NC_019916. 352 FSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLN--------QRYTVVAHIEERVNGKWDIDPDEIGFIF 423 (513) Q Consensus 352 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~--------~~~~li~~~l~~~~~~~~~~~~~i~i~f 423 (513) .-..-... ..-+...++.-++.. +.+++..+|..+. -+++.++.++...+.-.......+++.+ T Consensus 355 af~~~~~~-~~d~~rvTAtEV~~r-------~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~ 426 (535) T protein:vir:94 355 AFMLNSAV-QRTGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTI 426 (535) T ss_pred HHhHhhhc-cCCCCCccHHHHHHH-------HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceE Confidence 22111111 111233455544432 3344444444333 3344445555444333333333355555 Q ss_pred CCCCCcCHHHHHHHHHH-------HhcC--------CCHHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_019916. 424 RDNLPTDDVAIITALVQ-------AGAQ--------IPQEYLYQYL---PNVT-----DADEIVKMMDKQRKAMLKTY-- 478 (513) Q Consensus 424 ~~~~p~d~~e~a~~~~k-------l~g~--------iS~et~~~~l---~~v~-----D~~~E~~ri~~E~~~~~~~~-- 478 (513) .-++. .+...+.+.+ ++++ +....++..+ -+|+ -.++|++.+.+++++.+... T Consensus 427 vs~la--~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~ 504 (535) T protein:vir:94 427 STGME--ALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNA 504 (535) T ss_pred eehHH--HHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 43332 2222222222 1222 2222222222 1222 13556665555444333321 Q ss_pred -hhhcCCCCCCCCCCCCCCC-CCCCCCCCCCC Q lcl|NC_019916. 479 -DTKGGLIINGTSGNDPEDE-GVRGQQGEPED 508 (513) Q Consensus 479 -~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 508 (513) ...+.... .....+.+.. ...++-|-..+ T Consensus 505 ~~~~g~~~~-~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 505 AASAGAGAG-TMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHhhh-cccccChHHHHHHHHHhccCCC Confidence 11111111 1111111110 01111111111 No 138 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.00 E-value=7.5e-06 Score=48.69 Aligned_cols=378 Identities=11% Similarity=0.064 Sum_probs=154.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCC-CccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHH-H Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQ-NDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRL-D 101 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~-~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l-~ 101 (513) ..+.+.......++ ..-..++.+- ......... .........-+.+.-...+|+..+.-+-+-|+.+.....+.| . T Consensus 1 M~~f~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~ 78 (386) T protein:vir:49 1 MPIFNITNLATESP-PINQESFFDIADSDFLASLN-SSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQLQGIVD 78 (386) T ss_pred CchhhhhccCCCCc-ccchhhhhhhhhcccccccc-CCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchhhhhhh Confidence 11211111000000 0001111110 000000000 000000111122333445667777777778887654443322 1 Q ss_pred HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeE Q lcl|NC_019916. 102 DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTK 181 (513) Q Consensus 102 ~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~ 181 (513) .-............+..+.+.+|.||+.+-.+.+|.+.-.+.++|..+-+..++.. ..+.+ + |... +.... T Consensus 79 ~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~-~~~~y--~-~~~~--~~~~~--- 149 (386) T protein:vir:49 79 NPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY--N-ITFD--DPHIA--- 149 (386) T ss_pred ccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCC-ceEEE--E-EEEc--Ccccc--- Confidence 11111233456667888999999999998888888776566678888776665432 11111 1 1110 10000 Q ss_pred EEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 182 YEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAM 261 (513) Q Consensus 182 ~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~ 261 (513) ....+....+++++.. ++.+ .-.|.|.+..+...++....+..-..+.+...+.|- T Consensus 150 -~~~~~~~~evih~~~~--------------~~~~---------~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 205 (386) T protein:vir:49 150 -PKQHVPQNDILHFRLL--------------SVDG---------GLTSVSPLMALGREFNIQKASDKLTISALKNALNAN 205 (386) T ss_pred -ceeEEccccEEEecCC--------------CCCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Confidence 0112334444443221 0001 124677777777766655554444445555555565 Q ss_pred hheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHH Q lcl|NC_019916. 262 LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELY 341 (513) Q Consensus 262 l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 341 (513) .+++-......+ .. ..+.... .... .-.++++.+ ..+.+++-+........+... T Consensus 206 ~il~~~~~~~~~-----------~~---~~~~~~~-~~~~-~n~g~~~vl---------~~g~~~~~l~~~~~d~~~~e~ 260 (386) T protein:vir:49 206 GILKIKGGGLLD-----------FK---TKVSRSR-QAMK-QMQGGPLVL---------DDLEDFTPLEIKSNVAQLLSQ 260 (386) T ss_pred EEEEeCCCCChH-----------HH---HHHHHHH-HHhc-cCCCCceec---------CCCceEEEccCChhHHHHHHH Confidence 555432111110 00 0000000 0000 001122222 223345555444555566778 Q ss_pred HHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceee Q lcl|NC_019916. 342 KKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIG 420 (513) Q Consensus 342 ~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~ 420 (513) .+.+.+.|+..-++|+.-.+... +..++..++..+ ...++..++.+..-+...-. ..++ T Consensus 261 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~--------------~~~i~~~l~~i~~~~~~~l~------~~~~ 320 (386) T protein:vir:49 261 ADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIY--------------FKSVSRYLRPFVSEMSKKLS------CEVD 320 (386) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHH--------------HHHHHHHHHHHHHHHHHHhc------chhc Confidence 88899999999999986654222 223343333322 22223333222222211100 1223 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC---CCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCC Q lcl|NC_019916. 421 FIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP---NVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPE 495 (513) Q Consensus 421 i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~---~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (513) +.....+-.|..+.+..+.++ +|+++.-.+.+++. +..+ ++ ..+.+.. T Consensus 321 ~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~---~~------------------------~~~~~~~ 373 (386) T protein:vir:49 321 VDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPK---EL------------------------PDGKNPN 373 (386) T ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCC---cC------------------------cchhccC Confidence 333333444556666666665 56777655554431 1111 00 0000000 Q ss_pred CCCCCCCCCCCCCccCCC Q lcl|NC_019916. 496 DEGVRGQQGEPEDERTSD 513 (513) Q Consensus 496 ~~~~~~~~~~~~~~~~~~ 513 (513) ..... | +++.++| T Consensus 374 ~~~~~---g--Gd~~~~~ 386 (386) T protein:vir:49 374 RTSLK---G--GEINEQD 386 (386) T ss_pred CCCCC---C--CCCCCCC Confidence 00000 1 1111111 No 139 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=97.99 E-value=8e-06 Score=48.53 Aligned_cols=431 Identities=10% Similarity=0.014 Sum_probs=163.6 Q ss_pred Cc------cchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccc---cccccCCCCCCcceee Q lcl|NC_019916. 1 MI------DMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDG-ILS---PASRRNEKGKADHRAV 70 (513) Q Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i-~~~---~~~~~~~~~~~~~ri~ 70 (513) ++ -|+|..+.-+-... .-+....+.. +.|+... +.. +.............+. T Consensus 62 ~~~~~~~~~~kk~~i~~pfkkk-----------------~~~~~~d~f~-~s~es~s~vtsls~pdaf~~vnVs~~~Alk 123 (945) T protein:vir:10 62 IIIFRKNQVLKKEKIIVPYNHQ-----------------EPPFKFNLFE-YSPESLMYLPSISDPDAFFLINLFRKYRFN 123 (945) T ss_pred eeeehhhhHHHhhccccccccc-----------------ccchhhhhhh-ccCccceecccccCccceeeehhhhhhhhc Confidence 11 11222222222211 1111111111 2222110 000 0000000000111222 Q ss_pred cchhHHHHHHHHHHhhcCCeeec-----CC---------cHHHHHHHHHh-cCH-------HHHHHHHHHHHhhCCeEEE Q lcl|NC_019916. 71 HSFARYIADFQTSYSVGNAIAMS-----GP---------SSDRLDDFNRR-NDI-------DTLNYELYLDMTVTGRAYE 128 (513) Q Consensus 71 ~n~~~~ivd~~~~~l~g~p~~~~-----~~---------~~~~l~~~~~~-n~~-------~~~~~~~~~~a~~~G~~~~ 128 (513) ..-....|+..++-+-+-|+++- +. ....+..++.. |.. ......+..+.+.+|.+|+ T Consensus 124 nsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYi 203 (945) T protein:vir:10 124 NDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAI 203 (945) T ss_pred cHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEE Confidence 34455577777777788888641 11 12235555543 322 1244567789999999999 Q ss_pred EeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccc Q lcl|NC_019916. 129 YVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLE 208 (513) Q Consensus 129 ~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~ 208 (513) .+.++.+|.+.-.+.++|..+.+..++... ..+ + |... .++ . ....+.....+++... T Consensus 204 eIiRd~~G~ii~L~pLdPs~Vti~~ddDG~--~~y--~-Yv~~-idG---~---~~~~v~a~DvIlhirn---------- 261 (945) T protein:vir:10 204 VKIRDEQGNLVAITPVDGTTIKPILSEDTG--IVV--G-YVQE-VDG---A---IVAHFDKRDVVLFRQN---------- 261 (945) T ss_pred EEEECCCCcEEEEEEECCcceEEEEcCCCc--EEE--E-EEEe-cCC---c---eEEEecCCceEEEecc---------- Confidence 999988888765567899888877765432 111 1 1111 010 0 0112233332221110 Q ss_pred ccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhh----hhh--hheecCccccccccccccccc Q lcl|NC_019916. 209 VAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLN----EAM--LVIKGDIDTLFDDSTLLQMVD 282 (513) Q Consensus 209 ~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~----~~~--l~~~G~~~~~~~~~~~~~~~~ 282 (513) +++.|.. ...|.|.++.+...+ ...++-......+|. .|- +.+.|.... .. ..+. T Consensus 262 ---~s~DG~~-------~GyGlSPIeaa~~aI---~~alAaek~aar~FskNGa~PsGILsvkg~~~~--d~-k~~~--- 322 (945) T protein:vir:10 262 ---LTPDVYM-------YGYSLPPIEILYKVI---LSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYK--EG-DIYP--- 322 (945) T ss_pred ---CCCCccc-------ccCCchHHHHHHHHH---HHHHHHHHHHHHHHHhCCCccceEEEecCcccc--cc-cccc--- Confidence 0001110 012445454443333 333322222223332 232 222221110 00 0000 Q ss_pred chhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccc Q lcl|NC_019916. 283 PSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDN 362 (513) Q Consensus 283 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 362 (513) .+..+....++....... .....+.+.....+.++.-++.......+....+...+.|+..-++|+...+. T Consensus 323 --------~LseEq~erlKe~wee~~-sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~ 393 (945) T protein:vir:10 323 --------QLSREQLESIQRQLQAIM-MGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGI 393 (945) T ss_pred --------ccCHHHHHHHHHHHHHHh-CCcccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccc Confidence 111111111111100000 00000111111233344444444445566777888889999999999866543 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH- Q lcl|NC_019916. 363 FSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA- 441 (513) Q Consensus 363 ~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl- 441 (513) ..+ .++..++.... ..+..+++..++.+...++..- ........+.+.|+.....+..+.++++.++ T Consensus 394 ~e~-st~SNiEqq~~----------~Fv~~tL~Pil~~IEqeLNrkL-l~~~eg~~i~fdFd~ldl~D~ksraEal~kli 461 (945) T protein:vir:10 394 LEG-SNKATAEVMAS----------LTKAKGLEPLMATISKGFDEVV-SEFRNEKDIKLWFKEDDLEKERDWWNIIQGQL 461 (945) T ss_pred CCC-CCcchHHHHHH----------HHHHHHHHHHHHHHHHHHHHhc-cccccCceeEEEecchhccCHHHHHHHHHHHH Confidence 222 22222221111 1122233333333333222211 0112234578888777777888889888876 Q ss_pred -hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc---CCC Q lcl|NC_019916. 442 -GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER---TSD 513 (513) Q Consensus 442 -~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 513 (513) +|+++.-.+.++++. +++-+.-+-....-.... .......+..+++. .....+.+.+++.+.+++.. +.+ T Consensus 462 ~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d-~~~ka~~ga~p~q~-aq~~~dqp~~kGGe~dEns~~psE~k 537 (945) T protein:vir:10 462 NTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPED-EQAKAQQGAMPPQL-AQAMADQPSQQGGGVDENSSVPSEQK 537 (945) T ss_pred hCCCcCHHHHHHHhCCCCCCCcceeeeccccccccc-cccccccCCCCccc-ccCCCCCCCCCCCCCCCCCCCCCccc Confidence 688988777777643 221111110000000000 00000000011000 00111111111111111111 111 No 140 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.97 E-value=8.8e-06 Score=48.30 Aligned_cols=443 Identities=11% Similarity=0.073 Sum_probs=191.0 Q ss_pred CCc--ccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhc Q lcl|NC_019916. 13 EDA--DKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVG 87 (513) Q Consensus 13 ~~~--~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g 87 (513) |.. ..+..+.+.+..+...++|.+ +.+.+.+|..-. ...... ........++..+-....+++.++.|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDS--DNASTDYQTPWQAVGARGLNNLASKLML 75 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCC--CcccccccccccccHHHHHHHHHHHHHH Confidence 332 245677888887776666644 455555553321 111111 1111233456677788888888877764 Q ss_pred C--Ce----eecCCc-----------------------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCce Q lcl|NC_019916. 88 N--AI----AMSGPS-----------------------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKG 138 (513) Q Consensus 88 ~--p~----~~~~~~-----------------------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~ 138 (513) . |. ++...+ +..+...+..++|.....++.++..++|.|.+++-.+.++.. T Consensus 76 ~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:21 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred hhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 2 31 122111 012333445678999999999999999988755533332222 Q ss_pred eEEEEEcccceEEEecCCCCcceEEEEEEEeeccc------------ccccceeEEEEEEEc-----CC--cEEEEEeec Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTV------------VDNITQTKYEVETWT-----EN--DYTRYKPIV 199 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~------------~~~~~~~~~~ve~yt-----~~--~~~~~~~~~ 199 (513) ..+..-|..-+.+--+. ..++...+|.++.... .....+....+++|+ ++ ...+|... T Consensus 156 -~~f~~~pl~~~~v~~d~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~- 232 (536) T protein:vir:21 156 -NPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEV- 232 (536) T ss_pred -eeEEEEEcCeEEEeeCC-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEecc- Confidence 22334455555554443 3456666655433210 000011112233332 11 12222111 Q ss_pred cCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDD 274 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~ 274 (513) ++. ......-..+|..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........+...+.=.+.. T Consensus 233 -~g~-~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~---- 306 (536) T protein:vir:21 233 -EGM-EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT---- 306 (536) T ss_pred -CCe-eeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccccc---- Confidence 111 111111223567789887753 457999999999999888887666666666655554333100000 Q ss_pred ccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019916. 275 STLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSH 354 (513) Q Consensus 275 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 354 (513) + .. ......++....+...+.++..+....+.......++.++..|...-. T Consensus 307 ----------------~--------~~-----~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~ 357 (536) T protein:vir:21 307 ----------------Q--------PR-----RLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM 357 (536) T ss_pred ----------------c--------hh-----hhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHh Confidence 0 00 000001111111111222222223334555566777777776633221 Q ss_pred ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhcccccccccceeeEEeCCC Q lcl|NC_019916. 355 TPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERG--------LNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN 426 (513) Q Consensus 355 ~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~--------l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~ 426 (513) .-.+..- -+...++.-++.+ +.++...+|.. +.-+++.++.++...+.-.......+++.+.-+ T Consensus 358 ~~~l~~~-~~~r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~ 429 (536) T protein:vir:21 358 LNSAVQR-TGERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTG 429 (536) T ss_pred hhhcccC-CCCCccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEec Confidence 1111111 1233455544433 33333333333 333444455555444433333333456665443 Q ss_pred CCcCHHHHHHHHHH-------HhcC--------CCHHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHh---hh Q lcl|NC_019916. 427 LPTDDVAIITALVQ-------AGAQ--------IPQEYLY----QYLPNVT----DADEIVKMMDKQRKAMLKTY---DT 480 (513) Q Consensus 427 ~p~d~~e~a~~~~k-------l~g~--------iS~et~~----~~l~~v~----D~~~E~~ri~~E~~~~~~~~---~~ 480 (513) +. .++..+.+.+ ++++ +....++ ..++..+ -.++|++.+.+++++.+... .. T Consensus 430 l~--~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~ 507 (536) T protein:vir:21 430 LE--AIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAA 507 (536) T ss_pred HH--HHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 2332222222 1222 2222232 2333212 13667777766554433322 11 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCC-CCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQG-EPED 508 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 508 (513) .+.....+.........+...+.| .|+- T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 508 LAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHhcChhhHHhhhhccccCCCC Confidence 111111111100000000001111 1111 No 141 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.95 E-value=9.6e-06 Score=48.09 Aligned_cols=399 Identities=11% Similarity=0.015 Sum_probs=165.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc------cCC-CCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASR------RNE-KGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS 96 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~-~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~ 96 (513) .-+++..+.+..+. ......+.....+....... ... ......-+..+-...+|+..++-+-+-|+.+-... T Consensus 1 MG~f~~lf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~ 79 (422) T protein:vir:13 1 MGFLRGLFNKKNNN-DEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDK 79 (422) T ss_pred CchhhhhhhccCCc-cchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecC Confidence 11222111111000 00000111000000000000 000 00001112334445567777777777888762211 Q ss_pred ----HHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 97 ----SDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 97 ----~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) +..+..++.. |. .......+..+.+.+|.||+.+..+..|.+.-.+.++|..+.+++++........-+ + T Consensus 80 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~-~ 158 (422) T protein:vir:13 80 EEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKV-W 158 (422) T ss_pred cccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceE-E Confidence 2234444432 33 336667788899999999999998988887666778999998888765321111111 1 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQ 247 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~ 247 (513) |......+ . ...+.++.+.+++.. +.. +...|.|.++.+...++....+. T Consensus 159 y~~~~~~g--~-----~~~~~~~eiih~~~~--------------~~~---------~~~~G~s~~~~~~~~i~~~~~~~ 208 (422) T protein:vir:13 159 YVVTDKNG--K-----EHKLLPDEMLHFIGD--------------ITL---------DGLIGIKPLDYLRCTIENGRATQ 208 (422) T ss_pred EEEEeCCC--e-----EEEEcccceEEEcCC--------------CCC---------CCcccccHHHHHHHHHHHHHHHH Confidence 21111110 0 012233333333210 000 11246777777776666555554 Q ss_pred HHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCce Q lcl|NC_019916. 248 SDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSADA 326 (513) Q Consensus 248 S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 326 (513) .-..+.+...+.|-.+++-.... .... ...++..-........ .++++.+ ..+.++ T Consensus 209 ~~~~~~f~ng~~p~gil~~~~~l----------~~e~----~~~~~~~~~~~~~g~~n~~~~~vl---------~~g~~~ 265 (422) T protein:vir:13 209 EFINKFFKNGLSIKGIVQYVGDL----------DEKA----KKIFKKEFESMSNGLENAHSISLL---------PFGYQF 265 (422) T ss_pred HHHHHHHhccCCccEEEEeCCCC----------CHHH----HHHHHHHHHHHhcCccccCCceec---------CCCcee Confidence 44444455444555555432110 0000 0111110000011001 1122222 223344 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 327 NYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEE 406 (513) Q Consensus 327 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~ 406 (513) +.++.......+....+...+.|+..-++|+...+...+ .+...++-.. ...+...|...++.|...+. T Consensus 266 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~-~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~l~ 334 (422) T protein:vir:13 266 QPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLER-ATFNNLTEQQ----------KDFYVTTLQSSLTVYEQEIQ 334 (422) T ss_pred eeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 445444444556667778888999999999866543221 1111111111 11223333333333332222 Q ss_pred hcc-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 407 RVN-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 407 ~~~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) ..- .... .....+++.+..-+-.|..+.++++.++ +|+++.-.+.++++.- ++-+.-+.. ....+ T Consensus 335 ~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~---------~n~~~ 405 (422) T protein:vir:13 335 DKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRLLVN---------GNMIP 405 (422) T ss_pred HhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec---------cCccc Confidence 110 0000 0112344444555666888899998886 5789887777776432 211100000 00000 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) .. ....+....+++ +.+ T Consensus 406 l~--~~~~~~~~~g~~----------~g~ 422 (422) T protein:vir:13 406 IE--MAGEQYKKGGEK----------GGK 422 (422) T ss_pred hh--hcccccccCCCc----------CCC Confidence 00 000000000000 000 No 142 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.94 E-value=1e-05 Score=47.96 Aligned_cols=389 Identities=10% Similarity=0.005 Sum_probs=162.0 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPR-LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~-~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |-+. +.|..+.... ...... -..+-.+.-|.. ..+..-+.+.-...+|+. T Consensus 1 MG~~---------------~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~-------------~~~~~al~~~~V~~~v~~ 51 (411) T protein:vir:81 1 MGWW---------------SRLTRFFRPR-NETVDMTNPLLLQWLGVDP-------------DTPRNQLSEATYFACLKI 51 (411) T ss_pred CchH---------------HHHHhhccCc-ccccccchHHHHHHhcCcc-------------cChhhhhccHHHHHHHHH Confidence 1000 1111110000 000000 000111111110 001111223344557777 Q ss_pred HHHHhhcCCeeec---CC-----cHHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 81 QTSYSVGNAIAMS---GP-----SSDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 81 ~~~~l~g~p~~~~---~~-----~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) .++-+-+-|+++- .+ .+..+..++.. |. .......+..+.+.+|.||+++..+ +|.+.-.+.++|. T Consensus 52 Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~ 130 (411) T protein:vir:81 52 LSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQ 130 (411) T ss_pred HHHhHhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCc Confidence 7777777787751 11 12234444432 32 3455667888899999999988877 4555555678999 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) .+-++.++........ ..+|......+. . ...+..+.+++++.. +.+ +.. T Consensus 131 ~v~~~~~~~~~~~~~~-~~~~~~~~~~~g--~----~~~~~~~eiih~k~~--------------~~~---------~~~ 180 (411) T protein:vir:81 131 YVTIVVDDRGLLGEKN-AIWYRYNDPYDG--K----MYVFRNDEILHFKTS--------------VTF---------DGI 180 (411) T ss_pred eEEEEEcCcccccccc-eEEEEEEecCCc--e----EEEEccccEEEEcCC--------------CCC---------CCc Confidence 9888877643211111 111211110000 0 012344444443210 000 112 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hc Q lcl|NC_019916. 228 YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QA 306 (513) Q Consensus 228 ~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~ 306 (513) .|.|.+..+...++....+..-..+.+...+.|-.+++...... .. ....++..=........ .+ T Consensus 181 ~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e----~~~~~~~~~~~~~~g~~n~g 246 (411) T protein:vir:81 181 TGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLN----------QE----ARDRLVKGFEQFANGSKNAG 246 (411) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC----------HH----HHHHHHHHHHHHhcCccccC Confidence 46676666666665555544444444444445655554422110 00 00111100000011100 01 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELAST 385 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~ 385 (513) +++.+ ..+.+++.+........+....+...+.|+..-++|+...+... ++-|. ++.. T Consensus 247 ~~~vl---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n--~e~~---------- 305 (411) T protein:vir:81 247 KIIPV---------PLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYAS--AEAQ---------- 305 (411) T ss_pred Cceec---------CCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchh--HHHH---------- Confidence 12222 22334444443334445566778888999999999976654332 22111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCH- Q lcl|NC_019916. 386 KRKQFERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDA- 460 (513) Q Consensus 386 ~~~~f~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~- 460 (513) ....+..++...++.+..-+... ..........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++.-..+ T Consensus 306 ~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~g 385 (411) T protein:vir:81 306 NLAFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDY 385 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 11223334444444444333321 110001112345555666677899999998886 5788877777766542211 Q ss_pred -HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCC Q lcl|NC_019916. 461 -DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPED 496 (513) Q Consensus 461 -~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (513) +.-+... ...++..... +....|+. T Consensus 386 gD~~~~~~---------n~~pl~~~~~--~~~kgGd~ 411 (411) T protein:vir:81 386 GNNLMANG---------NYIPLSMLGA--NYGKGGDS 411 (411) T ss_pred CCeeeecc---------Cccchhhhhh--hhccCCCC Confidence 0000000 0000000000 00000000 No 143 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=97.89 E-value=1.3e-05 Score=47.45 Aligned_cols=404 Identities=11% Similarity=0.014 Sum_probs=164.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP-------RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) |. -.+.|.++..-+ .+..+ -...+..+.-+..... ... ...-+.+.-...+|+..+.-+ T Consensus 1 M~----~~~r~~~~~~~~-~r~~~~~~~~~~~~~~~~~~~g~~~~~~-------~v~--~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MK----IVDSVKKFFNFE-KRQTSQVIELNKDDEKLLEWLGISPSTI-------SVK--GKNALKVATVFACIKILSESV 66 (432) T ss_pred CC----hHHHHHHhcCcc-ccCcccccccCCchHHHHHHhCCCcCcc-------ccc--hhhhhccHHHHHHHHHHHHhh Confidence 00 111111111100 00000 0011111111100000 000 000122333445667777766 Q ss_pred hcCCeeec-CC-------cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 86 VGNAIAMS-GP-------SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 86 ~g~p~~~~-~~-------~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) -+-|+.+- .. .+..+..++.. | ........+..+.+.+|.||+++..+..|.+.-.+.++|..+-+. T Consensus 67 a~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 67 SKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVY 146 (432) T ss_pred ccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 67787751 11 12235555432 3 234566778889999999999999998888766667888888777 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) .|+....... ...+|.... . . . ...+.+..+++++.. ++. +...|.|. T Consensus 147 ~d~~~~~~~~-~~~~y~~~~-~--g-~----~~~~~~~eiih~r~~--------------~~~---------~~~~G~s~ 194 (432) T protein:vir:10 147 IDDVGLLNSK-TKMWYVVNT-G--G-Q----QRVLKPEEILHFKNG--------------ITL---------DGLVGVPT 194 (432) T ss_pred EcCccccccc-ceEEEEEec-C--C-e----EEEEccccEEEecCC--------------CCC---------CCcccccH Confidence 6653211100 111221110 0 0 0 012334444433210 000 11246677 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeec Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILL 311 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~ 311 (513) +..+...++....+..-....+...+.|-.+++-.... ...........+. ....... .++++.+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l----------~~e~~~~~~~~~~----~~~~g~~n~~~~~vl 260 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL----------NEDAKKVFRENFE----SMSSGLQNSHRIALM 260 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC----------CHHHHHHHHHHHH----HHhcccccCCcceec Confidence 77666666655554444444444444565555432110 0000000000000 0000000 1122222 Q ss_pred cccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQF 390 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f 390 (513) ..+.+++.+........+....+...+.|+..-++|+...+... ++-|. ++.. ....+ T Consensus 261 ---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~----------~~~~~ 319 (432) T protein:vir:10 261 ---------PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQ----------QQQFY 319 (432) T ss_pred ---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHH Confidence 22334444444334445566777888999999999986654322 22222 1111 11122 Q ss_pred HHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHH Q lcl|NC_019916. 391 ERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIV 464 (513) Q Consensus 391 ~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~ 464 (513) ...++..++.|...+... ..........+++.+..-+..|..+.++++.++ .|+++.-.+.+.++.- ++-+.-+ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 334444444443333221 110001112345555566777899999998887 5788887777776432 2111000 Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) -... +...+..+.. ...+ +++++...+.+..++ T Consensus 400 ~~~n------~~~~~~~~~~---~~k~--~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGN------MLPIDMAGQA---YLKG--GDTNGEVSKEGNEGN 432 (432) T ss_pred eccc------ccchhhcccc---ccCC--CCCCCCCCCCCCCCC Confidence 0000 0000000000 0000 000000000111111 No 144 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=97.89 E-value=1.3e-05 Score=47.45 Aligned_cols=404 Identities=11% Similarity=0.014 Sum_probs=164.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP-------RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) |. -.+.|.++..-+ .+..+ -...+..+.-+..... ... ...-+.+.-...+|+..+.-+ T Consensus 1 M~----~~~r~~~~~~~~-~r~~~~~~~~~~~~~~~~~~~g~~~~~~-------~v~--~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MK----IVDSVKKFFNFE-KRQTSQVIELNKDDEKLLEWLGISPSTI-------SVK--GKNALKVATVFACIKILSESV 66 (432) T ss_pred CC----hHHHHHHhcCcc-ccCcccccccCCchHHHHHHhCCCcCcc-------ccc--hhhhhccHHHHHHHHHHHHhh Confidence 00 111111111100 00000 0011111111100000 000 000122333445667777766 Q ss_pred hcCCeeec-CC-------cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 86 VGNAIAMS-GP-------SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 86 ~g~p~~~~-~~-------~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) -+-|+.+- .. .+..+..++.. | ........+..+.+.+|.||+++..+..|.+.-.+.++|..+-+. T Consensus 67 a~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 67 SKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVY 146 (432) T ss_pred ccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 67787751 11 12235555432 3 234566778889999999999999998888766667888888777 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) .|+....... ...+|.... . . . ...+.+..+++++.. ++. +...|.|. T Consensus 147 ~d~~~~~~~~-~~~~y~~~~-~--g-~----~~~~~~~eiih~r~~--------------~~~---------~~~~G~s~ 194 (432) T protein:vir:10 147 IDDVGLLNSK-TKMWYVVNT-G--G-Q----QRVLKPEEILHFKNG--------------ITL---------DGLVGVPT 194 (432) T ss_pred EcCccccccc-ceEEEEEec-C--C-e----EEEEccccEEEecCC--------------CCC---------CCcccccH Confidence 6653211100 111221110 0 0 0 012334444433210 000 11246677 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeec Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILL 311 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~ 311 (513) +..+...++....+..-....+...+.|-.+++-.... ...........+. ....... .++++.+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l----------~~e~~~~~~~~~~----~~~~g~~n~~~~~vl 260 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL----------NEDAKKVFRENFE----SMSSGLQNSHRIALM 260 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC----------CHHHHHHHHHHHH----HHhcccccCCcceec Confidence 77666666655554444444444444565555432110 0000000000000 0000000 1122222 Q ss_pred cccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQF 390 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f 390 (513) ..+.+++.+........+....+...+.|+..-++|+...+... ++-|. ++.. ....+ T Consensus 261 ---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~----------~~~~~ 319 (432) T protein:vir:10 261 ---------PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQ----------QQQFY 319 (432) T ss_pred ---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHH Confidence 22334444444334445566777888999999999986654322 22222 1111 11122 Q ss_pred HHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHH Q lcl|NC_019916. 391 ERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIV 464 (513) Q Consensus 391 ~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~ 464 (513) ...++..++.|...+... ..........+++.+..-+..|..+.++++.++ .|+++.-.+.+.++.- ++-+.-+ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 334444444443333221 110001112345555566777899999998887 5788887777776432 2111000 Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) -... +...+..+.. ...+ +++++...+.+..++ T Consensus 400 ~~~n------~~~~~~~~~~---~~k~--~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGN------MLPIDMAGQA---YLKG--GDTNGEVSKEGNEGN 432 (432) T ss_pred eccc------ccchhhcccc---ccCC--CCCCCCCCCCCCCCC Confidence 0000 0000000000 0000 000000000111111 No 145 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=97.89 E-value=1.3e-05 Score=47.45 Aligned_cols=404 Identities=11% Similarity=0.014 Sum_probs=164.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP-------RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) |. -.+.|.++..-+ .+..+ -...+..+.-+..... ... ...-+.+.-...+|+..+.-+ T Consensus 1 M~----~~~r~~~~~~~~-~r~~~~~~~~~~~~~~~~~~~g~~~~~~-------~v~--~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MK----IVDSVKKFFNFE-KRQTSQVIELNKDDEKLLEWLGISPSTI-------SVK--GKNALKVATVFACIKILSESV 66 (432) T ss_pred CC----hHHHHHHhcCcc-ccCcccccccCCchHHHHHHhCCCcCcc-------ccc--hhhhhccHHHHHHHHHHHHhh Confidence 00 111111111100 00000 0011111111100000 000 000122333445667777766 Q ss_pred hcCCeeec-CC-------cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 86 VGNAIAMS-GP-------SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 86 ~g~p~~~~-~~-------~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) -+-|+.+- .. .+..+..++.. | ........+..+.+.+|.||+++..+..|.+.-.+.++|..+-+. T Consensus 67 a~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 67 SKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVY 146 (432) T ss_pred ccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 67787751 11 12235555432 3 234566778889999999999999998888766667888888777 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) .|+....... ...+|.... . . . ...+.+..+++++.. ++. +...|.|. T Consensus 147 ~d~~~~~~~~-~~~~y~~~~-~--g-~----~~~~~~~eiih~r~~--------------~~~---------~~~~G~s~ 194 (432) T protein:vir:10 147 IDDVGLLNSK-TKMWYVVNT-G--G-Q----QRVLKPEEILHFKNG--------------ITL---------DGLVGVPT 194 (432) T ss_pred EcCccccccc-ceEEEEEec-C--C-e----EEEEccccEEEecCC--------------CCC---------CCcccccH Confidence 6653211100 111221110 0 0 0 012334444433210 000 11246677 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeec Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILL 311 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~ 311 (513) +..+...++....+..-....+...+.|-.+++-.... ...........+. ....... .++++.+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l----------~~e~~~~~~~~~~----~~~~g~~n~~~~~vl 260 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL----------NEDAKKVFRENFE----SMSSGLQNSHRIALM 260 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC----------CHHHHHHHHHHHH----HHhcccccCCcceec Confidence 77666666655554444444444444565555432110 0000000000000 0000000 1122222 Q ss_pred cccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQF 390 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f 390 (513) ..+.+++.+........+....+...+.|+..-++|+...+... ++-|. ++.. ....+ T Consensus 261 ---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~----------~~~~~ 319 (432) T protein:vir:10 261 ---------PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQ----------QQQFY 319 (432) T ss_pred ---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHH Confidence 22334444444334445566777888999999999986654322 22222 1111 11122 Q ss_pred HHHHHHHHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHH Q lcl|NC_019916. 391 ERGLNQRYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIV 464 (513) Q Consensus 391 ~~~l~~~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~ 464 (513) ...++..++.|...+... ..........+++.+..-+..|..+.++++.++ .|+++.-.+.+.++.- ++-+.-+ T Consensus 320 ~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~ 399 (432) T protein:vir:10 320 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 334444444443333221 110001112345555566777899999998887 5788887777776432 2111000 Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) -... +...+..+.. ...+ +++++...+.+..++ T Consensus 400 ~~~n------~~~~~~~~~~---~~k~--~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGN------MLPIDMAGQA---YLKG--GDTNGEVSKEGNEGN 432 (432) T ss_pred eccc------ccchhhcccc---ccCC--CCCCCCCCCCCCCCC Confidence 0000 0000000000 0000 000000000111111 No 146 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=97.84 E-value=1.5e-05 Score=46.99 Aligned_cols=451 Identities=10% Similarity=0.095 Sum_probs=155.1 Q ss_pred Ccc----chhhceeccCCcccC-----CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----c---ccccccccCCCC Q lcl|NC_019916. 1 MID----MQQANMNYQEDADKL-----TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQND-----G---ILSPASRRNEKG 63 (513) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~-----~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~-----i---~~~~~~~~~~~~ 63 (513) |.+ +.+--.. ..+.... -.++|...+.+... +-..+.+=-.|++. + +........... T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~ 75 (576) T protein:vir:96 1 MVTRLADIFKRLRL-GRDYEDIIDTVPIDDGLQANIRNIEE----KSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRS 75 (576) T ss_pred ChhhHHHHHHHHhc-cCccccchhhhhcccChhHHHHHhhh----hhhhhccccCCccchhhcceeeeeecCCCccccCc Confidence 221 1111110 0010000 01122222222110 00001111112111 0 000000000000 Q ss_pred CC-c-c---eee-----cchhHHHHHHHHHHhh-------------cCCeeecCC-----cHH-----HHHHHHHh---- Q lcl|NC_019916. 64 KA-D-H---RAV-----HSFARYIADFQTSYSV-------------GNAIAMSGP-----SSD-----RLDDFNRR---- 106 (513) Q Consensus 64 ~~-~-~---ri~-----~n~~~~ivd~~~~~l~-------------g~p~~~~~~-----~~~-----~l~~~~~~---- 106 (513) .. . . .+. .++...+|+..+.-+. +-++..... ... .+..++.. T Consensus 76 ~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~ 155 (576) T protein:vir:96 76 YMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRD 155 (576) T ss_pred chhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCC Confidence 00 0 0 011 1234445555432221 122322111 111 12222211 Q ss_pred -c----CHHHHHHHHHHHHhhCCeEEEEeeecCCC--ceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccce Q lcl|NC_019916. 107 -N----DIDTLNYELYLDMTVTGRAYEYVYRDPSQ--KGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQ 179 (513) Q Consensus 107 -n----~~~~~~~~~~~~a~~~G~~~~~v~~d~~~--~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~ 179 (513) | .+......+..+.+.+|.||+++..+.++ ++.-.+.++|..+.++.+.... ......+|+.... + T Consensus 156 ~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~-~~~~~~~~~~~~~--~---- 228 (576) T protein:vir:96 156 KDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGK-IIKGGKRFVQVIN--K---- 228 (576) T ss_pred CCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCc-eeeeeeEEEEecC--C---- Confidence 1 23456677888999999999887766554 3333456889888887765431 1111222222110 0 Q ss_pred eEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 180 TKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNE 259 (513) Q Consensus 180 ~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~ 259 (513) .....+....++++..... .-......|.|.++.+...++....+..-..+.+...+. T Consensus 229 --~~~~~~~~~dii~~~~~~~--------------------~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~ 286 (576) T protein:vir:96 229 --KVVASFTSREMAMGIRNPR--------------------TELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGT 286 (576) T ss_pred --ceEEEecccceEEEeecCC--------------------CCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 0111223333332221100 000001246676766666665554444444444444444 Q ss_pred hhhhee--cCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHH Q lcl|NC_019916. 260 AMLVIK--GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAG 337 (513) Q Consensus 260 ~~l~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 337 (513) |-.++. |... ... +....++..=...+.....++. .+.....+.++.-++....... T Consensus 287 p~giL~~~~~~~----------ls~----e~~~~lr~~~~~~~~G~~nag~-------~p~vl~~G~~~~~ls~~~~d~q 345 (576) T protein:vir:96 287 TRGILQIKSEQQ----------QSQ----RALENFKREWKSSFSGINGSWQ-------VPVVMADDIKFVNMTPTANDMQ 345 (576) T ss_pred CceEEEeCCCCC----------CCH----HHHHHHHHHHHHHhcccccccc-------ceeecCCCceEEeccCChhhHH Confidence 543333 2100 000 0011111100000110001110 0111223344444444455666 Q ss_pred HHHHHHHHHHHHHHHhCcccccccccccc-cc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_019916. 338 TELYKKRLAADIHKFSHTPDLTDDNFSGN-SS----GVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKW 412 (513) Q Consensus 338 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~S----g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~ 412 (513) +....+...+.|+..-++|+...+...+. .+ +.++.+. ... ......+..+|..+++.+...+...=- . T Consensus 346 fle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~s--n~e---~~~~~f~~~tL~P~~~~ie~~ln~~Ll-~ 419 (576) T protein:vir:96 346 FEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEA--DPG---KKQQQSQNKGLQPLLRFIEDLINTHII-S 419 (576) T ss_pred HHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccc--cHH---HHHHHHHHHHHHHHHHHHHHHHHhhhc-h Confidence 77888899999999999998655422111 11 1111110 000 111223333444444443333322100 0 Q ss_pred ccccceeeEEeCCCCCcCHHHHHHHHHHH-hcCCCHHHHHHhCCC--CCCHHHHH-----HHHH----H---HHHHHHHH Q lcl|NC_019916. 413 DIDPDEIGFIFRDNLPTDDVAIITALVQA-GAQIPQEYLYQYLPN--VTDADEIV-----KMMD----K---QRKAMLKT 477 (513) Q Consensus 413 ~~~~~~i~i~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~--v~D~~~E~-----~ri~----~---E~~~~~~~ 477 (513) .+ ...+.+.|.+..+.+.++..+..... +|+++.-.+.+.++. +++-+.-+ ..+. + +.+..... T Consensus 420 ~~-~~~~~~~f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 498 (576) T protein:vir:96 420 EY-SDKYVFQFVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKER 498 (576) T ss_pred hc-cCceEEEeccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccccc Confidence 11 13456778777776666665544332 588887777666532 22111000 0000 0 00000000 Q ss_pred hhhhcC--CCCCCCCCC-CCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 478 YDTKGG--LIINGTSGN-DPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 478 ~~~~~~--~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 513 (513) .+.... ....+..+. ...++..++..+.+...++.+ T Consensus 499 ~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~ 537 (576) T protein:vir:96 499 FDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSP 537 (576) T ss_pred ccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCc Confidence 110000 000000000 000011111111111111111 No 147 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=97.82 E-value=1.6e-05 Score=46.80 Aligned_cols=418 Identities=9% Similarity=-0.017 Sum_probs=161.9 Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec---CC Q lcl|NC_019916. 23 IAAFIRHHYNNQRPR----LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS---GP 95 (513) Q Consensus 23 i~~~i~~~~~~~~~~----~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~---~~ 95 (513) +-.++.......++. -.-....+..--....... .......+..-+.+.-...+|+..++-+-+-|+.+- .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~ 79 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAW-QQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQ 79 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchh-hcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccC Confidence 111111100000000 0000011110000000000 000000000001122233366666666667787752 11 Q ss_pred ------cHHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEE Q lcl|NC_019916. 96 ------SSDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAV 165 (513) Q Consensus 96 ------~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~i 165 (513) .+..+..++.. |. .......+..+++.+|.||+++-.+..|.+.-.+.++|..+-++.++.. ++.+ T Consensus 80 g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g--~~~y-- 155 (454) T protein:vir:93 80 GIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDG--EVFY-- 155 (454) T ss_pred CccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCC--cEEE-- Confidence 11223444433 33 2356667788999999999999888888776566789999888776542 2222 Q ss_pred EEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHH Q lcl|NC_019916. 166 RYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDV 245 (513) Q Consensus 166 r~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~ 245 (513) ++.. ..... ......+..+.+++++... . .+...|.|.+......+..... T Consensus 156 ~~~~--~~~~~----~~~~~~~~~~eViH~k~~~--------------~---------~~~~~G~sp~~~~~~~i~~~~~ 206 (454) T protein:vir:93 156 RITP--DRNCG----ITEAVTVPAREVIHDRFNC--------------F---------FHPLIGLPPVYAAGLAATQGHH 206 (454) T ss_pred EEEe--ccccc----cceeEEecCcceEEeccCC--------------C---------CCCceeccHHHHHHHHHHHHHH Confidence 1111 00000 0011124444454442100 0 0112466666655555554443 Q ss_pred HHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCc Q lcl|NC_019916. 246 AQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSAD 325 (513) Q Consensus 246 ~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (513) +..-....+...+.|-.+++-... . ... ....++..=.........++++.+ ..+.+ T Consensus 207 ~~~~~~~~f~ng~~p~gil~~~~~-l---------~~e----~~~~~~~~~~~~~~g~n~g~~~vl---------~~g~~ 263 (454) T protein:vir:93 207 IQENSTSFFRNGGRPSGVIEIPGS-I---------TEE----NAKKLKSNWDSGYTGENAGKTAIL---------SNGAK 263 (454) T ss_pred HHHHHHHHHhccCCccEEEecCCC-C---------CHH----HHHHHHHHHHHHhcccccCCceec---------cCCce Confidence 333333333333334444432110 0 000 011111100000000011122222 23345 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 326 ANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIE 405 (513) Q Consensus 326 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l 405 (513) ++.++.......+....+...+.|+..-++|+...+...+. +...++.. ....+...+.-+++.+...+ T Consensus 264 ~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~sn~e~~----------~~~f~~~~l~P~~~~ie~~l 332 (454) T protein:vir:93 264 YNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPP-SSDNVEAL----------EQQYYSQCLQTLIESIELLL 332 (454) T ss_pred EEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-cchhHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 55555444555566777788899999999998665432221 11111111 11112222333322222222 Q ss_pred HhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHH---HHHHHHHHHHHHHh Q lcl|NC_019916. 406 ERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIV---KMMDKQRKAMLKTY 478 (513) Q Consensus 406 ~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~---~ri~~E~~~~~~~~ 478 (513) ...-- .. ....+++.+...+..|..+.++++.++ .|+++.-.+.++++. +++-++=+ ..+--+.....+. T Consensus 333 n~~L~-~~-~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~- 409 (454) T protein:vir:93 333 DEALE-TG-ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDA- 409 (454) T ss_pred HHhhc-CC-CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCc- Confidence 21100 01 112455666677778999999998886 678887777666543 22211000 0000000000000 Q ss_pred hhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 479 DTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ...+ ....+....... +......+.+..+.+.| T Consensus 410 ~~~~-~~~~~~~~~~~~-~~~~~d~~~~~~e~~~d 442 (454) T protein:vir:93 410 REDP-FASSGKTASVPQ-AVAASDGNKAITETEHD 442 (454) T ss_pred ccCC-CCCCccCCCCCC-CCCCCCCCCCccCCccc Confidence 0000 000011111110 11111111222333333 No 148 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.82 E-value=1.7e-05 Score=46.76 Aligned_cols=386 Identities=8% Similarity=-0.027 Sum_probs=160.2 Q ss_pred HHHHHHHHHHH-H---HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee--cCCc--- Q lcl|NC_019916. 26 FIRHHYNNQRP-R---LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM--SGPS--- 96 (513) Q Consensus 26 ~i~~~~~~~~~-~---~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~--~~~~--- 96 (513) ++-...-.++. . .....-.+-|..+. .... ...+=+.+.-...+|+..+.-+-+-|+.+ ..+. T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~v----~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~ 72 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINPS----ETYV----NGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKR 72 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCcC----ccee----chhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeee Confidence 11000000000 0 00000001111000 0000 00001223334456677777677778765 1111 Q ss_pred --HHHHHHHHH--hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEe Q lcl|NC_019916. 97 --SDRLDDFNR--RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHA 169 (513) Q Consensus 97 --~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~ 169 (513) +..+..++. =|. .......+..+.+.+|.||+++..+..|.+.-.+.++|..+-++.++........-+.|.. T Consensus 73 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~ 152 (409) T protein:vir:10 73 VPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLY 152 (409) T ss_pred ccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEE Confidence 122334332 233 3355667888999999999999888888876666788888877776543211111111111 Q ss_pred ecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019916. 170 VQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSD 249 (513) Q Consensus 170 ~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~ 249 (513) . ... . ....+....+++++... + +...|.|.++.+...++....+... T Consensus 153 ~-~~~---g----~~~~~~~~evih~r~~~---------------~---------d~~~G~s~i~~~~~~i~~~~~~~~~ 200 (409) T protein:vir:10 153 T-DDL---G----QRHKFMSDEILHFKGLT---------------A---------DGLAGLSVIELLNHLIENGKSSETY 200 (409) T ss_pred E-eCC---c----eeEEeccccEEEecCcC---------------C---------CCcccccHHHHHHHHHHHHHHHHHH Confidence 1 000 0 01123444444432110 0 1113667666666666555444444 Q ss_pred HHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCceeE Q lcl|NC_019916. 250 TANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSADANY 328 (513) Q Consensus 250 ~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 328 (513) ....+...+.|-.+++-.... ...... .++..-........ .++++.+ ..+.+++- T Consensus 201 ~~~~f~ng~~~~gil~~~~~l----------~~e~~~----~~~~~~~~~~~g~~n~~~~~vl---------~~g~~~~~ 257 (409) T protein:vir:10 201 LNNFFKNGLQVKGLVQYAGDL----------NPEAEE----VFKENFERMSSGLKNAHRIAML---------PIGYKFEP 257 (409) T ss_pred HHHHHhccCCCcEEEEcCCCC----------CHHHHH----HHHHHHHHHhccccccCCceec---------CCCceEEE Confidence 444444444555555432110 000000 00000000000001 1112222 22334444 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019916. 329 IHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV 408 (513) Q Consensus 329 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~ 408 (513) +........+....+...+.|+..-++|+...+... ..++..++.. ....+..+|+..++.|..-++.. T Consensus 258 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~e~~----------~~~f~~~~l~P~~~~ie~~ln~k 326 (409) T protein:vir:10 258 ISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLD-RATHSNITEQ----------NREFYIDTLQSILNMYELEINYK 326 (409) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCccccHHHH----------HHHHHHHHHHHHHHHHHHHHHHh Confidence 444444455667778888999999999976654221 1122222111 11223334444444443333321 Q ss_pred c-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_019916. 409 N-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKG 482 (513) Q Consensus 409 ~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~ 482 (513) - .... .....+++.+..-+-.|..+.++++.++ +|+++.-.+.+.++. +++-+ + ...+.. T Consensus 327 L~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD-~-------------~~~~~n 392 (409) T protein:vir:10 327 LFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGD-V-------------LLINGN 392 (409) T ss_pred hcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-e-------------eeeccC Confidence 1 0001 1112345555555567888999988887 578887666666643 22100 0 000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 483 GLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) . .+-+..+++. ..+.++ T Consensus 393 ~------~~~~~~~~~~-----~kgGe~ 409 (409) T protein:vir:10 393 M------IPVKMAGEQY-----SKGGEK 409 (409) T ss_pred c------cchhhccccc-----cccCCC Confidence 0 0000000000 001111 No 149 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=97.81 E-value=1.8e-05 Score=46.61 Aligned_cols=451 Identities=11% Similarity=0.052 Sum_probs=161.1 Q ss_pred Ccc-chhhc-------------eeccCC----------cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019916. 1 MID-MQQAN-------------MNYQED----------ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPA 56 (513) Q Consensus 1 ~~~-~~~~~-------------~~~~~~----------~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~ 56 (513) |-. |-+|. ..|.|. ....+.+.|.+.++...............+..|... ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 76 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKT----KP 76 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccC----cC Confidence 000 00000 011111 011222223333222100000001011111111100 00 Q ss_pred cccCCCC-CC-ccee-ecchhHHHHHHHHHHh-----------hcCCeeec---CC---------cHHHHHHHHHhc--- Q lcl|NC_019916. 57 SRRNEKG-KA-DHRA-VHSFARYIADFQTSYS-----------VGNAIAMS---GP---------SSDRLDDFNRRN--- 107 (513) Q Consensus 57 ~~~~~~~-~~-~~ri-~~n~~~~ivd~~~~~l-----------~g~p~~~~---~~---------~~~~l~~~~~~n--- 107 (513) ..+.... .. ...+ .....+.+++..+..+ -+-|+.+- .+ ....+..++... T Consensus 77 ~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~ 156 (574) T protein:vir:80 77 SIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQF 156 (574) T ss_pred ccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCC Confidence 0000000 00 0001 1223334444433222 13455431 11 012344555321 Q ss_pred ------CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeE Q lcl|NC_019916. 108 ------DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTK 181 (513) Q Consensus 108 ------~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~ 181 (513) .+......+..+.+.+|.+|+.+-.+.+|.+.-.+.++|..+.+..+.... ......+||.... +. T Consensus 157 ~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~-~~~~~~~y~~~~~--g~----- 228 (574) T protein:vir:80 157 RDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGK-LIKNGERFVQVID--NR----- 228 (574) T ss_pred CCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccc-cccCceEEEEEeC--Cc----- Confidence 123456678888999999999888888888766667899998887765331 1111233433221 00 Q ss_pred EEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 182 YEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAM 261 (513) Q Consensus 182 ~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~ 261 (513) ....+....+++++..... +.+ ....|.|.++.+...++....+..-..+.+...+.|- T Consensus 229 -~~~~~~~~eiih~~~~~~~-----------~~~---------~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~ 287 (574) T protein:vir:80 229 -IVAKFNERELAFAVRNPRA-----------DIE---------VGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTR 287 (574) T ss_pred -eEEEEccccEEEEeccCCC-----------Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 1123444455444321100 011 1114777777666666655554444444444444454 Q ss_pred hheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHH Q lcl|NC_019916. 262 LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELY 341 (513) Q Consensus 262 l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 341 (513) .++.-..... .. .+....++..-...+.....++.+ +.....+.++.-++.......+... T Consensus 288 gil~~~~~~~--------ls----~e~~~~lk~~~~~~~~G~~n~g~~-------~vl~~~G~~~~~l~~s~~D~qfle~ 348 (574) T protein:vir:80 288 GILHVKTGQQ--------QS----QQALDIFRREWRSSLAGINGSWQI-------PVVSAEDVKFVNMTPSANDMQFEKW 348 (574) T ss_pred eEEEeCCCCC--------CC----HHHHHHHHHHHHHHhccccccccc-------eeecCCCceEEEccCChhHHHHHHH Confidence 3332110000 00 000011111000000000111111 1111223344444444455556677 Q ss_pred HHHHHHHHHHHhCccccccccccccc-cHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccee Q lcl|NC_019916. 342 KKRLAADIHKFSHTPDLTDDNFSGNS-SGVAMKY-KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEI 419 (513) Q Consensus 342 ~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~-~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i 419 (513) .+...+.|+..-++|++..+...... .|..... -+..+. ......+..+|+-+++.+...+...-- ..+. ..+ T Consensus 349 ~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E---~~~~~f~~~tL~P~~~~ie~~ln~~Ll-~~~~-~~~ 423 (574) T protein:vir:80 349 LNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSK---EKMQASQNKGLQPLLRFIEDTVNTYIV-AEFG-EKY 423 (574) T ss_pred HHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcC-Cce Confidence 88888999999999986654221111 1100000 000000 111122222333333333332222100 0111 245 Q ss_pred eEEeCCCCCcCHHHHHHHHHH-HhcCCCHHHHHHhCCC--CCCHHHHH-----HHHHH-------HHHHHHHHhhhhcCC Q lcl|NC_019916. 420 GFIFRDNLPTDDVAIITALVQ-AGAQIPQEYLYQYLPN--VTDADEIV-----KMMDK-------QRKAMLKTYDTKGGL 484 (513) Q Consensus 420 ~i~f~~~~p~d~~e~a~~~~k-l~g~iS~et~~~~l~~--v~D~~~E~-----~ri~~-------E~~~~~~~~~~~~~~ 484 (513) .+.|.+.-..+..+.+.++.. .+|+++.-.+.++++. +++-+.=+ ..+.+ +.+......+... T Consensus 424 ~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~-- 501 (574) T protein:vir:80 424 QFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLL-- 501 (574) T ss_pred EEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccc-- Confidence 778887766666665554332 2688888777666532 32111000 00000 0000000000000 Q ss_pred CCCCCCCCCCCCCCCCC--CCCCCCCccCCC Q lcl|NC_019916. 485 IINGTSGNDPEDEGVRG--QQGEPEDERTSD 513 (513) Q Consensus 485 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 513 (513) .+.+.+++.++..+ ....++++...+ T Consensus 502 ---~~~~~~~~~~~~~~p~~~~~d~~~~~~~ 529 (574) T protein:vir:80 502 ---ELSGGDVEQPEPEEPKDSQNDTDVSFQD 529 (574) T ss_pred ---cccCCCCCCCCCCCCCCccccccchhhh Confidence 00011111111111 011111111111 No 150 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=97.76 E-value=2.2e-05 Score=46.15 Aligned_cols=431 Identities=9% Similarity=0.049 Sum_probs=183.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--C----e- Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--A----I- 90 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p----~- 90 (513) |+...-.+.+...+..-..+.+.+.+|..-.- ...............++..+.....++..++.|++. | + T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~---~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 77 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYL---IDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFF 77 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcc---cCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 44433333333333333455666666643210 001100011122334577777888888888877642 2 2 Q ss_pred eecCCcH-----------HH-----------HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccc Q lcl|NC_019916. 91 AMSGPSS-----------DR-----------LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPME 148 (513) Q Consensus 91 ~~~~~~~-----------~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~ 148 (513) ++...+. .. +...+..++|.....++.++..++|.|. +|.++++ +. +-|.. T Consensus 78 ~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~---~~--~~pl~ 150 (522) T protein:vir:10 78 KLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNAL--IFMGKDG---LK--TFPLT 150 (522) T ss_pred cccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcee--EEEcCCC---ce--EEEcc Confidence 2222111 11 2233456789999999999999999877 4556654 22 22344 Q ss_pred eEEEecCCCCcceEEEEEEEeecc-------c-c------cccceeEEEEEEEc-----C--CcEEEEEeeccCCccccc Q lcl|NC_019916. 149 CFIIYDRSVNPKPIMAVRYHAVQT-------V-V------DNITQTKYEVETWT-----E--NDYTRYKPIVVAGSVPTL 207 (513) Q Consensus 149 ~~~~~d~~~~~~~~~~ir~~~~~~-------~-~------~~~~~~~~~ve~yt-----~--~~~~~~~~~~~~~~~~~~ 207 (513) -+++--+. .+++...+|.++... . + ....+....+++|+ . +.+.++.. ..+.... T Consensus 151 ~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~-~~~~~~~-- 226 (522) T protein:vir:10 151 RYVINRDG-DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQE-AFDKIIP-- 226 (522) T ss_pred eEEEeeCC-CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEc-cCCcccc-- Confidence 44444433 345565555443310 0 0 00001111233333 1 11222211 1111111 Q ss_pred cccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccccc Q lcl|NC_019916. 208 EVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVD 282 (513) Q Consensus 208 ~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~ 282 (513) ...-..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-......+...+|.+.+.-.+...... T Consensus 227 ~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~-------- 298 (522) T protein:vir:10 227 DSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPAT-------- 298 (522) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccc-------- Confidence 111123567788877653 4679999999999999999887788888888888886652111000000 Q ss_pred chhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCcccccc Q lcl|NC_019916. 283 PSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLTD 360 (513) Q Consensus 283 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 360 (513) + .+.+......+..+++..+. ...+.......++.++..|...-..-+ T Consensus 299 ---------l------------------~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~--- 348 (522) T protein:vir:10 299 ---------I------------------AKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN--- 348 (522) T ss_pred ---------c------------------cCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhcc--- Confidence 0 00011111122233343333 224556667777888877765321111 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcccccccc--c-ceeeEEeCCCCCc Q lcl|NC_019916. 361 DNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ--------RYTVVAHIEERVNGKWDID--P-DEIGFIFRDNLPT 429 (513) Q Consensus 361 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~--------~~~li~~~l~~~~~~~~~~--~-~~i~i~f~~~~p~ 429 (513) ..-+...++.-++.. +.+++..+|..+.+ +++-++.++...+.-.... . ....|++..++-+ T Consensus 349 ~~d~~rvTAtEV~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Lar 421 (522) T protein:vir:10 349 VRNAERVTAEEVRLT-------QLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGR 421 (522) T ss_pred CCCCCCCCHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHH Confidence 111234566655443 23344444443333 3333444444433211111 1 1122333333322 Q ss_pred CHHHHHHHHHH----HhcCCCHHHH---------HHhC---CCCC-----CHHHHHHHHHHHHHHHHHHhhhhcCC--CC Q lcl|NC_019916. 430 DDVAIITALVQ----AGAQIPQEYL---------YQYL---PNVT-----DADEIVKMMDKQRKAMLKTYDTKGGL--II 486 (513) Q Consensus 430 d~~e~a~~~~k----l~g~iS~et~---------~~~l---~~v~-----D~~~E~~ri~~E~~~~~~~~~~~~~~--~~ 486 (513) ++.++.+.. ++.++..+.+ +..+ -+|+ -.++|++.+++++++.+......... .. T Consensus 422 --aq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~ 499 (522) T protein:vir:10 422 --GQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMT 499 (522) T ss_pred --HHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 222222211 1122222222 2221 1222 13455555555444433322111111 11 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 487 NGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) .........+.+.-++-..++.| T Consensus 500 ~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 500 GSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred cccccCccccHHHHHHhCCCCCC Confidence 11111111111111222223333 No 151 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.67 E-value=3.1e-05 Score=45.31 Aligned_cols=384 Identities=11% Similarity=0.040 Sum_probs=156.3 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) ||.+..- ........ ....+...... . .-..+...+.|... . ...+..-+.+.-...+|+. T Consensus 2 ~m~~~~~--~~~~~~~~-~~~~~~~~~~~---~---~~~~~~~~~~~~~g------~----~v~~~~al~~~~v~~~v~~ 62 (392) T protein:vir:74 2 ILPILNF--INQTNDPP-EAGSVQSYFPD---G---NDAQIMESLLGDNN------E----WVSARAALRNSDLFSIILQ 62 (392) T ss_pred cchhhhh--hhcccCcc-ccccccccccc---C---chhhhhhhccCCCC------c----ccchhhhhcchHHHHHHHH Confidence 2222110 00000000 00000000000 0 00000000111000 0 0000011223344456777 Q ss_pred HHHHhhcCCeeecCCcHHHH-HHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCc Q lcl|NC_019916. 81 QTSYSVGNAIAMSGPSSDRL-DDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNP 159 (513) Q Consensus 81 ~~~~l~g~p~~~~~~~~~~l-~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~ 159 (513) .++-+-+-|+++.......| ..=.....-......+..+++.+|.||+++-.+.+|.+.-.+.++|..+-+..+... . T Consensus 63 ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~-~ 141 (392) T protein:vir:74 63 LSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-N 141 (392) T ss_pred HHHhhccCceeeccchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-c Confidence 77777777887654433322 211111223455666778999999999999889888766566688888877765432 2 Q ss_pred ceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHH Q lcl|NC_019916. 160 KPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSL 239 (513) Q Consensus 160 ~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~l 239 (513) .+. |.....++. ......+..+.+++++... + .....|.|-++.+... T Consensus 142 ~~~-----y~~~~~~~~----~~~~~~~~~~evih~~~~~--------------~---------~~~~~G~s~i~~~~~~ 189 (392) T protein:vir:74 142 GMY-----YNITFDDPK----IEPILQAPQSDLIHMKLLS--------------I---------DGGKTGISPLYSLRRE 189 (392) T ss_pred eEE-----EEEEecCCc----cceeEEEcCccEEEecCCC--------------C---------CCccccccHHHHHHHH Confidence 221 111111110 1111234444444432210 0 0112477777766666 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccc Q lcl|NC_019916. 240 IDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNG 319 (513) Q Consensus 240 iD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (513) ++....+..-....+...+.|-.+++=....... ........... ......++.+.+ T Consensus 190 i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~--------~~~~~~~~~~~-------~~~~n~g~~~vl-------- 246 (392) T protein:vir:74 190 SKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--------DKDKASRSRSF-------MKRSRSGGPVVL-------- 246 (392) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--------HHHHHHHHHHH-------hccccCCCeeec-------- Confidence 6555554444444445445454444321100000 00000000000 000111122222 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSS-GVAMKYKVLGTVELASTKRKQFERGLNQRY 398 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S-g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~ 398 (513) ..+.+++-++.......+....+...+.|+..=++|+...+....+.| ..+++. .+...|...+ T Consensus 247 -~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~--------------~~~~~l~p~~ 311 (392) T protein:vir:74 247 -DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISG--------------MYASALNRYL 311 (392) T ss_pred -CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH--------------HHHHHHHHHH Confidence 223344444444445566777888889999999999766543322222 222221 2233333333 Q ss_pred HHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHH Q lcl|NC_019916. 399 TVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKA 473 (513) Q Consensus 399 ~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~ 473 (513) +.+..-+...-. ..+++.+..-+-.|..+.++.+.++ +|+++...+.+++ |+..| |+.+ T Consensus 312 ~~ie~~l~~~l~------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn---e~r~------- 375 (392) T protein:vir:74 312 RPAISELEYKLS------DHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA------- 375 (392) T ss_pred HHHHHHHHHhcc------chhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc---ccch------- Confidence 333322222100 0122222333334666777777776 5788887776554 43221 1110 Q ss_pred HHHHhhhhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 474 MLKTYDTKGGLIINGTSGNDPEDEGVR 500 (513) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (513) .+...+.. +. +++++.+ T Consensus 376 -~enl~~~~--------~G-d~~~p~p 392 (392) T protein:vir:74 376 -PENTNKKT--------TG-QSNEPVP 392 (392) T ss_pred -hcCCCCCC--------CC-CCCCCCC Confidence 00111111 11 1122222 No 152 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.63 E-value=3.5e-05 Score=45.00 Aligned_cols=391 Identities=10% Similarity=0.043 Sum_probs=174.1 Q ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee-cCCc Q lcl|NC_019916. 21 TRIAAFIRHHYN---NQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM-SGPS 96 (513) Q Consensus 21 ~~i~~~i~~~~~---~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~-~~~~ 96 (513) -.+.+++.+... ....-...+.++|-|.... .... .....-+.......+|+..+.-+-+-|+++ ...+ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~----v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~ 73 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTA---SGER----VSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTD 73 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccc---cCce----echhhhhccHHHHHHHHHHHHhhhhCceEEEEecC Confidence 122222222110 0011123344555443210 0000 001112334555667888888777888764 2111 Q ss_pred --------HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEE Q lcl|NC_019916. 97 --------SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMA 164 (513) Q Consensus 97 --------~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ 164 (513) ....+.++.. |. .......+..+.+.+|.||+++-.+..|.+.-.+.++|..+-++.++... . T Consensus 74 ~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~-~---- 148 (416) T protein:vir:12 74 GGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTG-M---- 148 (416) T ss_pred CccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCc-E---- Confidence 1123333332 32 33566678889999999999998888887665667889888777655431 1 Q ss_pred EEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHH Q lcl|NC_019916. 165 VRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYD 244 (513) Q Consensus 165 ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~ 244 (513) .+|.... .+ . .+ .+.+..+++++... + +...|.|.++.+...++... T Consensus 149 -~~~~~~~-~g---~---~~-~~~~~eiih~~~~~---------------~---------~~~~G~s~i~~~~~~i~~~~ 195 (416) T protein:vir:12 149 -LWYQTVL-NG---K---AI-ELYDYEVLHFKGLS---------------T---------DGIHGKSPIGVVREHIGAQA 195 (416) T ss_pred -EEEEEec-CC---e---EE-EecCccEEEecCcC---------------C---------CCcccccHHHHHHHHHHHHH Confidence 1222211 11 0 01 23444444442110 0 11146677777766666655 Q ss_pred HHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc-hhcceeeccccccccccccC Q lcl|NC_019916. 245 VAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM-RQANMILLKTGMAPNGQQTS 323 (513) Q Consensus 245 ~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 323 (513) .+..-..+.+...+.|-.+++-... ........... .+... ..++++.+ ..+ T Consensus 196 ~~~~~~~~~~~ng~~p~~il~~~~~----------~~~e~~~~~~~--------~~~~~~~~~~~~vl---------~~g 248 (416) T protein:vir:12 196 AATKYNAKLYKNEATPRGILKVPAF----------LDEKPKENVRK--------EWKRVNKVENIAII---------DYG 248 (416) T ss_pred HHHHHHHHHHhcCCCCceEEecCCC----------CCHHHHHHHHH--------HHHHHhcCCCeeec---------CCC Confidence 5555555555555556555543211 00011111111 11111 11222222 233 Q ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 324 ADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSG-VAMKYKVLGTVELASTKRKQFERGLNQRYTVV 401 (513) Q Consensus 324 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li 401 (513) .+++.++.......+....+...+.|+..-++|+...+... ++-|. +... ...+...|...++.+ T Consensus 249 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~-------------~~f~~~~l~P~~~~i 315 (416) T protein:vir:12 249 LEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQS-------------IEYVRNTLQPWIVNF 315 (416) T ss_pred ceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHH-------------HHHHHHHHHHHHHHH Confidence 44444444444445566778888999999999986654332 22222 1111 112233444444444 Q ss_pred HHHHHhcc-ccc-ccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHH Q lcl|NC_019916. 402 AHIEERVN-GKW-DIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAML 475 (513) Q Consensus 402 ~~~l~~~~-~~~-~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~ 475 (513) ..-+...- ... ......+++.+..-+..|..+.++++.++ +|+++.-.+.++++. +++-+.-+.... . T Consensus 316 e~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n------~ 389 (416) T protein:vir:12 316 EQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLN------Y 389 (416) T ss_pred HHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccc------c Confidence 33332211 000 01112355556666788999999998887 578888777766642 332111000000 0 Q ss_pred HHhhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 476 KTYDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) -..+...........+...++++.+++ T Consensus 390 ~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 390 VFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred ccccccchhhccccccccCCCCCcCCC Confidence 000000000000000111111111111 No 153 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.63 E-value=3.5e-05 Score=45.00 Aligned_cols=402 Identities=11% Similarity=0.005 Sum_probs=161.3 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQ 81 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 81 (513) |-+=.-.+ .........++..+ -+...|..... ..... ...=+.+.-.-..|+.. T Consensus 1 Mg~f~~~~---~r~~~~~~~~~~~~---------------~~~~~~~~~~~---~~~~~----~~~al~~~~v~~cv~~I 55 (416) T protein:vir:45 1 MGIFYKNE---KRDLQYNEDDLQMM---------------VQTLPGFQGTK---LRQYK----DIEAIRHSDIFTAVMMI 55 (416) T ss_pred CCcccccc---cccccCCCcchhHH---------------HHHhccccccC---ccccc----hhhhhcchHHHHHHHHH Confidence 00000000 00000011111111 11111111000 00000 00001112222356777 Q ss_pred HHHhhcCCeeecCCcH----HHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 82 TSYSVGNAIAMSGPSS----DRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 82 ~~~l~g~p~~~~~~~~----~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) ++-+-+-|+++..+.. ..+..++. . |. .......+....+.+|.||+++.++..|.+.-.+.++|..+.+. T Consensus 56 a~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~ 135 (416) T protein:vir:45 56 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 135 (416) T ss_pred HHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 7777777887643322 22333332 1 32 23555677888899999999999988887655567889888887 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) .++.. .+.+. +...+..+ . .....+....+.+++.. ++ +...|.|. T Consensus 136 ~~~~g--~~~~~---~~~~~~~~--~---~~~~~~~~~evihir~~---------------~~---------d~~~G~s~ 181 (416) T protein:vir:45 136 SDARG--RLYYF---HQRIDSNG--N---NIERNVKFEDMLDIKFY---------------SL---------DGINGLSL 181 (416) T ss_pred ECCCc--cEEEE---EEEecCCC--c---eeEEEEccccEEEeccC---------------CC---------CCccccCH Confidence 76542 22221 11111111 0 11123444444443210 00 11146676 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeec Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILL 311 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~ 311 (513) ++.+...++.......-..+.+...+.|-.+++-... .. ... ....++..=...+..... ++++.+ T Consensus 182 i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~~---------~~~---~~~~~~~~~~~~~~g~~nag~~~vl 248 (416) T protein:vir:45 182 LDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LD---------NKK---ARDRAREEFHKSFSGTKQAGKVVVL 248 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CC---------CHH---HHHHHHHHHHHHhcCccccCceeec Confidence 6666666654444433334444444444444432110 00 000 000010000000111011 122222 Q ss_pred cccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFE 391 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~ 391 (513) ..+.++..++.......+....+..++.|+..-++|+...+...++.|.+..+. .|. T Consensus 249 ---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~--------------~~~ 305 (416) T protein:vir:45 249 ---------DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYL 305 (416) T ss_pred ---------CCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHH Confidence 222344444434444456667777888999999999765432222222221111 122 Q ss_pred HHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHH Q lcl|NC_019916. 392 RGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMM 467 (513) Q Consensus 392 ~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri 467 (513) .++..+++.|..-+...-. .......+++.+..-+-.|..+.++++.++ .|+++.-.+.+.++. +++.+...-.+ T Consensus 306 ~~l~P~~~~ie~~ln~~l~-~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~ 384 (416) T protein:vir:45 306 STLKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRV 384 (416) T ss_pred HHHHHHHHHHHHHHhhhcc-ccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEee Confidence 3344444444333332211 111223455555555667888889888876 678888777776633 33332211111 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) .. ...+-...+. .+....+.......|.++.| T Consensus 385 ~~---------n~~~~~~~~~-~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 385 DL---------NHVNIELVDE-YQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cc---------cccccccccc-cCcccccccccccCCCCCCC Confidence 00 0000000000 00000000001111111111 No 154 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.63 E-value=3.5e-05 Score=45.00 Aligned_cols=402 Identities=11% Similarity=0.005 Sum_probs=161.3 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHH Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQ 81 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 81 (513) |-+=.-.+ .........++..+ -+...|..... ..... ...=+.+.-.-..|+.. T Consensus 1 Mg~f~~~~---~r~~~~~~~~~~~~---------------~~~~~~~~~~~---~~~~~----~~~al~~~~v~~cv~~I 55 (416) T protein:vir:81 1 MGIFYKNE---KRDLQYNEDDLQMM---------------VQTLPGFQGTK---LRQYK----DIEAIRHSDIFTAVMMI 55 (416) T ss_pred CCcccccc---cccccCCCcchhHH---------------HHHhccccccC---ccccc----hhhhhcchHHHHHHHHH Confidence 00000000 00000011111111 11111111000 00000 00001112222356777 Q ss_pred HHHhhcCCeeecCCcH----HHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 82 TSYSVGNAIAMSGPSS----DRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 82 ~~~l~g~p~~~~~~~~----~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) ++-+-+-|+++..+.. ..+..++. . |. .......+....+.+|.||+++.++..|.+.-.+.++|..+.+. T Consensus 56 a~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~ 135 (416) T protein:vir:81 56 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 135 (416) T ss_pred HHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 7777777887643322 22333332 1 32 23555677888899999999999988887655567889888887 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) .++.. .+.+. +...+..+ . .....+....+.+++.. ++ +...|.|. T Consensus 136 ~~~~g--~~~~~---~~~~~~~~--~---~~~~~~~~~evihir~~---------------~~---------d~~~G~s~ 181 (416) T protein:vir:81 136 SDARG--RLYYF---HQRIDSNG--N---NIERNVKFEDMLDIKFY---------------SL---------DGINGLSL 181 (416) T ss_pred ECCCc--cEEEE---EEEecCCC--c---eeEEEEccccEEEeccC---------------CC---------CCccccCH Confidence 76542 22221 11111111 0 11123444444443210 00 11146676 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeec Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILL 311 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~ 311 (513) ++.+...++.......-..+.+...+.|-.+++-... .. ... ....++..=...+..... ++++.+ T Consensus 182 i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~~---------~~~---~~~~~~~~~~~~~~g~~nag~~~vl 248 (416) T protein:vir:81 182 LDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LD---------NKK---ARDRAREEFHKSFSGTKQAGKVVVL 248 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CC---------CHH---HHHHHHHHHHHHhcCccccCceeec Confidence 6666666654444433334444444444444432110 00 000 000010000000111011 122222 Q ss_pred cccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 312 KTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFE 391 (513) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~ 391 (513) ..+.++..++.......+....+..++.|+..-++|+...+...++.|.+..+. .|. T Consensus 249 ---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~--------------~~~ 305 (416) T protein:vir:81 249 ---------DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYL 305 (416) T ss_pred ---------CCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHH Confidence 222344444434444456667777888999999999765432222222221111 122 Q ss_pred HHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHH Q lcl|NC_019916. 392 RGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMM 467 (513) Q Consensus 392 ~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri 467 (513) .++..+++.|..-+...-. .......+++.+..-+-.|..+.++++.++ .|+++.-.+.+.++. +++.+...-.+ T Consensus 306 ~~l~P~~~~ie~~ln~~l~-~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~ 384 (416) T protein:vir:81 306 STLKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRV 384 (416) T ss_pred HHHHHHHHHHHHHHhhhcc-ccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEee Confidence 3344444444333332211 111223455555555667888889888876 678888777776633 33332211111 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) .. ...+-...+. .+....+.......|.++.| T Consensus 385 ~~---------n~~~~~~~~~-~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 385 DL---------NHVNIELVDE-YQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cc---------cccccccccc-cCcccccccccccCCCCCCC Confidence 00 0000000000 00000000001111111111 No 155 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=97.62 E-value=3.6e-05 Score=44.92 Aligned_cols=449 Identities=10% Similarity=0.040 Sum_probs=191.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHh---cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYY---RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.. =+.+.+.+..+.....|.+ +.+.+.+|. .+.-.. .. ...-.+...++..+-+...+++.++.|+ T Consensus 1 m~~--~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~---~~--~~~~~~~~~~~~dst~~~a~~~Las~l~ 73 (559) T protein:vir:95 1 MAE--TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT---SE--VNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) T ss_pred CCh--hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCC---CC--CCcccccccccccchHHHHHHHHHHHHH Confidence 332 2344555555554455544 455555553 222100 00 0111123456777888888888888886 Q ss_pred cC--C----e-eecCCc-----HH-----------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 87 GN--A----I-AMSGPS-----SD-----------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 87 g~--p----~-~~~~~~-----~~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +- | + ++...+ .. .+...+..++|.....++.++..++|.|.+++-.+..+-.++. . T Consensus 74 ~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~-~ 152 (559) T protein:vir:95 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTM-P 152 (559) T ss_pred HhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEE-E Confidence 52 2 1 232221 11 2334455678999999999999999998766544433322222 2 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccc--------c---------ccceeEEEEEE----EcCCc-----E----- Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVV--------D---------NITQTKYEVET----WTEND-----Y----- 192 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~--------~---------~~~~~~~~ve~----yt~~~-----~----- 192 (513) ++..+.++.-|. .+++...+|.++..... . .......++++ |.... . T Consensus 153 ~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 230 (559) T protein:vir:95 153 FPIGSYYLANSP--RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) T ss_pred eecCeEEEeeCC--CCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccc Confidence 444554444443 34566666654332100 0 00000111222 22110 0 Q ss_pred ----EEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcc-hhHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019916. 193 ----TRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGD-FENVLSLIDLYDVAQSDTANYMTDLNEAML 262 (513) Q Consensus 193 ----~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd-~e~v~~liD~~~~~~S~~~~~~~~~~~~~l 262 (513) +++.. ..++. .. ..+..|..+|++.++ +..+|+|. .+...+-+..+|.+.-..+...+...+|.+ T Consensus 231 pf~s~~~e~-~~~~~-~~---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~ 305 (559) T protein:vir:95 231 PFKSVYYEV-GGDND-KL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) T ss_pred eEEEEEEEe-cCCCc-ee---eecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 11111 11111 11 122345567776664 34579994 888999999999998888999999999876 Q ss_pred heecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe-ecCCHHHHHHH Q lcl|NC_019916. 263 VIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH-KEYDSAGTELY 341 (513) Q Consensus 263 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~ 341 (513) .+-+....... ++ ..++...... ......+..+. .+.+...+... T Consensus 306 ~v~~~~~~~~~-----------------~l-----------~pgg~~~~~~------~~~~~~i~p~~~~~~~~~~~~~~ 351 (559) T protein:vir:95 306 VAPTSLKNQRA-----------------SL-----------LPGDITYIDQ------ITGQDGFRPAYLVNPSTADLVAD 351 (559) T ss_pred eccccccccce-----------------ee-----------eccceeeeCC------CCCcccceeecccccchHHHHHH Confidence 65332110000 00 0011111100 01111222221 22345555566 Q ss_pred HHHHHHHHHHHhCcccc--c-cccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccc----c Q lcl|NC_019916. 342 KKRLAADIHKFSHTPDL--T-DDNFSGNSSGVAMKYKVLGTVEL-ASTKRKQFERGLNQRYTVVAHIEERVNGKW----D 413 (513) Q Consensus 342 ~~~l~~~i~~~s~~p~~--~-~~~~~~n~Sg~Ai~~~~~~l~~k-~~~~~~~f~~~l~~~~~li~~~l~~~~~~~----~ 413 (513) ++.++..|-..-.. +. . ....+...++.-++.....+... -....+.-.+.+.-+++-++.++...+.-. . T Consensus 352 i~~~~~rI~~af~~-d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~ 430 (559) T protein:vir:95 352 IQDTRQIINSAYFV-DLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDV 430 (559) T ss_pred HHHHHHHHHHHhhh-hhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc Confidence 77777766443222 11 1 11112445666554432222211 122222233334444444455554433211 2 Q ss_pred cccceeeEEeCCCCCcCH-HHH-------HHHHHHHhcC-------CCHHHHHHhC---CCCC----CHHHHHHHHHHHH Q lcl|NC_019916. 414 IDPDEIGFIFRDNLPTDD-VAI-------ITALVQAGAQ-------IPQEYLYQYL---PNVT----DADEIVKMMDKQR 471 (513) Q Consensus 414 ~~~~~i~i~f~~~~p~d~-~e~-------a~~~~kl~g~-------iS~et~~~~l---~~v~----D~~~E~~ri~~E~ 471 (513) .....++|++..++-+-. .+. ++.+..++++ +....++..+ -+|+ -.++|++.+++++ T Consensus 431 l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr 510 (559) T protein:vir:95 431 MEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQR 510 (559) T ss_pred ccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHH Confidence 233567777765554311 111 1222222232 2333333222 1222 1356677666655 Q ss_pred HHHHHHhhhhcCCCCCCCC-----CCCCCC----CCCCCCCCCCCCccC Q lcl|NC_019916. 472 KAMLKTYDTKGGLIINGTS-----GNDPED----EGVRGQQGEPEDERT 511 (513) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~-----~~~~~~----~~~~~~~~~~~~~~~ 511 (513) ++.++.++.........+. +....+ .+..+.-+..+.+.. T Consensus 511 ~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 511 AQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccCC Confidence 5433322111100000000 000000 000000111111111 No 156 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=97.54 E-value=4.8e-05 Score=44.28 Aligned_cols=391 Identities=10% Similarity=0.051 Sum_probs=158.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC-Cc-----H Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG-PS-----S 97 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~-~~-----~ 97 (513) ..++++.+...... +... .+.|-.... ..............=+.+.....+|+..++-+-+-|+.+-. ++ . T Consensus 1 Mgl~~~~f~~~~~~-~~~~-~~~~~~~~~-~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~ 77 (409) T protein:vir:84 1 MSLFTRIFSGPSEE-RTLT-KISGIPSPA-EDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIPV 77 (409) T ss_pred CchhhhhhcCCCcc-cccc-ccccccccc-chhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccccc Confidence 11222111110000 0000 000100000 00000000000111122344566788887777777886521 11 1 Q ss_pred HHHHHHHH-h-c---CHHHHHHHHHHHHhhCCeEEEEe-eecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeec Q lcl|NC_019916. 98 DRLDDFNR-R-N---DIDTLNYELYLDMTVTGRAYEYV-YRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQ 171 (513) Q Consensus 98 ~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~~~~~v-~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~ 171 (513) ..+-+++. . | ........+..+.+.+|.||+++ +.+..|.+.-.+.++|..+.+......... .....|. T Consensus 78 ~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~--~~~~~~~-- 153 (409) T protein:vir:84 78 SPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGD--WIEPVYR-- 153 (409) T ss_pred chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcce--EEEEEec-- Confidence 12333332 1 2 23466667888999999999866 466777665556688888766543322111 1111111 Q ss_pred ccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 172 TVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTA 251 (513) Q Consensus 172 ~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~ 251 (513) .. . ..+..+.+++++... +. +...|.|.++.+...++....+..-.. T Consensus 154 -~~---g------~~~~~~dvih~~~~~--------------~~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~ 200 (409) T protein:vir:84 154 -ID---G------KVVPNHRIMHIKRYP--------------VA---------GCALGMSPIEKAASAIGLGLAAERYGL 200 (409) T ss_pred -CC---c------eEEchhhEEEecCCC--------------CC---------cccccccHHHHHHHHHHHHHHHHHHHH Confidence 00 0 123444444432110 00 112467777666665555444443444 Q ss_pred HHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCceeEEe Q lcl|NC_019916. 252 NYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADANYIH 330 (513) Q Consensus 252 ~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~ 330 (513) +.+...+.|-.+++..... .... ...++..- ...... ++.+.+ .++++|.. T Consensus 201 ~~f~ng~~p~gil~~~~~l----------~~e~----~~~~~~~~---~~~~~n~g~~~vl-----------~~g~~~~~ 252 (409) T protein:vir:84 201 RWFRDSANPSGILSSDADL----------TPDQ----VKQTQKQW---IQSHHNRRLPAVM-----------SAGIKWQS 252 (409) T ss_pred HHHhcCCCccEEEecCCCC----------CHHH----HHHHHHHH---HHHhccCCCeeec-----------CCCceEEE Confidence 4444444555555432110 0000 11111100 011111 112222 23444544 Q ss_pred ecC--CHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 331 KEY--DSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEER 407 (513) Q Consensus 331 ~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~ 407 (513) ... ....+....+...+.|+..-++|+.-.+... ++.++..++-...... ..++...++.|...+.. T Consensus 253 ~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~----------~~~l~P~~~~ie~~l~~ 322 (409) T protein:vir:84 253 VSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFV----------RHTLLPWLRCIEQALDT 322 (409) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHH----------HHHHHHHHHHHHHHHHH Confidence 443 3344556677788899999999986554322 2222222322211111 12222222222222221 Q ss_pred cccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCC Q lcl|NC_019916. 408 VNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLI 485 (513) Q Consensus 408 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~ 485 (513) .- .....+++.+..-+-.|..+.++++.++ +|+++.-.+.+.++.-.- .+-. ....+..... T Consensus 323 ~L----~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~--~ggD----------~~~~~~n~~~ 386 (409) T protein:vir:84 323 FL----PRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPI--PEGD----------IHLQPMNFVP 386 (409) T ss_pred hc----cCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcc----------eeeecccccc Confidence 10 1123456666777778999999998886 578887777776643211 0000 0001111111 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 486 INGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) .+...+.++.++ +..+++.+.++ T Consensus 387 ~~~~~~~~~~~~--~~~~~~~~gn~ 409 (409) T protein:vir:84 387 LGYVPPEEPAQE--PQPNSATEGNK 409 (409) T ss_pred cccCCccccCcC--CCCCCccCCCC Confidence 111111111100 01111111111 No 157 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.54 E-value=4.8e-05 Score=44.26 Aligned_cols=407 Identities=13% Similarity=0.058 Sum_probs=181.9 Q ss_pred Cccchhh------ceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 1 MIDMQQA------NMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 1 ~~~~~~~------~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) ++..+-+ .....+....+||..+..+++..-.-...++..|.+..+-. .... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~----------------------D~~i 74 (528) T protein:vir:10 17 LRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEER----------------------DAHL 74 (528) T ss_pred ccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhh----------------------ChHH Confidence 2222211 12222445578888888887765433344444443332111 2344 Q ss_pred HHHHHHHHHHhhcCCeeecCC-----cHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-E Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGP-----SSDR----LDDFNRR-NDIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-V 142 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~-----~~~~----l~~~~~~-n~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~ 142 (513) .-.+.+...-+.+.++++... .+.. +++++.. .+|+..... ..+|.-+|.+. +++|.-.+|...+. + T Consensus 75 ~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~~~~~~ 153 (528) T protein:vir:10 75 FAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQGREWLPQAF 153 (528) T ss_pred HHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeecCCceeEEEe Confidence 556677777788888887532 1222 4444543 246655544 45577889885 56665444433221 1 Q ss_pred EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEE Q lcl|NC_019916. 143 KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIE 222 (513) Q Consensus 143 ~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 222 (513) ..-|...|.+ ++.. .+..-++ . +... -+.++ +++.+=.++ T Consensus 154 ~~r~~~~f~~-~~~~--~~~l~~~-----~--~~~~----g~~l~--------------------------~~k~iv~~~ 193 (528) T protein:vir:10 154 DHRPQSWFQL-NPDD--QDELRLR-----D--NSIA----GEVLQ--------------------------PFGWIMHKP 193 (528) T ss_pred eeecccceee-ccCC--CcEEecc-----C--CCCC----ceeec--------------------------CCCeEEEee Confidence 1112222211 1111 0100000 0 0000 00010 111111111 Q ss_pred e--cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh Q lcl|NC_019916. 223 Y--RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL 300 (513) Q Consensus 223 ~--~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 300 (513) - ..+..|.|.+..+-...---+..+.+.+..++.|+.|+++.+=..+... .+.. .| ...+ T Consensus 194 ~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~-----------~ek~---~L----~~al 255 (528) T protein:vir:10 194 RSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPD-----------EEKV---TL----LRAV 255 (528) T ss_pred cCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCH-----------HHHH---HH----HHHH Confidence 0 1233567777776666666677788899999999999988763211110 0000 00 1112 Q ss_pred hcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHH-HHHHHH Q lcl|NC_019916. 301 EAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAM-KYKVLG 378 (513) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai-~~~~~~ 378 (513) ..+..+....+ ..+..++|++.. .....++..++.+.+.|...--.-.++.+...|..+.-|+ +....- T Consensus 256 ~~i~~~~~~ii---------P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v 326 (528) T protein:vir:10 256 TGLGHAAAGII---------PESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEV 326 (528) T ss_pred HHHhhCcEEEe---------cCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHH Confidence 33333333333 245678888854 5667789999999998877643333322211111111121 111111 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccc-cceeeEEeCCCCCcCHHHHHHHHHHHh--cC-CCHHHHHHh Q lcl|NC_019916. 379 TVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDID-PDEIGFIFRDNLPTDDVAIITALVQAG--AQ-IPQEYLYQY 453 (513) Q Consensus 379 l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~~~~ 453 (513) ....++.-.+.....+. ++++-++.+ +.....+ .....+.|....+.|..+.++++.++. |+ +|.+.+.+. T Consensus 327 ~~di~~aDa~~i~~tln~~li~~l~~~----N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~ 402 (528) T protein:vir:10 327 RHDLLAADARQLAATLSRDLLWPLLVL----NRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQ 402 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHH Confidence 12222333333444443 344444432 2222212 234578899999999999999999884 55 899988888 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCC--CCCCccCCC Q lcl|NC_019916. 454 LPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQG--EPEDERTSD 513 (513) Q Consensus 454 l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 513 (513) ++. ..++.. +.+.. .... ....+.....+.......+.. ...+...-| T Consensus 403 ~gi-p~p~~~-e~~~~---------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 452 (528) T protein:vir:10 403 LGI-PLPANG-EAVLG---------DQAG-AGIAQLSRRPGPRIAALAQVIGPRYRDQEALD 452 (528) T ss_pred hCC-CCCCCC-ccccc---------CCCc-ccccccCcccccccccccccccccccccchHH Confidence 863 322110 00000 0000 000000000000000000000 011111111 No 158 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.49 E-value=5.7e-05 Score=43.83 Aligned_cols=381 Identities=12% Similarity=0.042 Sum_probs=153.1 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhcCCCc--cccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHH Q lcl|NC_019916. 22 RIAAFIRHHYNNQ-RPRLEMLYDYYRGQND--GILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSD 98 (513) Q Consensus 22 ~i~~~i~~~~~~~-~~~~~~~~~YY~G~~~--i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~ 98 (513) .+..+++.....+ .+.......++-...+ +................-+.++-...+|+..++-+-+-|+++...... T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 80 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQ 80 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhh Confidence 2222221110000 0000000001000000 000000000000000000123344556777777777778876544433 Q ss_pred HHH-HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccc Q lcl|NC_019916. 99 RLD-DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNI 177 (513) Q Consensus 99 ~l~-~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~ 177 (513) .|. .=............+..+.+.+|.||+++..+..|.+.-.+.++|..+-+..+... ..+. |........ T Consensus 81 ~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~-~~~~-----y~~~~~~~~- 153 (392) T protein:vir:10 81 GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-NGMY-----YNITFDDPK- 153 (392) T ss_pred hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceEE-----EEEEecCcc- Confidence 221 11111123455567788999999999999889888765556678888777665432 1211 111111110 Q ss_pred ceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 178 TQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDL 257 (513) Q Consensus 178 ~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~ 257 (513) ......+..+.+++++... +. ....|.|-++.+...++....+..-....+... T Consensus 154 ---~~~~~~~~~~eiih~~~~~--------------~~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng 207 (392) T protein:vir:10 154 ---IEPILQAPQSDLIHMKLLS--------------ID---------GGKTGISPLYSLRRESKIQRASDRLTISSLNSS 207 (392) T ss_pred ---cceeEEEccccEEEecCCC--------------CC---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 0111233444444432210 00 112466777666666654444443344444444 Q ss_pred hhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc-chhcceeeccccccccccccCCceeEEeecCCHH Q lcl|NC_019916. 258 NEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA-MRQANMILLKTGMAPNGQQTSADANYIHKEYDSA 336 (513) Q Consensus 258 ~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 336 (513) +.|-.+++=....... ........ ..+.. ...++++.+ ..+.+++-+....... T Consensus 208 ~~p~gil~~~~~~~~~--------~~~~~~~~--------~~~~~~~~~g~~~vl---------~~g~~~~~l~~~~~d~ 262 (392) T protein:vir:10 208 LNVPGVLTVKGGGLLS--------DKDKASRS--------RSFMKRSRSGGPVVL---------DDLEEFTALEIKSNVA 262 (392) T ss_pred CCCceEEEeCCCCCch--------HHHHHHHH--------HHHhccccCCCeeec---------CCCceEEEccCChhHH Confidence 4444333210000000 00000000 00100 011122222 2233444444444455 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019916. 337 GTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDID 415 (513) Q Consensus 337 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~ 415 (513) .+....+...+.|+..=++|+...+..+.+.|. .+.+ ..+...|...++.+..-+...-. T Consensus 263 ~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~--------------~f~~~~l~P~~~~ie~~l~~~L~----- 323 (392) T protein:vir:10 263 QLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEYKLS----- 323 (392) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhcc----- Confidence 667778888899999999997665433222222 1221 12333344444433333322100 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC Q lcl|NC_019916. 416 PDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTS 490 (513) Q Consensus 416 ~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 490 (513) ..+++......-.|..+.+..+.++ +|+++...+.+.+ ++..| |+.+ .+...+.. T Consensus 324 -~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--------~e~l~~~~-------- 383 (392) T protein:vir:10 324 -DHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA--------PENTNKKT-------- 383 (392) T ss_pred -ccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--------hcCCCCCC-------- Confidence 0112222222234556677777775 5788887665543 54432 2211 00111111 Q ss_pred CCCCCCCCCC Q lcl|NC_019916. 491 GNDPEDEGVR 500 (513) Q Consensus 491 ~~~~~~~~~~ 500 (513) + .+++++.+ T Consensus 384 ~-Gd~~~p~p 392 (392) T protein:vir:10 384 T-GQSNEPVP 392 (392) T ss_pred C-CCCCCCCC Confidence 0 11122222 No 159 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.49 E-value=5.7e-05 Score=43.83 Aligned_cols=381 Identities=12% Similarity=0.042 Sum_probs=153.1 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhcCCCc--cccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHH Q lcl|NC_019916. 22 RIAAFIRHHYNNQ-RPRLEMLYDYYRGQND--GILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSD 98 (513) Q Consensus 22 ~i~~~i~~~~~~~-~~~~~~~~~YY~G~~~--i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~ 98 (513) .+..+++.....+ .+.......++-...+ +................-+.++-...+|+..++-+-+-|+++...... T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 80 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQ 80 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhh Confidence 2222221110000 0000000001000000 000000000000000000123344556777777777778876544433 Q ss_pred HHH-HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccc Q lcl|NC_019916. 99 RLD-DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNI 177 (513) Q Consensus 99 ~l~-~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~ 177 (513) .|. .=............+..+.+.+|.||+++..+..|.+.-.+.++|..+-+..+... ..+. |........ T Consensus 81 ~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~-~~~~-----y~~~~~~~~- 153 (392) T protein:vir:39 81 GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-NGMY-----YNITFDDPK- 153 (392) T ss_pred hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceEE-----EEEEecCcc- Confidence 221 11111123455567788999999999999889888765556678888777665432 1211 111111110 Q ss_pred ceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 178 TQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDL 257 (513) Q Consensus 178 ~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~ 257 (513) ......+..+.+++++... +. ....|.|-++.+...++....+..-....+... T Consensus 154 ---~~~~~~~~~~eiih~~~~~--------------~~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng 207 (392) T protein:vir:39 154 ---IEPILQAPQSDLIHMKLLS--------------ID---------GGKTGISPLYSLRRESKIQRASDRLTISSLNSS 207 (392) T ss_pred ---cceeEEEccccEEEecCCC--------------CC---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 0111233444444432210 00 112466777666666654444443344444444 Q ss_pred hhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc-chhcceeeccccccccccccCCceeEEeecCCHH Q lcl|NC_019916. 258 NEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA-MRQANMILLKTGMAPNGQQTSADANYIHKEYDSA 336 (513) Q Consensus 258 ~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 336 (513) +.|-.+++=....... ........ ..+.. ...++++.+ ..+.+++-+....... T Consensus 208 ~~p~gil~~~~~~~~~--------~~~~~~~~--------~~~~~~~~~g~~~vl---------~~g~~~~~l~~~~~d~ 262 (392) T protein:vir:39 208 LNVPGVLTVKGGGLLS--------DKDKASRS--------RSFMKRSRSGGPVVL---------DDLEEFTALEIKSNVA 262 (392) T ss_pred CCCceEEEeCCCCCch--------HHHHHHHH--------HHHhccccCCCeeec---------CCCceEEEccCChhHH Confidence 4444333210000000 00000000 00100 011122222 2233444444444455 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_019916. 337 GTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDID 415 (513) Q Consensus 337 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~ 415 (513) .+....+...+.|+..=++|+...+..+.+.|. .+.+ ..+...|...++.+..-+...-. T Consensus 263 ~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~--------------~f~~~~l~P~~~~ie~~l~~~L~----- 323 (392) T protein:vir:39 263 QLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEYKLS----- 323 (392) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhcc----- Confidence 667778888899999999997665433222222 1221 12333344444433333322100 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC Q lcl|NC_019916. 416 PDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTS 490 (513) Q Consensus 416 ~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 490 (513) ..+++......-.|..+.+..+.++ +|+++...+.+.+ ++..| |+.+ .+...+.. T Consensus 324 -~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--------~e~l~~~~-------- 383 (392) T protein:vir:39 324 -DHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA--------PENTNKKT-------- 383 (392) T ss_pred -ccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--------hcCCCCCC-------- Confidence 0112222222234556677777775 5788887665543 54432 2211 00111111 Q ss_pred CCCCCCCCCC Q lcl|NC_019916. 491 GNDPEDEGVR 500 (513) Q Consensus 491 ~~~~~~~~~~ 500 (513) + .+++++.+ T Consensus 384 ~-Gd~~~p~p 392 (392) T protein:vir:39 384 T-GQSNEPVP 392 (392) T ss_pred C-CCCCCCCC Confidence 0 11122222 No 160 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=438 Identities=11% Similarity=0.017 Sum_probs=189.4 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHh---cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYY---RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.... ..+.+.+-.+.....|.+ +.+.+.+|. .|.- ..... ....+...++..+-....+++.++.|+ T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~---~~~~~--~~~~~~~~~~~dst~~~a~~~LAa~L~ 74 (555) T protein:vir:10 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF---FVQDR--NRGEKRHNNILDNTGTRALRVLAAGMM 74 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc---cCCCC--CcchhcccccccccHHHHHHHHHHHHH Confidence 44222 333444444444444433 455555553 2211 01111 111223456777888888888888886 Q ss_pred cC--Ce-----eecCCc-----H-----------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 87 GN--AI-----AMSGPS-----S-----------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 87 g~--p~-----~~~~~~-----~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +- |+ ++...+ . ..+...+..++|.....++.++..++|.|.+++-.+.++..++. . T Consensus 75 ~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~-~ 153 (555) T protein:vir:10 75 AGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHH-S 153 (555) T ss_pred HhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEE-E Confidence 53 21 222211 1 12334455678999999999999999998876655544332222 2 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccc-------c----------ccceeEEEEEE----EcCCc----------- Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVV-------D----------NITQTKYEVET----WTEND----------- 191 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-------~----------~~~~~~~~ve~----yt~~~----------- 191 (513) ++..+.++.-|. ..++...+|.++..... . .......++++ |.... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:10 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 444444444443 34566666654332100 0 00001112332 22111 Q ss_pred ---EEEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_019916. 192 ---YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLV 263 (513) Q Consensus 192 ---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~ 263 (513) .+++.... ++.. . ..+..|..+|++.++ .+.+|+|-.++..+-+-.+|.+.-......+...+|.+. T Consensus 232 p~~s~~~~~~~-d~~~-v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 232 AWKSVYFEPGA-DETR-T---LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ceEEEEEEecc-CCcc-c---cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 01111111 1110 0 112345668877764 345799999999999999998777778888888887765 Q ss_pred eecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHH Q lcl|NC_019916. 264 IKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKK 343 (513) Q Consensus 264 ~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 343 (513) +-....... ..+.. ++... ...+...++-.-.+.+..+.......++ T Consensus 307 v~~~~~~~~-----------------~~~~p-----------gg~~~-----v~~g~~~d~~~~~~~~~~d~~~~~~~i~ 353 (555) T protein:vir:10 307 LPVSAKNQD-----------------ISTVP-----------GGLSY-----VDAAAPNGGIRTAFEVNLDLSHLLADIV 353 (555) T ss_pred ecccccccc-----------------ceecc-----------ccccc-----cccCCCCcceecccccccchHHHHHHHH Confidence 432110000 00000 00000 0011111111122223345566677788 Q ss_pred HHHHHHHHHhCccc----cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhcccc Q lcl|NC_019916. 344 RLAADIHKFSHTPD----LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL--------NQRYTVVAHIEERVNGK 411 (513) Q Consensus 344 ~l~~~i~~~s~~p~----~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l--------~~~~~li~~~l~~~~~~ 411 (513) .++..|-..- .-+ +... -+...++.-++.. +.+++..+|..+ .-+++-++.++...+.- T Consensus 354 ~~~~rI~~af-~~dlf~~l~~~-~~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l 424 (555) T protein:vir:10 354 DVRERIKASF-YADLFLMLANG-TNPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL 424 (555) T ss_pred HHHHHHHHHh-hcchhhhccCC-CCCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 8887774432 222 1111 1234566555433 233333344333 33344444555443321 Q ss_pred c----ccccceeeEEeCCCCCcCHH-H-------HHHHHHHHhcC-------CCHHHHH----HhCCCCC----CHHHHH Q lcl|NC_019916. 412 W----DIDPDEIGFIFRDNLPTDDV-A-------IITALVQAGAQ-------IPQEYLY----QYLPNVT----DADEIV 464 (513) Q Consensus 412 ~----~~~~~~i~i~f~~~~p~d~~-e-------~a~~~~kl~g~-------iS~et~~----~~l~~v~----D~~~E~ 464 (513) . ......|+|++..++-+... + .++.+..++++ +....++ ..++ |+ -.++|+ T Consensus 425 P~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~G-vp~~~irs~eev 503 (555) T protein:vir:10 425 PPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLG-IDPELIVPGNQV 503 (555) T ss_pred CCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhC-CCccccCCHHHH Confidence 1 22334566776665543211 1 11222222332 2222222 3332 32 135667 Q ss_pred HHHHHHHHHHHHHhhh--hcCCCC---CCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDT--KGGLII---NGTSGNDPEDE----GVRGQQGEPE 507 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~--~~~~~~---~~~~~~~~~~~----~~~~~~~~~~ 507 (513) +++.+++++.+..... +..... ..-.+.+-..+ +.-...++=+ T Consensus 504 ~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 504 ALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 7666655443322111 111100 00000000000 0000011101 No 161 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=438 Identities=11% Similarity=0.017 Sum_probs=189.4 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHh---cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYY---RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.... ..+.+.+-.+.....|.+ +.+.+.+|. .|.- ..... ....+...++..+-....+++.++.|+ T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~---~~~~~--~~~~~~~~~~~dst~~~a~~~LAa~L~ 74 (555) T protein:vir:98 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF---FVQDR--NRGEKRHNNILDNTGTRALRVLAAGMM 74 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc---cCCCC--CcchhcccccccccHHHHHHHHHHHHH Confidence 44222 333444444444444433 455555553 2211 01111 111223456777888888888888886 Q ss_pred cC--Ce-----eecCCc-----H-----------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 87 GN--AI-----AMSGPS-----S-----------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 87 g~--p~-----~~~~~~-----~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +- |+ ++...+ . ..+...+..++|.....++.++..++|.|.+++-.+.++..++. . T Consensus 75 ~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~-~ 153 (555) T protein:vir:98 75 AGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHH-S 153 (555) T ss_pred HhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEE-E Confidence 53 21 222211 1 12334455678999999999999999998876655544332222 2 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccc-------c----------ccceeEEEEEE----EcCCc----------- Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVV-------D----------NITQTKYEVET----WTEND----------- 191 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-------~----------~~~~~~~~ve~----yt~~~----------- 191 (513) ++..+.++.-|. ..++...+|.++..... . .......++++ |.... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:98 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 444444444443 34566666654332100 0 00001112332 22111 Q ss_pred ---EEEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_019916. 192 ---YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLV 263 (513) Q Consensus 192 ---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~ 263 (513) .+++.... ++.. . ..+..|..+|++.++ .+.+|+|-.++..+-+-.+|.+.-......+...+|.+. T Consensus 232 p~~s~~~~~~~-d~~~-v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:98 232 AWKSVYFEPGA-DETR-T---LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ceEEEEEEecc-CCcc-c---cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 01111111 1110 0 112345668877764 345799999999999999998777778888888887765 Q ss_pred eecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHH Q lcl|NC_019916. 264 IKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKK 343 (513) Q Consensus 264 ~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 343 (513) +-....... ..+.. ++... ...+...++-.-.+.+..+.......++ T Consensus 307 v~~~~~~~~-----------------~~~~p-----------gg~~~-----v~~g~~~d~~~~~~~~~~d~~~~~~~i~ 353 (555) T protein:vir:98 307 LPVSAKNQD-----------------ISTVP-----------GGLSY-----VDAAAPNGGIRTAFEVNLDLSHLLADIV 353 (555) T ss_pred ecccccccc-----------------ceecc-----------ccccc-----cccCCCCcceecccccccchHHHHHHHH Confidence 432110000 00000 00000 0011111111122223345566677788 Q ss_pred HHHHHHHHHhCccc----cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhcccc Q lcl|NC_019916. 344 RLAADIHKFSHTPD----LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL--------NQRYTVVAHIEERVNGK 411 (513) Q Consensus 344 ~l~~~i~~~s~~p~----~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l--------~~~~~li~~~l~~~~~~ 411 (513) .++..|-..- .-+ +... -+...++.-++.. +.+++..+|..+ .-+++-++.++...+.- T Consensus 354 ~~~~rI~~af-~~dlf~~l~~~-~~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l 424 (555) T protein:vir:98 354 DVRERIKASF-YADLFLMLANG-TNPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL 424 (555) T ss_pred HHHHHHHHHh-hcchhhhccCC-CCCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 8887774432 222 1111 1234566555433 233333344333 33344444555443321 Q ss_pred c----ccccceeeEEeCCCCCcCHH-H-------HHHHHHHHhcC-------CCHHHHH----HhCCCCC----CHHHHH Q lcl|NC_019916. 412 W----DIDPDEIGFIFRDNLPTDDV-A-------IITALVQAGAQ-------IPQEYLY----QYLPNVT----DADEIV 464 (513) Q Consensus 412 ~----~~~~~~i~i~f~~~~p~d~~-e-------~a~~~~kl~g~-------iS~et~~----~~l~~v~----D~~~E~ 464 (513) . ......|+|++..++-+... + .++.+..++++ +....++ ..++ |+ -.++|+ T Consensus 425 P~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~G-vp~~~irs~eev 503 (555) T protein:vir:98 425 PPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLG-IDPELIVPGNQV 503 (555) T ss_pred CCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhC-CCccccCCHHHH Confidence 1 22334566776665543211 1 11222222332 2222222 3332 32 135667 Q ss_pred HHHHHHHHHHHHHhhh--hcCCCC---CCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDT--KGGLII---NGTSGNDPEDE----GVRGQQGEPE 507 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~--~~~~~~---~~~~~~~~~~~----~~~~~~~~~~ 507 (513) +++.+++++.+..... +..... ..-.+.+-..+ +.-...++=+ T Consensus 504 ~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 504 ALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 7666655443322111 111100 00000000000 0000011101 No 162 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=438 Identities=11% Similarity=0.017 Sum_probs=189.4 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHh---cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYY---RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.... ..+.+.+-.+.....|.+ +.+.+.+|. .|.- ..... ....+...++..+-....+++.++.|+ T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~---~~~~~--~~~~~~~~~~~dst~~~a~~~LAa~L~ 74 (555) T protein:vir:10 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF---FVQDR--NRGEKRHNNILDNTGTRALRVLAAGMM 74 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc---cCCCC--CcchhcccccccccHHHHHHHHHHHHH Confidence 44222 333444444444444433 455555553 2211 01111 111223456777888888888888886 Q ss_pred cC--Ce-----eecCCc-----H-----------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 87 GN--AI-----AMSGPS-----S-----------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 87 g~--p~-----~~~~~~-----~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +- |+ ++...+ . ..+...+..++|.....++.++..++|.|.+++-.+.++..++. . T Consensus 75 ~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~-~ 153 (555) T protein:vir:10 75 AGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHH-S 153 (555) T ss_pred HhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEE-E Confidence 53 21 222211 1 12334455678999999999999999998876655544332222 2 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccc-------c----------ccceeEEEEEE----EcCCc----------- Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVV-------D----------NITQTKYEVET----WTEND----------- 191 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-------~----------~~~~~~~~ve~----yt~~~----------- 191 (513) ++..+.++.-|. ..++...+|.++..... . .......++++ |.... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:10 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 444444444443 34566666654332100 0 00001112332 22111 Q ss_pred ---EEEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_019916. 192 ---YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLV 263 (513) Q Consensus 192 ---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~ 263 (513) .+++.... ++.. . ..+..|..+|++.++ .+.+|+|-.++..+-+-.+|.+.-......+...+|.+. T Consensus 232 p~~s~~~~~~~-d~~~-v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 232 AWKSVYFEPGA-DETR-T---LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ceEEEEEEecc-CCcc-c---cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 01111111 1110 0 112345668877764 345799999999999999998777778888888887765 Q ss_pred eecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHH Q lcl|NC_019916. 264 IKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKK 343 (513) Q Consensus 264 ~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 343 (513) +-....... ..+.. ++... ...+...++-.-.+.+..+.......++ T Consensus 307 v~~~~~~~~-----------------~~~~p-----------gg~~~-----v~~g~~~d~~~~~~~~~~d~~~~~~~i~ 353 (555) T protein:vir:10 307 LPVSAKNQD-----------------ISTVP-----------GGLSY-----VDAAAPNGGIRTAFEVNLDLSHLLADIV 353 (555) T ss_pred ecccccccc-----------------ceecc-----------ccccc-----cccCCCCcceecccccccchHHHHHHHH Confidence 432110000 00000 00000 0011111111122223345566677788 Q ss_pred HHHHHHHHHhCccc----cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhcccc Q lcl|NC_019916. 344 RLAADIHKFSHTPD----LTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL--------NQRYTVVAHIEERVNGK 411 (513) Q Consensus 344 ~l~~~i~~~s~~p~----~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l--------~~~~~li~~~l~~~~~~ 411 (513) .++..|-..- .-+ +... -+...++.-++.. +.+++..+|..+ .-+++-++.++...+.- T Consensus 354 ~~~~rI~~af-~~dlf~~l~~~-~~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l 424 (555) T protein:vir:10 354 DVRERIKASF-YADLFLMLANG-TNPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL 424 (555) T ss_pred HHHHHHHHHh-hcchhhhccCC-CCCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 8887774432 222 1111 1234566555433 233333344333 33344444555443321 Q ss_pred c----ccccceeeEEeCCCCCcCHH-H-------HHHHHHHHhcC-------CCHHHHH----HhCCCCC----CHHHHH Q lcl|NC_019916. 412 W----DIDPDEIGFIFRDNLPTDDV-A-------IITALVQAGAQ-------IPQEYLY----QYLPNVT----DADEIV 464 (513) Q Consensus 412 ~----~~~~~~i~i~f~~~~p~d~~-e-------~a~~~~kl~g~-------iS~et~~----~~l~~v~----D~~~E~ 464 (513) . ......|+|++..++-+... + .++.+..++++ +....++ ..++ |+ -.++|+ T Consensus 425 P~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~G-vp~~~irs~eev 503 (555) T protein:vir:10 425 PPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLG-IDPELIVPGNQV 503 (555) T ss_pred CCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhC-CCccccCCHHHH Confidence 1 22334566776665543211 1 11222222332 2222222 3332 32 135667 Q ss_pred HHHHHHHHHHHHHhhh--hcCCCC---CCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_019916. 465 KMMDKQRKAMLKTYDT--KGGLII---NGTSGNDPEDE----GVRGQQGEPE 507 (513) Q Consensus 465 ~ri~~E~~~~~~~~~~--~~~~~~---~~~~~~~~~~~----~~~~~~~~~~ 507 (513) +++.+++++.+..... +..... ..-.+.+-..+ +.-...++=+ T Consensus 504 ~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 504 ALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 7666655443322111 111100 00000000000 0000011101 No 163 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.46 E-value=6.2e-05 Score=43.64 Aligned_cols=415 Identities=8% Similarity=0.016 Sum_probs=161.4 Q ss_pred HHHHH--HH--HHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec---CC-- Q lcl|NC_019916. 25 AFIRH--HY--NNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS---GP-- 95 (513) Q Consensus 25 ~~i~~--~~--~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~---~~-- 95 (513) -+|.- -. .....+.-.+.+-|-+.... ..+...... .....-..++....+|+..+.-+-+-|+.+- .+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-g~~~~~~~~-~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~ 78 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAV-GMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) T ss_pred CcccCceeeccchhhhhhhhhhhccccccee-ceecccccc-hhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCcc Confidence 00000 00 00001111122222221100 000000000 0000001234556677777777777787751 11 Q ss_pred ---cHHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEE Q lcl|NC_019916. 96 ---SSDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYH 168 (513) Q Consensus 96 ---~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~ 168 (513) .+..+..++.. |. .......+..+.+.+|.+|+++-.+..|.+.-.+.++|..+.+..+... .... . +| T Consensus 79 ~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~-~~~~--y-~~ 154 (518) T protein:vir:78 79 TEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GRYE--Y-YF 154 (518) T ss_pred ccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCC-CEEE--E-EE Confidence 12234445543 32 2345567788889999999999988888766566788888888776532 1111 1 11 Q ss_pred eecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019916. 169 AVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQS 248 (513) Q Consensus 169 ~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S 248 (513) . ..+... .....+..+.+++++... +.| ...|.|-+..+...++....+.. T Consensus 155 ~--~~~~~~----~~~~~~~~~eIiHir~~~--------------~dg---------~~~G~Spi~~~~~~i~~~~aa~~ 205 (518) T protein:vir:78 155 Q--AGAGVG----TQLVSFADDEVVPIRFFN--------------PDG---------LERGLSLMESLKSTIFSEDSSRN 205 (518) T ss_pred E--ecCCcc----ceeEEecCCcEEEecCCC--------------CCc---------ccccccHHHHHHHHHHHHHHHHH Confidence 1 111111 111123444444432110 000 01356666555555544444433 Q ss_pred HHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc-chhcceeeccccccccccccCCcee Q lcl|NC_019916. 249 DTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA-MRQANMILLKTGMAPNGQQTSADAN 327 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (513) ...+.+...+.|-.+++..... .... ...++..=...+.. ...++++.++ .+.++. T Consensus 206 ~~~~~f~Ng~~p~gvl~~~~~l----------s~e~----~~~~k~~~~~~~~G~~nag~~~vL~---------~G~~~~ 262 (518) T protein:vir:78 206 ATAAMWKNAGRPNLVLRHEKRL----------SPEA----QQRLREQFDRAHAGSSNTGKTMVVE---------EGMEPI 262 (518) T ss_pred HHHHHHhcCCCccEEEecCCCC----------CHHH----HHHHHHHHHHHhcCcccCCceeEcC---------CCceEE Confidence 4444444444555555432110 0000 00110000000000 0112223322 233444 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 328 YIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEER 407 (513) Q Consensus 328 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~ 407 (513) .++.......+....+...+.|+..-++|+...+...+ .+...++.. ....+..++.-.++.|..-+.. T Consensus 263 ~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~-st~sn~e~~----------~~~f~~~tL~P~~~~ie~eln~ 331 (518) T protein:vir:78 263 PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR-ATFSNISAQ----------MRAFYRDTMAIPIARIQSAMDK 331 (518) T ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCC-CCchhHHHH----------HHHHHHHHHHHHHHHHHHHHHH Confidence 44433334445666777788999999999765432221 111111111 1112222333333333222221 Q ss_pred cccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC--CCCCHHHH-------HHHHHHHHHHHHH Q lcl|NC_019916. 408 VNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP--NVTDADEI-------VKMMDKQRKAMLK 476 (513) Q Consensus 408 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~--~v~D~~~E-------~~ri~~E~~~~~~ 476 (513) .-.........+++....-+..|..+.++++.++ +|+++.-.+.++++ -++++... +..+..- . T Consensus 332 ~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~-----~ 406 (518) T protein:vir:78 332 YVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGAT-----P 406 (518) T ss_pred hhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccc-----c Confidence 1000000112344444566678899999998886 57888877776654 33332111 0011000 0 Q ss_pred HhhhhcCCCCCCCCCCC-C---CCCCCCCCCC--------CCCCccCCC Q lcl|NC_019916. 477 TYDTKGGLIINGTSGND-P---EDEGVRGQQG--------EPEDERTSD 513 (513) Q Consensus 477 ~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~--------~~~~~~~~~ 513 (513) .....++..+..+.+.+ . .++..+++.. .++++...| T Consensus 407 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) T protein:vir:78 407 DGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTE 455 (518) T ss_pred ccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccc Confidence 00000000000000000 0 0000000000 011111111 No 164 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=332 Identities=11% Similarity=0.059 Sum_probs=136.6 Q ss_pred hhcCCeeecC---CcHHHHHHHHH-h-c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCC Q lcl|NC_019916. 85 SVGNAIAMSG---PSSDRLDDFNR-R-N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRS 156 (513) Q Consensus 85 l~g~p~~~~~---~~~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~ 156 (513) +-.-|+.+.. ..+..+.+++. . | .-......+...++.+|.||+++-.+..|.+.-.+.++|..+-+..++. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 2233444311 11222333332 1 3 2334456677888999999999988888876555667888777766543 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHH Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENV 236 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v 236 (513) . ..+ +|......+ . ...|.+..+++++... +. +...|.|.++.+ T Consensus 81 ~-~~~-----~y~~~~~~g----~---~~~~~~~eiih~r~~~--------------~~---------~~~~G~s~~~~~ 124 (348) T protein:vir:93 81 S-REL-----YYSIHAATG----N---KLIVHNMDMLHFKHIV--------------AS---------NMVQGISPIDVL 124 (348) T ss_pred C-cEE-----EEEEEcCCC----e---EEEEccccEEEecCCC--------------CC---------CceeeccHHHHH Confidence 2 111 111111110 0 1124444444432210 00 111355655555 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeecccc Q lcl|NC_019916. 237 LSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTG 314 (513) Q Consensus 237 ~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~ 314 (513) ...++..+.+... .+..+..+-.++.-.. ....... ...+.. .+...- .++++. T Consensus 125 ~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~---------~~l~~e~----~~~~~~----~~~~~~~n~~~~~v---- 180 (348) T protein:vir:93 125 KNTTDFDNAVRTF---NLTEMQKPDSFMLKYG---------SNVSTEK----RQQVLE----DFKQYYEENGGILF---- 180 (348) T ss_pred HHHHHHHHHHHHH---HHHhcCCCceeEEecC---------CCCCHHH----HHHHHH----HHHHHhhcCCCeee---- Confidence 5544433322111 1222222211111000 0000111 011110 011100 111222 Q ss_pred ccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 315 MAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL 394 (513) Q Consensus 315 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l 394 (513) ...+.++..++.+.....+....+...+.|+..-++|+...+.. +..+...++.... ..+...+ T Consensus 181 -----l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~-~~~~~~~~e~~~~----------~~~~~~l 244 (348) T protein:vir:93 181 -----QEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNAR-SNTNFAKNEELNR----------FYLQHTL 244 (348) T ss_pred -----cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCCcccHHHHHH----------HHHHHHH Confidence 12333444444444444566677788899999999997655422 2222222222111 1122233 Q ss_pred HHHHHHHHHHHHhcc-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHHHHHH Q lcl|NC_019916. 395 NQRYTVVAHIEERVN-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIVKMMD 468 (513) Q Consensus 395 ~~~~~li~~~l~~~~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~~ri~ 468 (513) .-+++.+...+...= ...+ .....+++.+..-+-.|..+.++++.++ +|+++.-.+.+.++.- ++-+.=+ + T Consensus 245 ~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~--~- 321 (348) T protein:vir:93 245 LPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL--I- 321 (348) T ss_pred HHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEe--e- Confidence 433333333332210 0000 1122345555566667888899988887 5788887777776531 1100000 0 Q ss_pred HHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 469 KQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 469 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) .....+. +.....+....+++++.+++ T Consensus 322 ------~~n~~~~-~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 322 ------SGDLYPI-DTPLELRKSLKGGDKNVNES 348 (348) T ss_pred ------ccccccc-ccchhhcccccCCCCCcCCC Confidence 0000000 00000111111111111111 No 165 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.36 E-value=8.6e-05 Score=42.86 Aligned_cols=418 Identities=9% Similarity=0.017 Sum_probs=160.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM 92 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~ 92 (513) |-.. ..+. ++. -....+.-.+.+.|-+.... ......... .....-..++....+|+..+.-+-+-|+.+ T Consensus 1 ~~~~--~~~~----~~~--p~~~e~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~a~~~~~V~acV~~IA~~iA~lpl~l 70 (518) T protein:vir:10 1 MLLA--NGQT----LSA--PAMAELSPQMQDSYYYAPAV-GMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) T ss_pred Cccc--Ccee----ecC--chhhhhhhhhhccccccccc-ceecccccc-hhhHHHhhhHHHHHHHHHHHHhhccCceEE Confidence 0000 0000 000 00000011111112111100 000000000 000000123455667777777777777764 Q ss_pred ---cCCc-----HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcc Q lcl|NC_019916. 93 ---SGPS-----SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPK 160 (513) Q Consensus 93 ---~~~~-----~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~ 160 (513) +.+. +..+..++.. |. .......+..+.+.+|.||+++-.+.+|.+.-.+.++|..+.+..+... .. T Consensus 71 ~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~-~~ 149 (518) T protein:vir:10 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GR 149 (518) T ss_pred EEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCC-CE Confidence 1111 2234445543 32 2345566778899999999999988888765556788888888776532 11 Q ss_pred eEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHH Q lcl|NC_019916. 161 PIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLI 240 (513) Q Consensus 161 ~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~li 240 (513) +. |......+.. ...-.+....+++++... +.| ...|.|.+..+...+ T Consensus 150 ~~-----y~~~~~~~~~----~~~~~~~~~eViHir~~s--------------~dg---------~~~G~spi~~a~~~i 197 (518) T protein:vir:10 150 YE-----YYFQAGAGVG----TQLVSFADDEVVPIRFFN--------------PDG---------LERGLSLMESLKSTI 197 (518) T ss_pred EE-----EEEEecCCcc----ceEEEecCCcEEEecCCC--------------CCc---------ccccccHHHHHHHHH Confidence 11 1111111111 111233444554432210 000 013566665555544 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc-chhcceeeccccccccc Q lcl|NC_019916. 241 DLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA-MRQANMILLKTGMAPNG 319 (513) Q Consensus 241 D~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 319 (513) .....+.....+.+...+.|-.+++..... ... ....++..=...+.. ...++++.+ T Consensus 198 ~~~~a~~~~~~~~f~ng~~p~gil~~~~~l----------s~e----~~~~~k~~~~~~~~G~~nag~v~vL-------- 255 (518) T protein:vir:10 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRL----------SEA----AQQRLREQFDRAHSGSSNTGKTMVV-------- 255 (518) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEecCCCC----------CHH----HHHHHHHHHHHHhcCccccCcceEc-------- Confidence 444443333344444444454444432110 000 001111100000100 011122332 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRY 398 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~ 398 (513) ..+.++..++.......+....+...+.|+..-++|+...+... ++-|. ++... ...+..++.-.+ T Consensus 256 -~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn--~eq~~----------~~f~~~tL~P~l 322 (518) T protein:vir:10 256 -EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--ISAQM----------RAFYRDTMAIPI 322 (518) T ss_pred -CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHHHH----------HHHHHHHHHHHH Confidence 22334444443333444666677788899999999976654222 21122 11111 112222333333 Q ss_pred HHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHH-------HHHH Q lcl|NC_019916. 399 TVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEI-------VKMM 467 (513) Q Consensus 399 ~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E-------~~ri 467 (513) +.|..-+...-.........+++....-+..|..+.++++.++ +|+++.-.+.++++. ++++... +..+ T Consensus 323 ~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl 402 (518) T protein:vir:10 323 ARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPL 402 (518) T ss_pred HHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceec Confidence 3333222221100001112344444566678899999988876 578888777776643 3322111 0111 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCC-CC---CCCCCCC--------CCCCCCccCCC Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGND-PE---DEGVRGQ--------QGEPEDERTSD 513 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~~~--------~~~~~~~~~~~ 513 (513) ..- ......++..+..+.+.+ .. ++..+++ .+.+++....| T Consensus 403 ~~~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) T protein:vir:10 403 GAT-----PDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTE 455 (518) T ss_pred ccc-----cccccCCCCCCCCCCCCccccccccccccccCCCCCcccccccccccccc Confidence 000 000000000000000000 00 0000000 00111111111 No 166 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.34 E-value=9.1e-05 Score=42.75 Aligned_cols=392 Identities=11% Similarity=0.017 Sum_probs=162.5 Q ss_pred HHHHHHHHHH----HHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec---CCc Q lcl|NC_019916. 24 AAFIRHHYNN----QRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS---GPS 96 (513) Q Consensus 24 ~~~i~~~~~~----~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~---~~~ 96 (513) ..+++..+.+ .......+.+.+.+..+.. ..... .+..-+...-...+|+..+.-+-+-|+++- .+. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~--~g~~v----~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTY--TGKQI----SSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSL 74 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCcccc--CCcee----chhhhhccHHHHHHHHHHHHHhccCceEEEEecCCc Confidence 1111110000 0000111222222111000 00000 000112234455677777777777787652 111 Q ss_pred -----HHHHHHHHH--hc---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEE Q lcl|NC_019916. 97 -----SDRLDDFNR--RN---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVR 166 (513) Q Consensus 97 -----~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir 166 (513) +..+..++. =| ........+..+++.+|.||+++..+ +|.+.-.+.++|..+.+.+++.. .+.+ T Consensus 75 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~--~~~y--- 148 (414) T protein:vir:44 75 KQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSW--EPVY--- 148 (414) T ss_pred eeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCC--cEEE--- Confidence 122334332 13 34456667888999999999888765 56654456688988888777543 2221 Q ss_pred EEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHH Q lcl|NC_019916. 167 YHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVA 246 (513) Q Consensus 167 ~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~ 246 (513) ......+ ....+.++.+++++... + +...|.|-+..+...++....+ T Consensus 149 --~~~~~~g-------~~~~~~~~evih~~~~~---------------~---------d~~~G~s~i~~~~~~i~~~~~~ 195 (414) T protein:vir:44 149 --QVTFPDG-------STDVLSQEDIWHVRTLT---------------L---------DGLVGLNPIAYAREAISLAAAT 195 (414) T ss_pred --EEEecCc-------eEEEEccccEEEecCCC---------------C---------CCcccccHHHHHHHHHHHHHHH Confidence 1111111 01234455555443110 0 1114667666666656554444 Q ss_pred HHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCc Q lcl|NC_019916. 247 QSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSAD 325 (513) Q Consensus 247 ~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 325 (513) ..-..+.+...+.|-.+++-.... .. +....++..=........ .++.+.+ ..+.+ T Consensus 196 ~~~~~~~f~ng~~p~gil~~~~~l-----------~~---e~~~~~~~~~~~~~~g~~n~~~~~vl---------~~g~~ 252 (414) T protein:vir:44 196 EEHGARLFSNGAVTSGVLRTEQTL-----------SD---QAYERLKKDFEERHTGLGNAHRPMIL---------EMGLD 252 (414) T ss_pred HHHHHHHHhccCCCceEEEeCCCC-----------CH---HHHHHHHHHHHHHhcCccccCcceec---------CCCce Confidence 444444444444455444432110 00 001111110000010000 0112222 22233 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 326 ANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 326 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) ++.++.......+....+...+.|+..-++|+...+... ++-|. ++-+ ....+..+++..++.+... T Consensus 253 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n--~e~~----------~~~~~~~~l~P~~~~ie~~ 320 (414) T protein:vir:44 253 WKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNN--IEEL----------GLGFINYSLVPYLTRIEQR 320 (414) T ss_pred EEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHHHHHHHHHHHHHHHH Confidence 444433333444556677778899999999976544321 22121 1111 1122233444444444333 Q ss_pred HHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019916. 405 EERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTK 481 (513) Q Consensus 405 l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 481 (513) +...- .........+++.+...+..|..+.++++.++ +|+++.-.+.+.++.-.-+. - . ....+. T Consensus 321 ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--g-------D---~~~~~~ 388 (414) T protein:vir:44 321 INTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPG--G-------D---VYLTPM 388 (414) T ss_pred HHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--c-------c---eecccc Confidence 33211 11111122344444566667889999998887 57888877777765311000 0 0 000000 Q ss_pred cCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 482 GGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) . .. ..+.++ .+...+++...+++.++ T Consensus 389 n--~~--~~~~~~-~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 389 N--MT--TKPSDG-SKAGKQKDNANADETTS 414 (414) T ss_pred c--cc--ccCCcc-ccCCCCCCCCCCCCCCC Confidence 0 00 001111 11111122222222223 No 167 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.30 E-value=0.0001 Score=42.50 Aligned_cols=399 Identities=10% Similarity=0.026 Sum_probs=157.3 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee-c-- Q lcl|NC_019916. 17 KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM-S-- 93 (513) Q Consensus 17 ~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~-~-- 93 (513) .+-+.++.+- .+-... -.-+..+.....+.. ... .....+..-+.+.-...+|+..+.-+-+-|+.+ . T Consensus 1 m~~~~~~~~~-~~~~s~-~~~w~~~~~~~~~~~----~~~---g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~ 71 (421) T protein:vir:10 1 MFIPQMFEGK-KRSVSG-GGFWEAMLGGVRSSH----SKA---GVMITPETALALSAVRACVTLLAESVAQLPVELYRRD 71 (421) T ss_pred CCCcchhccc-ccccCc-chhhHHHhhhhccCc----ccC---CceechHHhhccHHHHHHHHHHHHhhccCceEEEEEc Confidence 1111111000 000000 000111111111110 000 000000111223344557777777777778774 1 Q ss_pred CCc------HHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 94 GPS------SDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 94 ~~~------~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) .+. +..+..++. . |. .......+..+.+.+|.||+++-.+.+|.+.-.+.++|..+.+..++.. T Consensus 72 ~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g----- 146 (421) T protein:vir:10 72 KNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDG----- 146 (421) T ss_pred CCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCc----- Confidence 111 122334332 2 32 3345566788999999999999888888776666778887776655431 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDL 242 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~ 242 (513) ..+|..... .. .+..+.+++++.. + .+...|.|-++.+...++. T Consensus 147 --~~~y~~~~~----g~------~~~~~eiih~~~~--------------------~----~d~~~G~spi~~~~~~i~~ 190 (421) T protein:vir:10 147 --MPYYEIPEI----GE------TLPMRMMHHVKVF--------------------S----LDGYIGSSPIQTNADVLGL 190 (421) T ss_pred --eEEEEEcCC----Cc------EEchhhEEEecCc--------------------C----CCCcccccHHHHHHHHHHH Confidence 122322110 00 1222233322110 0 0112366666655555544 Q ss_pred HHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc-hhcceeeccccccccccc Q lcl|NC_019916. 243 YDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM-RQANMILLKTGMAPNGQQ 321 (513) Q Consensus 243 ~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 321 (513) ...+..-..+.+...+.|-.+++-...... ....+....+...-....... ..++++.+ . T Consensus 191 ~~~~~~~~~~~f~ng~~~~gil~~~~~~~~----------~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl---------~ 251 (421) T protein:vir:10 191 NLAVEEHASAVFRRGATMSGVIERPKEAPA----------IKSQEKIDQLLAKWTDRYSGINNMFSVALL---------Q 251 (421) T ss_pred HHHHHHHHHHHHhcCCCccEEEEecCccCc----------cCCHHHHHHHHHHHHHHhcCccccCcceec---------C Confidence 333333333333433444444432110000 000000000000000000000 01122222 2 Q ss_pred cCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 322 TSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTV 400 (513) Q Consensus 322 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~l 400 (513) .+.++.-++.......+....+...+.|+..-++|+...+... ++-|. ++- .....+...|...++. T Consensus 252 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~----------~~~~f~~~tl~P~~~~ 319 (421) T protein:vir:10 252 EGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNN--IEH----------QGLQFVMYTLLAWLKR 319 (421) T ss_pred CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCcccc--HHH----------HHHHHHHHHHHHHHHH Confidence 2233333433334445566777788899999999976543222 11121 111 1112223344444444 Q ss_pred HHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHH Q lcl|NC_019916. 401 VAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAML 475 (513) Q Consensus 401 i~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~ 475 (513) +...+...- .........+++.+..-+..|..+.++++.++ +|+++.-.+.+.++. +++-+ T Consensus 320 ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-------------- 385 (421) T protein:vir:10 320 HEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGD-------------- 385 (421) T ss_pred HHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-------------- Confidence 433333211 11111112344444555667889999988886 678888777777643 22111 Q ss_pred HHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 476 KTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ....+..........+.++ ...+.++.+.+.-.+| T Consensus 386 ~~~~~~n~~~~~~~~~~~~---~~~~~~~~e~d~~~~~ 420 (421) T protein:vir:10 386 KYLTPLNMVDSAQIIPGDK---KPTAQQMAEIDTILSR 420 (421) T ss_pred eeeeccccccccccccCCC---CcccccCccccccccc Confidence 0011111111111111111 1111111112222222 No 168 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=97.30 E-value=0.0001 Score=42.49 Aligned_cols=424 Identities=9% Similarity=0.020 Sum_probs=154.0 Q ss_pred CccchhhceeccCC-------cccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ec Q lcl|NC_019916. 1 MIDMQQANMNYQED-------ADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VH 71 (513) Q Consensus 1 ~~~~~~~~~~~~~~-------~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~ 71 (513) -|-|=.|-..-+.. ..+...-.+.++- ......-+ |.++... ++. ......++ .. T Consensus 33 ~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g----------~~~~~~~~-g~~~~~e-pp~----d~~~l~~l~~~n 96 (648) T protein:vir:79 33 SMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIG----------LAIMDGGG-GGRDFEE-PEF----DFNEITSAYNTE 96 (648) T ss_pred ccccCCCccccCCCCcccccccccchhHHHHHhH----------HHHHhhcC-Ccccccc-CCc----CHHHHHHHHhcC Confidence 11111111111111 1111111111100 00000111 3333211 100 00001111 24 Q ss_pred chhHHHHHHHHHHhhcCCeeecCCcHHHH-----HHHH-Hhc---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEE Q lcl|NC_019916. 72 SFARYIADFQTSYSVGNAIAMSGPSSDRL-----DDFN-RRN---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSV 142 (513) Q Consensus 72 n~~~~ivd~~~~~l~g~p~~~~~~~~~~l-----~~~~-~~n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~ 142 (513) ++.+..|+..+.-+.+.|+.+..+++... ..++ .-| ........+..+.+.+|.||+.+-.+.+|...... T Consensus 97 p~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l 176 (648) T protein:vir:79 97 GYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGM 176 (648) T ss_pred hHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhh Confidence 66777888888888888887755443221 1111 222 34456677888999999999998888777432111 Q ss_pred ---------------EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccc Q lcl|NC_019916. 143 ---------------KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTL 207 (513) Q Consensus 143 ---------------~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~ 207 (513) .++|..+.+..++. ..+..|.....++.. T Consensus 177 ~~~~~~~~~~v~~l~pl~p~~v~v~~d~~---------------------------------g~~~~Y~y~~~g~~~--- 220 (648) T protein:vir:79 177 NVMGVGDSMPVAGYFPLNLASMKVKRDKF---------------------------------GMIKGWQQEQEGQDK--- 220 (648) T ss_pred hhhhhccccceeeeEeecCceeEEEEcCC---------------------------------CceeeeEEEecCCce--- Confidence 11222222211110 000111111111000 Q ss_pred cccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccccc Q lcl|NC_019916. 208 EVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVD 282 (513) Q Consensus 208 ~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~ 282 (513) ...+..=.|++|+. ...|.|.+..+...|+....+.......+...+.|-.+++-.... .. . T Consensus 221 ----~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~-----~~----~ 287 (648) T protein:vir:79 221 ----PQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQ-----EG----F 287 (648) T ss_pred ----eEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCc-----cc----h Confidence 00011112444432 225778777766666555554444445555555555444311000 00 0 Q ss_pred chhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec--C--CHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 283 PSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE--Y--DSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 283 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~--~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) ...... ...+.... .++. +..+ ....+.+..+ . ....+....+...+.|+..-++|++ T Consensus 288 e~~k~~--------~e~~~~~~-~~~~-i~gg--------~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~ 349 (648) T protein:vir:79 288 GAEEGE--------VDLVRGEV-ENMD-VEGG--------MVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSEL 349 (648) T ss_pred HHHHHH--------HHHHHHhc-cccc-cccc--------ccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHh Confidence 000000 00011100 1111 1111 1111111111 1 1223566677888999999999987 Q ss_pred cccccc-cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHH Q lcl|NC_019916. 359 TDDNFS-GN-SSGVAMKYKVLGTVELASTKRKQFERGLNQR-YTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAII 435 (513) Q Consensus 359 ~~~~~~-~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~-~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 435 (513) ..+... ++ ..+.+....+... +.-.+..+...+... ++.++ +-.........+ ..+++.|+.....|....+ T Consensus 350 lLG~~~~ss~stae~~~~~~~~~---i~~l~~~i~~~le~~~~~~ll-~e~~l~~~l~~d-~~ieF~~~~Llr~D~~~~a 424 (648) T protein:vir:79 350 MMGRGGTASRSTGDNLSSDFKDR---IKALQKVMATFINEFMVKEIL-MEGGFDPVLNPD-DKVEFRFNEIDMDSKIKLE 424 (648) T ss_pred HcccCCCccchHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-hhhhcccccccc-ceEEEeecccchhhHHHHH Confidence 654322 11 2233332222211 111111122222111 11110 000011111111 2467788887788888888 Q ss_pred HHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCC-CCCCCCCCCCCCcc Q lcl|NC_019916. 436 TALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPED-EGVRGQQGEPEDER 510 (513) Q Consensus 436 ~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 510 (513) +.+.++ +|++|.-.+.++++. +++..... .+....-..................+..+.+ +....+.+.++..+ T Consensus 425 ~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~ 503 (648) T protein:vir:79 425 NQAVFLYEHNAISEDEMRELIGRDPVDDGEGRA-KMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPT 503 (648) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcc-ccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCC Confidence 888775 689999888877643 33211110 0100000000000000000000000000000 00000001111100 Q ss_pred CCC Q lcl|NC_019916. 511 TSD 513 (513) Q Consensus 511 ~~~ 513 (513) ++. T Consensus 504 ~~~ 506 (648) T protein:vir:79 504 NQH 506 (648) T ss_pred CCC Confidence 000 No 169 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.24 E-value=0.00012 Score=42.11 Aligned_cols=403 Identities=13% Similarity=0.058 Sum_probs=182.7 Q ss_pred Cccchhhc------eeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 1 MIDMQQAN------MNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 1 ~~~~~~~~------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) ++..+-+- ....+....+||..+..+|+..-.-...++..|.+..+-+ .... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~----------------------D~~i 74 (526) T protein:vir:99 17 LREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEER----------------------DAHL 74 (526) T ss_pred ccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhh----------------------ChHH Confidence 21111111 1122344578888888887765333333333333322110 2334 Q ss_pred HHHHHHHHHHhhcCCeeecCC-----cH----HHHHHHHHhc-CHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-E Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGP-----SS----DRLDDFNRRN-DIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-V 142 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~-----~~----~~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~ 142 (513) .-.+.+...-+.+.++++... .+ +.+++++... +|......+. +|.-+|.+. +++|.-.+|...+. + T Consensus 75 ~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l 153 (526) T protein:vir:99 75 FAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAF 153 (526) T ss_pred HHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEe Confidence 445666667777888877432 11 2355555442 5777666665 688899885 56665444433221 1 Q ss_pred EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccccc-ccCcccceE Q lcl|NC_019916. 143 KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEH-SAQFGFPMI 221 (513) Q Consensus 143 ~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~vPvv 221 (513) ..-|...|. |+.... .. +|+ +. .+. .+.+ .+++.|-.+ T Consensus 154 ~~r~~~~f~-~~~~~~--~~--l~~----------------------------~~--~~~------~g~~l~~~k~i~~~ 192 (526) T protein:vir:99 154 HHRPQSWFQ-LNPEDQ--NE--LRL----------------------------RD--NSP------AGEALQPFGWIIHR 192 (526) T ss_pred eeeccccee-eccCCC--cE--EEe----------------------------cC--CCC------CceeecCCCeEEEe Confidence 111222222 121111 00 000 00 000 0000 111111111 Q ss_pred Eec--CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 222 EYR--NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 222 ~~~--n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) +-. .+..|.|.+..+-...=-=+..+.+.+..++.|+.|+++.+=..+... .+. ..-... T Consensus 193 ~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~-----------~ek-------~~L~~a 254 (526) T protein:vir:99 193 PRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTAD-----------EEK-------ATLLRA 254 (526) T ss_pred ecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCH-----------HHH-------HHHHHH Confidence 111 233566777766665555566788899999999999988763211110 000 000111 Q ss_pred hhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHH-HHHHH Q lcl|NC_019916. 300 LEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAM-KYKVL 377 (513) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai-~~~~~ 377 (513) +..+..+....+ ..+..++|++.. .....++..++.+.+.|...--.-.++.+...|+.+.-|+ +.... T Consensus 255 v~~i~~d~~~ii---------P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~ 325 (526) T protein:vir:99 255 VTGLGHAAAGII---------PETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNE 325 (526) T ss_pred HHHHhhCcEEEe---------cCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHH Confidence 233333333333 345678888853 5567789999999999977633223332211122222222 11111 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccc-cceeeEEeCCCCCcCHHHHHHHHHHHh--cC-CCHHHHHH Q lcl|NC_019916. 378 GTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDID-PDEIGFIFRDNLPTDDVAIITALVQAG--AQ-IPQEYLYQ 452 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~~~ 452 (513) -....++.-.+.....+. ++++.++.+ +...... .....++|....+.|..+.++.+.++. |+ +|.+.+.+ T Consensus 326 v~~di~~aDa~~i~~tln~~Li~~l~~~----N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e 401 (526) T protein:vir:99 326 VRHDLLASDARQLAATLSRDLLWPLLVL----NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYD 401 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHH Confidence 122222233334444453 355544443 2221111 234578899999999999999999885 55 89999999 Q ss_pred hCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 453 YLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 453 ~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .++. ..+... +.-.. +..... .+....+...........+. +.+++ T Consensus 402 ~~Gi-p~~~~~--------e~~l~---~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~ 447 (526) T protein:vir:99 402 KLGI-PQPAKN--------EPVLR---SAAQPA--ILSRQHGQRVAALATIVGPR-YGDQQ 447 (526) T ss_pred HhCC-CCCCCc--------ccccC---CCCCCc--cccccccccccccccccccc-Ccchh Confidence 8864 322210 00000 000000 00000111000000011111 11111 No 170 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.20 E-value=0.00013 Score=41.83 Aligned_cols=414 Identities=11% Similarity=-0.006 Sum_probs=148.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhc----C---CCccccccccccCCCCCCcce----e-ecchhHHHHHHHHHHh Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYR----G---QNDGILSPASRRNEKGKADHR----A-VHSFARYIADFQTSYS 85 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~----G---~~~i~~~~~~~~~~~~~~~~r----i-~~n~~~~ivd~~~~~l 85 (513) |+.+ .++...|... ..+....++.|. | +.+.......+.. ...... . ....++.||+..+.-. T Consensus 1 ~~~~--~~~~~~~~~~-~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~--~~~~~~l~~~Yr~~~ia~~iVd~~~d~~ 75 (449) T protein:vir:10 1 MTDK--LTLAVNHALN-DARMARARMGLMVPTMGLDNKRHSAWCEYGFPE--LVTYENLYSLYRRGGIAHGAVEKLVGKC 75 (449) T ss_pred Cchh--hHHHHhhhcc-hhHHHHHHHHHHHHHhcCCcccchhhhhcCCcc--cCCHHHHHHHHhcCchhHHHHHhhhhhh Confidence 3433 2222222211 112222222211 1 1111111110000 000000 1 1345667888887755 Q ss_pred hcCCee-ecCCcH------HHHHHHHH---hcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecC Q lcl|NC_019916. 86 VGNAIA-MSGPSS------DRLDDFNR---RNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDR 155 (513) Q Consensus 86 ~g~p~~-~~~~~~------~~l~~~~~---~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~ 155 (513) .-+.+. +.+.+. ..++.-++ .+++.....++.+.+..+|.|++++..++ ++.. ..++.+. T Consensus 76 ~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d-~~~l-~~Pl~~~-------- 145 (449) T protein:vir:10 76 WQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRD-EKDW-NLPATKG-------- 145 (449) T ss_pred hhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecC-CCCC-CcccccC-------- Confidence 333222 222211 12233232 13455667788888888999988776643 3321 1112211 Q ss_pred CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhH Q lcl|NC_019916. 156 SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFEN 235 (513) Q Consensus 156 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~ 235 (513) ..+....-+|...............-..+-++ .|++.....++.. ....-|+-..+.++..+ ..|.|.++. T Consensus 146 ---~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~-~y~v~~~~~g~~~---~~~~iH~SRl~~~~~~~--~~g~~~L~~ 216 (449) T protein:vir:10 146 ---RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPK-LWKYTERLPNGSS---RRVDIHPDRVFILGDYS--EDAIGFLEP 216 (449) T ss_pred ---cceeeEEeeccccCChhhhhcCCCCCCCCCce-EEEEeeeccCCCc---cceeeccceeEeecCCC--CCChhHHHH Confidence 11111111221110000000000000011111 1111111111110 01112333323332221 124455554 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHHhhhh---hhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcc Q lcl|NC_019916. 236 VLSLIDLYDVAQSD-----TANYMTDLNEA---MLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN 307 (513) Q Consensus 236 v~~liD~~~~~~S~-----~~~~~~~~~~~---~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 307 (513) +-.-+-.++++.-. +.+......-. ..-++|...... .........+ ...+..+..+. T Consensus 217 ~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~----------~~~e~~~~~~----~~~~~~~~~~~ 282 (449) T protein:vir:10 217 AYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYG----------VSIDELQDKF----NEVAGEINRGN 282 (449) T ss_pred HHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhh----------CCchHHHHHH----HHHHHHHhccc Confidence 32211111221101 11111111100 001111110000 0000000000 00111111110 Q ss_pred eeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-cccc-cccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 308 MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTD-D-NFSG-NSSGVAMKYKVLGTVELAS 384 (513) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-n~Sg~Ai~~~~~~l~~k~~ 384 (513) ...... .+-+|-+.+.+.+.....++...+.++..+++|-.-+ + ..+| |.+++ ++. |... +. T Consensus 283 --------~~~~i~--~~~d~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~n-yyd~---i~ 347 (449) T protein:vir:10 283 --------DVLMTT--QGATVTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKY-FNAR---CQ 347 (449) T ss_pred --------hheeec--CCcceEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHH-HHHH---HH Confidence 000111 2224556667788888899999999999999995432 2 2223 23333 443 2323 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHh--CCCCCCHHH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQY--LPNVTDADE 462 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~--l~~v~D~~~ 462 (513) .++..++..|++++.+++.. ... +.+ .++.|+|+|-...+.+|.|++..+.+...+. ++.. .+-++ + . T Consensus 348 ~~Q~~l~p~le~l~~~l~~s--~~g---~~~-~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~--~~~ag~~~~~~-~-~ 417 (449) T protein:vir:10 348 SRRVDLSFEIEDFCDKLIEL--KII---DAV-AKKAVIWDDLNEQTGTEKLTNAKTMGEINQT--MLGSGDNPAFS-R-E 417 (449) T ss_pred HHHHhhhHHHHHHHHHHHHh--hcC---CCC-CceeEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHHccccCCcC-H-H Confidence 34445889999988876532 111 222 2689999999999999999988775432221 1111 11111 1 2 Q ss_pred HHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 463 IVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 463 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) |+. +.. ..++. .+....++..++.+++.+... T Consensus 418 EiR-------~~~-~~~~~--------~~~~~~~e~~de~~~~~d~~a 449 (449) T protein:vir:10 418 EIR-------TAA-GYDND--------DEEPLGEEDGDEEDKATDSAA 449 (449) T ss_pred HHH-------HHh-cccCC--------CCCCCCCCCCccccccCCcCC Confidence 221 100 01110 000000111111111111111 No 171 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=439 Identities=13% Similarity=0.104 Sum_probs=178.2 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCC---CCCC-ccee-ecchhH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNE---KGKA-DHRA-VHSFAR 75 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~---~~~~-~~ri-~~n~~~ 75 (513) ++.+.-|.-.-+.+.. .+.++..-.-..--++-..-..|--+...+..-.-..... .+.+ --.| .++-.+ T Consensus 51 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr 125 (694) T protein:vir:10 51 LNALDAAPVAEPSPSL-----RLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYR 125 (694) T ss_pred chhhcccccCCCCcch-----hhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchh Confidence 3333323222222211 1111111000000010011111211111000000000000 0000 0001 123334 Q ss_pred HHHHHHHHHhhcCCeeec-------------------C----CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeee Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMS-------------------G----PSSDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYR 132 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~-------------------~----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~ 132 (513) .++.+.+..+.-+-+... . +..+.|..-++.-++...+.++.+.+-.||.+..++-. T Consensus 126 ~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I 205 (694) T protein:vir:10 126 AMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKI 205 (694) T ss_pred hHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEe Confidence 445555554432221111 0 11234666666778888899999999999999877765 Q ss_pred cCCCce---eE--------------EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEE Q lcl|NC_019916. 133 DPSQKG---EV--------------SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRY 195 (513) Q Consensus 133 d~~~~~---~~--------------~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~ 195 (513) +.++.. ++ ...++|..+.|-.-+. ..|+. .+...-..|+| - ...++.- T Consensus 206 ~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s----------pdfgkP~~y~V--~-G~~IH~S 270 (694) T protein:vir:10 206 KGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA----------DDFYKPSTWWM--I-GTEVHAT 270 (694) T ss_pred ecCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh----------hccCCCceEEE--e-ceEEeee Confidence 443311 11 0011222222210000 00000 00001111111 0 0111111 Q ss_pred EeeccCCccccccccccccCcccceEEe---cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccc Q lcl|NC_019916. 196 KPIVVAGSVPTLEVAEHSAQFGFPMIEY---RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLF 272 (513) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~---~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~ 272 (513) +. +.+...|+-.. .++..|.|..+.+.+-+++.+++.-..+.-+..+....+. +++..... T Consensus 271 RL---------------~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~~L~ 334 (694) T protein:vir:10 271 RL---------------HTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALM 334 (694) T ss_pred eE---------------EEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH-HHHHHhhc Confidence 00 01111111110 1233577888888888888887776666555433333221 11110000 Q ss_pred ccccccccccchhhhhhhccccccchhhhcchhcc-eeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 273 DDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN-MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHK 351 (513) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 351 (513) .... ..+.. ....+..++.+. ++.+ ++.+=+|.+++.+.+.+...+.+..+.|+. T Consensus 335 ---------~g~~----~~l~~-R~eli~~~Rsn~G~~ll----------Dk~~Eefeq~stslSGLddVi~qf~q~VAg 390 (694) T protein:vir:10 335 ---------PGAN----VDLSM-RAELINRYRDNRNILFL----------DKATEEFFQFNTPLSGLDALQAQAQEQMSA 390 (694) T ss_pred ---------ChhH----HHHHH-HHHHHHHhcCccceEEE----------ecCCcceEEEecccCCHHHHHHHHHHHHHh Confidence 0000 00000 111122222222 2222 223447788889999999999999999999 Q ss_pred HhCccccccccc--cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCC Q lcl|NC_019916. 352 FSHTPDLTDDNF--SG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLP 428 (513) Q Consensus 352 ~s~~p~~~~~~~--~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p 428 (513) .+++|-.-+... +| |.||++=..-|...+ ....+..+...+++++.+|.. ...+. .+. ++.++|++--. T Consensus 391 aa~IPltkLfGqSPkGlNATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~r--S~~G~---idp-~i~~~fnPL~q 462 (694) T protein:vir:10 391 VSHIPLIKLLGITPTGLNASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQL--SLFGA---VDP-SIKWQWNALRE 462 (694) T ss_pred hhcCchhhhhccCcccccccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH--HhcCC---CCC-cceEEeCCCCC Confidence 999996543322 23 688987544444444 245578899999998887642 22222 222 57899999999 Q ss_pred cCHHHHHHHHHHH---------hcCCCHHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHhhhh-cCCCCCCCC Q lcl|NC_019916. 429 TDDVAIITALVQA---------GAQIPQEYLYQYL------PNVT--DADEIVKMMDKQRKAMLKTYDTK-GGLIINGTS 490 (513) Q Consensus 429 ~d~~e~a~~~~kl---------~g~iS~et~~~~l------~~v~--D~~~E~~ri~~E~~~~~~~~~~~-~~~~~~~~~ 490 (513) .++.|.|+.-.|- .|+|+...+..+| ++.. |...+=-..... .++....+ .+....+ T Consensus 463 mtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~---~~~~~~~~~~~~~~~~-- 537 (694) T protein:vir:10 463 LDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADD---DIDGVLTYVQRLAEGG-- 537 (694) T ss_pred cCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccc---hhhhhHhhhcCccccc-- Confidence 9999999875542 4666655555553 2211 100000000000 00000000 0001111 Q ss_pred CCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 491 GNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.+...++.++...|.+=..-+ T Consensus 538 -~~~~~~~~~~g~~~~~~v~~~~ 559 (694) T protein:vir:10 538 -DTGAPGGARAGATAPPTVANVN 559 (694) T ss_pred -ccCCCCcccccccCCCcccccc Confidence 1111111222222222111111 No 172 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.06 E-value=0.00019 Score=41.00 Aligned_cols=371 Identities=12% Similarity=0.044 Sum_probs=149.7 Q ss_pred HHHHHHHHHHHH----HHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHH Q lcl|NC_019916. 24 AAFIRHHYNNQR----PRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR 99 (513) Q Consensus 24 ~~~i~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~ 99 (513) ..++++....+. .....+...+-+- . ..........-+.++-...+|+..+.-+-+-|++........ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~-------~-~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 72 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLAS-------L-KGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKLQG 72 (382) T ss_pred CccccccccCCcccccccccchhhhcccc-------c-cCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchhhh Confidence 111111100000 0000000001000 0 000000000011233445567777777777788765444332 Q ss_pred H-HHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccc Q lcl|NC_019916. 100 L-DDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNIT 178 (513) Q Consensus 100 l-~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~ 178 (513) | ..=............+..+++.+|.||+++-.+..|.+.-.+.++|..+-+..++.. ..+ +|........ T Consensus 73 L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~-~~~-----~y~~~~~~~~-- 144 (382) T protein:vir:48 73 IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNK-DGI-----YYNITFDDPR-- 144 (382) T ss_pred hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-CeE-----EEEEEecCcc-- Confidence 2 111111233566667888999999999999888888765555678888877665432 111 1211111000 Q ss_pred eeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 179 QTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLN 258 (513) Q Consensus 179 ~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~ 258 (513) ......+....+++++.. ++. ..-.|.|-+..+...++....+..-..+.+...+ T Consensus 145 --~~~~~~~~~~evih~~~~--------------~~~---------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~ 199 (382) T protein:vir:48 145 --IPPKQHVPQNDVLHFRLL--------------SVD---------GGMTSVSPLMALSRELDIQKASGNLTINSLKNAL 199 (382) T ss_pred --ccceeEEcCccEEEecCC--------------CCC---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 000112333444433210 001 1124677777777777666555555556566666 Q ss_pred hhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHH Q lcl|NC_019916. 259 EAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGT 338 (513) Q Consensus 259 ~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 338 (513) .|-.+++-...... +.. ..+........ ...++++.+ ..+.++.-+........+ T Consensus 200 ~p~~il~~~~~~~~-----------e~~---~~~~~~~~~~~--~n~g~~~vl---------~~g~~~~~l~~~~~d~q~ 254 (382) T protein:vir:48 200 NANGILKIKGGGLL-----------DFK---TKLSRSRQAMK--QMQGGPLVL---------DDLEDFTPLEIKSNVSQL 254 (382) T ss_pred CCceEEEeCCCCCh-----------HHH---HHHHHHHHhhc--cCCCCeeEc---------CCCceEEEccCChhHHHH Confidence 66555543211110 000 00000000000 001222222 223344444444444556 Q ss_pred HHHHHHHHHHHHHHhCcccccccccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_019916. 339 ELYKKRLAADIHKFSHTPDLTDDNFSGNSS-GVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPD 417 (513) Q Consensus 339 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S-g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~ 417 (513) ....+...+.|+..-++|+...+..+.+.+ ..+.+ ..+...++.+++.+..-+...-. ..... T Consensus 255 ~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~--------------~~~~~~l~p~~~~i~~~l~~~l~-~~~~~- 318 (382) T protein:vir:48 255 LKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSS--------------DLYSKAVSRYLRPFLSELSQKLS-CDVDA- 318 (382) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhc-Chhhh- Confidence 677888889999999999766543222222 12221 22233333333333322222110 00110 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCC Q lcl|NC_019916. 418 EIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGN 492 (513) Q Consensus 418 ~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 492 (513) ++...+ + .+.......+.++ +|++++-.+++.+ ++.++...+. +. .. ... T Consensus 319 ~~~~~~-~---~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~--------------~~----~~---~~~ 373 (382) T protein:vir:48 319 DIFPAV-D---PTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNG--------------EN----PN---STL 373 (382) T ss_pred hhhhhh-c---cchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhh--------------hc----CC---CCC Confidence 111111 1 1222334444444 5677776665543 3333211000 00 00 000 Q ss_pred CCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 493 DPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~~~ 513 (513) .|++++. +| T Consensus 374 ~GGd~~~------------~~ 382 (382) T protein:vir:48 374 KGGEEDG------------QD 382 (382) T ss_pred CCCCCCC------------CC Confidence 1111111 11 No 173 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=97.06 E-value=0.00019 Score=41.00 Aligned_cols=436 Identities=10% Similarity=0.025 Sum_probs=182.6 Q ss_pred CCc---ccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDA---DKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~---~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |.. +.+..+.+.+..+...++|.+ +.+.+.+|..-. ...... ..-.+...++..+-....+++.++.|+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~LAa~L~ 75 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSAT--ADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhc---ccCCCC--CcchhhccccccchHHHHHHHHHHHHH Confidence 332 344577777777776666544 455555563331 111111 111223456777778888888888886 Q ss_pred cC--C----e-eecCCcH------------HHHH-----------HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCC Q lcl|NC_019916. 87 GN--A----I-AMSGPSS------------DRLD-----------DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQ 136 (513) Q Consensus 87 g~--p----~-~~~~~~~------------~~l~-----------~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~ 136 (513) +- | + ++...+. ..++ ..+..++|.....++.++..++|.|.+++..++.. T Consensus 76 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (532) T protein:vir:99 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) T ss_pred HhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccc Confidence 52 2 2 2222110 1122 33445789999999999999999998776554322 Q ss_pred c-eeEEEEEcccceEEEecCCCCcceEEEEEEEeecc--c---------cc-ccceeEEEEEEEc-----CCc--EEEEE Q lcl|NC_019916. 137 K-GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQT--V---------VD-NITQTKYEVETWT-----END--YTRYK 196 (513) Q Consensus 137 ~-~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~--~---------~~-~~~~~~~~ve~yt-----~~~--~~~~~ 196 (513) . ....+..-|..-+++--+. ..++...+|..+... . .. ...+....+++|+ ++. ...|. T Consensus 156 ~~~~~~f~~~pl~~y~v~~d~-~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~ 234 (532) T protein:vir:99 156 EGQSNAPKLYKLHNFVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) T ss_pred cCcccceEEEEcCeEEEeeCC-CCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEEE Confidence 1 1112223444445554443 345555555332210 0 00 0011122344443 221 11121 Q ss_pred eeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccc Q lcl|NC_019916. 197 PIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTL 271 (513) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~ 271 (513) ... +..... ..-..++..+|++.++- ..+|+|-.++..+-+-.+|.+.-......+....|.+.+.-.+... T Consensus 235 ~~~-g~~~~~--~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~ 311 (532) T protein:vir:99 235 EID-GEIVAG--TEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ 311 (532) T ss_pred eec-Cceecc--cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccc Confidence 111 111111 11122356788877653 4579999999999998888877777777777777764432100000 Q ss_pred cccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHH Q lcl|NC_019916. 272 FDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADI 349 (513) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i 349 (513) . . ......+|... .+...+++.+. ...+.......++.++..| T Consensus 312 ~----------------------------~-----~~~~~~~g~~v--~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI 356 (532) T protein:vir:99 312 I----------------------------R-----RVAKANTGDFV--AGRKQDVEVFQLEKYNDFQVAKATADDIEKRL 356 (532) T ss_pred h----------------------------h-----hhccCCCccee--cCCcccceeeecccccchhHHHHHHHHHHHHH Confidence 0 0 00000011111 11122333332 2235556667777777766 Q ss_pred HHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhcccccccc--ccee Q lcl|NC_019916. 350 HKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLN--------QRYTVVAHIEERVNGKWDID--PDEI 419 (513) Q Consensus 350 ~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~--------~~~~li~~~l~~~~~~~~~~--~~~i 419 (513) ...-. .+.....-+...++.-++.. +.++...+|..+. -+++.++.++...+.-.... ...+ T Consensus 357 ~~af~-~~~~~~~d~~r~TAtEV~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~ 428 (532) T protein:vir:99 357 SYAFM-LNSAVQRGGDRVTAEEIRYV-------AGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEP 428 (532) T ss_pred HHHHh-hhhcccCCCCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhccc Confidence 43211 11110011233455544332 3444444444333 33344445554433221111 1112 Q ss_pred e-EEeCCCCCcCHHHHHHHH----HHHhcC-------CCHHHHH----HhCCC----CCCHHHHHHHHHHHHHHHHHHh- Q lcl|NC_019916. 420 G-FIFRDNLPTDDVAIITAL----VQAGAQ-------IPQEYLY----QYLPN----VTDADEIVKMMDKQRKAMLKTY- 478 (513) Q Consensus 420 ~-i~f~~~~p~d~~e~a~~~----~kl~g~-------iS~et~~----~~l~~----v~D~~~E~~ri~~E~~~~~~~~- 478 (513) . +++-.++ ..++.++.+ ..++.+ +....++ ..++- +--.++|.+.+.++++.++... T Consensus 429 ~iv~~is~L--araq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~ 506 (532) T protein:vir:99 429 AIATGLEAL--GRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVT 506 (532) T ss_pred ceeecchHH--HHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHH Confidence 2 2221111 112222221 112222 2222232 22321 1123555555554433222111 Q ss_pred --hhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 479 --DTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 479 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) ...+.. .. +. ....... ....+.+ T Consensus 507 a~~~~~~~-~~-~~----~~~~~~~-~~~~~~~ 532 (532) T protein:vir:99 507 AGQQMGAA-GG-QA----AAAMMQQ-QAGMPTQ 532 (532) T ss_pred HHHHHHHH-HH-Hh----cchhHHh-hcCCCCC Confidence 111100 00 00 0000111 1111111 No 174 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=97.01 E-value=0.00021 Score=40.72 Aligned_cols=434 Identities=11% Similarity=0.056 Sum_probs=183.6 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--C--- Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQ---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--A--- 89 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p--- 89 (513) |. ....+..+...++| ..+.+.+.+|..-. ...... ........++..+.....++..++.|++- | T Consensus 1 mk-~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~--~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 74 (542) T protein:vir:78 1 MK-GLAQARYSAMRADREDFLDMARRCAALTLPY---LLTEDG--HASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQT 74 (542) T ss_pred Ch-hHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---cCCCCC--CcccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 11 11112222223333 44566666664321 111110 01112233566677788888888887652 2 Q ss_pred -e-eecCCc----------H--------------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 90 -I-AMSGPS----------S--------------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 90 -~-~~~~~~----------~--------------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) + ++...+ . ..+...+..++|.....++.++..++|.|. +|.+++. +. T Consensus 75 ~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~~---~~-- 147 (542) T protein:vir:78 75 SFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVL--VFAGKKT---LK-- 147 (542) T ss_pred ccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEE--EEecCCC---ce-- Confidence 2 222211 1 112334456789999999999999999985 4556542 22 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccc---------------------cccceeEEE-------EEEEcC-----C Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVV---------------------DNITQTKYE-------VETWTE-----N 190 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~---------------------~~~~~~~~~-------ve~yt~-----~ 190 (513) +-|..-+.+--+. ..++...+|.++..... .....++.+ +++|+. . T Consensus 148 ~~pl~~y~v~~d~-~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~ 226 (542) T protein:vir:78 148 VYPLDRYVIERDG-DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDG 226 (542) T ss_pred EEecceeEEeeCC-CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCC Confidence 2234445444443 34555556555432110 000111111 111111 0 Q ss_pred cEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhee Q lcl|NC_019916. 191 DYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIK 265 (513) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~ 265 (513) .+.+|.... +.... ....+.++..+|++.++- +.+|+|-.++..+-+-.+|.+.-......+...+|.+.+. T Consensus 227 ~~s~~~e~~-g~~v~--~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~ 303 (542) T protein:vir:78 227 QHRWHQECD-GKEIK--GSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVS 303 (542) T ss_pred eEEEEEEec-ccccc--ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Confidence 111111111 11100 011234667788777653 4679999999999999999998888888888888886542 Q ss_pred cCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEE--eecCCHHHHHHHHH Q lcl|NC_019916. 266 GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYI--HKEYDSAGTELYKK 343 (513) Q Consensus 266 G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~ 343 (513) -.+.... . .......+.... +..++++.+ ....+.......++ T Consensus 304 ~~g~~~~----------------------------~-----~~~~~~~g~iv~--g~~~~v~~~~~~~~~~~~~~~~~i~ 348 (542) T protein:vir:78 304 PSATTKP----------------------------Q-----SLARAGTGAIIQ--GRAEDVSVVQANKGADFRTVQEMIR 348 (542) T ss_pred cccccch----------------------------h-----hcccCCCceeec--CCccceeeeecccccchhHHHHHHH Confidence 1100000 0 000001111111 122333333 23345666777788 Q ss_pred HHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhcccccccc Q lcl|NC_019916. 344 RLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR--------YTVVAHIEERVNGKWDID 415 (513) Q Consensus 344 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~--------~~li~~~l~~~~~~~~~~ 415 (513) .++..|...-..-+ .- -+...++.-++. ++.++...+|..+.++ ++-++.++...+.-.... T Consensus 349 ~~~~rI~~aFl~~~--~~-d~~rvTAtEV~~-------r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p 418 (542) T protein:vir:78 349 DLSQRISDAFLILN--VR-QSERTTATEVRE-------VQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLP 418 (542) T ss_pred HHHHHHHHHhcccc--cC-CcccccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc Confidence 88877744311101 00 112334443333 3455555566555443 333444454444333333 Q ss_pred cceeeEEeCCCCCcC-HHHHHHHH----HHHhcCCCHHH---------HHHhC---CCCC-----CHHHHHHHHHHHHHH Q lcl|NC_019916. 416 PDEIGFIFRDNLPTD-DVAIITAL----VQAGAQIPQEY---------LYQYL---PNVT-----DADEIVKMMDKQRKA 473 (513) Q Consensus 416 ~~~i~i~f~~~~p~d-~~e~a~~~----~kl~g~iS~et---------~~~~l---~~v~-----D~~~E~~ri~~E~~~ 473 (513) ..-+++++.-++..- ..+.++.+ ..++.++..+. ++..+ -+|+ ..++|++++.++.++ T Consensus 419 ~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~ 498 (542) T protein:vir:78 419 KGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQ 498 (542) T ss_pred hhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHH Confidence 334666666555321 11111111 11111222222 22221 1232 224555555444332 Q ss_pred HHHHh--hhhcCCCCCCCC-CCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 474 MLKTY--DTKGGLIINGTS-GNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 474 ~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .+... ....+....... +.......++++...++-+.-+| T Consensus 499 ~~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 541 (542) T protein:vir:78 499 QQMTASLMGQAGQLAKSPIGEKMMQQINAPGQEAPAGPQTGED 541 (542) T ss_pred HHHHHHHHHhhhhccccccccchhhhcCCCCcCCCCCCccccc Confidence 22211 111111111111 11111112222111111111122 No 175 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.99 E-value=0.00022 Score=40.62 Aligned_cols=377 Identities=13% Similarity=0.059 Sum_probs=143.7 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHhcC-CCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHH- Q lcl|NC_019916. 24 AAFIRHHYNN-QRPRLEMLYDYYRG-QNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRL- 100 (513) Q Consensus 24 ~~~i~~~~~~-~~~~~~~~~~YY~G-~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l- 100 (513) ..+.+..... ..+.. ........ ....+.. +. .........-+...-...+|+..+.-+-+-|+++.......| T Consensus 1 Mglf~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~-~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~~~l~ 77 (384) T protein:vir:49 1 MPIFNITNLATESPPS-NQDSFFDITDPEFLDA-LN-GSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQLQGIV 77 (384) T ss_pred CccccccccCcccccc-cchhhccccchhhccc-cc-CCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchhhhhh Confidence 1121110000 00000 00000000 0000000 00 000000000012233445677777777778888654433322 Q ss_pred HHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccccee Q lcl|NC_019916. 101 DDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQT 180 (513) Q Consensus 101 ~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~ 180 (513) ..-............+..+++.+|.||+++-.+..|.+.-.+.++|..+-+..++.. ..+ +|.....+.... T Consensus 78 ~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~-~~~-----~y~~~~~~~~~~-- 149 (384) T protein:vir:49 78 DNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQ-NGL-----YYNITFDDPRIP-- 149 (384) T ss_pred hccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceE-----EEEEEecCcccc-- Confidence 111112234566667888999999999999988888765566688888877665432 111 111111110000 Q ss_pred EEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019916. 181 KYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEA 260 (513) Q Consensus 181 ~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~ 260 (513) ....+..+.+++++.. ++.+ .-.|.|-+..+...++....+..-..+.+...+.| T Consensus 150 --~~~~~~~~eVih~~~~--------------~~~~---------~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~ 204 (384) T protein:vir:49 150 --PKQHVPQGDILHFRLL--------------SVDG---------GLTSVSPLMALGRELNIQKASDKLTLNALKNALNA 204 (384) T ss_pred --ceeEecCccEEEecCC--------------CCCC---------ceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 0112344444443221 0011 12466766666666655444444444555544555 Q ss_pred hhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHH Q lcl|NC_019916. 261 MLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTEL 340 (513) Q Consensus 261 ~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 340 (513) -.+++-......+ +... .. ..........++++.+ ..+.++.-+........+.. T Consensus 205 ~~il~~~~~~~~~----------~~~~---~~---~~~~~~~~n~~~~~vl---------~~g~~~~~l~~~~~d~q~~e 259 (384) T protein:vir:49 205 NGILKIKGGGLLD----------FKTK---QS---RSRQAMKQMQGGPLVL---------DDLEDFTPLEIKSNVAQLLS 259 (384) T ss_pred ceEEEeCCCCChH----------HHHH---HH---HHHHhcccCCccceec---------CCCceEEEccCChhhHHHHH Confidence 5444322111100 0000 00 0000000111222222 22333333433344455567 Q ss_pred HHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccee Q lcl|NC_019916. 341 YKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEI 419 (513) Q Consensus 341 ~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i 419 (513) ..+.+.+.|+..-++|+...+...+ ..++..++..+...+..+- .-+...+.+. +.. ....+ + T Consensus 260 ~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l---~pi~~~i~~~-------l~~---~l~~~---~ 323 (384) T protein:vir:49 260 QADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFL---RPFVSELSKK-------LSC---EVDAD---I 323 (384) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHH---HHHHHHHHHH-------hch---hhhhh---h Confidence 7788889999999999766543322 2344444433332221110 1111111111 100 00000 0 Q ss_pred eEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCC Q lcl|NC_019916. 420 GFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL---PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDP 494 (513) Q Consensus 420 ~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l---~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 494 (513) .. ..+..+......++.+.+ +|+.++-.+.+.+ |+.+ -|+.++ ....+..+ +.+++.- T Consensus 324 ~~-~~~~~~~~~~~~~~~l~~-~~~~t~~e~~~~l~~~g~~~---ne~r~~--------~~~~p~~g----Gd~~~~~ 384 (384) T protein:vir:49 324 LP-AVDPTGSNYIGLINSMVK-TGTLAQNQGLYVLQQAEILP---KDLPEG--------ETDSTLKG----GETNEQY 384 (384) T ss_pred hh-hhhccchHHHHHHHHHhh-cCcccHHHHHHHHhhCCCCC---hhHHHH--------cCCCCCCC----CCCCCCC Confidence 00 001111112222222222 4566665555443 4333 122111 11111111 1111111 No 176 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=96.93 E-value=0.00025 Score=40.32 Aligned_cols=400 Identities=12% Similarity=0.019 Sum_probs=179.8 Q ss_pred Cccchhh------ceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 1 MIDMQQA------NMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 1 ~~~~~~~------~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) ++..+-+ .....+....|||..+..+++..-.-...++-.| |+ ++.. ..... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L---~e---dm~e----------------~D~~i 74 (526) T protein:vir:79 17 LREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAEL---FM---DMEE----------------RDAHL 74 (526) T ss_pred cchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHH---HH---HHHh----------------hChHH Confidence 1111211 2223344567888888877766433222222222 21 1100 12444 Q ss_pred HHHHHHHHHHhhcCCeeecCC-----cHH----HHHHHHHhc-CHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-E Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGP-----SSD----RLDDFNRRN-DIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-V 142 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~-----~~~----~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~ 142 (513) .-++.+....+.+.++++... .+. .+++++... +|......+. +|.-+|.+. +++|.-.+|...+. + T Consensus 75 ~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l 153 (526) T protein:vir:79 75 FAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAF 153 (526) T ss_pred HHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEe Confidence 556777777788888887532 112 355555442 5777665554 488889885 56665444432221 1 Q ss_pred EEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEE Q lcl|NC_019916. 143 KLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIE 222 (513) Q Consensus 143 ~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 222 (513) ..-|...|. |++... .. +|+ .. +... -+.+. +++.|-.++ T Consensus 154 ~~r~~~~F~-~~~~~~--~~--l~~-~~----~~~~----g~~l~--------------------------~~k~iv~~~ 193 (526) T protein:vir:79 154 HHRPQSWFQ-LNPEDQ--NE--LRL-RD----NSPA----GEALQ--------------------------PFGWIIHRP 193 (526) T ss_pred eeecccceE-eccCCC--cE--EEe-cC----CCCC----ceeec--------------------------CCceEEEee Confidence 111222222 222111 00 000 00 0000 00010 111111111 Q ss_pred ec--CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh Q lcl|NC_019916. 223 YR--NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL 300 (513) Q Consensus 223 ~~--n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 300 (513) -. .+..|.|.+..+-...=-=+..+.+.+..++.|+.|+++.+=..+.... +. ..-...+ T Consensus 194 ~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~-----------ek-------~~L~~av 255 (526) T protein:vir:79 194 RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADE-----------EK-------ATLLRAV 255 (526) T ss_pred cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHH-----------HH-------HHHHHHH Confidence 00 2334667776665555555567888899999999999887632111100 00 0001112 Q ss_pred hcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccc----cccccccHH-HHHH Q lcl|NC_019916. 301 EAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDD----NFSGNSSGV-AMKY 374 (513) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~~~n~Sg~-Ai~~ 374 (513) ..+..+.+..+ ..+..++|++.. .....++..++.+.+.|...--.-.++.+ +.++...|. .-+. T Consensus 256 ~~i~~da~~ii---------P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v 326 (526) T protein:vir:79 256 TGLGHAAAGII---------PETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEV 326 (526) T ss_pred HHHhcCcEEEe---------cCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHH Confidence 23333333333 345678888853 55677899999999999776332223322 112222221 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccc-cceeeEEeCCCCCcCHHHHHHHHHHHh--cC-CCHHH Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDID-PDEIGFIFRDNLPTDDVAIITALVQAG--AQ-IPQEY 449 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS~et 449 (513) ... .++.-.+.....+. ++++-++.+ +.....+ .....++|....+.|..+.++.+.++. |+ +|.+. T Consensus 327 ~~d----i~~aDa~~i~~tln~~Li~~l~~~----N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~ 398 (526) T protein:vir:79 327 RHD----ILASDARQLAATLSRDLLWPLLVL----NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAW 398 (526) T ss_pred HHH----HHHHHHHHHHHHHHHHHHHHHHHh----CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHH Confidence 222 22222333444442 344444432 2221111 124578899999999999999999885 55 89999 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 450 LYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 450 ~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.+.++. +.++.. +.+... ...+ .......+..... ..........++| T Consensus 399 i~e~~gi-p~~~~~-e~~l~~------~~~~------~~~~~~~~~~~~~-~~~~~~~~~~~~~ 447 (526) T protein:vir:79 399 VYDKLGI-PQPAKN-EPVLRP------AAQP------AILSRQHGQRVAA-LATIVGPRYGDQQ 447 (526) T ss_pred HHHHhCC-CCCCCc-hhhccc------cCCc------ccccccccccccc-ccccccccCchhh Confidence 9999864 432211 111100 0000 0000000000000 0000011111111 No 177 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=96.91 E-value=0.00026 Score=40.22 Aligned_cols=196 Identities=15% Similarity=0.115 Sum_probs=86.2 Q ss_pred eEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchh Q lcl|NC_019916. 220 MIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQ 299 (513) Q Consensus 220 vv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 299 (513) |. ....+. .+++.=+..+.. .... T Consensus 1 V~-------k~~~l~---~~~~~~~~~~~~----------------------------------------------r~~~ 24 (201) T protein:vir:10 1 MW-------KAKGLA---DLCDDSDGAARL----------------------------------------------RLAQ 24 (201) T ss_pred Cc-------cchHHH---HHhcCChHHHHH----------------------------------------------HHHH Confidence 00 000000 000000000000 0000 Q ss_pred hhcchh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-cccc-cccHHH-HHH Q lcl|NC_019916. 300 LEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTD-D-NFSG-NSSGVA-MKY 374 (513) Q Consensus 300 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-n~Sg~A-i~~ 374 (513) +...+. .+++.+ .+.+-+|-+.+.+.+++...+....+.|+..+++|-.-+ + ..+| |.||.. ++. T Consensus 25 ~~~~~~~~~~~~l----------d~~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~n 94 (201) T protein:vir:10 25 VDNNSGVGQAIGI----------DADSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALET 94 (201) T ss_pred HHHhhhhhhhhee----------ecCCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHH Confidence 000000 000100 112235667788889999999999999999999995432 2 2223 457775 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHHHhC Q lcl|NC_019916. 375 KVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLYQYL 454 (513) Q Consensus 375 ~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l 454 (513) -|..+. ...+..+++.++++++++.. ..++.++|+|-...+..+.|+...+.+...+. +-.. T Consensus 95 yyd~i~---~~Qe~~l~p~le~l~~~~~~------------~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~---~~~~ 156 (201) T protein:vir:10 95 FYGYVD---RKRKAELLPLLEFLLPFIVT------------EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAA---LIAA 156 (201) T ss_pred HHHHHH---HHHHHHHHHHHHHHHHhhcC------------CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHH---HHHc Confidence 333333 33346788888887765321 12578999999999999999988775322211 1112 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 455 PNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 455 ~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) + +-++++..+++.+. ...+... +.....+. +. .....|.+.-+++ T Consensus 157 g-~i~~~e~r~~L~~~--------~~~~~~~-~~~~~~~~--~~--~e~~dp~~~~~~~ 201 (201) T protein:vir:10 157 G-IIDADEARDTLRAI--------STEVKIG-EGSIQTEV--VI--NESEDPLDVSANN 201 (201) T ss_pred C-CCCHHHHHHHHHhc--------CCcCCCC-CCCCCccc--cc--cccCCCCCCCCCC Confidence 2 22333333333322 1111111 11000010 00 1111111111111 No 178 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=96.86 E-value=0.00029 Score=39.96 Aligned_cols=444 Identities=10% Similarity=0.049 Sum_probs=192.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHh---cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhh Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYY---RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSV 86 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 86 (513) |. +-..+.|.+..+.....|.+ +.+.+.+|. .+.-. .... ....+...++..+-+...+++.++.|+ T Consensus 1 m~--~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---~~~~--~~~~~~~~~~~dst~~~a~~~Las~l~ 73 (556) T protein:vir:73 1 MA--ETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFL---TSDV--NRDDRRNTKIVDPTGSMAQRILSSGMM 73 (556) T ss_pred CC--hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcC---CCCC--CcchhhcCccccchHHHHHHHHHHHHH Confidence 33 22345555555555555544 455555553 22110 0000 011123446777888888888888886 Q ss_pred cC--C-----eeecCCc----------------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 87 GN--A-----IAMSGPS----------------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 87 g~--p-----~~~~~~~----------------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) +- | +++...+ +..+...+..++|.....++.++..++|.|.+++-.+..+-.++. . T Consensus 74 ~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~-~ 152 (556) T protein:vir:73 74 SGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTM-P 152 (556) T ss_pred HhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEE-E Confidence 52 2 1232221 112334455678999999999999999999876654443322222 2 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeeccc--------cc---------ccceeEEEEEE----EcCCcE---------- Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTV--------VD---------NITQTKYEVET----WTENDY---------- 192 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~--------~~---------~~~~~~~~ve~----yt~~~~---------- 192 (513) ++..+.++--|. .+++...+|.++.... +. .......++++ |..... T Consensus 153 ~~l~~~~~~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~ 230 (556) T protein:vir:73 153 FPIGSYYLANSP--RGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNK 230 (556) T ss_pred eecceeEEeeCC--CCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccc Confidence 344444444443 3456666665443210 00 00010112222 221110 Q ss_pred ----EEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcc-hhHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019916. 193 ----TRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGD-FENVLSLIDLYDVAQSDTANYMTDLNEAML 262 (513) Q Consensus 193 ----~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd-~e~v~~liD~~~~~~S~~~~~~~~~~~~~l 262 (513) +++.....+.. . ..+..|..+|++.++ ++.+|+|. .++..+-+..+|.+.-..+...+...+|.+ T Consensus 231 p~~s~~~~~~~~~~~--v---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 231 PYRSVYFESGGDSDK--L---LRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred eEEEEEEEecCCCce--e---cccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 11111101100 0 122345667877664 34579994 999999999999988888999998888877 Q ss_pred heecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccc-ccc-ccccCCceeEEe-ecCCHHHHH Q lcl|NC_019916. 263 VIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGM-APN-GQQTSADANYIH-KEYDSAGTE 339 (513) Q Consensus 263 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~l~-~~~~~~~~~ 339 (513) .+-...... .+.+.+++ ... ..+...+++.+. ...+..... T Consensus 306 ~v~~~~~~~------------------------------------~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~ 349 (556) T protein:vir:73 306 VAPTSLKNQ------------------------------------RVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLL 349 (556) T ss_pred ecccccccc------------------------------------ceeeccCccccccCCCCccceeeeccccccHHHHH Confidence 653321100 00001111 000 011122344432 223455566 Q ss_pred HHHHHHHHHHHHHhCccccc---cccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|NC_019916. 340 LYKKRLAADIHKFSHTPDLT---DDNFSGNSSGVAMKYKVLGTVEL-ASTKRKQFERGLNQRYTVVAHIEERVNGKW--- 412 (513) Q Consensus 340 ~~~~~l~~~i~~~s~~p~~~---~~~~~~n~Sg~Ai~~~~~~l~~k-~~~~~~~f~~~l~~~~~li~~~l~~~~~~~--- 412 (513) ..++.++..|-.. ...++. ...-+.+.++.-++..-..+... -....+.-.+.+.-+++-++.++...+.-. T Consensus 350 ~~i~~~~~rI~~a-f~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P 428 (556) T protein:vir:73 350 ADIQDTRQTINSA-YFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPP 428 (556) T ss_pred HHHHHHHHHHHHH-hhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc Confidence 6677777777433 222211 11112345665554432222211 122222233334444444455554433211 Q ss_pred -ccccceeeEEeCCCCCcCH-HHH-------HHHHHHHhcC-------CCHHHHHHhC---CCCC----CHHHHHHHHHH Q lcl|NC_019916. 413 -DIDPDEIGFIFRDNLPTDD-VAI-------ITALVQAGAQ-------IPQEYLYQYL---PNVT----DADEIVKMMDK 469 (513) Q Consensus 413 -~~~~~~i~i~f~~~~p~d~-~e~-------a~~~~kl~g~-------iS~et~~~~l---~~v~----D~~~E~~ri~~ 469 (513) ......|+|++..++-... ... ++.+..++++ +....++..+ -+|+ -.++|.+.+.+ T Consensus 429 ~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq 508 (556) T protein:vir:73 429 DVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIRE 508 (556) T ss_pred hhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHH Confidence 2234457777765554321 111 1122222232 2233333222 1222 13566666655 Q ss_pred HHHHHHHHhhhh------cC---CCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 470 QRKAMLKTYDTK------GG---LIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 470 E~~~~~~~~~~~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) ++++.+..+... .+ ...+...+....-...-.+-|.|.. T Consensus 509 ~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 509 ERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 544333322111 00 0110000000000000001111211 No 179 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=96.77 E-value=0.00035 Score=39.51 Aligned_cols=393 Identities=9% Similarity=0.020 Sum_probs=160.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee-c--CC-- Q lcl|NC_019916. 21 TRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM-S--GP-- 95 (513) Q Consensus 21 ~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~-~--~~-- 95 (513) --|.+++.. +...+.-.......|-.. . .........+..-+...-...+|+..+.-+-+-|+.+ . .+ T Consensus 1 m~~~~~~~~---~~~~~~~~~~~~~~~~~~-~---~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~ 73 (419) T protein:vir:57 1 MFIPQFWKG---RPSENRVNWQVVPGGMRS-S---SSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGG 73 (419) T ss_pred Ccchhhhcc---CCcccccccccccccccc-c---cccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCc Confidence 111111111 000000000000011000 0 0000000001112233445667777777777778775 1 11 Q ss_pred ----cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEE Q lcl|NC_019916. 96 ----SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVR 166 (513) Q Consensus 96 ----~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir 166 (513) .+..+..++.. | ........+..+.+.+|.||+++..+..|.+.-.+.++|..+.+..+... .. T Consensus 74 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g-------~~ 146 (419) T protein:vir:57 74 REIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDG-------MP 146 (419) T ss_pred eeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCc-------eE Confidence 12234554432 3 23455667888999999999999888888766566788888777655421 12 Q ss_pred EEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHH Q lcl|NC_019916. 167 YHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVA 246 (513) Q Consensus 167 ~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~ 246 (513) +|..... . .++..+.+++++.. ++ +...|.|.++.+...++....+ T Consensus 147 ~y~~~~~----~------~~~~~~~vih~r~~---------------~~---------d~~~G~s~i~~~~~~i~~~~~~ 192 (419) T protein:vir:57 147 YYDIPSI----G------EILPMRMVHHIKSF---------------SL---------DGYIGTSPIQTNPDVLGLGIAV 192 (419) T ss_pred EEEEcCC----c------eEEchhhEEEecCc---------------CC---------CCcccccHHHHHHHHHHHHHHH Confidence 3332110 0 12333444433210 00 1124667776666666544443 Q ss_pred HHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCc Q lcl|NC_019916. 247 QSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSAD 325 (513) Q Consensus 247 ~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 325 (513) ..-....+...+.|-.+++-...... .... +....++..=...+.... .++++.+ ..+.+ T Consensus 193 ~~~~~~~f~ng~~p~gil~~~~~~~~------~~~~----e~~~~~~~~~~~~~~g~~nag~~~vl---------~~g~~ 253 (419) T protein:vir:57 193 EQHAAQVFARGTTMSGVIERPFEAKA------IASQ----AAVDAILAKWTERYGGVRNAFSVGML---------QEGMT 253 (419) T ss_pred HHHHHHHHHccCCccEEEEecCcCCc------ccCH----HHHHHHHHHHHHHhccccccccceec---------CCCce Confidence 33333333333444444432110000 0000 000000000000000000 0122222 22334 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 326 ANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 326 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) ++-++.......+....+...+.|+..-++|+...+... ++-|+ ++-. ....+...|...++.+..- T Consensus 254 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~----------~~~f~~~~l~P~~~~ie~~ 321 (419) T protein:vir:57 254 YKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNN--IEHQ----------GLQYVIYTMLAILKRHESA 321 (419) T ss_pred EEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccc--HHHH----------HHHHHHHHHHHHHHHHHHH Confidence 444443344455667778888999999999976544221 22122 1111 1122233444444444333 Q ss_pred HHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019916. 405 EERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTK 481 (513) Q Consensus 405 l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 481 (513) +...- .........+++.+...+..|..++++++.++ +|+++.-.+.++++.-.-+. -. ....++ T Consensus 322 l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD----------~~~~~~ 389 (419) T protein:vir:57 322 MMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPG--GD----------KYLTPL 389 (419) T ss_pred HHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cC----------eeeecc Confidence 33211 11111122344445566667899999988886 57888877777765421000 00 000010 Q ss_pred cCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 482 GGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .. ...... .+.+ ...|+...+.+ T Consensus 390 n~--~~~~~~-~~~~------~~~~~~~~~~~ 412 (419) T protein:vir:57 390 NM--VDSKAL-TGIG------KATPQQLKDIE 412 (419) T ss_pred cc--cccccc-cccc------CCCcccCcchh Confidence 00 000000 0000 00011111111 No 180 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.61 E-value=0.00047 Score=38.83 Aligned_cols=432 Identities=12% Similarity=0.110 Sum_probs=178.8 Q ss_pred Cccchhhcee-----------ccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee Q lcl|NC_019916. 1 MIDMQQANMN-----------YQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA 69 (513) Q Consensus 1 ~~~~~~~~~~-----------~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri 69 (513) +-.|--|-.+ +..+....++..-..+ +.-++-....++.| .+|.|..-+-+.- --.| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l-~~~~~~~F~Gy~~----------la~l 119 (695) T protein:vir:36 52 LNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAA-SYALDFNGTSMDAL-SFVTSSGFPGFPT----------LVLL 119 (695) T ss_pred ccccccccccCCCcccccceeceecccccCccccchh-hhhhcccccccccc-hhhhccCcchHHH----------HHHH Confidence 1111111111 1122222222222221 11111111112222 2233211000000 0001 Q ss_pred -ecchhHHHHHHHHHHhhcCCeeec-------------------C----CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 70 -VHSFARYIADFQTSYSVGNAIAMS-------------------G----PSSDRLDDFNRRNDIDTLNYELYLDMTVTGR 125 (513) Q Consensus 70 -~~n~~~~ivd~~~~~l~g~p~~~~-------------------~----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~ 125 (513) .++-.+.++.+.+..+.-+-+... . +..+.|..-++.-++...+.++.+.+-.||. T Consensus 120 aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGG 199 (695) T protein:vir:36 120 AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGR 199 (695) T ss_pred hhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 023333444444444432211111 1 1123566667777888899999999999999 Q ss_pred EEEEeeecCCCce---eE--------------EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc Q lcl|NC_019916. 126 AYEYVYRDPSQKG---EV--------------SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT 188 (513) Q Consensus 126 ~~~~v~~d~~~~~---~~--------------~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt 188 (513) +..++-.+.++.. ++ ...++|..+.|-.-+. ..|+. .+...-..|+| - T Consensus 200 a~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s----------pdfgkP~~y~V--~- 264 (695) T protein:vir:36 200 AHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA----------DDFYKPSTWWM--I- 264 (695) T ss_pred eEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh----------hccCCCceEEE--e- Confidence 9877766543321 11 0012222222210000 00000 00001111111 0 Q ss_pred CCcEEEEEeeccCCccccccccccccCcccceEEe---cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhee Q lcl|NC_019916. 189 ENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEY---RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIK 265 (513) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~ 265 (513) ...++.-+. +.+...|+-.. .++..|.|..+.+.+-+++.+++.-..+.-+..+....+. + T Consensus 265 G~kIH~SRL---------------~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~ 328 (695) T protein:vir:36 265 GTEVHATRL---------------HTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-M 328 (695) T ss_pred ceEEeeeeE---------------EEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHH-H Confidence 011111100 01111111110 1233577888888888888887776666555433332221 1 Q ss_pred cCcccccccccccccccchhhhhhhccccccchhhhcchhcc-eeeccccccccccccCCceeEEeecCCHHHHHHHHHH Q lcl|NC_019916. 266 GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN-MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKR 344 (513) Q Consensus 266 G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 344 (513) ++..... .... ..+.. ....+..++.+. ++.+ ++.+=+|.+++.+.+.+...+.+ T Consensus 329 dla~aL~---------~g~~----~~l~~-R~eli~~~Rsn~G~~ll----------Dk~~Eefeq~stslSGLddVi~q 384 (695) T protein:vir:36 329 DLAQALM---------PGAN----VDLSM-RAELINRYRDNRNILFL----------DKATEEFFQFNTPLSGLDALQAQ 384 (695) T ss_pred HHHHhhc---------ChhH----HHHHH-HHHHHHHhcCccceEEE----------ecCCcceEEEecccCCHHHHHHH Confidence 1110000 0000 00000 111122222222 2222 22344778888999999999999 Q ss_pred HHHHHHHHhCccccccccc--cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeE Q lcl|NC_019916. 345 LAADIHKFSHTPDLTDDNF--SG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGF 421 (513) Q Consensus 345 l~~~i~~~s~~p~~~~~~~--~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i 421 (513) ..+.|+..+++|-.-+-.. +| |.||+.=..-|...+ ....+..+...+++++.+|.. ...+. .+. ++.+ T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~r--S~~G~---idp-di~~ 456 (695) T protein:vir:36 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQL--SLFGA---VDP-SIKW 456 (695) T ss_pred HHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH--HhcCC---CCC-cceE Confidence 9999999999996543322 23 688987544444444 245578899999998887642 22222 222 5789 Q ss_pred EeCCCCCcCHHHHHHHHHHH---------hcCCCHHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHhhhh-cC Q lcl|NC_019916. 422 IFRDNLPTDDVAIITALVQA---------GAQIPQEYLYQYL------PNVT--DADEIVKMMDKQRKAMLKTYDTK-GG 483 (513) Q Consensus 422 ~f~~~~p~d~~e~a~~~~kl---------~g~iS~et~~~~l------~~v~--D~~~E~~ri~~E~~~~~~~~~~~-~~ 483 (513) +|++--..++.|.|+.-.|- .|+|+...+..+| ++.. |...+=-..... .++....+ .+ T Consensus 457 ~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~---~~~~~~~~~~~ 533 (695) T protein:vir:36 457 QWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADD---DIDGVLTYVQR 533 (695) T ss_pred EeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccc---hhhhhHhhhcC Confidence 99999999999999875542 3566655555553 2211 100000000000 00000000 00 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 484 LIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) .... ++.+...++.++...|.+=..-+ T Consensus 534 ~~~~---~~~~~~~~~~~g~~~~~~v~~~~ 560 (695) T protein:vir:36 534 LAEG---GDTGAPGGARAGATAPPTVANVN 560 (695) T ss_pred cccc---cccCCCCcccccccCCCcccccc Confidence 0111 11111111222222222111111 No 181 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=96.58 E-value=0.00049 Score=38.72 Aligned_cols=439 Identities=9% Similarity=0.010 Sum_probs=187.6 Q ss_pred CCcccCCHHHHHHHH----HHHHHHHHH---HHHHHHHH---hcCCCccccccccccCCCCCCcceeecchhHHHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFI----RHHYNNQRP---RLEMLYDY---YRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQT 82 (513) Q Consensus 13 ~~~~~~~~~~i~~~i----~~~~~~~~~---~~~~~~~Y---Y~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~ 82 (513) |..+ + +.+.+-| +.....|.+ +.+.+.+| |.|.-....... ...-.+.+.++..+-....++..+ T Consensus 1 m~~d--~-~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~--~~~~~~~~~~~~dstg~~a~~~LA 75 (549) T protein:vir:10 1 MTND--D-AKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPD--SEKGRERSQKMFDSTAPLALRNFV 75 (549) T ss_pred CCcc--h-HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCC--CCcccccccccccchHHHHHHHHH Confidence 3221 2 2222222 232333333 44455555 222211101111 111122345677788888899988 Q ss_pred HHhhcC--Ce-----eecCCcH-----HH-----------HHHHH--HhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCc Q lcl|NC_019916. 83 SYSVGN--AI-----AMSGPSS-----DR-----------LDDFN--RRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQK 137 (513) Q Consensus 83 ~~l~g~--p~-----~~~~~~~-----~~-----------l~~~~--~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~ 137 (513) +.|++. |+ ++...++ .. +...+ ...+|.....++.++..++|.|.+++-.+..+. T Consensus 76 s~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~ 155 (549) T protein:vir:10 76 AAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKG 155 (549) T ss_pred HHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCe Confidence 888653 21 2222221 11 11211 246788888999999999999887765544332 Q ss_pred eeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc--------c--------ccceeEEEEEEEcC----Cc------ Q lcl|NC_019916. 138 GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV--------D--------NITQTKYEVETWTE----ND------ 191 (513) Q Consensus 138 ~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~--------~--------~~~~~~~~ve~yt~----~~------ 191 (513) +.+..-|..-+.+--|. .+++...+|.++..... . ........+++|+. .. T Consensus 156 --~~f~~~pl~~~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~ 232 (549) T protein:vir:10 156 --IVYRNVPMQRLWFAENN-SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKL 232 (549) T ss_pred --eEEEEEEcCeEEEeeCC-CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcccc Confidence 33333455555554443 34556666554331100 0 00011123343321 00 Q ss_pred --------EEEEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 192 --------YTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLN 258 (513) Q Consensus 192 --------~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~ 258 (513) .+++. .+++. ...+..|..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+... T Consensus 233 ~~~~~pf~sv~~e--~~~~~-----il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~ 305 (549) T protein:vir:10 233 DGRNMQFASYWLD--EGRDR-----IVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLV 305 (549) T ss_pred ccccCceEEEEEE--ecCCE-----eeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11111 11111 1112345567877664 3467999999999999999999888888999898 Q ss_pred hhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHH Q lcl|NC_019916. 259 EAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGT 338 (513) Q Consensus 259 ~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 338 (513) +|.+.+--.+..... ++. .++.. ....+.+.+..+..+....+.... T Consensus 306 ~p~~~v~~~g~~~~~-----------------~l~-----------pgg~~-----~~~~~~~~~~~~~pl~~~~~~~~~ 352 (549) T protein:vir:10 306 DPPLLANEDGVLDGF-----------------DLR-----------SGALN-----WGGLNDKGEEMVKPLLTGKQAQIG 352 (549) T ss_pred cCceeeccccccccc-----------------eec-----------cCCcc-----ccccCCCCccceeeeccccchhHH Confidence 888765321100000 000 00000 000111223345555555565666 Q ss_pred HHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhccc Q lcl|NC_019916. 339 ELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGL--------NQRYTVVAHIEERVNG 410 (513) Q Consensus 339 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l--------~~~~~li~~~l~~~~~ 410 (513) ...++.++..|-..-....+....-+...++.-++.. +.+++..+|..+ .-+++-++.++...+. T Consensus 353 ~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~ 425 (549) T protein:vir:10 353 IEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQR-------AQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQ 425 (549) T ss_pred HHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 7777777776644322111111111233455544433 233333333333 2333334444444332 Q ss_pred cccc------ccceeeEEeCCCCCcCH-HHH-------HHHHHHHhcC-------CCHHHHHHhC---CCCC-C---HHH Q lcl|NC_019916. 411 KWDI------DPDEIGFIFRDNLPTDD-VAI-------ITALVQAGAQ-------IPQEYLYQYL---PNVT-D---ADE 462 (513) Q Consensus 411 ~~~~------~~~~i~i~f~~~~p~d~-~e~-------a~~~~kl~g~-------iS~et~~~~l---~~v~-D---~~~ 462 (513) -..+ ....++|++..++-+.. .+. ++.+..++++ +....++..+ -+|+ . .++ T Consensus 426 lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~e 505 (549) T protein:vir:10 426 LPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDE 505 (549) T ss_pred CCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHH Confidence 1111 22345666654444311 111 1222222222 2222333222 1222 1 356 Q ss_pred HHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 463 IVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP 506 (513) Q Consensus 463 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (513) |++.+.+++++.+...+.........+...+..+.....+..-. T Consensus 506 ev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 506 ELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCcccCC Confidence 66666655444333222111111111111111111111111111 No 182 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=432 Identities=12% Similarity=0.104 Sum_probs=175.7 Q ss_pred Cccchhhceecc-----------CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee Q lcl|NC_019916. 1 MIDMQQANMNYQ-----------EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA 69 (513) Q Consensus 1 ~~~~~~~~~~~~-----------~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri 69 (513) +-.|--|-..-+ .+....++..=..+ +.-++-....++.| .+|.|..-+-+. .--.| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l-~~~~~~~F~Gy~----------~la~l 119 (695) T protein:vir:78 52 LNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAA-SYALDFNGTSMDAL-SFVTSSGFPGFP----------TLVLL 119 (695) T ss_pred ccccccccccCCCcccccceeceeccccCCccccchh-hhhhcccccccccc-hhhhccCcchHH----------HHHHH Confidence 111111111111 11111111111100 00011001111111 122221100000 00001 Q ss_pred -ecchhHHHHHHHHHHhhcCCeeec-------------------C----CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 70 -VHSFARYIADFQTSYSVGNAIAMS-------------------G----PSSDRLDDFNRRNDIDTLNYELYLDMTVTGR 125 (513) Q Consensus 70 -~~n~~~~ivd~~~~~l~g~p~~~~-------------------~----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~ 125 (513) .++-.+.++.+.+..+.-+-+... . +..+.|..-++.-++...+.++.+.+-.||. T Consensus 120 aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGG 199 (695) T protein:vir:78 120 AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGR 199 (695) T ss_pred hhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 023333444454444432211111 0 1123466666677888899999999999999 Q ss_pred EEEEeeecCCCce---eE--------------EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc Q lcl|NC_019916. 126 AYEYVYRDPSQKG---EV--------------SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT 188 (513) Q Consensus 126 ~~~~v~~d~~~~~---~~--------------~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt 188 (513) +..++-.+.++.. ++ ...++|..+.|-.-+. ..|+. .+...-..|+| - T Consensus 200 a~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s----------pdfgkP~~y~V--~- 264 (695) T protein:vir:78 200 AHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA----------DDFYKPSTWWM--I- 264 (695) T ss_pred eEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh----------hccCCCceEEE--e- Confidence 9877766543321 11 0112222222210000 00000 00001111111 0 Q ss_pred CCcEEEEEeeccCCccccccccccccCcccceEEe---cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhee Q lcl|NC_019916. 189 ENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEY---RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIK 265 (513) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~ 265 (513) ...++.-+. +.+...|+-.. .++..|.|..+.+.+-+++.+++.-..+.-+..+....+. + T Consensus 265 G~kIH~SRL---------------~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~ 328 (695) T protein:vir:78 265 GTEVHATRL---------------HTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-M 328 (695) T ss_pred ceEEeeeeE---------------EEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH-H Confidence 011111100 01111111110 1233577888888888888887776666655433333221 1 Q ss_pred cCcccccccccccccccchhhhhhhccccccchhhhcchhcc-eeeccccccccccccCCceeEEeecCCHHHHHHHHHH Q lcl|NC_019916. 266 GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN-MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKR 344 (513) Q Consensus 266 G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 344 (513) ++..... .... ..+.. ....+..++.+. ++.+ ++.+=+|.+++.+.+.+...+.+ T Consensus 329 dla~~L~---------~g~~----~~l~~-R~eli~~~Rsn~G~~ll----------Dk~~Eefeq~stslSGLddVi~q 384 (695) T protein:vir:78 329 DLAQALM---------PGAN----VDLSM-RAELINRYRDNRNILFL----------DKATEEFFQFNTPLSGLDALQAQ 384 (695) T ss_pred HHHHhhc---------ChhH----HHHHH-HHHHHHHhcCccceEEE----------ecCCcceEEEecccCCHHHHHHH Confidence 1110000 0000 00000 111122222222 2222 22344778888999999999999 Q ss_pred HHHHHHHHhCccccccccc--cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeE Q lcl|NC_019916. 345 LAADIHKFSHTPDLTDDNF--SG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGF 421 (513) Q Consensus 345 l~~~i~~~s~~p~~~~~~~--~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i 421 (513) ..+.|+..+++|-.-+-.. +| |.||+.=..-|...+ ....+..+...+++++.+|.. ...+. .+. ++.+ T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~r--S~~G~---idp-di~~ 456 (695) T protein:vir:78 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQL--SLFGA---VDP-SIKW 456 (695) T ss_pred HHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH--HhcCC---CCC-cceE Confidence 9999999999996543322 23 688987544444444 245578899999998887642 22222 222 5789 Q ss_pred EeCCCCCcCHHHHHHHHHHH---------hcCCCHHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHhhhh-cC Q lcl|NC_019916. 422 IFRDNLPTDDVAIITALVQA---------GAQIPQEYLYQYL------PNVT--DADEIVKMMDKQRKAMLKTYDTK-GG 483 (513) Q Consensus 422 ~f~~~~p~d~~e~a~~~~kl---------~g~iS~et~~~~l------~~v~--D~~~E~~ri~~E~~~~~~~~~~~-~~ 483 (513) +|+|--..++.|.|+.-.|- .|+|+...+..+| ++.. |...+=-..... .++....+ .+ T Consensus 457 ~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~---~~~~~~~~~~~ 533 (695) T protein:vir:78 457 QWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADD---DIDGVLTYVQR 533 (695) T ss_pred EeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccc---hhhhhHhhhcC Confidence 99999999999999875542 3566655555553 2211 100000000000 00000000 00 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 484 LIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ....+ +.+...+..++...|.+=..-+ T Consensus 534 ~~~~~---~~~~~~~~~~g~~~~~~~~~~~ 560 (695) T protein:vir:78 534 LAEGG---DTGAPGGARAGATAPPTVANVN 560 (695) T ss_pred ccccc---ccCCCCCCCCCCCCCCceeeee Confidence 00000 0111111222222221111111 No 183 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=96.56 E-value=0.00051 Score=38.62 Aligned_cols=392 Identities=11% Similarity=0.052 Sum_probs=153.6 Q ss_pred HHHHHH--HHHHHHHH-HHHHHHH-hcCCCccccccccccCCCC-CCcceeecchhHHHHHHHHHHhhcCCeeecC---C Q lcl|NC_019916. 24 AAFIRH--HYNNQRPR-LEMLYDY-YRGQNDGILSPASRRNEKG-KADHRAVHSFARYIADFQTSYSVGNAIAMSG---P 95 (513) Q Consensus 24 ~~~i~~--~~~~~~~~-~~~~~~Y-Y~G~~~i~~~~~~~~~~~~-~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~---~ 95 (513) ..++.+ ......+. ...+... ..+-.+ +..+....... -...-+........|+..+.-+-.-|+.+-. . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLYD--FSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhccccccccc--ccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc Confidence 222221 00000111 1111110 111111 01110000000 0111123344555677777767777887521 1 Q ss_pred cHHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEee Q lcl|NC_019916. 96 SSDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAV 170 (513) Q Consensus 96 ~~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~ 170 (513) .+..+..++. . |. -......+..+++.+|.||+++..+..|.+.-.+.++|..+-+..++.. .. + +|.. T Consensus 79 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~----~y~~ 152 (412) T protein:vir:26 79 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-L----YYSI 152 (412) T ss_pred ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-E----EEEE Confidence 2223444442 2 43 2344566888999999999999988888765556678888877776543 11 1 1221 Q ss_pred cccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_019916. 171 QTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDT 250 (513) Q Consensus 171 ~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~ 250 (513) ....+ ....+.++.+.+++.. +.. +.-.|.|.++-+...++..+.+... T Consensus 153 ~~~~g-------~~~~~~~~evih~~~~--------------~~~---------~~~~G~s~i~~~~~~i~~~~a~~~~- 201 (412) T protein:vir:26 153 HAATG-------NKLIVHNMDMLHFKHI--------------VAS---------NMVQGISPIDVLKNTTDFDNAVRTF- 201 (412) T ss_pred EcCCc-------eEEEEccccEEEeCCC--------------CCC---------CCcccccHHHHHHHHHHHHHHHHHH- Confidence 11110 0113455555554311 011 1124666666555544433222111 Q ss_pred HHHHHHhhhhh-hheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceeeccccccccccccCCcee Q lcl|NC_019916. 251 ANYMTDLNEAM-LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMILLKTGMAPNGQQTSADAN 327 (513) Q Consensus 251 ~~~~~~~~~~~-l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 327 (513) .+..+..+- .+++.. ....+ +....+.. .++.. ..++++.+ ..+.++. T Consensus 202 --~~~~~~~~~~~i~~~~----------~~l~~----e~~~~~~~----~~~~~~~~~g~~~vl---------~~g~~~~ 252 (412) T protein:vir:26 202 --NLTEMQKPDSFMLKYG----------SNVGK----EKRQQVLE----DFKQYYEENGGILFQ---------EPGVEIE 252 (412) T ss_pred --HHHhcCCCCceEEecC----------CCCCH----HHHHHHHH----HHHHHhhcCCCeeec---------CCCceEE Confidence 111111111 111110 00000 00111110 11110 01112222 2233344 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 328 YIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEER 407 (513) Q Consensus 328 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~ 407 (513) .++.......+....+..++.|+..-++|+.-.+... +.+...++... ...+...+...++.|..-+.. T Consensus 253 ~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~-~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~ln~ 321 (412) T protein:vir:26 253 PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARS-NTNFAKNEELN----------RFYLQHTLLPIVKQYEEEFNR 321 (412) T ss_pred EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHh Confidence 4433333445566667778899999999976554321 11111111111 112222344444444333322 Q ss_pred cc-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcC Q lcl|NC_019916. 408 VN-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGG 483 (513) Q Consensus 408 ~~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~ 483 (513) .- ...+ .....+++.+..-+..|..+.++++.++ +|+++.-.+.+.++.-.-+ ...++.- .....+. + T Consensus 322 kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~--ggD~~~~-----~~n~~~~-~ 393 (412) T protein:vir:26 322 KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLI-----SGDLYPI-D 393 (412) T ss_pred hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeeee-----ccccccc-c Confidence 10 0000 1112344444565667899999998887 5788887777776432100 0000000 0000000 0 Q ss_pred CCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 484 LIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~ 502 (513) .....+....|++++.+++ T Consensus 394 ~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 394 TPLELRKSLKGGDKNVNES 412 (412) T ss_pred cchhhcccccCCCCCcCCC Confidence 0000011111111111111 No 184 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=96.40 E-value=0.00066 Score=38.01 Aligned_cols=396 Identities=11% Similarity=0.024 Sum_probs=152.3 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCcceeecchhHHHHHHHHHHhhcCCee Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KADHRAVHSFARYIADFQTSYSVGNAIA 91 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~ri~~n~~~~ivd~~~~~l~g~p~~ 91 (513) |-+.++....-..+++.-.... -.+-.+. ..+....... ....-+.+......|+..++-+-.-|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~--~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~ 68 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQS----------TSKLYDF--SPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLK 68 (409) T ss_pred CCccchhhhhhhhhhhhhhccc----------ccccccc--ccccCccccccchhhhhccHHHHHHHHHHHHhhhhCcee Confidence 2222221111111111110000 0011110 0000000000 0011123344555677777766677887 Q ss_pred ecCC---cHHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEE Q lcl|NC_019916. 92 MSGP---SSDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIM 163 (513) Q Consensus 92 ~~~~---~~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~ 163 (513) +-.. .+..+..++. . |. -......+..+++.+|.||+++..+..|.+.-.+.++|..+-+..++.. ..+ T Consensus 69 ~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~-~~~-- 145 (409) T protein:vir:93 69 MYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-REL-- 145 (409) T ss_pred EeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cEE-- Confidence 5221 1233444442 2 42 3344567788999999999999888888765556678888777665432 111 Q ss_pred EEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHH Q lcl|NC_019916. 164 AVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLY 243 (513) Q Consensus 164 ~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~ 243 (513) +|......+ . . ..+.++.+.+++.. ++.. .-.|.|.++.+...++.. T Consensus 146 ---~y~~~~~~g--~----~-~~~~~~eVih~r~~--------------~~~~---------~~~G~s~i~~~~~~i~~~ 192 (409) T protein:vir:93 146 ---YYSIHAATG--N----K-LIVHNMDMLHFKHI--------------VASN---------MVQGISPIDVLKNTTDFD 192 (409) T ss_pred ---EEEEEcCCc--e----E-EEEccccEEEeCCC--------------CCCC---------ccccccHHHHHHHHHHHH Confidence 122111110 0 1 12344444443210 0111 114666666555555433 Q ss_pred HHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccC Q lcl|NC_019916. 244 DVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTS 323 (513) Q Consensus 244 ~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (513) +.+... .+..+..+--.+.-. .... ..+....+...-..... ..++++.+ ..+ T Consensus 193 ~~~~~~---~~~~~~~~~~~i~~~----------~~~l---~~e~~~~~~~~~~~~~~--~~g~~~vl---------~~g 245 (409) T protein:vir:93 193 NAVRTF---NLTEMQKPDSFMLKY----------GSNV---GKEKRQQVLEDFKQYYE--ENGGILFQ---------EPG 245 (409) T ss_pred HHHHHH---HHHhcCCCCceEEec----------CCCC---CHHHHHHHHHHHHHHhh--cCCCeeec---------CCC Confidence 222111 122222211111000 0000 00111111100000000 01122222 223 Q ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 324 ADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAH 403 (513) Q Consensus 324 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 403 (513) .++..++.......+....+...+.|+..-++|+...+... +.+...++... ...+...|..+++.|.. T Consensus 246 ~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:93 246 VEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARS-NTNFAKNEELN----------RFYLQHTLLPIVKQYEE 314 (409) T ss_pred ceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCCcccHHHHH----------HHHHHHHHHHHHHHHHH Confidence 34444433333445566667778899999999976654322 21211121111 11222334444444433 Q ss_pred HHHhc-cccccc-ccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 404 IEERV-NGKWDI-DPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYD 479 (513) Q Consensus 404 ~l~~~-~~~~~~-~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~ 479 (513) -+... -..... ....+++.+..-+-.|..+.++++.++ +|+++.-.+.+.++.-.-+ ..+...- ..... T Consensus 315 ~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~--ggD~~~~-----~~n~~ 387 (409) T protein:vir:93 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLI-----SGDLY 387 (409) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeeee-----ccccc Confidence 33221 111011 112233333455556888889988886 5788877776666432100 0000000 00000 Q ss_pred hhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 480 TKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) +.. .....+....+++.+.+++ T Consensus 388 ~~~-~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 388 PID-TPLELRKSLKGGDKNVNES 409 (409) T ss_pred ccc-cchhhcccccCCCCCcCCC Confidence 000 0011111111111111121 No 185 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=96.28 E-value=0.00078 Score=37.61 Aligned_cols=394 Identities=9% Similarity=-0.014 Sum_probs=162.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC---Cc- Q lcl|NC_019916. 21 TRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG---PS- 96 (513) Q Consensus 21 ~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~---~~- 96 (513) -.+..+..++..........+...+-+..... . ....-...=+.+......|+..+.-+-+-|+++-. +. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~---~---g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~ 74 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTY---T---GKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLK 74 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccc---c---CceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc Confidence 11111111110011111112222222211100 0 00000011122344455677777777777876421 11 Q ss_pred ----HHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 97 ----SDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 97 ----~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) +..+..++.. | ........+..+.+.+|.||+++..+ .|.+.-.+.++|..+.+..+... .+.+ T Consensus 75 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~--~~~y---- 147 (413) T protein:vir:48 75 TRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQW--QPVY---- 147 (413) T ss_pred eeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCc--eEEE---- Confidence 1234444432 2 23456667888999999999888765 45554455678888877776432 2221 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQ 247 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~ 247 (513) ......+ ....+..+.+++++... + +...|.|-++.+...++....+. T Consensus 148 -~~~~~~g-------~~~~~~~~evih~~~~~---------------~---------d~~~G~s~i~~~~~~i~~~~~~~ 195 (413) T protein:vir:48 148 -QVTFPDG-------SVDVLTQDEIWHVRTLT---------------L---------DGLVGLNPIAYAREAISLAAATE 195 (413) T ss_pred -EEEecCc-------eEEEEccccEEEecCcC---------------C---------CCcccccHHHHHHHHHHHHHHHH Confidence 1111111 11234555555543210 0 11246676666666666555444 Q ss_pred HHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc-chhcceeeccccccccccccCCce Q lcl|NC_019916. 248 SDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA-MRQANMILLKTGMAPNGQQTSADA 326 (513) Q Consensus 248 S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 326 (513) .-..+.+...+.|-.+++...... ..........+. ..... ...++++.+ ..+.++ T Consensus 196 ~~~~~~~~ng~~p~gil~~~~~~~----------~e~~~~~~~~~~----~~~~g~~n~g~~~vl---------~~g~~~ 252 (413) T protein:vir:48 196 EHGARLFGNGAVTSGVLRTEQKLT----------PDAYERLKKDFE----ERHTGLGNAHRPMIL---------EMGLDW 252 (413) T ss_pred HHHHHHHhccCCcceEEEeCCCCC----------HHHHHHHHHHHH----HHhcCccccCcceec---------CCCceE Confidence 444444444444555544321100 000011100000 00000 001112222 223344 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 327 NYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GN-SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 327 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +-+........+....+.....|+..-++|+...+... ++ .+..... ...+...+.-+++.+..- T Consensus 253 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~-------------~~f~~~~i~P~~~~ie~~ 319 (413) T protein:vir:48 253 KSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG-------------LGFINYSLVPYLTRIEQR 319 (413) T ss_pred EeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHH Confidence 44443334445567778888999999999986554321 11 1111111 111222333333333333 Q ss_pred HHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019916. 405 EERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTK 481 (513) Q Consensus 405 l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 481 (513) +...- .........+++.+...+-.|..+.++++.++ +|+++.-.+.++++.-.- .... ....+. T Consensus 320 l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~--~ggD----------~~~~~~ 387 (413) T protein:vir:48 320 INTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPR--PGGD----------VYLTPM 387 (413) T ss_pred HHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcc----------eeeccc Confidence 32210 01111122344555555566888899998886 578887766666643110 0000 000000 Q ss_pred cCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 482 GGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) . -.......++..+..++.++++.++ T Consensus 388 n-----~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 388 N-----MTTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred c-----ccccccccccCCCCCCCCCccccCC Confidence 0 0000111111111111112222222 No 186 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.26 E-value=0.00082 Score=37.51 Aligned_cols=391 Identities=11% Similarity=0.022 Sum_probs=154.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--------ccccc-----c-ccCCCCC---CcceeecchhHHHHHH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDG--------ILSPA-----S-RRNEKGK---ADHRAVHSFARYIADF 80 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i--------~~~~~-----~-~~~~~~~---~~~ri~~n~~~~ivd~ 80 (513) +-++...-++. +++..+....+. ..... . .....+. +..=+.+.-....|+. T Consensus 1 ~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~ 69 (432) T protein:vir:10 1 MPDEKKLGLLG-----------QLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKL 69 (432) T ss_pred CCCCcccchhh-----------hhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHH Confidence 22222222222 112222111000 00000 0 0000000 0111223444457777 Q ss_pred HHHHhhcCCeeec---CCc-----HHHHHHHH-Hh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 81 QTSYSVGNAIAMS---GPS-----SDRLDDFN-RR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 81 ~~~~l~g~p~~~~---~~~-----~~~l~~~~-~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) .++-+-+-|+.+- .+. +..+..++ .. |. .......+..+++.+|.||+++..+ +|.+.-.+.++|. T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~ 148 (432) T protein:vir:10 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLAND 148 (432) T ss_pred HHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCC Confidence 7777777787641 111 12233333 22 33 2345566788999999999888775 4665545568888 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) .+.++.+... .+. |.....++ ....+..+.+++++.. .+ +.. T Consensus 149 ~v~v~~~~~g--~~~-----y~~~~~~g-------~~~~~~~~~iih~~~~---------------~~---------dg~ 190 (432) T protein:vir:10 149 RLTITTDTKG--NTA-----YRYRRTDG-------QMIDIPKQQIWKIMGY---------------SL---------DGE 190 (432) T ss_pred ceEEEEcCCC--cEE-----EEEEecCc-------eEEEEcCccEEEecCC---------------CC---------CCc Confidence 8888776532 221 11111111 0112344444433210 00 111 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-c Q lcl|NC_019916. 228 YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-A 306 (513) Q Consensus 228 ~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~ 306 (513) .|.|-++.+...++.......-..+.+...+.|-.+++.... ............ +..... + T Consensus 191 ~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~----------l~~e~~~~~~~~--------~~~~~nag 252 (432) T protein:vir:10 191 NGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF----------LTDDQYDSFAKK--------VSGSVEAG 252 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC----------CCHHHHHHHHHH--------HhhhhhCC Confidence 356655554444443332222222333333334444442111 011111111111 111111 2 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc--ccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN--SSGVAMKYKVLGTVELAS 384 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n--~Sg~Ai~~~~~~l~~k~~ 384 (513) +++.+ ..+.+++.++.......+....+.....|+..-++|+...+....+ ..|..++-... T Consensus 253 ~~~vl---------~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~------- 316 (432) T protein:vir:10 253 RAPLL---------EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL------- 316 (432) T ss_pred Cceec---------CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHH------- Confidence 22332 2233444444444445566677888899999999998765433221 12233322111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh-cccccccccceeeEEe--CCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--C Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEER-VNGKWDIDPDEIGFIF--RDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--V 457 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~-~~~~~~~~~~~i~i~f--~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v 457 (513) ..+...+...++.+..-+.. .-.... .....+.| ..-+-.|..+.++++.++ +|+++.-.+.++++. + T Consensus 317 ---~f~~~tl~P~~~~ie~~ln~kL~~~~~--~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi 391 (432) T protein:vir:10 317 ---GFLSMTLSPWLRRIEQSIALNLLSPAE--RRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL 391 (432) T ss_pred ---HHHHHHHHHHHHHHHHHHHhhhcCccc--cCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 11222333333333222221 111111 11234455 455567888999988886 578888777777643 2 Q ss_pred CCHHHHHHHHHHHHHHHHH-HhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAMLK-TYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) ++-. .+ +- +. ...++... +...+.++......+.+++-+. T Consensus 392 ~g~~-~~--~~------~~~~~~pl~~~------~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 392 GGNA-AV--LT------VQSAMVPLDSI------GLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCCc-ce--Ee------ecCcccchhhh------cccCCCCCCCCCCCcccccccC Confidence 2100 00 00 00 00000000 0000000000000001111111 No 187 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=96.17 E-value=0.00092 Score=37.22 Aligned_cols=390 Identities=11% Similarity=0.071 Sum_probs=151.5 Q ss_pred CCHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCcceeecchhHHHHHHHHHHhhcCCeeecC- Q lcl|NC_019916. 18 LTPTRIAAFIRHH-YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KADHRAVHSFARYIADFQTSYSVGNAIAMSG- 94 (513) Q Consensus 18 ~~~~~i~~~i~~~-~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~- 94 (513) |.++.|..-|... +.... ..-+.+-.+ +..+....... -...-+.+.-....|+..++-+-.-|+++-. T Consensus 1 ~~~~~~~~~~k~~~~~~~~------~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~ 72 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWI------DQSASKLYD--FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYED 72 (409) T ss_pred CcccccchhhhhHHhhhhh------cCCcccccc--cccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeec Confidence 2222222211111 00000 000111111 00000000000 0001122344455667776666677877521 Q ss_pred --CcHHHHHHHHH--hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 95 --PSSDRLDDFNR--RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 95 --~~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) ..+..+..++. -|. -......+..+++.+|.||+++..+..|.+.-.+.++|..+-+..++.. .. + + T Consensus 73 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~----~ 146 (409) T protein:vir:94 73 YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-L----Y 146 (409) T ss_pred ccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-E----E Confidence 12223444442 243 2344556788999999999999888888765556678888877766532 11 1 1 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQ 247 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~ 247 (513) |......+ . .+ .+..+.+.+++.. ++. +.-.|.|.+..+...++..+.+. T Consensus 147 y~~~~~~g---~---~~-~~~~~dvih~r~~--------------~~~---------~~~~G~s~l~~~~~~i~~~~~~~ 196 (409) T protein:vir:94 147 YSIHAATG---N---KL-IVHNMDMLHFKHI--------------VAS---------NMVQGISPIDVLKNTTDFDNAVR 196 (409) T ss_pred EEEEcCCc---e---EE-EEccccEEEecCC--------------CCC---------CccccccHHHHHHHHHHHHHHHH Confidence 22111110 0 01 2334444443210 011 11146666665555554333221 Q ss_pred HHHHHHHHHhhhhh-hheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceeeccccccccccccCC Q lcl|NC_019916. 248 SDTANYMTDLNEAM-LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMILLKTGMAPNGQQTSA 324 (513) Q Consensus 248 S~~~~~~~~~~~~~-l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 324 (513) .- .+..++.+- .+++.. ....... ...++. .++.. ..++++.+ ..+. T Consensus 197 ~~---~~~~~~~~~~~i~~~~----------~~l~~e~----~~~~~~----~~~~~~~~~g~~~vl---------~~g~ 246 (409) T protein:vir:94 197 TF---NLTEMQKPDSFMLKYG----------SNVGKEK----RQQVLE----DFKQYYEENGGILFQ---------EPGV 246 (409) T ss_pred HH---HHHhcCCCCeeEEecC----------CCCCHHH----HHHHHH----HHHHHhhcCCCeeec---------CCCc Confidence 11 122222211 111100 0000000 011100 11110 01112222 2233 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 325 DANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 325 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) ++..++.......+....+...+.|+..-++|+...+.. ++.+...++-... ..+...+..+++.|..- T Consensus 247 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~ 315 (409) T protein:vir:94 247 EIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNAR-SNTNFAKNEELNR----------FYLQHTLLPIVKQYEEE 315 (409) T ss_pred eEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCCcccHHHHHH----------HHHHHHHHHHHHHHHHH Confidence 444444333344556667777889999999997655422 2222222221111 11222344443333332 Q ss_pred HHhcc-ccccc-ccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 405 EERVN-GKWDI-DPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTY 478 (513) Q Consensus 405 l~~~~-~~~~~-~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~ 478 (513) +...- ..... ....+++....-+-.|..+.++++.++ +|+++.-.+.+.++. +++-+.=+- .... T Consensus 316 ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~---------~~n~ 386 (409) T protein:vir:94 316 FNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLI---------SGDL 386 (409) T ss_pred HHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEee---------cccc Confidence 22210 11011 112233333455567888899988887 678877666666543 221000000 0000 Q ss_pred hhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 479 DTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) .+.. .....+....|+++..+++ T Consensus 387 ~~~~-~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 387 YPID-TPLELRKSLKGGDKNVNES 409 (409) T ss_pred cccc-cchhhcccccCCCCCcCCC Confidence 0000 0111111111222222221 No 188 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=96.15 E-value=0.00094 Score=37.17 Aligned_cols=376 Identities=9% Similarity=0.028 Sum_probs=145.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCC-CcceeecchhHHHHHHHHHHhhcCCeeecCCcHHH-HH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGK-ADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDR-LD 101 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~-~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~-l~ 101 (513) ..+.+.........-.....+..-......... ...... ...-+.++-...+|+..+.-+-+-|+++....... +. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~ 78 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTL--NGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQLQGIID 78 (386) T ss_pred Ccccccccccccccccccccccccccchhcccc--cCCceechhhhhcchHHHHHHHHHHHhhccCceeeccchhHHHhh Confidence 111111100000000000000000000000000 000000 00001223344566666666666777754332222 11 Q ss_pred HHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeE Q lcl|NC_019916. 102 DFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTK 181 (513) Q Consensus 102 ~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~ 181 (513) .-............+..+.+.+|.||+.+-.+.+|.+.-.+.++|..+-+..+... .. + +|........ . T Consensus 79 ~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~-~~-~----~y~~~~~~~~----~ 148 (386) T protein:vir:48 79 NPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNK-DG-I----YYNITFDDPR----I 148 (386) T ss_pred cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCC-ce-E----EEEEEecCcc----c Confidence 11112233455667888999999999998888888765555678888777665432 11 1 1211110000 0 Q ss_pred EEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 182 YEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAM 261 (513) Q Consensus 182 ~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~ 261 (513) .....+..+.+++++.. ++.+ .-.|.|.++.+...+.....+..-....+...+.|- T Consensus 149 ~~~~~~~~~evih~~~~--------------~~~~---------~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 205 (386) T protein:vir:48 149 PPKQHVPQGDVLHFKLL--------------SVDG---------GLTSVSPLMALSRELNIQKASDKLTLNSLKNALNAN 205 (386) T ss_pred cceeEecCccEEEecCC--------------CCCC---------ceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 01123344444443211 0011 114667666665555544444444444444444455 Q ss_pred hheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEeecCCHHHHH Q lcl|NC_019916. 262 LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHKEYDSAGTE 339 (513) Q Consensus 262 l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 339 (513) .+++-..... .+ ....+... +..+. .++.+.+ ..+.+++-++.......+. T Consensus 206 ~ii~~~~~~~-----------~e---~~~~~~~~----~~~~~~n~g~~~vl---------~~g~~~~~l~~~~~d~q~~ 258 (386) T protein:vir:48 206 GILKIKGGGL-----------LD---FKTKLSRS----RQAMKQMQGGPLVL---------DDLEEFTPLEIKSNVSQLL 258 (386) T ss_pred eEEEeCCCCC-----------HH---HHHHHHHH----HHHhhcCCCCceec---------CCCceEEEcCCChhHHHHH Confidence 5554322111 00 01111110 00000 1112222 2233444444333444566 Q ss_pred HHHHHHHHHHHHHhCccccccccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_019916. 340 LYKKRLAADIHKFSHTPDLTDDNFSGNS--SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPD 417 (513) Q Consensus 340 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~--Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~ 417 (513) ...+...+.|+..-++|+.-.+..+.+. ....+.+. ...|..+++.+..-++..=. . T Consensus 259 e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~~---------------~~~l~P~~~~ie~~l~~~l~-~----- 317 (386) T protein:vir:48 259 KQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDLY---------------NKAVSRYLRPFLSELSQKLS-C----- 317 (386) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHH---------------HHHHHHHHHHHHHHHHHhhc-c----- Confidence 7788888999999999976554222221 12222222 22233333222222211100 0 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC--CCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCC Q lcl|NC_019916. 418 EIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP--NVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGND 493 (513) Q Consensus 418 ~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~--~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 493 (513) .+++.+...+-.+....+..+.++ +|+++.-.+.+.++ .+.. .|+... +. .+ ..+.. T Consensus 318 ~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~~~~~~-----------~~-----~~-~~~~~ 378 (386) T protein:vir:48 318 DVDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILP--KELPEG-----------EN-----PN-KTTLK 378 (386) T ss_pred hhhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCC--ccchhh-----------cC-----CC-CCccC Confidence 111222222233445556666665 67888877766553 2221 111100 00 00 00011 Q ss_pred CCCCCCCC Q lcl|NC_019916. 494 PEDEGVRG 501 (513) Q Consensus 494 ~~~~~~~~ 501 (513) +++++.++ T Consensus 379 gGd~~~~~ 386 (386) T protein:vir:48 379 GGEINGED 386 (386) T ss_pred CCCCCCCC Confidence 11111101 No 189 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=96.06 E-value=0.0011 Score=36.91 Aligned_cols=449 Identities=13% Similarity=0.095 Sum_probs=162.7 Q ss_pred CccchhhceeccCCcccC---------CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc-ccCCCCCCc---- Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKL---------TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS-RRNEKGKAD---- 66 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---------~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~-~~~~~~~~~---- 66 (513) |.-|+.-.-.+.+...++ ..+.|.+.|.--. ...++-+.|- ++....+. +.......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~-~~~~~~~~g~~~~~~~~~~~~~ 72 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGR-------ASARDTVDGI-DIADGNVAGQYSVASISDVLST 72 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhh-------hhhhcccccc-ccccCCcccccccCccccccCH Confidence 333333222233332222 2222222222110 1112223331 11111110 000000000 Q ss_pred ---cee--ecchhHHHHHHHHHHh-------------hcCCeeecC-C----c-----HHHHHHHHHh--cCH------- Q lcl|NC_019916. 67 ---HRA--VHSFARYIADFQTSYS-------------VGNAIAMSG-P----S-----SDRLDDFNRR--NDI------- 109 (513) Q Consensus 67 ---~ri--~~n~~~~ivd~~~~~l-------------~g~p~~~~~-~----~-----~~~l~~~~~~--n~~------- 109 (513) .+. .....+.+|++.+... .+-|+++.. + . ...+..++.. |.+ T Consensus 73 ~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~ 152 (535) T protein:vir:10 73 KKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTF 152 (535) T ss_pred HHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHH Confidence 010 1122334444333221 233554321 1 1 1224445432 332 Q ss_pred HHHHHHHHHHHhhCC-eEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc Q lcl|NC_019916. 110 DTLNYELYLDMTVTG-RAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT 188 (513) Q Consensus 110 ~~~~~~~~~~a~~~G-~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt 188 (513) ......+..+++.+| .+|+++..+..|.+.-.+.++|..+.+..+........ +||.... + . . ...+. T Consensus 153 ~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~---~~~~~~~--~--~-~---~~~~~ 221 (535) T protein:vir:10 153 PRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPR---KFEQFVS--E--T-K---SVKFS 221 (535) T ss_pred HHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCce---EEEEEec--C--c-e---eEEEC Confidence 234555667777775 57988888888877666778999888877754322211 1222111 0 0 0 11234 Q ss_pred CCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCc Q lcl|NC_019916. 189 ENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDI 268 (513) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~ 268 (513) .+.+++++..... ... ....|.|.++.+...++....+..-..+.+...+.|-.+++-.. T Consensus 222 ~~eiih~~~~~~~-----------~~~---------~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~ 281 (535) T protein:vir:10 222 ERNLTFINYWNLS-----------DTD---------RRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQ 281 (535) T ss_pred cccEEEEeccCCC-----------Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecC Confidence 4444444321100 000 01136677766666665555444444444444444543433211 Q ss_pred ccccccccccccccchhhhhhhccccccchhhhcchhcceeecc-ccccccccccCCceeEEeec--CCHHHHHHHHHHH Q lcl|NC_019916. 269 DTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLK-TGMAPNGQQTSADANYIHKE--YDSAGTELYKKRL 345 (513) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l 345 (513) ... ..+..+....++.......-... .+..+.. .+.+++|.... .....+....+.. T Consensus 282 ~~~------------------~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl--~~~g~~~~~l~~~~~D~qfle~~~~~ 341 (535) T protein:vir:10 282 DGD------------------AQANQMMLAGIRRQWTSQGSGLGGAWKIPIL--AAKDAKFVNMTQNSRDMEFDKFLNFM 341 (535) T ss_pred CCC------------------cccCHHHHHHHHHHHHHHhcCcccccccccc--cCCCceEEecCCChhHHHHHHHHHHH Confidence 000 00001111111111110000000 0000111 12344554444 3445566667788 Q ss_pred HHHHHHHhCcccccccccc-c---cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeE Q lcl|NC_019916. 346 AADIHKFSHTPDLTDDNFS-G---NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGF 421 (513) Q Consensus 346 ~~~i~~~s~~p~~~~~~~~-~---n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i 421 (513) .+.|+..-++|++..+-.. + |.++.....--..+. .........+|...++.+...++..-- ...+ ..+.+ T Consensus 342 ~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E---~~~~~~~~~~L~P~l~~ie~~ln~~Ll-~~~~-~~~~f 416 (535) T protein:vir:10 342 IYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAK---AKLESSKDKGLTPLLSFIEQVINDKIM-RYVD-TDYRF 416 (535) T ss_pred HHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHhhhcc-cccC-CeEEE Confidence 8899999999986654321 1 111111100001111 111222233444444444443332111 1112 25678 Q ss_pred EeCCCCCcCHHHHHHHHHH-HhcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHH-----HHHHHh-hh---hcCCCCCC- Q lcl|NC_019916. 422 IFRDNLPTDDVAIITALVQ-AGAQIPQEYLYQYLPN--VTDADEIVKMMDKQRK-----AMLKTY-DT---KGGLIING- 488 (513) Q Consensus 422 ~f~~~~p~d~~e~a~~~~k-l~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~-----~~~~~~-~~---~~~~~~~~- 488 (513) .|+.....+.++.+++... .+|+++.-.+.++++. +++-+.-+-.+..+.- ...+.. ++ .+...... T Consensus 417 ~f~~l~~~d~~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 496 (535) T protein:vir:10 417 SFTLGDAQDKLQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERE 496 (535) T ss_pred EeccccccCHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccc Confidence 8888888888777766543 3567787777766532 3211110000000000 000000 00 00000000 Q ss_pred CCC-------CCCCCCCCCCCCCCC---CC-ccCCC Q lcl|NC_019916. 489 TSG-------NDPEDEGVRGQQGEP---ED-ERTSD 513 (513) Q Consensus 489 ~~~-------~~~~~~~~~~~~~~~---~~-~~~~~ 513 (513) +.+ .+.+.+.......+| ++ +..+| T Consensus 497 ~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 532 (535) T protein:vir:10 497 RQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNED 532 (535) T ss_pred cCcccccccccccCCCCCCCCCCcCCCCCccccccc Confidence 000 000000000000000 00 00011 No 190 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=96.06 E-value=0.0011 Score=36.90 Aligned_cols=394 Identities=12% Similarity=0.059 Sum_probs=150.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCcceeecchhHHHHHHHHHHhhcCCee Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KADHRAVHSFARYIADFQTSYSVGNAIA 91 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~ri~~n~~~~ivd~~~~~l~g~p~~ 91 (513) |-+++ ...++++.+ +..-. ..-..+-++ +..+....... -.+.-+...-....|+..+.-+-.-|+. T Consensus 1 ~~~~~-~~~~~k~~~---~~~~~------~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~ 68 (409) T protein:vir:96 1 MAKEN-IVTRIKKKL---IDNWI------DQSASKLYD--FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLK 68 (409) T ss_pred Ccccc-chhhhhhHH---hhhhh------ccccccccc--cccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceE Confidence 22111 111111110 00000 000011111 00110000000 0011122344455666666666667776 Q ss_pred ecCC---cHHHHHHHHH--hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEE Q lcl|NC_019916. 92 MSGP---SSDRLDDFNR--RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIM 163 (513) Q Consensus 92 ~~~~---~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~ 163 (513) +-.. .+..+..++. -|. -......+..+++.+|.||+++-.+..|.+.-.+.++|..+-++.++.. .. + T Consensus 69 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~- 145 (409) T protein:vir:96 69 MYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-L- 145 (409) T ss_pred EeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-E- Confidence 5221 1223444442 232 2344567888999999999999888888766566678888777766532 11 1 Q ss_pred EEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHH Q lcl|NC_019916. 164 AVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLY 243 (513) Q Consensus 164 ~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~ 243 (513) +|......+ . ...+.++.+++++.. ++. +.-.|.|.++.+...++.. T Consensus 146 ---~y~~~~~~g---~----~~~~~~~evih~r~~--------------~~~---------~~~~G~s~l~~~~~~i~~~ 192 (409) T protein:vir:96 146 ---YYSIHAATG---N----KLIVHNMDMLHFKHI--------------VAS---------NMVQGISPIDVLKNTTDFD 192 (409) T ss_pred ---EEEEEcCCc---e----EEEEccccEEEeCCC--------------CCC---------CccccccHHHHHHHHHHHH Confidence 121111110 0 112344444443210 001 1124666665555554433 Q ss_pred HHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccC Q lcl|NC_019916. 244 DVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTS 323 (513) Q Consensus 244 ~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (513) +.+.. . .+..++.+--++.-.. ...... ....+...-..... ..++++.+ ..+ T Consensus 193 ~~~~~-~--~~~~~~~~~~~i~~~~---------~~l~~e----~~~~~~~~~~~~~~--n~g~~~vl---------~~g 245 (409) T protein:vir:96 193 NAVRT-F--NLTEMQKPDSFMLKYG---------SNVSTE----KRQQVLEDFKQYYE--ENGGILFQ---------EPG 245 (409) T ss_pred HHHHH-H--HHHhcCCCceeEEecC---------CCCCHH----HHHHHHHHHHHHhh--cCCCeeec---------CCC Confidence 22211 1 1111122111110000 000000 01111100000000 01112222 233 Q ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 324 ADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAH 403 (513) Q Consensus 324 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 403 (513) .++..++.......+....+...+.|+..-++|+.-.+... +.+...++-. ....+...+..+++.+.. T Consensus 246 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~s~~e~~----------~~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:96 246 VEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARS-NTNFAKNEEL----------NRFYLQHTLLPIVKQYEE 314 (409) T ss_pred ceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCCcccHHHH----------HHHHHHHHHHHHHHHHHH Confidence 34444444334445566677778899999999976554321 1111111111 112222334444333333 Q ss_pred HHHhcc-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 404 IEERVN-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 404 ~l~~~~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~ 477 (513) -+...- .... .....+++....-+-.|..+.++++.++ +|+++.-.+.+.++. +++-+.=+- ... T Consensus 315 ~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~---------~~n 385 (409) T protein:vir:96 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLI---------SGD 385 (409) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeee---------ccc Confidence 332210 0000 1112233333455556888899998887 578887777766643 221100000 000 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) ..+.. .....+....++++..+++ T Consensus 386 ~~~~~-~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 386 LYPID-TPLELRKSLKGGDKNVNES 409 (409) T ss_pred ccccc-cchhhcccccCCCCCcCCC Confidence 00000 0000111111111111111 No 191 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.05 E-value=0.0011 Score=36.85 Aligned_cols=409 Identities=10% Similarity=-0.003 Sum_probs=143.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCC--- Q lcl|NC_019916. 19 TPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGP--- 95 (513) Q Consensus 19 ~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~--- 95 (513) -.+.|.+++.+...........+-++ .|..- ...+..-. ..-...-+..+....+|+..++-+.+-|+.+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~--~~~~~~~~-~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~ 76 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKY-IGQTF--TKYDNNGK-TYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDT 76 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHh-hcccc--CCCccchh-hhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCC Confidence 33444444433221111112222222 22210 00000000 0000011223455567777777777777764211 Q ss_pred ---------------------------------cHHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCC--- Q lcl|NC_019916. 96 ---------------------------------SSDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPS--- 135 (513) Q Consensus 96 ---------------------------------~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~--- 135 (513) .......++.. |. .......+..+.+.+|.||+++..+.. T Consensus 77 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~ 156 (460) T protein:vir:10 77 KAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGIN 156 (460) T ss_pred ccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCcc Confidence 01112223322 22 234455677899999999998887643 Q ss_pred -CceeEEEEEcccceEEEecCCCCcc-eEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccc Q lcl|NC_019916. 136 -QKGEVSVKLDPMECFIIYDRSVNPK-PIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHS 213 (513) Q Consensus 136 -~~~~~~~~~~p~~~~~~~d~~~~~~-~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (513) |.+.-.+.++|..+-+..++..... ....+++|.... + .....+.++.+++++...... T Consensus 157 ~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~--~------g~~~~~~~~evih~r~~~~~~----------- 217 (460) T protein:vir:10 157 AGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQ--G------DQFIEFNEDEVIHTKYANPNF----------- 217 (460) T ss_pred CceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEec--C------ceeEEecccceEEEecCCCCc----------- Confidence 3333345577777776655433111 011111121110 0 001123444444432211000 Q ss_pred cCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccc Q lcl|NC_019916. 214 AQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLA 293 (513) Q Consensus 214 ~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (513) .+. .....|.|.++.+...++....+..-..+.+...+.|-.+++-... ..... ...++ T Consensus 218 ~~~-------~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~----------l~~e~----~~~~~ 276 (460) T protein:vir:10 218 DLQ-------GSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTG----------LTQPQ----ADSLK 276 (460) T ss_pred ccc-------cCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCC----------CCHHH----HHHHH Confidence 000 0012456666665555554443333333333333333332221110 00000 01111 Q ss_pred cccchhhhc-chhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc-ccHHH Q lcl|NC_019916. 294 DEKMAQLEA-MRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN-SSGVA 371 (513) Q Consensus 294 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~A 371 (513) ..-...... ...++++.+ ..+.+++-++.......+....+...+.|+..=++|+...+...++ .++.. T Consensus 277 ~~~~~~~~g~~n~g~~~vl---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn 347 (460) T protein:vir:10 277 QRLTEMDKSPDRLSQIAGA---------SGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGN 347 (460) T ss_pred HHHHHHhcCccccCCceec---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCcccc Confidence 000000000 001122222 2233344444333445566777888899999999997655432221 12222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHH Q lcl|NC_019916. 372 MKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEY 449 (513) Q Consensus 372 i~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et 449 (513) ++.... ..+..+|...++.+..-+...--..........+.|+........+...+..++ +|+++.-. T Consensus 348 ~e~~~~----------~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE 417 (460) T protein:vir:10 348 LEEERK----------RVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNE 417 (460) T ss_pred HHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHH Confidence 221111 112223333333333222221000000111223444322211121212222222 57777766 Q ss_pred HHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 450 LYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 450 ~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) +.+.++. +++.- .. ....+...... + ..+++..+..+++.+ T Consensus 418 ~R~~~g~~pi~~~~--gD----------~~~~~~n~~~~------~----~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 418 IRIAMKYETLNQDG--MD----------IVFMPSNKVRI------D----DVSNNLIDSAFNQNQ 460 (460) T ss_pred HHHHhCCCCCCCCC--CC----------eeeecccccch------h----hcccccCCCcccCCC Confidence 6666532 21100 00 00000000000 0 000000011111111 No 192 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=95.90 E-value=0.0013 Score=36.45 Aligned_cols=440 Identities=12% Similarity=0.082 Sum_probs=177.0 Q ss_pred Cccchhhceecc-----------CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee Q lcl|NC_019916. 1 MIDMQQANMNYQ-----------EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA 69 (513) Q Consensus 1 ~~~~~~~~~~~~-----------~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri 69 (513) +-.|--|-..-+ .+....++..=..+ +.-++-....++.| .+|.|..-+-+. .--.| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l-~~~~~~~F~Gy~----------~la~l 119 (698) T protein:vir:10 52 LNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAA-SYALDFNGTSMDAL-SFVTSSGFPGFP----------TLVLL 119 (698) T ss_pred ccccccccccCCCccccccccceeccccCCccccchh-hhhhcccccccccc-hhhhccCcchHH----------HHHHH Confidence 111111111111 11111111111000 00001001111111 122221100000 00001 Q ss_pred -ecchhHHHHHHHHHHhhcCCeeec-------------------C----CcHHHHHHHHHhcCHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 70 -VHSFARYIADFQTSYSVGNAIAMS-------------------G----PSSDRLDDFNRRNDIDTLNYELYLDMTVTGR 125 (513) Q Consensus 70 -~~n~~~~ivd~~~~~l~g~p~~~~-------------------~----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~ 125 (513) .++-.+.++.+.+..+.-+-+... . +..+.|..-++.-++.....++.+.+-.||. T Consensus 120 aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGG 199 (698) T protein:vir:10 120 AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGR 199 (698) T ss_pred hhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 023333444454444432211111 0 1123466666677788889999999999999 Q ss_pred EEEEeeecCCCceeEEEEE--cccc-------eEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEE Q lcl|NC_019916. 126 AYEYVYRDPSQKGEVSVKL--DPME-------CFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYK 196 (513) Q Consensus 126 ~~~~v~~d~~~~~~~~~~~--~p~~-------~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~ 196 (513) +..++-.+.++.. ...++ +|.. .+.+.| ++|..... ........=.+|-++.++. T Consensus 200 a~~~i~I~gdd~~-l~~PL~~~~~~I~kGslKGL~ViD-----------p~~vtP~~--~n~~dP~spdfgkP~~y~V-- 263 (698) T protein:vir:10 200 AHPYFKIKGDDQI-MDTPLVPRPYTVPKGSFQGLRVVE-----------PYWVTPNN--YNSINPVADDFYKPSTWWM-- 263 (698) T ss_pred eEEEEEeecCccc-cccccccccccccCccceeeeeec-----------ccccccch--hhhccchhhccCCCceEEE-- Confidence 9866655433311 00001 1211 111111 11111100 0000000001111111111 Q ss_pred eeccCCcccccccccccc-----Ccc--cceEE-ecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCc Q lcl|NC_019916. 197 PIVVAGSVPTLEVAEHSA-----QFG--FPMIE-YRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDI 268 (513) Q Consensus 197 ~~~~~~~~~~~~~~~~~~-----~g~--vPvv~-~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~ 268 (513) ...+-|. +.. +|-.. -.++-.|.|..+.+.+-+++++++.-..+..+..+....+. +++. T Consensus 264 -----------~G~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~-~dla 331 (698) T protein:vir:10 264 -----------IGSEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLA 331 (698) T ss_pred -----------ecceecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHH-HHHH Confidence 0000111 111 11110 01233578888888888888888777766655444333321 1211 Q ss_pred ccccccccccccccchhhhhhhccccccchhhhcchhcc-eeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHH Q lcl|NC_019916. 269 DTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN-MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAA 347 (513) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 347 (513) ..... . + ...+.. ....+..++.+. ++.+ ++.+=+|.+++.+.+.+...+.+..+ T Consensus 332 ~aL~~---------g--~--~~~l~~-R~eli~~~Rsn~G~~ll----------Dk~~Eefeq~st~lSGLddVi~qf~q 387 (698) T protein:vir:10 332 QALTP---------G--A--NVDLSM-RAELINRYRDNRNILFL----------DKATEEFFQFNTPLSGLDALQAQAQE 387 (698) T ss_pred HhcCC---------h--h--hHHHHH-HHHHHHHhcCccceEEE----------ecCCcceEEEecCcCCHHHHHHHHHH Confidence 11000 0 0 000000 111122222222 2222 22345778888999999999999999 Q ss_pred HHHHHhCccccccccc--cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeC Q lcl|NC_019916. 348 DIHKFSHTPDLTDDNF--SG-NSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFR 424 (513) Q Consensus 348 ~i~~~s~~p~~~~~~~--~~-n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~ 424 (513) +|+..+++|-.-+-.. +| |.||++=..-|...+ ....+..+...+++++.+|.. ...+. .+. +|.++|+ T Consensus 388 ~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~r--S~~G~---idp-~i~~~fn 459 (698) T protein:vir:10 388 QMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQL--SLFGA---VDP-SIKWQWN 459 (698) T ss_pred HHHhhhcCchhhhhccCCcccCccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH--HhcCC---CCC-cceEEeC Confidence 9999999996543322 23 688987444444444 245578899999998887642 22222 222 5889999 Q ss_pred CCCCcCHHHHHHHHHHH---------hcCCCHHHHHHhC------CCC--CCHHHHH-----HHHHHHHHHHHHHhhhhc Q lcl|NC_019916. 425 DNLPTDDVAIITALVQA---------GAQIPQEYLYQYL------PNV--TDADEIV-----KMMDKQRKAMLKTYDTKG 482 (513) Q Consensus 425 ~~~p~d~~e~a~~~~kl---------~g~iS~et~~~~l------~~v--~D~~~E~-----~ri~~E~~~~~~~~~~~~ 482 (513) +--..++.|.|++-.|- .|+|+...+..+| +|. .|++.+- ..++.+... .......+ T Consensus 460 PL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~-~~~~~~~~ 538 (698) T protein:vir:10 460 ALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTY-VQRMAEGG 538 (698) T ss_pred CCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhh-hcCCcCCC Confidence 99999999999875542 4666665554444 221 1211110 000100000 00000001 Q ss_pred CCCCCC--CCCCCCCCCCCCCCCCC-C--CCccCCC Q lcl|NC_019916. 483 GLIING--TSGNDPEDEGVRGQQGE-P--EDERTSD 513 (513) Q Consensus 483 ~~~~~~--~~~~~~~~~~~~~~~~~-~--~~~~~~~ 513 (513) ..+... .....|..+++...+.. . ..+.... T Consensus 539 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (698) T protein:vir:10 539 DTGAPTAPGGARAGATAPPAAANVNANANPREAGAQ 574 (698) T ss_pred CcccccccccccCCCCCCcccccccCCCCccccCcc Confidence 111000 00011111111111111 0 1111111 No 193 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=95.72 E-value=0.0016 Score=35.96 Aligned_cols=402 Identities=10% Similarity=0.004 Sum_probs=152.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc-----cc-ccCCCCCC---cceeecchhHHHHHHHHHHhhcC Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSP-----AS-RRNEKGKA---DHRAVHSFARYIADFQTSYSVGN 88 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~-----~~-~~~~~~~~---~~ri~~n~~~~ivd~~~~~l~g~ 88 (513) +-++.+.-++......-.+.-- ....|........ .. .....+.. ..=+.+.-....|+..++-+-+- T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~l 77 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDP---VDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAM 77 (432) T ss_pred CCCcccCchhhhhHhhcCCccc---cccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccC Confidence 3333333332221111000000 0000000000000 00 00000000 00111223334666666666667 Q ss_pred Ceeec---CCc-----HHHHHHHH-Hh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecC Q lcl|NC_019916. 89 AIAMS---GPS-----SDRLDDFN-RR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDR 155 (513) Q Consensus 89 p~~~~---~~~-----~~~l~~~~-~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~ 155 (513) |+.+- .+. +..+..++ .. |. .......+..+.+.+|.||+++..+ +|++.-.+.++|..+-+..+. T Consensus 78 p~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~ 156 (432) T protein:vir:97 78 PLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDT 156 (432) T ss_pred ceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcC Confidence 87641 111 12233343 22 33 2355566788999999999888776 466554556888888887765 Q ss_pred CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhH Q lcl|NC_019916. 156 SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFEN 235 (513) Q Consensus 156 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~ 235 (513) .. .+. |.....++ ....+..+.+++++.. ++ +...|.|-++. T Consensus 157 ~g--~~~-----y~~~~~~g-------~~~~~~~~~iih~r~~--------------------~~----dg~~G~spi~~ 198 (432) T protein:vir:97 157 KG--NTA-----YRYRRTDG-------QMIDIPRQQIWKIMGY--------------------SL----DGENGLSAIRY 198 (432) T ss_pred CC--cEE-----EEEEecCc-------eEEEEccccEEEecCc--------------------CC----CCcccccHHHH Confidence 32 221 11111111 0112344444443211 00 11135565555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc-hhcceeecccc Q lcl|NC_019916. 236 VLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM-RQANMILLKTG 314 (513) Q Consensus 236 v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~ 314 (513) +...++....+.....+.+...+.|-.+++-... ..... ...++. .+... ..++++.+ T Consensus 199 ~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~----------l~~e~----~~~~~~----~~~~~~nag~~~vl--- 257 (432) T protein:vir:97 199 GAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF----------LTDDQ----YDSFSK----KVSGSVEAGRAPLL--- 257 (432) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceeEecCCC----------CCHHH----HHHHHH----HHhhhhcCCCceec--- Confidence 4444433332222222333333333333332110 00000 111110 11111 11222222 Q ss_pred ccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccc--cHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 315 MAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNS--SGVAMKYKVLGTVELASTKRKQFER 392 (513) Q Consensus 315 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~--Sg~Ai~~~~~~l~~k~~~~~~~f~~ 392 (513) ..+.+++.++.......+....+...+.|+..-++|+...+....+. .|..++.... ..+.. T Consensus 258 ------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~----------~f~~~ 321 (432) T protein:vir:97 258 ------EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLTM 321 (432) T ss_pred ------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHH----------HHHHH Confidence 22334444444444455566788888999999999986654322211 1222322211 11222 Q ss_pred HHHHHHHHHHHHHHh-cccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHH Q lcl|NC_019916. 393 GLNQRYTVVAHIEER-VNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMM 467 (513) Q Consensus 393 ~l~~~~~li~~~l~~-~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri 467 (513) +|...++.+..-+.. .-.........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++. +++ ...+ + T Consensus 322 tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g-~~~~--~ 398 (432) T protein:vir:97 322 TLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG-NAAV--L 398 (432) T ss_pred HHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-Ccce--E Confidence 333333333222221 1111111112344444555667899999998887 578888777666543 221 0000 0 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) -.. ....+.... ..+. +.++. ++.+..+.+..+- T Consensus 399 ~~~-----~~~~pl~~~--~~~~----~~~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:97 399 TVQ-----SAMVPLDSI--GLQA----SPEPA-SGLGNQQQDKVSK 432 (432) T ss_pred eec-----ccccchhhh--cccC----CCCCC-CCCCCcccccccC Confidence 000 000000000 0000 00000 0001010000011 No 194 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=95.66 E-value=0.0017 Score=35.80 Aligned_cols=426 Identities=8% Similarity=0.041 Sum_probs=181.6 Q ss_pred CccchhhceeccCCc-ccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDA-DKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ 76 (513) |-+ .|+. ..++.+.+.+..+.....+.+ +.+.+.+|..- .+..+. .......++..+-... T Consensus 1 ~~~--------~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP---~~~~~~----~~~~~~~~~~dstg~~ 65 (516) T protein:vir:10 1 MKQ--------STDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLP---YLMNDK----GDNETSQNGWQGVGAQ 65 (516) T ss_pred CCc--------hhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc---cccCCC----CCcccccccccchHHH Confidence 211 1221 134456666666665555544 44555555433 111111 1112223466677777 Q ss_pred HHHHHHHHhhcC--Ce-----eecCCc------------HH-----------HHHHHHHhcCHHHHHHHHHHHHhhCCeE Q lcl|NC_019916. 77 IADFQTSYSVGN--AI-----AMSGPS------------SD-----------RLDDFNRRNDIDTLNYELYLDMTVTGRA 126 (513) Q Consensus 77 ivd~~~~~l~g~--p~-----~~~~~~------------~~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~~ 126 (513) .++..++-|++. |+ ++...+ .. .+...+..++|.....++.++..++|.| T Consensus 66 a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a 145 (516) T protein:vir:10 66 ATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSC 145 (516) T ss_pred HHHHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeE Confidence 888888777642 21 222211 11 1223445678999999999999999987 Q ss_pred EEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc--------------cccceeEEEEEEEc---- Q lcl|NC_019916. 127 YEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV--------------DNITQTKYEVETWT---- 188 (513) Q Consensus 127 ~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~--------------~~~~~~~~~ve~yt---- 188 (513) . +|.++++.. . .-|..-+++--|.. +++...+|..+..... ....+....+++|| T Consensus 146 ~--l~~d~~~~~--~--~~pl~~y~v~~d~~-G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~ 218 (516) T protein:vir:10 146 M--LYKPSKGAI--S--AIPMHHYVVNRDTN-GDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKY 218 (516) T ss_pred e--EEecCCCCe--E--EEEcCeEEEeeCCC-CCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEe Confidence 4 566766532 2 23444455554443 3444444432211000 00001111233332 Q ss_pred -CCcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019916. 189 -ENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAML 262 (513) Q Consensus 189 -~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l 262 (513) ++..+.+.....+ .... .. -..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-.+.........|.+ T Consensus 219 ~~~~~~~~~~~~d~-~~~~-~~-s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~ 295 (516) T protein:vir:10 219 LGEGFWELKQSADD-IPVG-KV-SKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKY 295 (516) T ss_pred cCCCceEEEEeeCc-eeec-cc-cccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 2332222222111 1111 11 123456788877653 4679998888888888888777667766666666655 Q ss_pred heecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHH Q lcl|NC_019916. 263 VIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTEL 340 (513) Q Consensus 263 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~ 340 (513) .+--.+... ... ......+.. ..+...++..+. +..+...... T Consensus 296 lv~p~g~~~----------------------------~~~-----l~~~~~g~~--~~g~~~~v~~~q~~~~~d~~~~~~ 340 (516) T protein:vir:10 296 LIRPGAQTD----------------------------VDH-----FVNSGTGEV--VTGVEEDIHIVQLGKYADLTPISA 340 (516) T ss_pred ccCcccccc----------------------------hhh-----hccCCCcee--ecCCcccceeeecCcccchHHHHH Confidence 442100000 000 000000001 112223334433 2235566667 Q ss_pred HHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHhcccccccc Q lcl|NC_019916. 341 YKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAH-I----EERVNGKWDID 415 (513) Q Consensus 341 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~-~----l~~~~~~~~~~ 415 (513) .++.++..|...-....+..- -+...++.-++ .++.+++..+|..+.++-.=++. + +.......... T Consensus 341 ~i~~~~~rI~~af~~~~l~~r-d~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~ 412 (516) T protein:vir:10 341 VLEVYTRRIGVVFMMETMTRR-DAERVTAVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSD 412 (516) T ss_pred HHHHHHHHHHHHHhhhhhhcc-CCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChh Confidence 777777777443221111100 01234554443 44667777777776654221111 1 11111111111 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHH----------HhcCC-------C----HHHHHHhCCCCC----CHHHHHHHHHHH Q lcl|NC_019916. 416 PDEIGFIFRDNLPTDDVAIITALVQ----------AGAQI-------P----QEYLYQYLPNVT----DADEIVKMMDKQ 470 (513) Q Consensus 416 ~~~i~i~f~~~~p~d~~e~a~~~~k----------l~g~i-------S----~et~~~~l~~v~----D~~~E~~ri~~E 470 (513) -+.+.. ..+-+.+..++-+.. ++++- . .+.+...++ ++ -.++|++.+.++ T Consensus 413 --lv~~~~--v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~g-vp~~~irs~eev~~~r~~ 487 (516) T protein:vir:10 413 --LVDPVI--ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQIS-AELPFLKSAEEMEQEQEA 487 (516) T ss_pred --hcCcce--ehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhC-CChhccCCHHHHHHHHHH Confidence 112211 112222222222211 11111 1 122333332 21 246677777666 Q ss_pred HHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 471 RKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) +.+.+.........+...+. .-.++-.+. T Consensus 488 ~~~~q~~~~~~~~~~~~~~~---~~~~~~~~~ 516 (516) T protein:vir:10 488 QMQAQQAQMLEEGVAKAVPG---VIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHhhhcccc---hhhhhhhcC Confidence 55544433322222221111 111111111 No 195 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=95.60 E-value=0.0018 Score=35.66 Aligned_cols=403 Identities=9% Similarity=0.039 Sum_probs=155.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHH------------HHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRL------------EMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~------------~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) |. +-+.+++.......+..+ ..+...+-|... .......+..-+.+.-...+|+. T Consensus 1 ~~------~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-------~~g~~v~~~~al~~~~V~~~i~~ 67 (434) T protein:vir:43 1 MS------KSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRES-------SSGKKVTVDKAMKLSAVWACVRL 67 (434) T ss_pred Cc------cchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCc-------cCCceechhhhhccHHHHHHHHH Confidence 11 111112221111111110 011111112100 00000000011222333456777 Q ss_pred HHHHhhcCCeee-c--CC------cHHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcc Q lcl|NC_019916. 81 QTSYSVGNAIAM-S--GP------SSDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDP 146 (513) Q Consensus 81 ~~~~l~g~p~~~-~--~~------~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p 146 (513) .+.-+-.-|+++ . .+ .+..+..++.. |. -......+..+.+.+|.||+++-.+ +|.+.-.+.++| T Consensus 68 ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p 146 (434) T protein:vir:43 68 ISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLP 146 (434) T ss_pred HHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcC Confidence 777777778775 1 11 12234444422 43 3355667788999999999887655 566544456888 Q ss_pred cceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC Q lcl|NC_019916. 147 MECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN 226 (513) Q Consensus 147 ~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 226 (513) ..+-+..+... .+ +|+.. ..++ ....+..+.+.+++... + +. T Consensus 147 ~~v~~~~~~~g--~~----~y~~~-~~~g-------~~~~~~~~eVih~~~~~---------------~---------dg 188 (434) T protein:vir:43 147 SRVDLECDENG--RL----KYFYT-TKKG-------ARREIERTNMLHIPAFT---------------L---------DG 188 (434) T ss_pred cceEEEEcCCC--eE----EEEEE-ecCc-------eEEEEccccEEEecCcC---------------C---------CC Confidence 88877776532 11 12111 1111 01133444444432110 0 11 Q ss_pred CCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-h Q lcl|NC_019916. 227 EYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-Q 305 (513) Q Consensus 227 ~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~ 305 (513) ..|.|-++.+...+........-....+...+.|-.+++-... ..+.. ...++.. ........ . T Consensus 189 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~----------l~~e~----~~~~r~~-~~~~~g~~na 253 (434) T protein:vir:43 189 RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRI----------LQPAQ----REEFREY-VKSVSGAMNS 253 (434) T ss_pred ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCC----------CCHHH----HHHHHHH-HHHhcCcccc Confidence 1355555444444433222222222222323334333332111 00000 1111100 00000001 1 Q ss_pred cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELAS 384 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~ 384 (513) ++.+.+ ..+.++.-++.......+....+...+.|+..-++|+...+...+ +..+..++.... T Consensus 254 g~~~vl---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~------- 317 (434) T protein:vir:43 254 GRSPVL---------EQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQML------- 317 (434) T ss_pred CCcccc---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHH------- Confidence 112222 222333334333344556677888889999999999765443222 212333322211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDAD 461 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~ 461 (513) ..+..+|...+..|...+...- .........+++.+..-+..|..+.++++.++ +|+++.-.+.+.++.-.-+. T Consensus 318 ---~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~g 394 (434) T protein:vir:43 318 ---AFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPG 394 (434) T ss_pred ---HHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 1223334444443333332211 11111112345555566677889999998876 57888877776654321100 Q ss_pred HHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 462 EIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 462 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) .+.+.-- ....+.. ...+............+..+.|+=++ T Consensus 395 --gD~~~~~-----~n~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 395 --GDILTVQ-----SNLVPID--QLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred --CCeEeec-----cCccchh--hhhccCCCcchhhhhhccCCCCCCCC Confidence 0000000 0000000 00011111111111111111111111 No 196 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=269 Identities=11% Similarity=0.036 Sum_probs=115.3 Q ss_pred hhcCCeeecC---CcHHHHHHHHH-h-c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCC Q lcl|NC_019916. 85 SVGNAIAMSG---PSSDRLDDFNR-R-N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRS 156 (513) Q Consensus 85 l~g~p~~~~~---~~~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~ 156 (513) +-+-|+.+.. ..+..+..++. . | ........+..+.+.+|.||+.+..+.+|.+.-.+.++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 3334443311 11223333332 1 2 2345667788899999999999988888876555667888887776653 Q ss_pred CCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHH Q lcl|NC_019916. 157 VNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENV 236 (513) Q Consensus 157 ~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v 236 (513) . ..+ +|......+ . ...+....+++++.. ++.. ...|.|.+..+ T Consensus 81 ~--~~~----~y~~~~~~g---~----~~~~~~~evih~~~~--------------~~~~---------~~~G~s~~~~~ 124 (278) T protein:vir:78 81 S--REL----YYSIHAATG---N----KLIVHNMDMLHFKHI--------------VASN---------MVQGISPIDVL 124 (278) T ss_pred C--ceE----EEEEEcCCc---e----EEEEccccEEEECCC--------------CCCC---------CeeeccHHHHH Confidence 3 122 122111110 0 112344444443210 1111 11466766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccc Q lcl|NC_019916. 237 LSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMA 316 (513) Q Consensus 237 ~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 316 (513) ...++....+... ....++.+-..+.-.... .... ....++..=..... ..++++.+ T Consensus 125 ~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~---------l~~e----~~~~~~~~~~~~~~--~~g~~~vl----- 181 (278) T protein:vir:78 125 KNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSN---------VGKE----KRQQVLEDFKQYYE--ENGGILFQ----- 181 (278) T ss_pred HHHHHHHHHHHHH---HHHHhcCCCcEEEEeCCC---------CCHH----HHHHHHHHHHHHhc--cCCCceec----- Confidence 6666543333221 223333321111110000 0001 11111110000000 11122222 Q ss_pred ccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 317 PNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLN 395 (513) Q Consensus 317 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~ 395 (513) ..+.+++.++.......+....+...+.|+..-++|+...+... ++-|. ++.. ....+...++ T Consensus 182 ----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn--~~~~----------~~~~~~~~l~ 245 (278) T protein:vir:78 182 ----EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAK--NEEL----------NRFYLQHTLL 245 (278) T ss_pred ----CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHHHHHHH Confidence 23334454544445555667788889999999999976554332 22121 1111 1122333344 Q ss_pred HHHHHHHHHHHhcccccccccceeeEEeCCCCC Q lcl|NC_019916. 396 QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLP 428 (513) Q Consensus 396 ~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p 428 (513) .+++.+..-++..--....-.....+.|+-+.- T Consensus 246 P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 246 PIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 444444443332111000001123456654333 No 197 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=95.33 E-value=0.0023 Score=35.08 Aligned_cols=432 Identities=11% Similarity=0.037 Sum_probs=180.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--C--- Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQ---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--A--- 89 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p--- 89 (513) |- +.+.+..+...++| ..+.+.+.+|..-. +..... ........++..+-....+++.++.|++- | T Consensus 1 m~-~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~ 74 (555) T protein:vir:17 1 MK-HSAQAKYMMLRADREDYLDSGRQSARLTLPY---ILTDEG--HVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNT 74 (555) T ss_pred Ch-hHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCC--CcccccccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 11 11223334433444 34455666664331 111111 11112334566777888888888888652 2 Q ss_pred -e-eecCCcH------------H-----------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEE Q lcl|NC_019916. 90 -I-AMSGPSS------------D-----------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKL 144 (513) Q Consensus 90 -~-~~~~~~~------------~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~ 144 (513) + ++...+. . .+...+..++|.....++.++..++|.+. +|.++++ +. + T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~---~~--~ 147 (555) T protein:vir:17 75 SFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNAL--LYQGKKN---LK--L 147 (555) T ss_pred cccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEE--EEecCCc---ee--E Confidence 1 2222210 1 12223345789999999999999999876 4556553 22 2 Q ss_pred cccceEEEecCCCCcceEEEEEEEeecccc-----cc----------------------------cceeEEEEEEEcC-- Q lcl|NC_019916. 145 DPMECFIIYDRSVNPKPIMAVRYHAVQTVV-----DN----------------------------ITQTKYEVETWTE-- 189 (513) Q Consensus 145 ~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-----~~----------------------------~~~~~~~ve~yt~-- 189 (513) -|..-|++--|. ..++...+|.++..... +. ..+....+++||. T Consensus 148 ~pl~~y~v~~d~-~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~ 226 (555) T protein:vir:17 148 YPLDRFVVSRDG-EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVC 226 (555) T ss_pred EEcCeEEEeeCC-CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeeccc Confidence 344445554443 34555556544321100 00 0011112334441 Q ss_pred ---CcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 190 ---NDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAM 261 (513) Q Consensus 190 ---~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~ 261 (513) ..++++.... +..... .-.+.++..+|++.++- +.+|+|-.++..+-+..+|.+.-......+...+|. T Consensus 227 ~~~~~~~~~~e~~-~~~v~~--~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 303 (555) T protein:vir:17 227 RKDGQVKWHQECD-GKVIPG--SNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVV 303 (555) T ss_pred ccCCeeEEEEecC-ceeccc--cccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Confidence 1122222211 111100 11234677789877653 467999999999999999999888888899888888 Q ss_pred hheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHH Q lcl|NC_019916. 262 LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTE 339 (513) Q Consensus 262 l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~ 339 (513) +.+--.+..... .+.... .+.+ ..+...+++.+. ...+..... T Consensus 304 ~lv~~~g~~~~~-----------------~l~~~~---------~g~v---------~~g~~~~v~~~~~~~~~~~~~~~ 348 (555) T protein:vir:17 304 FMVSPSATTKPQ-----------------NLALAA---------NGAI---------IQGRPDDVSVVQANKAADFRTVL 348 (555) T ss_pred eeeccccccCcc-----------------eeecCC---------Ccee---------ecCCcccceeeeccccchhhHHH Confidence 655211100000 000000 0000 011223333333 223445556 Q ss_pred HHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcccc Q lcl|NC_019916. 340 LYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ--------RYTVVAHIEERVNGK 411 (513) Q Consensus 340 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~--------~~~li~~~l~~~~~~ 411 (513) ..++.++..|...-.. .... -+...++.-++. ++.++...+|..+.+ +++-++.++...+.- T Consensus 349 ~~i~~~~~~I~~aFm~--~~~~-d~~r~TAtEV~~-------r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~l 418 (555) T protein:vir:17 349 EMIQKLEQRISDAFLM--LQVR-QSERTTATEVQA-------TVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKL 418 (555) T ss_pred HHHHHHHHHHHHHHhh--cCCC-CcccchHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 6677666666432111 0111 123345544433 244445555444433 333344555443332 Q ss_pred cccccceeeEEeCCCCCc-CHHHHHHH----HHHHhcC---------CCHHHH----HHhCCCCC----CHHHHHHHHHH Q lcl|NC_019916. 412 WDIDPDEIGFIFRDNLPT-DDVAIITA----LVQAGAQ---------IPQEYL----YQYLPNVT----DADEIVKMMDK 469 (513) Q Consensus 412 ~~~~~~~i~i~f~~~~p~-d~~e~a~~----~~kl~g~---------iS~et~----~~~l~~v~----D~~~E~~ri~~ 469 (513) .......+.+.+.-.+.. ...+.++. +..++.+ +....+ ...++... ..++|++++++ T Consensus 419 P~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq 498 (555) T protein:vir:17 419 PQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGD 498 (555) T ss_pred CCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHH Confidence 222222233333222211 01111111 1112222 222222 33333211 24566666655 Q ss_pred HHHHHHHHh------hhh-cCCC--------CCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 470 QRKAMLKTY------DTK-GGLI--------INGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 470 E~~~~~~~~------~~~-~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) ++++++... ..+ +... ...+.++.+.+....+... |+.-..- T Consensus 499 ~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~-~~~~~~~ 555 (555) T protein:vir:17 499 QQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGAAESETSS-AEAQAGA 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHHHHhhcCC-cccccCC Confidence 444322211 111 1110 1111111111100000000 0000000 No 198 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=95.32 E-value=0.0023 Score=35.04 Aligned_cols=411 Identities=12% Similarity=0.013 Sum_probs=157.6 Q ss_pred chhhceeccCCcccCCHHHHHHHHHHHHHHHHHH--HHHHHHHhcCCCcc--ccccc-----c----ccCCCC---CCcc Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPR--LEMLYDYYRGQNDG--ILSPA-----S----RRNEKG---KADH 67 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~--~~~~~~YY~G~~~i--~~~~~-----~----~~~~~~---~~~~ 67 (513) |+=+| .|-.=+++ +.+..+| +.-.-=+...+..- ....+ . ...... .... T Consensus 1 ~~~~~----~~~~~~~~----------~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (441) T protein:vir:94 1 MHWYN----TDCYFVDF----------KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE 66 (441) T ss_pred Ccccc----Cccccccc----------cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh Confidence 11111 11000000 0000000 00000000000000 00000 0 000000 0000 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCcH----HHHHHHH-H-hcCH---HHHHHHHHHHHhhCCeEEEEeeecCCCce Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPSS----DRLDDFN-R-RNDI---DTLNYELYLDMTVTGRAYEYVYRDPSQKG 138 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~----~~l~~~~-~-~n~~---~~~~~~~~~~a~~~G~~~~~v~~d~~~~~ 138 (513) =+...-.-..|+..++-+-+-|+++..+.. ..+..++ . -|.. ......+..+.+.+|.||+++..+..|.+ T Consensus 67 al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:94 67 AIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred hhccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 011111223566666666667877633221 2233333 2 2332 34556678889999999999988888876 Q ss_pred eEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCccc Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGF 218 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 218 (513) .-.+.++|..+.+..|+.. ++.+..+. .+..+ . .....+....+++++.. ++. T Consensus 147 ~~L~~i~~~~v~v~~d~~g--~~~~~~~~---~~~~~--~---~~~~~~~~~dvih~k~~---------------~~d-- 199 (441) T protein:vir:94 147 MNLTFRKTSEIELKSDARG--RLYYFHQR---IDSNG--N---NIERNVKFEDMLDIKFY---------------SLD-- 199 (441) T ss_pred EEEEEEcCceeEEEECCCc--cEEEEEEE---eccCC--c---eeEEEEccccEEEeccC---------------CCC-- Confidence 5556789998888887542 22221111 11110 0 11123444454443210 011 Q ss_pred ceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccch Q lcl|NC_019916. 219 PMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMA 298 (513) Q Consensus 219 Pvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 298 (513) ...|.|.++.+...++....+..-..+.+...+.|-.+++=... .. ..+ ....++..=.. T Consensus 200 -------g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~---------~~e---~~e~~r~~~~~ 259 (441) T protein:vir:94 200 -------GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LD---------NKK---ARDRAREEFHK 259 (441) T ss_pred -------CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CC---------CHH---HHHHHHHHHHH Confidence 11366766655555544333333333334444444444432110 00 000 00011100000 Q ss_pred hhhcchh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHH Q lcl|NC_019916. 299 QLEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVL 377 (513) Q Consensus 299 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 377 (513) .+..... ++++.+ ..+.+++.++.......+....+...+.|+..-++|+...+...++.|...... T Consensus 260 ~~~G~~nag~~~vl---------~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~--- 327 (441) T protein:vir:94 260 SFSGTKQAGKVVVL---------DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--- 327 (441) T ss_pred HhcCccccCcceec---------CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH--- Confidence 0111011 122222 223344444444444556677788888999999999865542222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP 455 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~ 455 (513) .|...+...++.+..-+...-. .......+++.+..-+-.|..+.++++.++ +|+++.-.+.++++ T Consensus 328 -----------~~~~tl~P~~~~ie~eln~kl~-~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:94 328 -----------DYLSTLKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred -----------HHHHHHHHHHHHHHHHHhhhcc-ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 1222333333333332222111 111122344444555666888888888876 68888877776654 Q ss_pred C--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 456 N--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 456 ~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) . +++.+..+-.+. ....+....+. .+....+.......|.+++| T Consensus 396 l~Pi~ggd~~~~~~~---------~n~~~~~~~~~-~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 396 LAPIPGGNGSIHRVD---------LNHVNIELVDE-YQMNKSRATDKKLKGGEENE 441 (441) T ss_pred CCCCCCCCcceEeec---------ccccccccccc-cccccccccccccCCCCCCC Confidence 3 333222110000 00000000000 00000000001111111111 No 199 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=95.32 E-value=0.0023 Score=35.04 Aligned_cols=411 Identities=12% Similarity=0.013 Sum_probs=157.6 Q ss_pred chhhceeccCCcccCCHHHHHHHHHHHHHHHHHH--HHHHHHHhcCCCcc--ccccc-----c----ccCCCC---CCcc Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPR--LEMLYDYYRGQNDG--ILSPA-----S----RRNEKG---KADH 67 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~--~~~~~~YY~G~~~i--~~~~~-----~----~~~~~~---~~~~ 67 (513) |+=+| .|-.=+++ +.+..+| +.-.-=+...+..- ....+ . ...... .... T Consensus 1 ~~~~~----~~~~~~~~----------~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (441) T protein:vir:79 1 MHWYN----TDCYFVDF----------KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE 66 (441) T ss_pred Ccccc----Cccccccc----------cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh Confidence 11111 11000000 0000000 00000000000000 00000 0 000000 0000 Q ss_pred eeecchhHHHHHHHHHHhhcCCeeecCCcH----HHHHHHH-H-hcCH---HHHHHHHHHHHhhCCeEEEEeeecCCCce Q lcl|NC_019916. 68 RAVHSFARYIADFQTSYSVGNAIAMSGPSS----DRLDDFN-R-RNDI---DTLNYELYLDMTVTGRAYEYVYRDPSQKG 138 (513) Q Consensus 68 ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~----~~l~~~~-~-~n~~---~~~~~~~~~~a~~~G~~~~~v~~d~~~~~ 138 (513) =+...-.-..|+..++-+-+-|+++..+.. ..+..++ . -|.. ......+..+.+.+|.||+++..+..|.+ T Consensus 67 al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:79 67 AIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred hhccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 011111223566666666667877633221 2233333 2 2332 34556678889999999999988888876 Q ss_pred eEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCccc Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGF 218 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 218 (513) .-.+.++|..+.+..|+.. ++.+..+. .+..+ . .....+....+++++.. ++. T Consensus 147 ~~L~~i~~~~v~v~~d~~g--~~~~~~~~---~~~~~--~---~~~~~~~~~dvih~k~~---------------~~d-- 199 (441) T protein:vir:79 147 MNLTFRKTSEIELKSDARG--RLYYFHQR---IDSNG--N---NIERNVKFEDMLDIKFY---------------SLD-- 199 (441) T ss_pred EEEEEEcCceeEEEECCCc--cEEEEEEE---eccCC--c---eeEEEEccccEEEeccC---------------CCC-- Confidence 5556789998888887542 22221111 11110 0 11123444454443210 011 Q ss_pred ceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccch Q lcl|NC_019916. 219 PMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMA 298 (513) Q Consensus 219 Pvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 298 (513) ...|.|.++.+...++....+..-..+.+...+.|-.+++=... .. ..+ ....++..=.. T Consensus 200 -------g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~---------~~e---~~e~~r~~~~~ 259 (441) T protein:vir:79 200 -------GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LD---------NKK---ARDRAREEFHK 259 (441) T ss_pred -------CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CC---------CHH---HHHHHHHHHHH Confidence 11366766655555544333333333334444444444432110 00 000 00011100000 Q ss_pred hhhcchh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHH Q lcl|NC_019916. 299 QLEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVL 377 (513) Q Consensus 299 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 377 (513) .+..... ++++.+ ..+.+++.++.......+....+...+.|+..-++|+...+...++.|...... T Consensus 260 ~~~G~~nag~~~vl---------~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~--- 327 (441) T protein:vir:79 260 SFSGTKQAGKVVVL---------DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--- 327 (441) T ss_pred HhcCccccCcceec---------CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH--- Confidence 0111011 122222 223344444444444556677788888999999999865542222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCC Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLP 455 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~ 455 (513) .|...+...++.+..-+...-. .......+++.+..-+-.|..+.++++.++ +|+++.-.+.++++ T Consensus 328 -----------~~~~tl~P~~~~ie~eln~kl~-~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:79 328 -----------DYLSTLKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred -----------HHHHHHHHHHHHHHHHHhhhcc-ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 1222333333333332222111 111122344444555666888888888876 68888877776654 Q ss_pred C--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 456 N--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 456 ~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) . +++.+..+-.+. ....+....+. .+....+.......|.+++| T Consensus 396 l~Pi~ggd~~~~~~~---------~n~~~~~~~~~-~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 396 LAPIPGGNGSIHRVD---------LNHVNIELVDE-YQMNKSRATDKKLKGGEENE 441 (441) T ss_pred CCCCCCCCcceEeec---------ccccccccccc-cccccccccccccCCCCCCC Confidence 3 333222110000 00000000000 00000000001111111111 No 200 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=95.06 E-value=0.0028 Score=34.54 Aligned_cols=394 Identities=11% Similarity=0.011 Sum_probs=153.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--------cccccccc------cCCCCC---CcceeecchhHHHHHH Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQND--------GILSPASR------RNEKGK---ADHRAVHSFARYIADF 80 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~--------i~~~~~~~------~~~~~~---~~~ri~~n~~~~ivd~ 80 (513) +.++.+..+.++. +..+..... ........ ....+. ...=+.+.-...+|+. T Consensus 1 ~~~~~~mg~f~r~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~ 69 (432) T protein:vir:81 1 MPDEKKLGLFGQL-----------KAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKL 69 (432) T ss_pred CCchhhcchhhhh-----------hhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHH Confidence 3333333333321 111111000 00000000 000000 0001122333446777 Q ss_pred HHHHhhcCCeee-c--CCc-----HHHHHHHHH--hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 81 QTSYSVGNAIAM-S--GPS-----SDRLDDFNR--RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 81 ~~~~l~g~p~~~-~--~~~-----~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) .++-+-+-|+.+ . .+. +..+..++. =|. -......+..+++.+|.||+++..+ +|.+.-.+.++|. T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~ 148 (432) T protein:vir:81 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLAND 148 (432) T ss_pred HHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCC Confidence 777777778764 1 111 122334442 233 2345566778899999999888765 4555444567888 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) .+.+..++.. ++.+ .....++ ....+..+.+++++... + +.. T Consensus 149 ~v~v~~~~~g--~~~y-----~~~~~~g-------~~~~~~~~~iih~r~~~---------------~---------dg~ 190 (432) T protein:vir:81 149 RLTITTDPKG--NTAY-----RYRRTDG-------QMIDIPKQQIWKIMGYS---------------L---------DGE 190 (432) T ss_pred ceEEEECCCC--cEEE-----EEEecCc-------eEEEEccccEEEecCCC---------------C---------CCc Confidence 8888776532 2221 1111111 01123344444432110 0 111 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-c Q lcl|NC_019916. 228 YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-A 306 (513) Q Consensus 228 ~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~ 306 (513) .|.|-++.+...++.......-..+.+...+.|-.+++-... ............ +..... + T Consensus 191 ~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~----------l~~e~~~~~~~~--------~~~~~nag 252 (432) T protein:vir:81 191 NGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF----------LTDDQYDSFAKK--------VSGSVEAG 252 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC----------CCHHHHHHHHHH--------HhhhhcCC Confidence 355655555444443333322222222222233333322110 001111111111 111111 2 Q ss_pred ceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc--ccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 307 NMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGN--SSGVAMKYKVLGTVELAS 384 (513) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n--~Sg~Ai~~~~~~l~~k~~ 384 (513) +++.+ ..+.+++.++.......+....+...+.|+..-++|+...+....+ ..|..++-.... T Consensus 253 ~~~vl---------~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~------ 317 (432) T protein:vir:81 253 RAPLL---------EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG------ 317 (432) T ss_pred Cceec---------CCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHH------ Confidence 23333 2233444444444445566677888899999999998665433221 122333221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc-ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCC Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERV-NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTD 459 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~-~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D 459 (513) .+...|...++.+..-+... -.........+++.+..-+..|..+.++++.++ +|+++.-.+.++++. +++ T Consensus 318 ----f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g 393 (432) T protein:vir:81 318 ----FLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred ----HHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 11223333333332222211 111111112334444455667899999998876 578888777777643 221 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 460 ADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 460 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) - ..+ + ..+......+ ..+...+.++..+...+.+++-+. T Consensus 394 ~-~~~--~----------~~~~~~~pl~-~~~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 394 N-AAV--L----------TVQSAMVPLD-SIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred C-cce--E----------eecCcccchh-hhccCCCCCCCCCCCCcccccccC Confidence 0 000 0 0000000000 000000111110000111111111 No 201 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=94.83 E-value=0.0034 Score=34.13 Aligned_cols=424 Identities=8% Similarity=0.030 Sum_probs=182.3 Q ss_pred eeccCCcc-cCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHH Q lcl|NC_019916. 9 MNYQEDAD-KLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSY 84 (513) Q Consensus 9 ~~~~~~~~-~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~ 84 (513) |.-.++.. +++.+.+.+..+.....|.+ +.+.+.+|..-. .+.... ......++..+-....+++.++- T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~--~~~~~~-----~~~~~~~~~dstg~~a~~~LAa~ 73 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPY--LMNDKG-----DNETSQNGWQGVGAQATNHLANK 73 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--ccCCCC-----CccccCCcccchHHHHHHHHHHH Confidence 33333322 45567777777776666644 455555554431 111111 11222346667777888888877 Q ss_pred hhcC--Ce-----eecCCcH------------HH-----------HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC Q lcl|NC_019916. 85 SVGN--AI-----AMSGPSS------------DR-----------LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP 134 (513) Q Consensus 85 l~g~--p~-----~~~~~~~------------~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~ 134 (513) |++. |+ ++...+. .. +...+..++|.....++.++..++|.+. +|.++ T Consensus 74 l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~ 151 (516) T protein:vir:96 74 LAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM--LYKPS 151 (516) T ss_pred HHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe--EEecC Confidence 7652 21 2222110 11 2234455789999999999999999875 56676 Q ss_pred CCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecc-------------------cccccceeEEEEEEEcCCcEEEE Q lcl|NC_019916. 135 SQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQT-------------------VVDNITQTKYEVETWTENDYTRY 195 (513) Q Consensus 135 ~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~-------------------~~~~~~~~~~~ve~yt~~~~~~~ 195 (513) ++. +. .-|..-+++--|.. +++...+|...... ........++..-.+.++..+.+ T Consensus 152 ~~~--~~--~~pl~~y~v~~d~~-G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~ 226 (516) T protein:vir:96 152 KGA--IS--AIPMHHYVVNRDTN-GDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWEL 226 (516) T ss_pred CCC--EE--EEEcCeEEEeeCCC-CCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEE Confidence 653 22 23444455544433 23433333211100 00001111221112333332222 Q ss_pred EeeccCCccccccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccc Q lcl|NC_019916. 196 KPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDT 270 (513) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~ 270 (513) .....+.. . ...-..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-.+....+....|.+.+--.+.. T Consensus 227 ~~~~d~~~-~--~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~ 303 (516) T protein:vir:96 227 KQSADDIP-V--GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQT 303 (516) T ss_pred EEEeCcee-e--ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCccccc Confidence 22111111 1 111123456788877653 467999888888888888877777777777666666543210000 Q ss_pred ccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHH Q lcl|NC_019916. 271 LFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAAD 348 (513) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~ 348 (513) + ... ....+.+.. ..+...+++.+. +..+.......++.++.. T Consensus 304 --------------------~--------~~~-----l~~~~~g~i--~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~r 348 (516) T protein:vir:96 304 --------------------D--------VDH-----FVNSGTGEV--VTGVEEDIHIVQLGKYADLTPISAVLEVYTRR 348 (516) T ss_pred --------------------c--------hhh-----hccCCCcee--ecCCcccceeeecCcccchhHHHHHHHHHHHH Confidence 0 000 000000000 112223334433 223556666777777777 Q ss_pred HHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhcccccccccceeeEEe Q lcl|NC_019916. 349 IHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVA-----HIEERVNGKWDIDPDEIGFIF 423 (513) Q Consensus 349 i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~-----~~l~~~~~~~~~~~~~i~i~f 423 (513) |...-....+..- -+...++.-++ .++.+++..+|..+.++-.=++ ..+.... ..+....+.+.+ T Consensus 349 I~~af~~~~l~~r-~~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~--p~lp~~~v~~~~ 418 (516) T protein:vir:96 349 IGVVFMMETMTRR-DAERVTAVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG--ESFTSDLVDPVI 418 (516) T ss_pred HHHHHhhhhhccC-CCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC--CCCcccccccee Confidence 6443211111110 11234554443 3456667777776666321111 1111111 111112233333 Q ss_pred CCCCCcCHHHHHHHHH----------HHhcC-------CCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHh- Q lcl|NC_019916. 424 RDNLPTDDVAIITALV----------QAGAQ-------IPQEYLYQYL---PNVT----DADEIVKMMDKQRKAMLKTY- 478 (513) Q Consensus 424 ~~~~p~d~~e~a~~~~----------kl~g~-------iS~et~~~~l---~~v~----D~~~E~~ri~~E~~~~~~~~- 478 (513) .. +-+.+..++.+. .++++ +....++..+ -+|+ -.++|++.+.+++.+.+... T Consensus 419 vs--~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~ 496 (516) T protein:vir:96 419 IT--GIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQM 496 (516) T ss_pred ec--hHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHH Confidence 22 222222222211 11211 2222333222 1122 23556665555544433332 Q ss_pred --hhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 479 --DTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 479 --~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) ...+..... .-.++..|. T Consensus 497 ~a~~~~~~~~~------~~~~~~~~~ 516 (516) T protein:vir:96 497 LEEGVAKAVPG------VIQQELKEA 516 (516) T ss_pred HHHHhhhhhhH------HhhcccccC Confidence 222222211 111111111 No 202 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=94.51 E-value=0.0042 Score=33.60 Aligned_cols=469 Identities=10% Similarity=0.048 Sum_probs=187.9 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHH---HHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIR---HHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYI 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~---~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~i 77 (513) |-..|.+-+.-+ + +.+ ...+.+.|+ +|.+.-..+.+...+-|.+...- ... ..+. .|+.--= T Consensus 1 m~~~~~~~~~~t-p-e~l-a~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~--~~~-----~~~r-----~nl~~sn 65 (663) T protein:vir:34 1 MNESQPTDFADT-P-QGW-AQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDS--AHD-----AETR-----WNLFSTN 65 (663) T ss_pred CCccccccchhc-c-hhH-HHHHHHHHHHHHhccchHHHHHHHHHHHhhccccC--CCc-----cccc-----cchhhhh Confidence 555566633333 2 223 333333343 24444455677777778775421 111 1111 2322222 Q ss_pred HHHHHHHhhcCCeeec------CCc-------HHHHHHHH------HhcCHHHHHHHHHHHHhhCCeEEEEeeecC---- Q lcl|NC_019916. 78 ADFQTSYSVGNAIAMS------GPS-------SDRLDDFN------RRNDIDTLNYELYLDMTVTGRAYEYVYRDP---- 134 (513) Q Consensus 78 vd~~~~~l~g~p~~~~------~~~-------~~~l~~~~------~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~---- 134 (513) |.+..--+.+.+|..+ ..+ .+.+.+.+ +.++++..+....++++.||+|-+.|-+-. T Consensus 66 i~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~ 145 (663) T protein:vir:34 66 IQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEE 145 (663) T ss_pred HHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccch Confidence 3333344455554432 212 12344433 346688899999999999999887665411 Q ss_pred ----------C-Cce----eEEEEEcccceEE---------EecCCC----CcceEEEE---------EE---------E Q lcl|NC_019916. 135 ----------S-QKG----EVSVKLDPMECFI---------IYDRSV----NPKPIMAV---------RY---------H 168 (513) Q Consensus 135 ----------~-~~~----~~~~~~~p~~~~~---------~~d~~~----~~~~~~~i---------r~---------~ 168 (513) . +.. ..-+.+-..++++ ++++.. .+++..-+ || + T Consensus 146 ~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a 225 (663) T protein:vir:34 146 VAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWA 225 (663) T ss_pred hccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhh Confidence 1 110 0000000111111 111110 01111111 00 0 Q ss_pred eecc-cc-c----ccc----eeEEEEEEEcC--CcEEEEEeeccCCccccccccccc----cCcccceEEecCC----CC Q lcl|NC_019916. 169 AVQT-VV-D----NIT----QTKYEVETWTE--NDYTRYKPIVVAGSVPTLEVAEHS----AQFGFPMIEYRNN----EY 228 (513) Q Consensus 169 ~~~~-~~-~----~~~----~~~~~ve~yt~--~~~~~~~~~~~~~~~~~~~~~~~~----~~g~vPvv~~~n~----~~ 228 (513) .... .+ . +.. .....-|||+. .++|++. .+.. .+....++| .|.-||...|++. -. T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~--eg~~--~~L~~~~p~lgl~~ffPcPrpl~~~~~~ds~i 301 (663) T protein:vir:34 226 SVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYV--EGYS--AVLDTQPDPLGLESFFPCPKPLLANWTTDKVV 301 (663) T ss_pred hccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEE--cCcc--eecccCCCCCCCCCCCCCcccccceecCCCee Confidence 0000 00 0 000 11222356764 4444442 2221 222223332 2555677666553 24 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcce Q lcl|NC_019916. 229 RQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANM 308 (513) Q Consensus 229 ~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 308 (513) +-++|---+.+++++|.+.- ..|.+.+. ++ +.|.-..... ...+. .+..-.++.+ T Consensus 302 pvpd~~~y~~~~~E~n~~t~-Rin~l~d~---ik-v~gvy~~~~g---------~~i~~-----------~l~~a~~n~l 356 (663) T protein:vir:34 302 PRPDFVLAQDLYKEIDLVST-RITLLERA---IR-VVGVYDKSSG---------LTIGR-----------LLSEAAQNDL 356 (663) T ss_pred cCCcHHHHHHHHHHHHHHHH-HHHHHHhh---hh-hceeeccccc---------hhHHH-----------HHHHhhCCCc Confidence 66888888999999998543 33333322 22 2332110000 00000 0112222233 Q ss_pred eecccccccccccc-CCceeEEeecCC---HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 309 ILLKTGMAPNGQQT-SADANYIHKEYD---SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELAS 384 (513) Q Consensus 309 ~~~~~~~~~~~~~~-~~~~~~l~~~~~---~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 384 (513) +++.........+. .+.+.|+-.+-- ....-..-..++.++|.+|++-++.=+...-+-++.|-+.+-+.+-.++. T Consensus 357 vpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIq 436 (663) T protein:vir:34 357 IPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQ 436 (663) T ss_pred eecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHH Confidence 33333322221111 122344433322 22333556778889999999988766554445566666667788888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccc----------cc----------------cccceeeEEeCCCCCcCHHHHHHHH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVNGK----------WD----------------IDPDEIGFIFRDNLPTDDVAIITAL 438 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~~~----------~~----------------~~~~~i~i~f~~~~p~d~~e~a~~~ 438 (513) +++....+..+.+.++..+++...-.. .. .....++|.=.-+.-.|.++.-+.+ T Consensus 437 e~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~ 516 (663) T protein:vir:34 437 RLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEK 516 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHH Confidence 999999999999999988887543110 00 1111233333333334444433333 Q ss_pred HH-HhcCCCHHH----HHHhCCCCCCHHHHHHHH-------HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 439 VQ-AGAQIPQEY----LYQYLPNVTDADEIVKMM-------DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP 506 (513) Q Consensus 439 ~k-l~g~iS~et----~~~~l~~v~D~~~E~~ri-------~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (513) ++ +.++-+--+ ++.+.|.....-.|+-.. ...-+..++.+..........+...+.....+...-+.. T Consensus 517 ~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~ 596 (663) T protein:vir:34 517 MEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQ 596 (663) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHH Confidence 32 222211111 112222211111111100 000000111110000000000000000000000000000 Q ss_pred CCccCCC Q lcl|NC_019916. 507 EDERTSD 513 (513) Q Consensus 507 ~~~~~~~ 513 (513) +-..+.+ T Consensus 597 q~k~q~~ 603 (663) T protein:vir:34 597 AMKGQQE 603 (663) T ss_pred HHHHHHH Confidence 0000000 No 203 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=94.38 E-value=0.0046 Score=33.42 Aligned_cols=346 Identities=11% Similarity=0.056 Sum_probs=132.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCcHHHHHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRLDDF 103 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~ 103 (513) ..+++.+..+.......+.....+.... .......+..-+.+.-....|+..++-+-+.|+. +...+..+ T Consensus 1 M~~~~~f~~r~~~~~~~~~~~~~~~~~~------~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~----~~~~~~~L 70 (359) T protein:vir:10 1 MSILNPFERRSSITPNNYYPFMVQNGSI------VPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI----GNQVFTSV 70 (359) T ss_pred CcccchhhccccCCCCcchhhhhccccc------cCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc----cchHHHHH Confidence 1111111000000000000000000000 0000000000011222334666666666666764 23334444 Q ss_pred HHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccce Q lcl|NC_019916. 104 NRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQ 179 (513) Q Consensus 104 ~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~ 179 (513) +.. |. -......+..+.+.+|.||+++-.+.+|.+.-.+.++|..+.+..++. . ++|.... ..+ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~---~----~~y~~~~-~~~---- 138 (359) T protein:vir:10 71 LNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD---T----LTYEVNQ-FDD---- 138 (359) T ss_pred hhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC---e----EEEEEEe-cCC---- Confidence 443 32 223445677788889999999988888876555567777766655432 1 1121110 010 Q ss_pred eEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 180 TKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNE 259 (513) Q Consensus 180 ~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~ 259 (513) .....+.++.+.+++..... .+++. ...|.|-++.+...++....+..-..+.+...+. T Consensus 139 --~~~~~~~~~evih~~~~~~~----------~~~~d---------g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~ 197 (359) T protein:vir:10 139 --YPSAKYNASEMIHVKIMAYG----------VDTLH---------NLVGHSPLESLTSEIGQQKEANRLSLSTLKGALN 197 (359) T ss_pred --ceEEEEcccceEEeccCCCC----------CCccC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 01123444455544321100 00011 1146676666666555444444333444443333 Q ss_pred hhhheecCcccccccccccccccchhhhhhhccccccchhhhcc---h-hcceeeccccccccccccCCceeEEeecCCH Q lcl|NC_019916. 260 AMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM---R-QANMILLKTGMAPNGQQTSADANYIHKEYDS 335 (513) Q Consensus 260 ~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 335 (513) |-.+++-.... ...+. ...++. .++.. . .++.+.+ ..+.+++.++..... T Consensus 198 ~~gil~~~~~~----------l~~e~---~~~~~~----~~~~~~~~~n~g~~~vl---------~~g~~~~~l~~~~~d 251 (359) T protein:vir:10 198 PTSVVKVPQGT----------LSSEA---KDSIRK----EFEKANGGNNSGRVMVL---------DQSADFSTVSINADV 251 (359) T ss_pred cceEEEeCCCC----------CCHHH---HHHHHH----HHHHHhCccccCCceec---------CCCcceeeecCCHHH Confidence 44343321100 00000 000100 11110 0 0112222 222333434333333 Q ss_pred HHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_019916. 336 AGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDI 414 (513) Q Consensus 336 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~ 414 (513) ..+....+...+.|+..-++|+...+..+ .+.+...++..+......+-. -+...|.+. ......+ T Consensus 252 ~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~---p~~~~l~~~----------l~~~~~~ 318 (359) T protein:vir:10 252 ANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIE---PLISELRIK----------CDSSIGV 318 (359) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHH---HHHHHHHHH----------hhhhhcc Confidence 34556677778899999999987654332 233444444333222111100 011111110 0000111 Q ss_pred ccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhC--CCCC Q lcl|NC_019916. 415 DPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYL--PNVT 458 (513) Q Consensus 415 ~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l--~~v~ 458 (513) +... -+.| |.......+.++ +|+++.-.+.+.+ +-|- T Consensus 319 ~~~~-~~~~------d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 319 DMSP-ITDY------SNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred cchh-hhhc------CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 1100 0111 222223333343 5788876666654 2222 No 204 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=93.99 E-value=0.0057 Score=32.87 Aligned_cols=427 Identities=8% Similarity=0.056 Sum_probs=188.6 Q ss_pred chhhceeccCCcccCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) ||.+.+.|- .+.+.+.+..+....++.+ +.+.+.+|..-. ..... .......++..+-....+++ T Consensus 1 ~~~~~~~~~-----~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~--~~~~~-----~~~~~~~~~~dstg~~a~~~ 68 (515) T protein:vir:70 1 MQDTILEYG-----GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMNNK-----GDNETSQNGWQGVGAQATNH 68 (515) T ss_pred Ccchhhhhc-----CCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc--ccCCC-----CCcccccccccchHHHHHHH Confidence 555555443 4555666666665555544 555566664431 11111 11112224556677777888 Q ss_pred HHHHhhcC--C----e-eecCCc------------HHH-----------HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEe Q lcl|NC_019916. 81 QTSYSVGN--A----I-AMSGPS------------SDR-----------LDDFNRRNDIDTLNYELYLDMTVTGRAYEYV 130 (513) Q Consensus 81 ~~~~l~g~--p----~-~~~~~~------------~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v 130 (513) .++.|++. | + ++...+ ... +...+..++|.....++.++...+|.+. + T Consensus 69 LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l 146 (515) T protein:vir:70 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--L 146 (515) T ss_pred HHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEE--E Confidence 88777642 2 1 222111 111 2223455789999999999999999875 4 Q ss_pred eecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccc--------------cccceeEEEEEEE-----cCCc Q lcl|NC_019916. 131 YRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV--------------DNITQTKYEVETW-----TEND 191 (513) Q Consensus 131 ~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~--------------~~~~~~~~~ve~y-----t~~~ 191 (513) |.++++. +. +-|..-+++--|.. +++...+|.++..... ....+....+++| .++. T Consensus 147 ~~d~~~~--~~--~~pl~~y~v~~d~~-G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 221 (515) T protein:vir:70 147 YKPSKGA--MS--AVPMHHYVVNRDTN-GDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG 221 (515) T ss_pred EEeCCCC--eE--EEEcCeEEEeeCCC-cCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCC Confidence 5566553 22 23444455554443 4455555443221100 0000011123333 3332 Q ss_pred EE-EEEeeccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhee Q lcl|NC_019916. 192 YT-RYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIK 265 (513) Q Consensus 192 ~~-~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~ 265 (513) .+ ++... ++.. +...-..++..+|++.++ ++.+|+|-.++..+-+-.+|.+.-......+...+|.+.+- T Consensus 222 ~~~~~~e~--d~~~--~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~ 297 (515) T protein:vir:70 222 FWKINQSA--DDIP--VGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIR 297 (515) T ss_pred ceEEEEec--Ccee--eccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeC Confidence 22 22221 1111 111112335678887765 34679999999999999998888888877777777776543 Q ss_pred cCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHH Q lcl|NC_019916. 266 GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKK 343 (513) Q Consensus 266 G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~ 343 (513) -.+... .... .....+. ...+...++..+. +..+.......++ T Consensus 298 ~~g~~~----------------------------~~~l-----~~~~~g~--iv~g~~~~v~~~~~~~~~d~~~~~~~i~ 342 (515) T protein:vir:70 298 PGSQTD----------------------------VDHF-----VNSGTGE--VITGVAEDIHIVQLGKYADLTPISAVLE 342 (515) T ss_pred cccccc----------------------------hhhc-----cccCCce--eecCCcccceeeecCcccchhHHHHHHH Confidence 110000 0000 0000000 0112233344443 2345566677777 Q ss_pred HHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcccccccccceee Q lcl|NC_019916. 344 RLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEE---RVNGKWDIDPDEIG 420 (513) Q Consensus 344 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~---~~~~~~~~~~~~i~ 420 (513) .++..|...-..-.+... -+.+.++.-++ .++.+++..+|..+.++-.-++.=+. ..+-...+....+. T Consensus 343 ~~~~rI~~af~~~~l~~r-d~~rvTAtEV~-------~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~ 414 (515) T protein:vir:70 343 VYTRRIGVIFMMETMTRR-DAERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVD 414 (515) T ss_pred HHHHHHHHHHhhhhhhcc-CCccccHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcc Confidence 777777442221111100 01234554443 34567777777777664222111110 00111111112233 Q ss_pred EEeCCCCCcCHHHHHHH---HHHH-------hc-------CCCH----HHHHHhCCCCC----CHHHHHHHHHHHHHHHH Q lcl|NC_019916. 421 FIFRDNLPTDDVAIITA---LVQA-------GA-------QIPQ----EYLYQYLPNVT----DADEIVKMMDKQRKAML 475 (513) Q Consensus 421 i~f~~~~p~d~~e~a~~---~~kl-------~g-------~iS~----et~~~~l~~v~----D~~~E~~ri~~E~~~~~ 475 (513) +.+.. +-+.+..++. +... ++ .+.. +.+...+ +++ -.++|++.+.+++++.+ T Consensus 415 ~~~vs--~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~-g~p~~~~rs~eev~~~r~q~~~~~ 491 (515) T protein:vir:70 415 PVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQI-SAELPFLKSEEEMQQEMAQQAQAQ 491 (515) T ss_pred cceeh--hHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHh-CCCccccCCHHHHHHHHHHHHHHH Confidence 33322 2222222222 1111 11 1221 2222232 221 24667777766655443 Q ss_pred HHhhhhcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 476 KTYDTKGGLIINGTSGNDPEDEGVRGQ 502 (513) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) .......+. ++...+...+....+ T Consensus 492 ~~~~~~~~~---~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 492 QEAMLNEGV---AKAVPGVIQQEMKEG 515 (515) T ss_pred HHHHHHHhh---hhhcccchhhhhccC Confidence 332211111 111112222222221 No 205 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=93.75 E-value=0.0065 Score=32.57 Aligned_cols=382 Identities=11% Similarity=0.085 Sum_probs=142.2 Q ss_pred HHHhcCCCccccccccc-------cCCC---C--CCcceeecchhHHHHHHHHHHhhcCCeeec-CC-----cHHHHHHH Q lcl|NC_019916. 42 YDYYRGQNDGILSPASR-------RNEK---G--KADHRAVHSFARYIADFQTSYSVGNAIAMS-GP-----SSDRLDDF 103 (513) Q Consensus 42 ~~YY~G~~~i~~~~~~~-------~~~~---~--~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~-~~-----~~~~l~~~ 103 (513) -+.|++........+.. .+.. . ....+++ =.-.+|+..++-+-+-|+++- .. +...+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~--~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNS--DVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccH--HHHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 11122221111100000 0000 0 0011222 222467777777777787752 11 11223334 Q ss_pred HHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCc-eeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccccc Q lcl|NC_019916. 104 NRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQK-GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNI 177 (513) Q Consensus 104 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~-~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~ 177 (513) +.. |. .......+..+++.+|.||+++.++..|. +...+.++|..+.+..++.. ++. | ......+. T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~--~~~----y-~~~~~~~~- 150 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPD--NII----Y-RFTPYNSS- 150 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCC--eEE----E-EEEEcCCc- Confidence 322 32 23455667888999999999998886543 43344567777766544322 121 1 11111110 Q ss_pred ceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 178 TQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDL 257 (513) Q Consensus 178 ~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~ 257 (513) ...++..+.+++++... + +.-.|.|.++.+...+.....+..-..+.+... T Consensus 151 -----~~~~~~~~dviH~r~~~---------------~---------d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng 201 (417) T protein:vir:38 151 -----MQKVCGFEDVIHWKFFS---------------Y---------DTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSG 201 (417) T ss_pred -----EEEEecCcceEEecCCC---------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 01123333443332100 0 111366666555554443333333333333333 Q ss_pred hhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHH Q lcl|NC_019916. 258 NEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAG 337 (513) Q Consensus 258 ~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 337 (513) +.|-.+++-... .. . +....++..=.........++.+.+ ..+.+++.++....... T Consensus 202 ~~p~~il~~~~~----------l~-~---e~~~~~~~~~~~~~~g~n~g~~~vl---------~~g~~~~~l~~~~~d~q 258 (417) T protein:vir:38 202 LKGSIIKAKESR----------LS-A---EARQKIREDFERAQAGADAGSPIIV---------DATMDYQPLEVDTNVLN 258 (417) T ss_pred CCCcEEEEeCCC----------CC-H---HHHHHHHHHHHHHhcccccCCceec---------cCCceEEEccCCHHHHH Confidence 334433332110 00 0 0011111100000000011122222 12233333333333334 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_019916. 338 TELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPD 417 (513) Q Consensus 338 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~ 417 (513) +....+...+.|+..-++|+...+..+.+.|...+ ....+...|+..++.+..-+...--. ..... T Consensus 259 ~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~-------------~~~~~~~tl~P~~~~ie~~l~~~Ll~-~~~~~ 324 (417) T protein:vir:38 259 LINSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQL-------------ADDYIRNDLPFYFEPITSEFELKLLD-DAQRH 324 (417) T ss_pred HHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhhcC-hhhcc Confidence 55566677888999989997665422112221111 11223334444444443333221110 11112 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHH-H------HHHHHHHHHHHHHhhhhcCCCC Q lcl|NC_019916. 418 EIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEI-V------KMMDKQRKAMLKTYDTKGGLII 486 (513) Q Consensus 418 ~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E-~------~ri~~E~~~~~~~~~~~~~~~~ 486 (513) ...+.|.... -+.+..++ +.++ +|+++.-.+.++++. +++.+.. + ..++.. .+.+ .. T Consensus 325 ~~~~~fd~~~-l~~~~~~~-~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~--~~~~--------~~ 392 (417) T protein:vir:38 325 QYCIGFDTKS-VNGLPIAD-VNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQK--EAYQ--------AE 392 (417) T ss_pred cceEEechhh-hhHHHHHH-HHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccc--cccc--------cc Confidence 3456664321 12222233 2232 588888777777643 3332110 0 011100 0000 00 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 487 NGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) ...+...+++++.+.+.+..+++++ T Consensus 393 ~~~~~kgg~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 393 HAAELKGGDTNAKGNQNGSGTNANS 417 (417) T ss_pred cccccCCCCCCCCCCCcCCCCcCCC Confidence 0111111111222222222233333 No 206 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=93.66 E-value=0.0068 Score=32.46 Aligned_cols=403 Identities=12% Similarity=0.037 Sum_probs=181.8 Q ss_pred Cccchhhc---------eeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeec Q lcl|NC_019916. 1 MIDMQQAN---------MNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVH 71 (513) Q Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~ 71 (513) +..+++.- ....+....+|+..+..+++..-.-...++-.| ||+-. ... T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L--~~dm~--------------------~~D 71 (512) T protein:vir:19 14 FDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADL--AFDME--------------------EKD 71 (512) T ss_pred cccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHH--HHHHH--------------------hhC Confidence 22222211 111234467899999888877544344444333 22210 013 Q ss_pred chhHHHHHHHHHHhhcCCeeecCC---c--HH----HHHHHHHhc-CHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeE Q lcl|NC_019916. 72 SFARYIADFQTSYSVGNAIAMSGP---S--SD----RLDDFNRRN-DIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEV 140 (513) Q Consensus 72 n~~~~ivd~~~~~l~g~p~~~~~~---~--~~----~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~ 140 (513) ....-++.+...-+.+.++++... + +. .+++++..- +|+.....+. +|.-+|.+. +++|.-.+|...+ T Consensus 72 ~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~ 150 (512) T protein:vir:19 72 THLFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVP 150 (512) T ss_pred hHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceee Confidence 445556777777888888887531 1 12 245555432 5777666664 688899885 5666444443222 Q ss_pred E-EEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccccc-ccCccc Q lcl|NC_019916. 141 S-VKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEH-SAQFGF 218 (513) Q Consensus 141 ~-~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~v 218 (513) . +..-|...|. |+.....+ +|+ +. ... .+.+ .+++.| T Consensus 151 ~~~~~r~~~~f~-~~~~~~~~----lr~----------------------------~~--~~~------~G~~l~~~k~i 189 (512) T protein:vir:19 151 VALHHRDPALFC-ANPDNLNE----LRL----------------------------RD--ASY------HGLELQPFGWF 189 (512) T ss_pred eeeeeeccccce-eccCCCcE----EEe----------------------------cC--CCC------CceeecCCceE Confidence 1 1111222221 22111101 110 00 000 0000 011111 Q ss_pred ceEE--ecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccccc Q lcl|NC_019916. 219 PMIE--YRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEK 296 (513) Q Consensus 219 Pvv~--~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 296 (513) -.++ -..+..|.|.+..+-...=--+..+.+.+..++.|+.|+++.+=..+....+ .. .- T Consensus 190 ~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~e-----------k~-------~L 251 (512) T protein:vir:19 190 MHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNRE-----------KA-------TL 251 (512) T ss_pred EEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHH-----------HH-------HH Confidence 1111 0123457777777766666667788899999999999988766322111110 00 00 Q ss_pred chhhhcchhcceeeccccccccccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCcccccccc--ccccccHH-HH Q lcl|NC_019916. 297 MAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDN--FSGNSSGV-AM 372 (513) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~n~Sg~-Ai 372 (513) ...+..+..+....+ ..+..++|++.. .....++.+++.+.+.|...--.-.++.+. .+++..|. .- T Consensus 252 ~~al~~~~~~a~~ii---------P~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~ 322 (512) T protein:vir:19 252 MQAVMDIGRRAGGII---------PMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHD 322 (512) T ss_pred HHHHHHHhhCcEEEe---------cCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH Confidence 111222333333333 345677888754 455668999999999888753222222222 12222221 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccc-cceeeEEeCCCCCcCHHHHHHHHHHHh-cC-CCHH Q lcl|NC_019916. 373 KYKVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDID-PDEIGFIFRDNLPTDDVAIITALVQAG-AQ-IPQE 448 (513) Q Consensus 373 ~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~a~~~~kl~-g~-iS~e 448 (513) +. ....++.-.+.+...+. ++++-++.+ +.....+ .....+.|....|.|....++.+.+++ |+ +|.+ T Consensus 323 ev----~~di~~aDa~~i~~tln~~li~~l~~~----N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~ 394 (512) T protein:vir:19 323 EV----RREIRNADVGQLARSINRDLIYPLLAL----NSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVS 394 (512) T ss_pred HH----HHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHH Confidence 22 22222233333444442 344444332 2221111 123578899999999999999988863 55 8888 Q ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCC-CCccCCC Q lcl|NC_019916. 449 YLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEP-EDERTSD 513 (513) Q Consensus 449 t~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 513 (513) .+.+.++ ++.++.+-.-+. ..+.... ...........+....++..+ ......| T Consensus 395 ~i~e~~G-ip~~~~~e~~~~---------~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 449 (512) T protein:vir:19 395 WIQEKLH-IPQPVGDEAVFT---------IQPVVPD-NGSQKEAALSAEDIPQEDDIDRMGVSPED 449 (512) T ss_pred HHHHHhC-CCCCCCcccccc---------CCCcccc-ccccccccccccCCCchhhHhHHhhhHHH Confidence 8888885 343221100000 0000000 000000000000000000000 0000001 No 207 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=93.39 E-value=0.0077 Score=32.16 Aligned_cols=393 Identities=10% Similarity=0.072 Sum_probs=161.3 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc---------ccCCCC-CCcceeecchhHHHHHHHH Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS---------RRNEKG-KADHRAVHSFARYIADFQT 82 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~---------~~~~~~-~~~~ri~~n~~~~ivd~~~ 82 (513) |..-+.|-+. ..++..+..++..+.|....-..... ...... -+..=+.+.-...+|+..+ T Consensus 1 ~~~~~~~~~~---------~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia 71 (424) T protein:vir:18 1 MEEPKYTIDL---------RTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) T ss_pred CCCCcccccc---------CCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHH Confidence 2211111111 11122234445555543211111000 000000 0000012223445677777 Q ss_pred HHhhcCCeee-c--CC-------cHHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 83 SYSVGNAIAM-S--GP-------SSDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 83 ~~l~g~p~~~-~--~~-------~~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) .-+-+-|+.+ . .+ .+..+..++. . |. -......+..+.+.+|.||+++-.+..|.+.-.+.++|. T Consensus 72 ~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~ 151 (424) T protein:vir:18 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) T ss_pred HhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCc Confidence 7777778775 1 11 1222444443 2 32 234556678899999999999988888876656667888 Q ss_pred ceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCC Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNE 227 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 227 (513) .+.+..++. .+. |.... ++ ....|.++.+++++.... +.. T Consensus 152 ~v~v~~~~~---~~~-----y~~~~-~g-------~~~~~~~~eVihir~~~~------------------------dg~ 191 (424) T protein:vir:18 152 NMDVKLVGK---KVV-----YRYQR-DS-------EYADFSQKEIFHLKGFGF------------------------TGL 191 (424) T ss_pred ceEEEEcCC---eEE-----EEEEe-CC-------eEEEeccccEEEecCcCC------------------------CCc Confidence 876654432 111 11110 00 011344445544432100 011 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcc Q lcl|NC_019916. 228 YRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQAN 307 (513) Q Consensus 228 ~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 307 (513) .|.|.++.+...++....+.....+.+...+.|-.+++-..... .. +....++..-.........++ T Consensus 192 ~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l----------~~---e~~~~~~~~~~~~~~~~nag~ 258 (424) T protein:vir:18 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL----------TE---QQRSQVEENFKEIAGGPVKKR 258 (424) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCC----------CH---HHHHHHHHHHHHHhCCcccCC Confidence 34555544444333322222223333333344444443211100 00 000111100000000001112 Q ss_pred eeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-cccHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 308 MILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG-NSSGVAMKYKVLGTVELASTK 386 (513) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~ 386 (513) .+.+ ..+.+++.++.......+....+..++.|+..-++|+...+...+ +..|..++-..... T Consensus 259 ~~vl---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f------- 322 (424) T protein:vir:18 259 LWIL---------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF------- 322 (424) T ss_pred ceec---------cCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHH------- Confidence 2222 223344444434444556677778888999999999866543332 22233343322222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHH Q lcl|NC_019916. 387 RKQFERGLNQRYTVVAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDAD 461 (513) Q Consensus 387 ~~~f~~~l~~~~~li~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~ 461 (513) +...+...++.+..-++..- .........+++.+..-+..|..+.++.+.++ .|+++.-.+.++++. +++-+ T Consensus 323 ---~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD 399 (424) T protein:vir:18 323 ---LQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD 399 (424) T ss_pred ---HHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Confidence 22233333333333222210 11111223355556677778899999998887 678888777666543 11100 Q ss_pred HHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 462 EIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGV 499 (513) Q Consensus 462 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) +-+- . ....++...+. .....+.++ T Consensus 400 ~~~~--~-------~n~~~l~~~~~----~~~~~~n~a 424 (424) T protein:vir:18 400 VAMR--Q-------AQYVPITDLGT----NKEPRNNGA 424 (424) T ss_pred eeee--c-------cCccchhhhhc----cCCccccCC Confidence 0000 0 00000000000 000111111 No 208 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=92.81 E-value=0.0099 Score=31.58 Aligned_cols=399 Identities=10% Similarity=-0.018 Sum_probs=153.9 Q ss_pred eeccCCcccCCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHh Q lcl|NC_019916. 9 MNYQEDADKLTPTRIAAFIRHHYNNQ---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYS 85 (513) Q Consensus 9 ~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 85 (513) +.|---.-.+=++-.+-+++..+..+ .+....-..++.+.. +... .....+.+-+.+.-...+|+..+.-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~--~~~~----~~~vs~~~al~~~~v~~cv~~Ia~~i 74 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDG--LFRA----DVYVSPETAMKLAAVYSCIYVLSSSL 74 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccccchhhhhhhc--cccC----CceechHHhhccHHHHHHHHHHHHHH Confidence 33322222222222222222211110 000000000000000 0000 00000011122233445677777777 Q ss_pred hcCCeeecC--C-c-----HHHHHHHHH--hcC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE Q lcl|NC_019916. 86 VGNAIAMSG--P-S-----SDRLDDFNR--RND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII 152 (513) Q Consensus 86 ~g~p~~~~~--~-~-----~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~ 152 (513) -+-|+++-. + . +..+..++. =|. .......+..+++.+|.||+++-.+..|.+.-.+.++|..+.+. T Consensus 75 A~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~ 154 (424) T protein:vir:45 75 AQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLM 154 (424) T ss_pred hhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEE Confidence 777887521 1 1 122444432 233 22455567889999999999998888887665566777776554 Q ss_pred ecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcc Q lcl|NC_019916. 153 YDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGD 232 (513) Q Consensus 153 ~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 232 (513) -++ .++. |...... . ...+.++.+++++... .+...|.|. T Consensus 155 ~~~---~~~~-----y~~~~~~----~----~~~~~~~eVih~r~~~------------------------~d~~~G~sp 194 (424) T protein:vir:45 155 NTG---GRYT-----YGLYNEY----G----AFAISPDDMIHIRALG------------------------NNQKMGLSP 194 (424) T ss_pred EcC---CeEE-----EEEEecC----c----eEEECcccEEEecCcC------------------------CCCcccccH Confidence 332 1111 1111000 0 0123344444432100 011235666 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceee Q lcl|NC_019916. 233 FENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMIL 310 (513) Q Consensus 233 ~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~ 310 (513) ++.....|+....+..-..+.+...+.|-.+++-.... .. +....++..-....... ..++++. T Consensus 195 i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l----------~~----e~~~~~~~~~~~~~~g~~~n~g~~~v 260 (424) T protein:vir:45 195 IMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGL----------NK----ESWGWLKDQWQKASQALRRQENKTML 260 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC----------CH----HHHHHHHHHHHHHhccccccCCceeE Confidence 65555444433333322333333334454444422110 00 00001100000000000 0112222 Q ss_pred ccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 311 LKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQ 389 (513) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~ 389 (513) + ..+.+++-++.......+....+...+.|+..-++|+...+... ++-|+. +. ..... T Consensus 261 l---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--eq----------~~~~f 319 (424) T protein:vir:45 261 L---------PADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNI--SA----------QAIQF 319 (424) T ss_pred c---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--HH----------HHHHH Confidence 2 22233333333333344566777888899999999986654322 221221 11 11112 Q ss_pred HHHHHHHHHHHHHHHHHhcc-cccc-cccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHH Q lcl|NC_019916. 390 FERGLNQRYTVVAHIEERVN-GKWD-IDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEI 463 (513) Q Consensus 390 f~~~l~~~~~li~~~l~~~~-~~~~-~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E 463 (513) ....|...++.+..-++..- .... .....+++.+..-+-.|..+.++++.++ +|+++.-.+.+.++. +++-+ T Consensus 320 ~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD-- 397 (424) T protein:vir:45 320 VRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLD-- 397 (424) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-- Confidence 23334444433333332210 0000 0112344444555567888999998887 478887777666643 11100 Q ss_pred HHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 464 VKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 464 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ....+..... ..+...+. ..++ .+++| T Consensus 398 ------------~~~~~~n~~~------~~~~~~~~-~~~~----~~~~~ 424 (424) T protein:vir:45 398 ------------EMLVSVNAAN------PAGDFKPP-KNDE----GKTNE 424 (424) T ss_pred ------------eeeecccccc------cccccCCC-CCCC----CCCCC Confidence 0000100000 00000000 0000 01111 No 209 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=92.62 E-value=0.011 Score=31.40 Aligned_cols=384 Identities=8% Similarity=-0.015 Sum_probs=154.6 Q ss_pred HHHHHHHH---HHHHHH-HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec--CC-- Q lcl|NC_019916. 24 AAFIRHHY---NNQRPR-LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS--GP-- 95 (513) Q Consensus 24 ~~~i~~~~---~~~~~~-~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~--~~-- 95 (513) ..+++.+. .....+ ......++.+.... . ........=........+|+..+.-+.+-|+.+- .+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 73 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDV------S-FLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDG 73 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCccc------C-ccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 12222111 000000 00001111111000 0 0000000012245667788888888888888751 11 Q ss_pred ----cHHHHHHHHHh-c---CHHHHHHHHHHHHhhCCeEEEEee--ecCCCceeEEEEEcccceEEEecCCCCcceEEEE Q lcl|NC_019916. 96 ----SSDRLDDFNRR-N---DIDTLNYELYLDMTVTGRAYEYVY--RDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAV 165 (513) Q Consensus 96 ----~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~~~~~v~--~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~i 165 (513) .......++.. | ........+..+.+.+|.|+.++. .+..|.+.-.+.++|..+-+..+... . T Consensus 74 ~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~-------~ 146 (406) T protein:vir:95 74 DIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDG-------Y 146 (406) T ss_pred ceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCe-------E Confidence 11223333332 3 234556677788888877765544 44455544445677777766655421 1 Q ss_pred EEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHH Q lcl|NC_019916. 166 RYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDV 245 (513) Q Consensus 166 r~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~ 245 (513) ++. . . ...|....+++++... +++. .-.|.|-++.+...++.... T Consensus 147 ~~~-~---~---------~~~~~~~evih~~~~~-------------~~~~---------~~~G~s~i~~~~~~i~~~~~ 191 (406) T protein:vir:95 147 QVL-Y---G---------GQTFNYDEVLHFIYNP-------------DPER---------PYIGRGYRVVLKDIADNLKQ 191 (406) T ss_pred EEE-e---c---------cEEEchhHEEEeeccC-------------CCCC---------CccccCHHHHHHHHHHHHHH Confidence 110 0 0 0123333333322100 0000 01366766666666655555 Q ss_pred HHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCC Q lcl|NC_019916. 246 AQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSA 324 (513) Q Consensus 246 ~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 324 (513) +..-..+.+...+.|-.+++-... ...... ..++..-...+..... ++.+.+..+ .. T Consensus 192 ~~~~~~~~~~ng~~~~~il~~~~~----------l~~e~~----~~~~~~~~~~~~g~~n~~~~~v~~~~--------~~ 249 (406) T protein:vir:95 192 ATATKKSFMSGKYMPSLIVKVDAA----------TAELSS----EEGRNAVFKKYLQATEAGQPWIIPAE--------LL 249 (406) T ss_pred HHHHHHHHHhccCCcceEEEeCCC----------CCHHHH----HHHHHHHHHHhccccccCCceeecCC--------Cc Confidence 544444444444444444432211 000111 1111110111111111 111111111 11 Q ss_pred ceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 325 DANYIH-KEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAH 403 (513) Q Consensus 325 ~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 403 (513) ...-++ .......+....+...+.|+..-++|+.-.+. +.+.+.. + ...+..++..+++.+.. T Consensus 250 ~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~-~~~~~~~-----~----------~~~~~~~l~P~~~~ie~ 313 (406) T protein:vir:95 250 EVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGI-GEFNRDE-----Y----------NNFINSTILPIAKGIEQ 313 (406) T ss_pred cccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCchHHH-----H----------HHHHHHHHHHHHHHHHH Confidence 111111 12233445567788889999999999755431 1221111 1 12344555555555554 Q ss_pred HHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019916. 404 IEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTK 481 (513) Q Consensus 404 ~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 481 (513) -+...--. + ....+++.++.-+..|..+.++.+.++ .|+++...+.++++.-..+ ...++..- ....+. T Consensus 314 ~l~~~l~~-~-~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~--~gd~~~~~-----~n~~~~ 384 (406) T protein:vir:95 314 ELTRKLLI-S-PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKE--GLSELVIL-----ENYIPL 384 (406) T ss_pred HHHHhcCC-C-CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceeeec-----cCccch Confidence 44332111 1 112455556666677888889888876 6789888888887542211 11110000 000000 Q ss_pred cCCCCCCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 482 GGLIINGTSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) .. ....+....+++++ .+++++ T Consensus 385 ~~-~~~~~~~k~g~~~~-------~~~~~~ 406 (406) T protein:vir:95 385 DK-IGDQSKLKGGDNSG-------ADGQTD 406 (406) T ss_pred hh-cccccccCCCCCCC-------CCCCCC Confidence 00 00000011111110 111111 No 210 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=92.58 E-value=0.011 Score=31.36 Aligned_cols=395 Identities=10% Similarity=0.009 Sum_probs=155.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec---CCc- Q lcl|NC_019916. 21 TRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS---GPS- 96 (513) Q Consensus 21 ~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~---~~~- 96 (513) -...+..............-+..-.-|-... ......-...=+.+.-...+|+..+.-+-+-|+.+- .+. T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s------~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~ 74 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRS------DSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDR 74 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCc------cCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcc Confidence 0000000000000000000000001111000 000000001112234455577777777777787642 111 Q ss_pred ----HHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEE Q lcl|NC_019916. 97 ----SDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRY 167 (513) Q Consensus 97 ----~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~ 167 (513) +..+..++.. |. .......+..+.+.+|.+|+++..+.+|.+.-.+.++|..+-+..+... .+. T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~--~~~----- 147 (419) T protein:vir:14 75 KPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL--KPV----- 147 (419) T ss_pred ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc--eEE----- Confidence 1234444432 32 3345556788999999999999888888766566788888777665432 111 Q ss_pred EeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019916. 168 HAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQ 247 (513) Q Consensus 168 ~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~ 247 (513) |...... .+..+.+++++. .+ .+.-.|.|.++.+...++....+. T Consensus 148 y~~~~~~-----------~~~~~~i~h~~~--------------------~~----~dg~~G~s~i~~~~~~i~~~~~~~ 192 (419) T protein:vir:14 148 YRVRGSD-----------PMPQRLVHHVRW--------------------MS----INGYTGLSPVLLHANAIGHAQAIQ 192 (419) T ss_pred EEEccCc-----------ccchhheeEecC--------------------cC----CCCcccccHHHHHHHHHHHHHHHH Confidence 1111000 011112221110 00 011246676666666655444444 Q ss_pred HHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCce Q lcl|NC_019916. 248 SDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADA 326 (513) Q Consensus 248 S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 326 (513) .-..+.+...+.|-.+++-...... ....+....++..=...+..... ++++.++ .+.++ T Consensus 193 ~~~~~~f~ng~~p~gil~~~~~~~~----------~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~---------~g~~~ 253 (419) T protein:vir:14 193 QYAGKSFMNGTALSGVIERPKDAPA----------LKDQASVDRITDGWNAKFGGSGNAKKVALLQ---------EGMTF 253 (419) T ss_pred HHHHHHHhccCCccEEEEecCCCCc----------ccCHHHHHHHHHHHHHHhcCccccCCceecC---------CCceE Confidence 3334444444445444442111000 00000011111000000000001 1222221 22333 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 327 NYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIE 405 (513) Q Consensus 327 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l 405 (513) ..+........+....+...+.|+..-++|+.-.+... ++-|+ ++... ...+...|.-.++.+..-+ T Consensus 254 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~--~E~~~----------~~f~~~~L~P~~~~ie~~l 321 (419) T protein:vir:14 254 RPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQS----------LQFVIYTLLPWVKRHEQAK 321 (419) T ss_pred EEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHHH----------HHHHHHHHHHHHHHHHHHH Confidence 33333333334556667778899999999976544221 22222 22111 1222333444433333333 Q ss_pred Hhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_019916. 406 ERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKG 482 (513) Q Consensus 406 ~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~ 482 (513) ...- .........+++.+..-+-.|..+.++++.++ .|+++.-.+.++++.-.-+. -+ ....+.. T Consensus 322 ~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--GD----------~~~~~~n 389 (419) T protein:vir:14 322 TRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKG--GD----------IYLSPMN 389 (419) T ss_pred hhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cC----------eeeeccc Confidence 2211 11111122344444555667888899988887 67888777766654311000 00 0000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 483 GLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ..... .. ++.+.+..++....+++ T Consensus 390 ~~~~~-----~~--~~~~~~~~~~~~~~~~e 413 (419) T protein:vir:14 390 MVDAS-----KP--QQLPVGKSEPTKAAIDE 413 (419) T ss_pred ccccc-----cc--ccccCCCCCCccccccc Confidence 00000 00 00111111222222222 No 211 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=92.54 E-value=0.011 Score=31.33 Aligned_cols=393 Identities=11% Similarity=0.030 Sum_probs=153.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec---C Q lcl|NC_019916. 18 LTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS---G 94 (513) Q Consensus 18 ~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~---~ 94 (513) |-.. +..........+.-.-+-....|-.. ... ....-+..-+.+.-...+|+..+.-+-+-|+.+- . T Consensus 1 m~~~---~~~~~~~~~~~~~~~~~~~~~~g~~~---s~~---~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~ 71 (419) T protein:vir:80 1 MFFS---RQLLSNLGQTQPGSGGWVSALLGSAR---SEA---GQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSG 71 (419) T ss_pred CCcc---cccccccCcCCCCcchhhHHhhcccc---ccc---CcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 0000 00000000000000000011111100 000 0000011112344455577887777777888751 1 Q ss_pred Cc-----HHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEE Q lcl|NC_019916. 95 PS-----SDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMA 164 (513) Q Consensus 95 ~~-----~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ 164 (513) +. +..+..++.. |. .......+..+.+.+|.||+++..+.+|.+.-.+.++|..+-+..+... .+ T Consensus 72 ~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~--~~--- 146 (419) T protein:vir:80 72 DDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDL--KP--- 146 (419) T ss_pred CCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc--eE--- Confidence 11 1224444432 32 3355566778999999999999888888766566788888776655432 11 Q ss_pred EEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHH Q lcl|NC_019916. 165 VRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYD 244 (513) Q Consensus 165 ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~ 244 (513) +|..... ..+..+.+++++. .| .+.-.|.|.++.+...++... T Consensus 147 --~y~~~~~-----------~~~~~~~i~h~~~--------------------~~----~d~~~G~s~i~~~~~~i~~~~ 189 (419) T protein:vir:80 147 --MYRVAGA-----------DPLPQRLVHHVRW--------------------MS----INGYTGLSPVLLHANAIGHAQ 189 (419) T ss_pred --EEEEcCc-----------cccchhheEEecC--------------------CC----CCCcccccHHHHHHHHHHHHH Confidence 1111100 0111222221110 00 011246666665555554433 Q ss_pred HHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccC Q lcl|NC_019916. 245 VAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTS 323 (513) Q Consensus 245 ~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 323 (513) .+..-..+.+...+.|-.+++-...... . ...+....++..=...+.... .++++.++ .+ T Consensus 190 ~~~~~~~~~f~ng~~~~gil~~~~~~~~-------~---~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~---------~g 250 (419) T protein:vir:80 190 AIQQYAGKSFMNGTALSGVIERPTDAPA-------L---KDQASVDRITDGWNAKFGGSGNAKKVALLQ---------EG 250 (419) T ss_pred HHHHHHHHHHhcCCCccEEEEecCCCCc-------c---cCHHHHHHHHHHHHHHhcCccccCCceecC---------CC Confidence 3333333333333444444432110000 0 000001111110000000101 12223322 23 Q ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 324 ADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVA 402 (513) Q Consensus 324 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~ 402 (513) .+++-++.......+....+...+.|+..-++|+...+... ++-|+ ++... ...+...+.-.++.+. T Consensus 251 ~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n--~e~~~----------~~f~~~~l~P~~~~ie 318 (419) T protein:vir:80 251 MKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQS----------LQFVIYTLLPWVKRHE 318 (419) T ss_pred ceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHHH----------HHHHHHHHHHHHHHHH Confidence 33444443334445667777888999999999976544221 22122 11110 1122223333333333 Q ss_pred HHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 403 HIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 403 ~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~ 477 (513) ..+...- .........+++.+...+..|..+.++.+.++ +|+++.-.+.+.++. +++-+ + . T Consensus 319 ~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD-~-------------~ 384 (419) T protein:vir:80 319 QAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD-I-------------Y 384 (419) T ss_pred HHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-e-------------e Confidence 3322210 00011112344444556667889999988886 678887777666543 11100 0 0 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ..+........+.+ ...+...+++...+. T Consensus 385 ~~~~n~~~~~~~~~-------~~~~~~~~~~~~~~~ 413 (419) T protein:vir:80 385 LSPMNMVDASKPQP-------IPMGKTEPTKAALDE 413 (419) T ss_pred eecccccccccccc-------ccCCCCCchhhhHHH Confidence 00100000000000 011111111111111 No 212 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=92.17 E-value=0.013 Score=31.01 Aligned_cols=418 Identities=12% Similarity=0.027 Sum_probs=157.0 Q ss_pred chhhce-eccCCcc-cCCHHHHHHHHHHHH--HHHH---H--HHHHHHHHhcCCCccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 4 MQQANM-NYQEDAD-KLTPTRIAAFIRHHY--NNQR---P--RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 4 ~~~~~~-~~~~~~~-~~~~~~i~~~i~~~~--~~~~---~--~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) |+=+|- -|-.|-. .=+..--..++..+. .++. + ....+-+...|-... ... ...+..=+.+.-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~---~~~----~~~~~~al~~~~V 73 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT---KLR----QYKDIEAIRHSDI 73 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhccccc---Ccc----ccchhhhhccHHH Confidence 111111 0000000 000000000000000 0000 0 000000000000000 000 0000000112222 Q ss_pred HHHHHHHHHHhhcCCeeecCCcH----HHHHHHH-Hh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEc Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGPSS----DRLDDFN-RR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLD 145 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~~~----~~l~~~~-~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~ 145 (513) ..+|+..++-+-+-|+++..+.. ..+..++ .. |. .......+..+++.+|.||+++..+.+|.+.-.+.++ T Consensus 74 ~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:98 74 FTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEc Confidence 33567666666677887643221 1233333 22 43 2345566788889999999999888888766566789 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecC Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN 225 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 225 (513) |..+.+..++.. ++.+..+. .. ..+ . .....+.+..+++++.. ++ + T Consensus 154 ~~~v~v~~~~~g--~~~~~~~~--~~-~~~--~---~~~~~~~~~dviHir~~---------------~~---------d 199 (441) T protein:vir:98 154 TSEIELKLDARG--RLYYFHQR--ID-SNG--N---NIERNVKFEDMLDIKFY---------------SL---------D 199 (441) T ss_pred CceeEEEECCCC--cEEEEEEE--ec-cCc--c---eeeEEEccccEEEeccC---------------CC---------C Confidence 998888876532 33322111 10 000 0 01123444444443210 00 0 Q ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch- Q lcl|NC_019916. 226 NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR- 304 (513) Q Consensus 226 ~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~- 304 (513) .-.|.|-++.+...++....+..-....+...+.|-.+++=... . ...+ ....++..=........ T Consensus 200 g~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-~---------~~~e---~~~~~~~~~~~~~~G~~n 266 (441) T protein:vir:98 200 GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-L---------DNKK---ARDRAREEFHKSFSGTKQ 266 (441) T ss_pred CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-C---------CCHH---HHHHHHHHHHHHhcCccc Confidence 11355666655555544433333333333333334433321100 0 0000 00001000000011000 Q ss_pred hcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 305 QANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELAS 384 (513) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 384 (513) .++++.+ ..+.+++.++.......+....+...+.|+..-++|+...+...++.|-+.... T Consensus 267 ag~~~vl---------~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~---------- 327 (441) T protein:vir:98 267 AGKVVVL---------DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL---------- 327 (441) T ss_pred cCcceec---------CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH---------- Confidence 1122222 223344555444445556667777888999999999766542222223221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDA 460 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~ 460 (513) .|...|...++.+..-+...-- .......+++....-+-.|..+.++++.++ +|+++.-.+.++++. +++. T Consensus 328 ----~y~~tl~P~~~~ie~~ln~~L~-~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gG 402 (441) T protein:vir:98 328 ----DYLSTLKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGG 402 (441) T ss_pred ----HHHHHHHHHHHHHHHHHHhhcc-ccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 1112233333322222221110 011122344444455667888899988876 678888777766533 3332 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019916. 461 DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDE 509 (513) Q Consensus 461 ~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (513) +..+-.+.. ...+....+. -+....+.......|.++++ T Consensus 403 d~~~~~~~~---------n~~~~~~~~~-~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 403 NGSIHRVDL---------NHVNIELVDE-YQMNKSRATDKKLKGGEENE 441 (441) T ss_pred CcceEeecc---------cccccccccc-cccccccccccccCCCCCCC Confidence 211100000 0000000000 00000000001111111111 No 213 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=91.55 E-value=0.015 Score=30.53 Aligned_cols=390 Identities=10% Similarity=0.057 Sum_probs=136.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc-----HH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS-----SD 98 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~-----~~ 98 (513) ..+.+.........-.-+..+.-|.-.. ..-..+=+.+.-...+|+..+.-+-.-|+.....+ +. T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~ 70 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQ----------KYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDE 70 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCc----------ccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccccc Confidence 1111100000000000011111121100 00000001112222356666655555677643221 12 Q ss_pred HHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecC-CCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecc Q lcl|NC_019916. 99 RLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDP-SQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQT 172 (513) Q Consensus 99 ~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~-~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~ 172 (513) .+..++.. |. .......+..+++.+|.||+++.++. .|.+.-.+.++|..+.+..++.. ++. |.... T Consensus 71 ~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~--~~~-----y~~~~ 143 (406) T protein:vir:97 71 DINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNH--EIV-----YTFTD 143 (406) T ss_pred hHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCc--eEE-----EEEEe Confidence 34444432 33 23555668888899999999988874 45554455678888776655432 221 11111 Q ss_pred cccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 173 VVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTAN 252 (513) Q Consensus 173 ~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~ 252 (513) ..+ .. ...+.+..+++++.. .+ +.-.|.|.++.+...++....+..-... T Consensus 144 ~~~--~~----~~~~~~~evih~r~~---------------~~---------dg~~G~spi~~~~~~i~~~~a~~~~~~~ 193 (406) T protein:vir:97 144 MLT--AK----QVKCFAHDVIHWKFF---------------SH---------DTILGRSPLLSLGDEIDLQTGGINTLIK 193 (406) T ss_pred cCC--ce----EEEEccccEEEecCC---------------CC---------CCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 110 00 012333444433210 00 0113566655544444432222222222 Q ss_pred HHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec Q lcl|NC_019916. 253 YMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE 332 (513) Q Consensus 253 ~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 332 (513) .+...+.|-.++.-. . .. ..+....++..=.........++.+.+ ..+.++..++.. T Consensus 194 ~f~ng~~~~~i~~~~-~----------~l---~~e~~~~~~~~~~~~~~g~n~g~~~vl---------~~g~~~~~l~~~ 250 (406) T protein:vir:97 194 FFKDGFSSGILTMKG-A----------QL---SGDARQRARQEFEKMREGSVGGSPLVF---------DSTMEYTPLEID 250 (406) T ss_pred HHhccCCCceEEecC-C----------CC---CHHHHHHHHHHHHHHhcccccCceeec---------CCCceEEEccCC Confidence 222222222111110 0 00 011111111110000000001122222 223333334333 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_019916. 333 YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKW 412 (513) Q Consensus 333 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~ 412 (513) .+...+....+...+.|+..-++|+...+.. +.-|..+ -. ....+...|...++.|..-+...-- . T Consensus 251 ~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~-~~~~~~e--~~----------~~~f~~~~l~P~~~~ie~~l~~kll-~ 316 (406) T protein:vir:97 251 TNVLQLITSNNFSTAQIAKALRVPSYKLGVN-SPNQSVA--QL----------MEDYVTNDLPFYFDAITSELGLKTL-N 316 (406) T ss_pred HHHHHHHHHHHhhHHHHHHHhCCCHHHcCCC-CCcchHH--HH----------HHHHHHHHHHHHHHHHHHHHhhhhc-C Confidence 3333344556666788888889998765421 1112111 11 1112223333333333332222100 0 Q ss_pred ccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHhhhhcCCCCCC Q lcl|NC_019916. 413 DIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIVKMMDKQRKAMLKTYDTKGGLIING 488 (513) Q Consensus 413 ~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 488 (513) ........+.|.- -.+....++++.++ .|+++.-.+.+.++.- +++. ..+ ...+......+. T Consensus 317 ~~~~~~~~i~fd~--~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~--gD~----------~~~~~n~~~~~~ 382 (406) T protein:vir:97 317 DKDRRLYHIEFDT--RSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPN--MDR----------YQSSLNYVFLDK 382 (406) T ss_pred hhhccceeEEEec--CccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--CCe----------EeeccCccchhc Confidence 1111123445531 12334445555554 5788887777666432 1111 000 000000000000 Q ss_pred -CCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_019916. 489 -TSGNDPEDEGVRGQQGEPEDERT 511 (513) Q Consensus 489 -~~~~~~~~~~~~~~~~~~~~~~~ 511 (513) +++.+......+++++..++.++ T Consensus 383 ~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 383 KEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred ccccccccccccCCCCCCCCCCCC Confidence 00111111111111111111111 No 214 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=91.37 E-value=0.016 Score=30.40 Aligned_cols=375 Identities=11% Similarity=0.020 Sum_probs=149.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCC Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQ---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNA 89 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p 89 (513) |. =++.... .+ ...+ .+.-........|.... .. .....-+.+.-...+|+..++-+-+-| T Consensus 1 Mg--~~~~~~~----~k-~~~~~~~~~~~~~~~~~~~~~~~~-----~~----v~~~~~l~~~~v~~~i~~ia~~ia~~~ 64 (383) T protein:vir:10 1 MG--LLTPKNF----SK-RNAKNMVYPSNPAFFTTTVGGMQL-----SY----VSALSALQNTNVYSVINRIASDVSSAH 64 (383) T ss_pred CC--ccccccc----cc-ccccccccccchhhhhhhccCccc-----cc----cchhHhhcchHHHHHHHHHHHhhccCc Confidence 11 0111000 00 0000 00000011111110000 00 001111223344556677777666778 Q ss_pred eeecCCcHH-HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEE Q lcl|NC_019916. 90 IAMSGPSSD-RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYH 168 (513) Q Consensus 90 ~~~~~~~~~-~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~ 168 (513) +++...... .+..-+...........+..+.+.+|.||+++..+. .-.+.++|..+.+..+.. ...+ ++ T Consensus 65 ~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~----~~~~p~~~~~v~~~~~~~---~~~~---~~ 134 (383) T protein:vir:10 65 FKTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN----LEHIPNSDVQINYLPGNM---GIVY---TV 134 (383) T ss_pred eeecccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc----eeEeecCcceEEEEEcCC---ceEE---EE Confidence 876543322 222111112344556678888889999998875432 112234444443333221 1111 11 Q ss_pred eecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019916. 169 AVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQS 248 (513) Q Consensus 169 ~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S 248 (513) .. . .+ . ....|.++.+++++..+.. .+ +...|.|.++.+...++....+.. T Consensus 135 ~~-~-~~-~-----~~~~~~~~evih~r~~~~~------------~~---------~~~~G~s~l~~~~~~i~~~~~~~~ 185 (383) T protein:vir:10 135 LE-S-ND-R-----PKMVLRQDQMLHFRLMPDP------------QY---------RYLIGRSPLESLQNALNLDDKASK 185 (383) T ss_pred EE-c-CC-c-----eEEEEcccceEEeccCCCC------------cc---------cccccccHHHHHHHHHHHHHHHHH Confidence 10 0 00 0 0112334444433211000 00 112477777777777776666555 Q ss_pred HHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeE Q lcl|NC_019916. 249 DTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANY 328 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (513) -....+...+.|-.+++-..... ..+ ....++..-.........++++.+ ..+.+++. T Consensus 186 ~~~~~f~ng~~~~~il~~~~~~~----------~~e---~~~~~~~~~~~~~~~~n~~~~~vl---------~~g~~~~~ 243 (383) T protein:vir:10 186 SNMSAMENQINPAGKLTISNYLS----------DGK---DLESAREEFEKANTGDNSGRLMVL---------PDGFDYTQ 243 (383) T ss_pred HHHHHHhccCCcceEEEeCCCCC----------CHH---HHHHHHHHHHHHhCccccCCcccc---------CCCceEEe Confidence 55555555555544443211100 000 000010000000000001112222 22334444 Q ss_pred EeecCCHHH-HHHHHHHHHHHHHHHhCccccccccc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 329 IHKEYDSAG-TELYKKRLAADIHKFSHTPDLTDDNF-SGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEE 406 (513) Q Consensus 329 l~~~~~~~~-~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~ 406 (513) +..+..... +....+...+.|+..-++|+...+.. .++.++..++. ....|...++..++.|..-+. T Consensus 244 l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq-----------~~~~~~~~l~P~~~~ie~~l~ 312 (383) T protein:vir:10 244 LEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQ-----------IKATYLANLNSYVNPIVDELR 312 (383) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHH-----------HHHHHHHHHHHHHHHHHHHHH Confidence 443333333 34667777899999999998654321 12222221211 111222334444444433332 Q ss_pred hcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCC Q lcl|NC_019916. 407 RVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGL 484 (513) Q Consensus 407 ~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~ 484 (513) ..- ....+++.+...+..|..+.++++.++ +|+++.-.+.+.++.-.-+..++ + . T Consensus 313 ~~l-----~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~-----------~-------~ 369 (383) T protein:vir:10 313 LKM-----NAPDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNL-----------P-------E 369 (383) T ss_pred Hhh-----CCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcc-----------c-------c Confidence 211 112466777888888999999998887 57888877776664311000000 0 0 Q ss_pred CCCCCCCCCCCCCC Q lcl|NC_019916. 485 IINGTSGNDPEDEG 498 (513) Q Consensus 485 ~~~~~~~~~~~~~~ 498 (513) .....++..+++++ T Consensus 370 ~~~~~~~~~gGd~e 383 (383) T protein:vir:10 370 FKPLTNETKGGDDK 383 (383) T ss_pred cCCCcccCCCCCCC Confidence 00000111111111 No 215 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=91.29 E-value=0.017 Score=30.35 Aligned_cols=390 Identities=9% Similarity=-0.008 Sum_probs=154.2 Q ss_pred Cccchhh-ceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCccee-ecchhHHH Q lcl|NC_019916. 1 MIDMQQA-NMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KADHRA-VHSFARYI 77 (513) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~ri-~~n~~~~i 77 (513) |-+.+|- |+. +.+........+.......+..... ........... ....++ .......+ T Consensus 4 ~~~~~~~~~m~---------------~F~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~c 66 (413) T protein:vir:96 4 VSEIRKDKNLK---------------FFNNKRSPTEESKAKDEIPKAPQVV--MTLPNFFKELISDGYTKLSDSPEVRMA 66 (413) T ss_pred cchhhhhhcCC---------------ccccCCCcchhhhhhcccccccccc--ccchhhHhhhccchhHHHhhchHHHHH Confidence 1111110 000 0000000000000000000000000 00000000000 000111 24566677 Q ss_pred HHHHHHHhhcCCeeec---CCc----HHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCce-eEEEEE Q lcl|NC_019916. 78 ADFQTSYSVGNAIAMS---GPS----SDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKG-EVSVKL 144 (513) Q Consensus 78 vd~~~~~l~g~p~~~~---~~~----~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~-~~~~~~ 144 (513) |+..+.-+-+-|+.+- .+. +..+..++. . |. .......+..+.+.+|.||+++..+..|.. .-.+.+ T Consensus 67 I~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l 146 (413) T protein:vir:96 67 VDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPI 146 (413) T ss_pred HHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEe Confidence 8888777777788751 111 122333332 2 32 345666788899999999999988877643 223457 Q ss_pred cccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEec Q lcl|NC_019916. 145 DPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR 224 (513) Q Consensus 145 ~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 224 (513) +|..+.+.+++.. .. |.... .+ . .+.+..+++++.. +++.+ T Consensus 147 ~~~~v~~~~~~~~---~~-----y~~~~-~~---~------~~~~~evih~k~~-------------~~~~~-------- 187 (413) T protein:vir:96 147 SPYKVTFNVSDDD---LD-----YSITF-DN---K------EYDPSTLLHFVLN-------------PSIER-------- 187 (413) T ss_pred cCceeEEEEcCCe---EE-----EEEee-cC---c------EEchhhEEEEecc-------------CCCCC-------- Confidence 8887776665321 11 11100 00 0 1122233322110 00000 Q ss_pred CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch Q lcl|NC_019916. 225 NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR 304 (513) Q Consensus 225 n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 304 (513) .-.|.|-++.+...+.....+.....+.+...+.|-.+++..... . +.... .+...-...+.... T Consensus 188 -~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l-~---------~e~~~----~~~~~~~~~~~g~~ 252 (413) T protein:vir:96 188 -PFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDS-D---------ELSDE----EGRENFEEMYLKRK 252 (413) T ss_pred -ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC-C---------HHHHH----HHHHHHHHHhcCcc Confidence 013666666555555444444334444444445555555432110 0 00000 11110000011101 Q ss_pred -hcceeeccccccccccccCCceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHH Q lcl|NC_019916. 305 -QANMILLKTGMAPNGQQTSADANYIH-KEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVEL 382 (513) Q Consensus 305 -~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k 382 (513) .++.+.+..++ ..+.-+. .......+....+...+.|+..-++|+.-.+. +.+.+..++ T Consensus 253 n~g~~~vl~~~~--------~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~-~~~~~~~~~---------- 313 (413) T protein:vir:96 253 EAGKPWIIPEGM--------VNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGV-GTYNKDEFN---------- 313 (413) T ss_pred ccCceeeecCCc--------ccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CcchHHHHH---------- Confidence 11222222111 1111111 12234445567778888999999999765432 111121111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCH Q lcl|NC_019916. 383 ASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDA 460 (513) Q Consensus 383 ~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~ 460 (513) ..+...+...++.|...+...-- .+...+++.+..-+..|..+.++++.++ +|+++.-.+.++++.-..+ T Consensus 314 -----~~~~~~l~P~~~~ie~~ln~~ll---~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ 385 (413) T protein:vir:96 314 -----NFINTKIMSIAQVIQQTYNKLIV---EEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDA 385 (413) T ss_pred -----HHHHHHHHHHHHHHHHHHHHhhC---CCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 12333455555554444433211 1123455666677778899999998887 6788887777777542211 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 461 DEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPE 507 (513) Q Consensus 461 ~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (513) ... ....+.... +.+..++..+. .++++ T Consensus 386 --~gd----------~~~~~~n~~------~~~~~~~~~~~-~~~dt 413 (413) T protein:vir:96 386 --EMD----------DLLVLENYL------QQKDLVNQKKL-IQDET 413 (413) T ss_pred --Ccc----------eeeeccccc------chhhcccccCC-CCCCC Confidence 000 000000000 00000000000 11111 No 216 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=91.18 E-value=0.017 Score=30.28 Aligned_cols=415 Identities=9% Similarity=0.062 Sum_probs=177.3 Q ss_pred CCcc-cCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC Q lcl|NC_019916. 13 EDAD-KLTPTRIAAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN 88 (513) Q Consensus 13 ~~~~-~~~~~~i~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~ 88 (513) ||.. -.+.+.+.+..+....++.+ +.+.+.+|..-. +... ........++..+-....+++.++.|++- T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~---~~~~----~~~~~~~~~~~dstg~~a~~~LAa~l~~~ 73 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPY---LMAD----VNDDLSSQNAWQDDGASATNFLSNKLSQV 73 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc---cccC----CCCCccccccccchHHHHHHHHHHHHHHh Confidence 4433 22455666666665555554 455555554331 1111 11122334566777788888888877652 Q ss_pred --Ce-----eecCCcH------------H-----------HHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCce Q lcl|NC_019916. 89 --AI-----AMSGPSS------------D-----------RLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKG 138 (513) Q Consensus 89 --p~-----~~~~~~~------------~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~ 138 (513) |+ ++...+. . .+...+..++|.....++.++...+|.|. +|.++.+ . T Consensus 74 ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~-~ 150 (517) T protein:vir:10 74 LFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVM--MYHPDKT-S 150 (517) T ss_pred hcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEE--EEEeCCC-C Confidence 22 2222110 1 12234455789999999999999999875 4555433 3 Q ss_pred eEEEEEcccceEEEecCCCCcceEEEEEEEeecccc-----c---------ccceeEEEEEEEc-----CCcE-EEEEee Q lcl|NC_019916. 139 EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVV-----D---------NITQTKYEVETWT-----ENDY-TRYKPI 198 (513) Q Consensus 139 ~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~-----~---------~~~~~~~~ve~yt-----~~~~-~~~~~~ 198 (513) .+. .-|..-+.+--|.. +++...+|..+..... + ...+....+++|| .+.. .+|... T Consensus 151 ~~~--~~pl~~y~v~~d~~-G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~ 227 (517) T protein:vir:10 151 PIQ--AVPLHHYCVRRDNN-GTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSA 227 (517) T ss_pred cEE--EEEcCeEEEeeCCC-cCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEe Confidence 333 23445555554443 3444444432221000 0 0001112233333 2222 222221 Q ss_pred ccCCccccccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhee--cCcccc Q lcl|NC_019916. 199 VVAGSVPTLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIK--GDIDTL 271 (513) Q Consensus 199 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~--G~~~~~ 271 (513) ++... ...-..++..+|++.++ ++.+|+|-.++..+-+-.+|.+.-.+.........|.+.+- |..... T Consensus 228 --d~~~~--~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~ 303 (517) T protein:vir:10 228 --DDVPV--GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDIN 303 (517) T ss_pred --Cceee--ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchh Confidence 11111 11112235678887765 34679998888888888888776666666666665554432 110000 Q ss_pred cccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHH Q lcl|NC_019916. 272 FDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADI 349 (513) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i 349 (513) .+.. + ..+ ....+..+++..+. ...+.......++.++..| T Consensus 304 -------------------~l~~-----------~-----~~g--~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI 346 (517) T protein:vir:10 304 -------------------QFVE-----------G-----GSG--AVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRI 346 (517) T ss_pred -------------------hccC-----------C-----Ccc--ccccCCcccceeeecccccchhHHHHHHHHHHHHH Confidence 0000 0 000 01112223334333 2234555666677777666 Q ss_pred HHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcccccccccceeeE Q lcl|NC_019916. 350 HKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ--------RYTVVAHIEERVNGKWDIDPDEIGF 421 (513) Q Consensus 350 ~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~--------~~~li~~~l~~~~~~~~~~~~~i~i 421 (513) ...-..-.+..- -+...++.-++ .++.+++..+|..+.+ +++.++..+....- ...+.+ T Consensus 347 ~~af~~~~l~~~-~~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~-----~~~v~~ 413 (517) T protein:vir:10 347 GRVFMMEAMTRR-DAERVTAYEIQ-------RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILT-----SKNVSP 413 (517) T ss_pred HHHHhhhhhhcc-CCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcC-----CCCccc Confidence 543221111110 01234444443 3456666677776555 22222222221111 112334 Q ss_pred EeCCCCCcCHHHHHH---HHHHH-------hcC-------CCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 422 IFRDNLPTDDVAIIT---ALVQA-------GAQ-------IPQEYLYQYL---PNVT----DADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 422 ~f~~~~p~d~~e~a~---~~~kl-------~g~-------iS~et~~~~l---~~v~----D~~~E~~ri~~E~~~~~~~ 477 (513) ...-++ +.+...+ .+... +.+ +-...++..+ -+|+ -.++|+++..+++.+.+.. T Consensus 414 ~~~s~l--a~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~ 491 (517) T protein:vir:10 414 TILTGI--EALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEAT 491 (517) T ss_pred eeeccH--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHH Confidence 332222 2222222 22211 110 1122222221 1222 1245555554443332222 Q ss_pred hh-------hhcCCCCCCCCCCCCCC Q lcl|NC_019916. 478 YD-------TKGGLIINGTSGNDPED 496 (513) Q Consensus 478 ~~-------~~~~~~~~~~~~~~~~~ 496 (513) .+ .++.....++.+.++.. T Consensus 492 ~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 492 KYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 11 11222222222222222 No 217 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=91.16 E-value=0.017 Score=30.26 Aligned_cols=458 Identities=12% Similarity=0.068 Sum_probs=166.7 Q ss_pred Cccchh---hceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHH Q lcl|NC_019916. 1 MIDMQQ---ANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYI 77 (513) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~i 77 (513) |++-+. |-.+--.+.. ......+ .-..++.+ -.++|.++. ++. ++-. + ..-....-.+++.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~----~~~~~~~~~-~~~-p~~~-~-~~L~~~~e~~~~~~~~ 67 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLG--GEADLAK---SPNSTQIP----DHRIQSHNV-GVN-PPYN-P-DRLAAFLELNETLATG 67 (651) T ss_pred CCCccceeeeeEEEeeccc--ccccccc---cccccccc----hhhhcccCC-CCC-CCCC-H-HHHHHHHhcChHHHHH Confidence 877662 2111111110 0111110 00011111 112333332 221 1110 0 0001111236899999 Q ss_pred HHHHHHHhhcCCeeecC------Cc-H----HHHHHHHHh---------------cCHHHHHHHHHHHHhhCCeEEEEee Q lcl|NC_019916. 78 ADFQTSYSVGNAIAMSG------PS-S----DRLDDFNRR---------------NDIDTLNYELYLDMTVTGRAYEYVY 131 (513) Q Consensus 78 vd~~~~~l~g~p~~~~~------~~-~----~~l~~~~~~---------------n~~~~~~~~~~~~a~~~G~~~~~v~ 131 (513) |+..+..+.|-++.+.. ++ . +.++.+|.. .........+..+...+|.+|+-+. T Consensus 68 i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiI 147 (651) T protein:vir:99 68 IRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEML 147 (651) T ss_pred HHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhh Confidence 99999999999876521 11 1 234444432 1233455566678888898888777 Q ss_pred ecCCCceeEEEEEcccceEEEecCCCC-cceEEE---------------EEEEe---------ecccccccce------- Q lcl|NC_019916. 132 RDPSQKGEVSVKLDPMECFIIYDRSVN-PKPIMA---------------VRYHA---------VQTVVDNITQ------- 179 (513) Q Consensus 132 ~d~~~~~~~~~~~~p~~~~~~~d~~~~-~~~~~~---------------ir~~~---------~~~~~~~~~~------- 179 (513) .+..+++...+.++|..+ -+...... ..+... .+++. ....+..... T Consensus 148 rn~~g~pv~L~~lp~~~~-Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~ 226 (651) T protein:vir:99 148 TDIEGRPVGLAYVPARTV-RVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDES 226 (651) T ss_pred hcCccchhhhhhcChhhe-eeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccC Confidence 665554322222233211 11110000 000000 00000 0000000000 Q ss_pred -eEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccc---eEEecCC-----CCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_019916. 180 -TKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFP---MIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDT 250 (513) Q Consensus 180 -~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~ 250 (513) ....+..+.+..... ......................+| |++|++. ..|.|.+..+...++....+..-. T Consensus 227 ~~~v~~~~~~d~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~ 305 (651) T protein:vir:99 227 GDEPTIRYREDEESER-EPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYN 305 (651) T ss_pred CcceeEEeccCcceee-eeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 000000011100000 000000000000000000111233 5666643 246676666555554444433333 Q ss_pred HHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe Q lcl|NC_019916. 251 ANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH 330 (513) Q Consensus 251 ~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 330 (513) .+.+...+.|-.+++-..... . .+....++..-..... ..++.+.+..++.......+.+++|.. T Consensus 306 ~~~f~NG~~p~gil~~~~~~l---------s----~e~~~~lr~~~~~~~~--nagk~~vL~~~~~~~~~~~~~g~~~~p 370 (651) T protein:vir:99 306 RDFFDNDTIPRMVIKVTGGEL---------S----EESKRDLRQMLNGLRE--ESHRAVVLEVEKFQSQLDEDVEIELEP 370 (651) T ss_pred HHHHhccCCCceEEEecCCCC---------C----HHHHHHHHHHHHHHhc--cCCceEEeecccccccccccCCceEEE Confidence 344444344444443211000 0 0001111110000000 123444444444444444556677766 Q ss_pred ecC---CHHHHHHHHHHHHHHHHHHhCcccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 331 KEY---DSAGTELYKKRLAADIHKFSHTPDLTDDNFS-GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEE 406 (513) Q Consensus 331 ~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~ 406 (513) ... ....+....+.....|+..-++|+...+... ++-|. ++... ...+...|+.+++.+...++ T Consensus 371 ls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn--~E~~~----------~~f~~~tL~P~~~~ie~eln 438 (651) T protein:vir:99 371 MGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSN--SDQQD----------KDFALEVIQPEQHTFAEWLY 438 (651) T ss_pred cCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCccc--HHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 543 2445667778888999999999976554221 12111 11111 11122233333333333332 Q ss_pred hc--ccccccccceeeEEeC--CCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 407 RV--NGKWDIDPDEIGFIFR--DNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTY 478 (513) Q Consensus 407 ~~--~~~~~~~~~~i~i~f~--~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~ 478 (513) .. ..........+.+.|+ .-+-.|...+++.+.++ +|+++.-.+.++++. +++...-. ..... T Consensus 439 ~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~---------~l~~~ 509 (651) T protein:vir:99 439 QIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEM---------TLSEF 509 (651) T ss_pred HhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccc---------ccccc Confidence 21 1111111123555664 34446788888888775 688988787777643 33211000 00000 Q ss_pred hhhcCCCCCCCCCCCCCC-CCCCCCCCCCCCccCCC Q lcl|NC_019916. 479 DTKGGLIINGTSGNDPED-EGVRGQQGEPEDERTSD 513 (513) Q Consensus 479 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 513 (513) +.. ..+. ...+.+..+ ...+++ .+...++.+ T Consensus 510 ~~~-~~g~-~~~gge~~~~~~~~~~--~~~~~~e~~ 541 (651) T protein:vir:99 510 EAE-VAGD-VAGGGETEAVHEPPEE--NKIGEREWD 541 (651) T ss_pred ccc-cccc-cccCCCCcccccCccc--cccccchhh Confidence 000 0000 000000000 000000 001111111 No 218 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=90.99 E-value=0.018 Score=30.14 Aligned_cols=395 Identities=11% Similarity=0.034 Sum_probs=168.8 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc-c-ccCCCCCCcce-eecchhHHHHHHHHHHhhcCC Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPA-S-RRNEKGKADHR-AVHSFARYIADFQTSYSVGNA 89 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~-~-~~~~~~~~~~r-i~~n~~~~ivd~~~~~l~g~p 89 (513) .....++.+. . ....-.+....|.-|-.. ..... . +.......... .......-++.+....+.+.+ T Consensus 1 v~~~~l~~e~-----a----t~~~~~d~~~~~~~~l~~-~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~ 70 (488) T protein:vir:99 1 MEKPALGREI-----A----TSGDGRDITRPFISGLQV-PNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSRE 70 (488) T ss_pred CCccchhHHH-----H----HHHhhhhhhccccCCCCC-CChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCC Confidence 1111111110 0 000001111222222110 00000 0 00000000011 124566778888888889999 Q ss_pred eeecCCcH--------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-EEEcccceEEEecCCCCc Q lcl|NC_019916. 90 IAMSGPSS--------DRLDDFNRRNDIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-VKLDPMECFIIYDRSVNP 159 (513) Q Consensus 90 ~~~~~~~~--------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~~~~p~~~~~~~d~~~~~ 159 (513) +.+....+ +.++++++.-+|......+. ++.-+|.+. +++|.-.+|...+. +..-|...| .||... T Consensus 71 w~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f-~~d~~~-- 146 (488) T protein:vir:99 71 WKVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF-RYDQDG-- 146 (488) T ss_pred ceEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccce-eecCCC-- Confidence 99853321 34666666667888777775 688899885 56665444433221 111111111 122111 Q ss_pred ceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEe--cCCCCCCcchhHHH Q lcl|NC_019916. 160 KPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEY--RNNEYRQGDFENVL 237 (513) Q Consensus 160 ~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~--~n~~~~~sd~e~v~ 237 (513) .+ + +.+.. ... . ....+.+++.+-.++. ..+..|.|.+..+- T Consensus 147 ~l----~-------------------~~~~~---------~~~--~--g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~ 190 (488) T protein:vir:99 147 GL----R-------------------LLTPN---------NMF--E--GEPCPAPYFWHFSTGADNDDEPYGLGLAHWLY 190 (488) T ss_pred ce----E-------------------EeccC---------CCC--C--ccccccCceEEEEeecCCCCCcccchHHHHHH Confidence 00 0 00100 000 0 0000111121111111 12345778888776 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 238 SLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 238 ~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) ...=--+..+.+.+..++.|+.|+++.+-...... ..+... -...+..+..+....+ T Consensus 191 w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~----------~~ek~~-------l~~av~~~~~~~~~vi------ 247 (488) T protein:vir:99 191 WPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTAT----------PEDKAK-------LLAALHAIQTDSAIIM------ 247 (488) T ss_pred HHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCC----------HHHHHH-------HHHHHHHHhcCcEEEe------ Confidence 66666667788899999999999987763211000 000000 0011222222333333 Q ss_pred cccccCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCccccccccc-cccccHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKE-YDSAGTELYKKRLAADIHKFSHTPDLTDDNF-SGNSSGV-AMKYKVLGTVELASTKRKQFERGL 394 (513) Q Consensus 318 ~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~-Ai~~~~~~l~~k~~~~~~~f~~~l 394 (513) ..+..+++++.. .+...++..++.+.+.|...--.-.++.+.. ++...|. .-+. ....++.-.+.+...+ T Consensus 248 ---P~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~v----~~d~~~aDa~~i~~tl 320 (488) T protein:vir:99 248 ---PAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQADV----RLDLVKADADLICESF 320 (488) T ss_pred ---cCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHHHHHHHHHHHHHHH Confidence 345678888754 4556689999999998876532222222221 2222222 2222 2222233334444455 Q ss_pred H-HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh---cC-CCHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_019916. 395 N-QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG---AQ-IPQEYLYQYLPNVTDADEIVKMMDK 469 (513) Q Consensus 395 ~-~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~---g~-iS~et~~~~l~~v~D~~~E~~ri~~ 469 (513) . ++++.++.+ +.. .. .-..+.|....+.|..+.++++.++. |+ ++.+.+.+.++. +.++.+ T Consensus 321 n~~li~~l~~~----N~~-~~--~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gi-p~~~~~------ 386 (488) T protein:vir:99 321 NLGPARWLTEW----NFP-GA--QPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGV-EVESTQ------ 386 (488) T ss_pred HHHHHHHHHHh----CcC-Cc--CCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCC-CCcccc------ Confidence 3 344444433 211 11 12357788889999999999988873 55 788877787753 321110 Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 470 QRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 470 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ++ ... +.............+.. ..-.++-..+ T Consensus 387 --~~---~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 418 (488) T protein:vir:99 387 --AE---ATA--PTPSTEFAEGDQPSDPA-----AAMAPQLAEA 418 (488) T ss_pred --cc---ccc--CCCcccCCCCCCCCCch-----HHHHHHHHHH Confidence 00 000 00000000000000000 0000000000 No 219 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=89.20 E-value=0.028 Score=29.12 Aligned_cols=368 Identities=11% Similarity=0.023 Sum_probs=141.7 Q ss_pred HHHHHHHH---HHHHHH----HHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc Q lcl|NC_019916. 24 AAFIRHHY---NNQRPR----LEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS 96 (513) Q Consensus 24 ~~~i~~~~---~~~~~~----~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~ 96 (513) .-+++.+. ...... ......++.|.-. ........-+...-...+|+..++-+-+-|+++.... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~~ 71 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQ---------LSYVSALSALQNTNVYSVINRIASDVASAHFKTENTA 71 (385) T ss_pred CccccchhcccccccccccccchhhhhhhccccC---------ccccCHHHhhccHHHHHHHHHHHHHHhhCceeeeccc Confidence 11111100 000000 0000011111000 0000000112233445577777777777788864332 Q ss_pred HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecc Q lcl|NC_019916. 97 SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQT 172 (513) Q Consensus 97 ~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~ 172 (513) . ..++.. |. .......+..+.+.+|.||+++..+.. .+ +.++|..+.+..+... . ++++... T Consensus 72 ~---~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~~---~~-~p~~~~~v~~~~~~~~---~---~~~~~~~- 137 (385) T protein:vir:10 72 T---LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQNL---EH-IPNSDVQINYLPGNMG---I---VYTVLES- 137 (385) T ss_pred h---hhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCce---eE-eecCCceEEEEEcCCc---e---EEEEEEc- Confidence 2 223322 32 334555677888899999988865421 11 2344444443333211 1 1111110 Q ss_pred cccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 173 VVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTAN 252 (513) Q Consensus 173 ~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~ 252 (513) .+ . ....+..+.+++++..... .+ +...|.|.+..+...++....+..-..+ T Consensus 138 -~~--~----~~~~~~~~eiihik~~~~~------------~~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 189 (385) T protein:vir:10 138 -ND--R----PQMVLRQDQMLHFRLMPDP------------QY---------RYLIGRSPLESLQNALNLDDKASKSNMS 189 (385) T ss_pred -CC--c----eEEEEccccEEEeccCCCC------------cc---------cccccccHHHHHHHHHHHHHHHHHHHHH Confidence 00 0 0112344444443221100 00 1124677777777766655555444444 Q ss_pred HHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEeec Q lcl|NC_019916. 253 YMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKE 332 (513) Q Consensus 253 ~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 332 (513) .+...+.|-.+++-.... . ..+. ...++..-.........++++.+ ..+.+++.++.. T Consensus 190 ~~~ng~~~~gil~~~~~~-~---------~~e~---~~~~~~~~~~~~~~~n~~~~~vl---------~~g~~~~~l~~~ 247 (385) T protein:vir:10 190 AMENQINPAGKLTISNYL-S---------DGKD---LESAREEFEKANTGDNSGRLMVL---------PDGFDYTQLEMK 247 (385) T ss_pred HHhccCCcceEEEeCCCC-C---------CHHH---HHHHHHHHHHHhCccccCCcccc---------CCCceEEecCCC Confidence 444444454444332110 0 0000 00110000000000001112222 222333334333 Q ss_pred CCHHH-HHHHHHHHHHHHHHHhCccccccccc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019916. 333 YDSAG-TELYKKRLAADIHKFSHTPDLTDDNF-SGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNG 410 (513) Q Consensus 333 ~~~~~-~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~ 410 (513) ..... +....+...+.|+..-++|+...+.. .++.++..++.... .|...+...++.+..-+...- T Consensus 248 ~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~-----------~~~~~l~P~~~~ie~~l~~~l- 315 (385) T protein:vir:10 248 TDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA-----------TYLANLNSYVNPIVDELRLKM- 315 (385) T ss_pred hhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHH-----------HHHHHHHHHHHHHHHHHHHhh- Confidence 22333 23566777889999999997654321 23322222221111 111123333332222222110 Q ss_pred ccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCC Q lcl|NC_019916. 411 KWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLII 486 (513) Q Consensus 411 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 486 (513) -...+++.+..-+..|..+.++++.++ .|+++.-.+.+.++. +.+ ..+ +..... . T Consensus 316 ----~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~--~~~--------------~~~~~~-~ 374 (385) T protein:vir:10 316 ----NAPDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLP--DNL--------------PEFKPL-T 374 (385) T ss_pred ----CCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCC--CCC--------------ccccCc-c Confidence 012356666777778999999998887 578877666555432 111 000 000000 0 Q ss_pred CCCCCCCCCCC Q lcl|NC_019916. 487 NGTSGNDPEDE 497 (513) Q Consensus 487 ~~~~~~~~~~~ 497 (513) +...+.+..++ T Consensus 375 ~~~~~g~~~dn 385 (385) T protein:vir:10 375 TQVKGGDEGDN 385 (385) T ss_pred cccCCCCCCCC Confidence 00000000000 No 220 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=89.09 E-value=0.028 Score=29.07 Aligned_cols=453 Identities=13% Similarity=0.073 Sum_probs=159.8 Q ss_pred chhhceeccCCcccCCHHHHHHHHHH-H-------------------------HHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019916. 4 MQQANMNYQEDADKLTPTRIAAFIRH-H-------------------------YNNQRPRLEMLYDYYRGQNDGILSPAS 57 (513) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~i~~~i~~-~-------------------------~~~~~~~~~~~~~YY~G~~~i~~~~~~ 57 (513) |-+..|.+.....+..++-. +.+.. . -.....+|+.+-.+++-+ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~-s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd--------- 70 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGP-SFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECD--------- 70 (537) T ss_pred CccccccceeecccccccCC-cccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchh--------- Confidence 22222222222111110000 00000 0 001111222222222221 Q ss_pred ccCCCCCCcceeecchhHHHHHHHH-HHhhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCe Q lcl|NC_019916. 58 RRNEKGKADHRAVHSFARYIADFQT-SYSVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGR 125 (513) Q Consensus 58 ~~~~~~~~~~ri~~n~~~~ivd~~~-~~l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~ 125 (513) +-...||+..+ .-....||.+.-++ -++.+.+++--+|+....+..|.+.+.|+ T Consensus 71 --------------~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgR 136 (537) T protein:vir:10 71 --------------SAVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGR 136 (537) T ss_pred --------------hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE Confidence 22222333222 22234555553322 13455667777899999999999999999 Q ss_pred EEEEeeecCC----CceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeE--EEEEEEcCCcEEEEEeec Q lcl|NC_019916. 126 AYEYVYRDPS----QKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTK--YEVETWTENDYTRYKPIV 199 (513) Q Consensus 126 ~~~~v~~d~~----~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~--~~ve~yt~~~~~~~~~~~ 199 (513) .|.+...|.+ |-..+. .++|+.+..|.--.. +....++.... +..... ....+|++...+. .+ T Consensus 137 i~fhKiid~k~pk~GI~ELr-~lDPr~i~~vR~i~~--~~~~~~~~~~~-----~~~v~~~~~eyf~ynp~g~~~---~~ 205 (537) T protein:vir:10 137 LFFHKVIDPKKPRQGLVELR-YVDPRKIRKVTEYEA--KRPEALRTQDL-----NQQLTQQSASYFLYNPKGLKN---ST 205 (537) T ss_pred EEEEEEEeCCCccccceeee-eeCCccceeeEeecc--cCCccceEEec-----ceeeeecccceeeeccccccc---cC Confidence 9999998754 322232 368877654432110 00011111000 000000 0112344433221 00 Q ss_pred cCCccccccccccccCcccc--eEEec-------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccc Q lcl|NC_019916. 200 VAGSVPTLEVAEHSAQFGFP--MIEYR-------NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDT 270 (513) Q Consensus 200 ~~~~~~~~~~~~~~~~g~vP--vv~~~-------n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~ 270 (513) +.--+|| .|.|+ |.....|-++..+....-+ +++-|.+....-...|-+=+.-.... T Consensus 206 -------------~~~vkI~~dAI~y~hSGl~d~n~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVG 271 (537) T protein:vir:10 206 -------------NQGMKIAPDSIAYCHSGIQDLNKNMVLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVG 271 (537) T ss_pred -------------CCceeccHhheeeecccceeCCCCeeeeeehhhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecC Confidence 0001122 11111 2222334444322221111 11222222222222222111111100 Q ss_pred ccccccccccccchhhhh-hhccccccchhhhcchhcce----eeccccccccccccCCceeEEeecCCHHHHHHHHHHH Q lcl|NC_019916. 271 LFDDSTLLQMVDPSDADA-MKKLADEKMAQLEAMRQANM----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRL 345 (513) Q Consensus 271 ~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 345 (513) ..-.....+....-...+ ++..-+.....++.++.-.. ++|+.- ..+.+-.+..|---.|+.-. .-++-. T Consensus 272 nLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRR----eGgrgTEItTLpGgqnlgem-~DV~YF 346 (537) T protein:vir:10 272 NLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR----EGGRGTEISTLPGGQNLGEL-EDVKYF 346 (537) T ss_pred CCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhccccc----CCCcccceeeccccCCcChH-HHHHHH Confidence 000000000000000000 00000000000111110000 011100 00111222222222222222 234445 Q ss_pred HHHHHHHhCccccccccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----cee Q lcl|NC_019916. 346 AADIHKFSHTPDLTDDNFSG-NS-SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP----DEI 419 (513) Q Consensus 346 ~~~i~~~s~~p~~~~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~----~~i 419 (513) ++-+|..-++|-.-.+.-++ |. -|..|-.-+....+.+.+.+..|..-|..+++.=+-+ ++.....++ ..| T Consensus 347 ~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLil---Kgiit~eeW~~i~~~I 423 (537) T protein:vir:10 347 QKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLIL---KGICSIEEWEEMKEHI 423 (537) T ss_pred HHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---ccCCCHHHHHHHhhcc Confidence 56666666787433221111 11 2223555555566677888888888888887753322 222222233 457 Q ss_pred eEEeCCCCCcCHHHHHH-------HHHHHh---c-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHH---HHhhhhcC Q lcl|NC_019916. 420 GFIFRDNLPTDDVAIIT-------ALVQAG---A-QIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAML---KTYDTKGG 483 (513) Q Consensus 420 ~i~f~~~~p~d~~e~a~-------~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~---~~~~~~~~ 483 (513) .+.|...-.-.+...++ ++..+. | .+|.+++++.+=--+| .+++-+.|++|..+-. +.....++ T Consensus 424 ~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~ 503 (537) T protein:vir:10 424 QFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAME 503 (537) T ss_pred eEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccc Confidence 78886555544444333 334443 3 3799999988544443 4555566666554310 00011111 Q ss_pred CCC--CCCCCCCCCCCCCCCCCC-CCCCccCCC Q lcl|NC_019916. 484 LII--NGTSGNDPEDEGVRGQQG-EPEDERTSD 513 (513) Q Consensus 484 ~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 513 (513) .+. +.+-+..+++++.+...+ .|++.+--+ T Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 504 MGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred cCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 110 001111111111111111 122222222 No 221 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=88.84 E-value=0.03 Score=28.94 Aligned_cols=318 Identities=8% Similarity=-0.080 Sum_probs=116.8 Q ss_pred EEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccce Q lcl|NC_019916. 141 SVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPM 220 (513) Q Consensus 141 ~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 220 (513) .++ .+|...........+. |.. ..+..+.....+.....++.....+. +..+-+..+ + T Consensus 1 v~E-------ivw~~~~g~~~~~~l~-~r~-------~~~~~~f~~~~~~~l~~~~~~~~~g~-----~~~~lp~~k--f 58 (355) T protein:vir:78 1 MFE-------QVYRIENGRARLGKLA-WRP-------PRTISRFDVAPDGGLVAIEQWGVFGK-----ATVRIPVDR--L 58 (355) T ss_pred CeE-------EEEEeeCCeEEEeeee-ecC-------ccceeeeeeccCCceeEEEecCCCCC-----CcceeccCC--E Confidence 001 1332211111111111 000 00011111111222222221110000 111111112 2 Q ss_pred EEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccc Q lcl|NC_019916. 221 IEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADE 295 (513) Q Consensus 221 v~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 295 (513) |.++ .+..|.|.+..+-..---=+..+.+.+..++.|..|+.+.+|..+....+......- ......+.. T Consensus 59 i~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~-----~~~~~~~~~ 133 (355) T protein:vir:78 59 VVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAE-----QWLNDQKEE 133 (355) T ss_pred EEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHH-----HHHHHHHHH Confidence 3332 234677888776666555567778888899999888888888654322111100000 000000000 Q ss_pred cchhhhcc--hhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccc---ccccc-cH Q lcl|NC_019916. 296 KMAQLEAM--RQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDN---FSGNS-SG 369 (513) Q Consensus 296 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~~~n~-Sg 369 (513) -...+... ...... ....+.+++++........+...++...+.|...--...+..+. .++.. +. T Consensus 134 l~~~~~~i~~g~~a~~---------iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~ 204 (355) T protein:vir:78 134 GLQLAKEFRAGEAAGG---------YIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGD 204 (355) T ss_pred HHHHHHHhhCCcceeE---------eecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHH Confidence 00000000 000111 12345678888777666667788999999887764443333321 11212 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cC-C Q lcl|NC_019916. 370 VAMKYKVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG--AQ-I 445 (513) Q Consensus 370 ~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-i 445 (513) +..+....-+..-+ +.+...+. ++++-++.+ +-.. ......+.|.. .+.+....++.+.++. |+ + T Consensus 205 vh~~v~~~~~~aD~----~~i~~~ln~~li~~l~~l----N~~~--~~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~ 273 (355) T protein:vir:78 205 TFASFFTGSLNAVM----KHIADVTQQHVVEDLVDQ----NWGP--EEPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFT 273 (355) T ss_pred HHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHh----cCCC--CCCCCEEEecC-cChhHHHHHHHHHHHHhCCCcc Confidence 22333333233233 33344442 344444432 2111 11223566754 4556667788888773 44 5 Q ss_pred CHH----HHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC-CCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 446 PQE----YLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTS-GNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 446 S~e----t~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.+ .+.+.++ +..+...-+ .........+........ +.....+........++..+..+ T Consensus 274 ~~~~~~~~~~e~~g-ip~p~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~ 338 (355) T protein:vir:78 274 ADPELEKDLRARYG-LPAPAERDD-------GADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGP 338 (355) T ss_pred ccHHHHHHHHHHhC-CCCCCCCCc-------ccCCccccccccccccccCCccccccccccCCCCCChhhhHH Confidence 543 3445554 332211000 000000000000000000 00001111111111111111111 No 222 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=88.74 E-value=0.03 Score=28.90 Aligned_cols=398 Identities=12% Similarity=0.041 Sum_probs=168.7 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCCccccccccccCCCCC---Ccce-eecchhH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDY-YRGQNDGILSPASRRNEKGK---ADHR-AVHSFAR 75 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~Y-Y~G~~~i~~~~~~~~~~~~~---~~~r-i~~n~~~ 75 (513) |++ -.|-.+..=+......+-+........ +.+..+ +.|-.+ .+.......+. .... ....... T Consensus 1 ~~~-----~i~~~~g~~~~~~~~~~~~~~~ia~~~---~~~~~~~~~~~~p---~~~~il~~~~~~~~~y~~m~~D~~i~ 69 (491) T protein:vir:79 1 MSK-----GLWVSPTEFVKFGEPDKSLSSQIATRA---RSIDFFALGMYLP---NPDPVLKALGKDIRVYRELRADAHVG 69 (491) T ss_pred CCC-----eeeCCCCCcccccccchhHHHHHhhhc---cccccccccccCc---chhHHHhhccCCHHHHHHHhhChHHH Confidence 544 233333221111111111111111111 111111 122111 00000000000 0001 1356677 Q ss_pred HHHHHHHHHhhcCCeeecCC--cH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-EEEcccc Q lcl|NC_019916. 76 YIADFQTSYSVGNAIAMSGP--SS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-VKLDPME 148 (513) Q Consensus 76 ~ivd~~~~~l~g~p~~~~~~--~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~~~~p~~ 148 (513) -.+.+...-+.+.++++... ++ +.+.++++.-+|+.....+ .+|.-+|.+. +++|...+|...+. +..-|.. T Consensus 70 s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~ 148 (491) T protein:vir:79 70 GCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPAD 148 (491) T ss_pred HHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeeccc Confidence 77888888888999998542 22 4567777777788877766 4688899886 56665444433221 1111222 Q ss_pred eEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccccc-ccCcccceEEec--C Q lcl|NC_019916. 149 CFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEH-SAQFGFPMIEYR--N 225 (513) Q Consensus 149 ~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~vPvv~~~--n 225 (513) .|. ||... .+ +++..... ..+.+ .+++.|-..+-. . T Consensus 149 ~f~-~d~~~--~l--------------------------------~l~~~~~~------~~g~~lp~~k~i~~~~~~~~g 187 (491) T protein:vir:79 149 WFV-YDPEN--QL--------------------------------RFRSKEHW------VQGEELPARKFLVPRQEATYL 187 (491) T ss_pred cee-eccCC--ce--------------------------------EEeecCCC------CCceeecCCCeEEEEecCCCC Confidence 221 22111 11 11110000 00001 112222111111 2 Q ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 226 NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 226 ~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) +..|.|.+..+-...---+..+.+.+..++.|+.|+++.+=..+.... +.. .+ ...+..+.. T Consensus 188 ~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~-----------ek~---~l----~~al~~~~~ 249 (491) T protein:vir:79 188 NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDA-----------ETN---LL----LDRLEDMVQ 249 (491) T ss_pred CcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHH-----------HHH---HH----HHHHHHHhc Confidence 345778888777766666777888999999999999877632111100 000 00 111222333 Q ss_pred cceeeccccccccccccCCceeEEeec---CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHH-HHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKE---YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGV-AMKYKVLGTVE 381 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~---~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~-Ai~~~~~~l~~ 381 (513) +..+.+ ..+.++++++.. .+...++..++.+.+.|...--.=.++.+..++...|. .-+.. .. T Consensus 250 ~a~~vi---------P~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~vh~~v~----~~ 316 (491) T protein:vir:79 250 DAVAVI---------PDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQAGLEVT----DD 316 (491) T ss_pred CeEEEe---------cCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHHHHHHHH----HH Confidence 333333 345778888643 23456888888888888765322122223333333232 22222 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCH-HHHHHHHHHHh--cC-CCHHHHHHhCCCC Q lcl|NC_019916. 382 LASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDD-VAIITALVQAG--AQ-IPQEYLYQYLPNV 457 (513) Q Consensus 382 k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~a~~~~kl~--g~-iS~et~~~~l~~v 457 (513) .++.-.+.....+.++++-++.+ +.. +.....+.| ..+.+. ...++.+.++. |+ +|.+.+.+.++ + T Consensus 317 i~~~D~~~i~~tln~li~~l~~~----N~~---~~~~p~f~~--~e~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~G-i 386 (491) T protein:vir:79 317 IRDGDKAIVVEAMNMLIRWICDL----NFD---GAARPVFDM--WEQEQVDEIQAGRDEKLTRAGARFTPAYFKRAYN-L 386 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh----cCC---CCCcceEee--cCcCchhHHHHHHHHHHHhCCCccCHHHHHHHhC-C Confidence 22333344555666555554443 221 111233444 334443 45678887774 55 88888888885 3 Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 458 TDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 458 ~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.++.+-+ ..+. ..+..... .........+....| T Consensus 387 p~~~~~e~--------~~~~------~~~~~~~~-------~~~~~~~~~~~~~~d 421 (491) T protein:vir:79 387 QDGDLDER--------PLPV------SAVDAVGA-------ASFAEFEAPDQDALD 421 (491) T ss_pred CCCCCCcc--------ccCc------Cccccccc-------ccccccCCCCCcchH Confidence 43221100 0000 00000000 000000000111111 No 223 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=88.53 E-value=0.032 Score=28.80 Aligned_cols=397 Identities=12% Similarity=0.038 Sum_probs=172.4 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCC-----ccccccccccCCCCCCcce-eecch Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYY-RGQN-----DGILSPASRRNEKGKADHR-AVHSF 73 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY-~G~~-----~i~~~~~~~~~~~~~~~~r-i~~n~ 73 (513) |++ -.|-.+..=+......+-+.+....+ ...-++. .|-. +|+.. .......... ..... T Consensus 1 m~~-----~i~~~~g~p~~~~~~~~~~~~~ia~~----~~~~~~~~~~~~~~~~~~iLr~----~~~~~~~y~~m~~D~~ 67 (491) T protein:vir:10 1 MSK-----GLWVSPTEFVTFGEPDKSLSSQIATR----ARSIDFFALGMYLPNPDPVLKA----LGKDIRVYRELRADAH 67 (491) T ss_pred CCC-----ceeCCCCCccCcccCChHHHHHHHhh----hcccccccccCCccchHHHHHh----cCCCHHHHHHHhhChH Confidence 444 33433322222111111111110000 0111110 0110 01100 0000000000 13566 Q ss_pred hHHHHHHHHHHhhcCCeeecCC--cH---HHHHHHHHhcCHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEE-EEEcc Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSGP--SS---DRLDDFNRRNDIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVS-VKLDP 146 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~~--~~---~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~-~~~~p 146 (513) ..-++++...-+.+.++++... ++ +.+.++++.-+|+.....+. +|.-+|.+. +++|...+|...+. +..-| T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~ 146 (491) T protein:vir:10 68 VGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKP 146 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeec Confidence 7778888888888999988542 22 35667777778888887775 788899885 56675444433221 11122 Q ss_pred cceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccccc-ccCcccceEEe-- Q lcl|NC_019916. 147 MECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEH-SAQFGFPMIEY-- 223 (513) Q Consensus 147 ~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~vPvv~~-- 223 (513) ...| .|+... .+ +++...... .+.+ .+++.|-.++- T Consensus 147 ~~~f-~~d~~~--~l--------------------------------~~~~~~~~~------~g~~l~~~k~i~~~~~~~ 185 (491) T protein:vir:10 147 ADWF-VYDPEN--QL--------------------------------RFRSKDHWM------QGEELPARKFLVPRQEAT 185 (491) T ss_pred ccce-eeccCC--ce--------------------------------EEecCCCCC------CcceecCCCEEEEEecCC Confidence 2222 122211 11 111000000 0000 11111111110 Q ss_pred cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc Q lcl|NC_019916. 224 RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM 303 (513) Q Consensus 224 ~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 303 (513) ..+..|.|.+..+-...---+..+.+.+...+.|+.|+++.+=..+.... +.. .-...+..+ T Consensus 186 ~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~-----------ek~-------~l~~al~~~ 247 (491) T protein:vir:10 186 YLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDG-----------EKN-------LLLDCLEDM 247 (491) T ss_pred CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHH-----------HHH-------HHHHHHHHH Confidence 02345778888887777777888899999999999999887642211110 000 001112333 Q ss_pred hhcceeeccccccccccccCCceeEEeec---CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHH-HHHHHHHHH Q lcl|NC_019916. 304 RQANMILLKTGMAPNGQQTSADANYIHKE---YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGV-AMKYKVLGT 379 (513) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~---~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~-Ai~~~~~~l 379 (513) ..+..+.+ ..+.++++++.. .+...++..++.+.+.|...--.=.++.+..++...|. .-+.. T Consensus 248 ~~~a~~vi---------P~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~vh~~v~---- 314 (491) T protein:vir:10 248 VQDAVAVV---------PDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQAGLEVT---- 314 (491) T ss_pred hcCcEEEe---------cCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHHHHHHHH---- Confidence 33333333 345778888754 23456888898888888765322222223323332222 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cC-CCHHHHHHhCCC Q lcl|NC_019916. 380 VELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG--AQ-IPQEYLYQYLPN 456 (513) Q Consensus 380 ~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~~~~l~~ 456 (513) ...++.-.+.....+.++++-++.+ ..+. .+ ...+.|... .......++.+.++. |+ ++.+.+.+.++ T Consensus 315 ~di~~~D~~~i~~tln~li~~l~~~---N~~~--~~--~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~~i~e~~G- 385 (491) T protein:vir:10 315 DDIRDGDKAVVSEAMNMLIRWICDL---NFDG--AD--RPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPAYFKRAYN- 385 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cCCC--CC--cceEEecCc-CchhHHHHHHHHHHHhCCCcCCHHHHHHHhC- Confidence 1222222344555566555544433 2221 11 245566543 233467788888874 55 88888888885 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 457 VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 457 v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ++.+..+.. ..+.. .....+..... + ...+ +....| T Consensus 386 ip~~~~~~~--------------~~~~~-~~~~~~~~~~~----~-~~~~-~~~~~d 421 (491) T protein:vir:10 386 LQDGDLDER--------------PLPVS-AVDTVGAASFA----E-FEAP-DQDALD 421 (491) T ss_pred CCCCCcCcc--------------ccccC-CCCCccccccc----c-cCCC-CCCchH Confidence 343221100 00000 00000000000 0 0000 000111 No 224 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=88.03 E-value=0.035 Score=28.57 Aligned_cols=393 Identities=9% Similarity=0.074 Sum_probs=164.0 Q ss_pred ccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc---------ccCCCC-CCcceeec Q lcl|NC_019916. 2 IDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPAS---------RRNEKG-KADHRAVH 71 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~---------~~~~~~-~~~~ri~~ 71 (513) |..-|+-+..+.+.. + +..++..+.|.......... ...... .+..=+.+ T Consensus 1 ~~~~~~~~~~~~~~g-----~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~ 60 (424) T protein:vir:18 1 MEEPKYTIDLRTNNG-----W---------------WARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) T ss_pred CCCCcceEeecCCCc-----h---------------HHHHHhhhcccccccccccccccccccccccccccccHHHhhcc Confidence 333333333333322 2 22233333332111000000 000000 00001222 Q ss_pred chhHHHHHHHHHHhhcCCeee-c--CCc-------HHHHHHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCC Q lcl|NC_019916. 72 SFARYIADFQTSYSVGNAIAM-S--GPS-------SDRLDDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQ 136 (513) Q Consensus 72 n~~~~ivd~~~~~l~g~p~~~-~--~~~-------~~~l~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~ 136 (513) .-...+|+..++-+-+-|+.+ . .+. +..+..++. . |. .......+..+.+.+|.||+++-.+.+| T Consensus 61 ~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 140 (424) T protein:vir:18 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) T ss_pred HHHHHHHHHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 334456777777777778775 1 111 222444443 2 32 3345566788999999999999888888 Q ss_pred ceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCc Q lcl|NC_019916. 137 KGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQF 216 (513) Q Consensus 137 ~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g 216 (513) .+.-.+.++|..+.+..++. .+. |.... ++ . ...|.++.+++++... + T Consensus 141 ~~~~L~pl~~~~V~v~~~~~---~~~-----y~~~~-~g---~----~~~~~~~eIih~r~~~---------------~- 188 (424) T protein:vir:18 141 DVISLLPLQSANMDVKLVGK---KVV-----YRYQR-DS---E----YADFSQKEIFHLKGFG---------------F- 188 (424) T ss_pred cEEEEEEecCcceEEEEcCC---eEE-----EEEEe-CC---e----EEEeccccEEEecCcC---------------C- Confidence 87656677888877655432 111 11111 00 0 1134455555443110 0 Q ss_pred ccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcccccc Q lcl|NC_019916. 217 GFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEK 296 (513) Q Consensus 217 ~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 296 (513) +...|.|-++.+...++....+..-..+.+...+.|-.+++-...... . +....++..- T Consensus 189 --------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~----------~---e~~~~~~~~~ 247 (424) T protein:vir:18 189 --------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT----------E---QQRSQVEENF 247 (424) T ss_pred --------CCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCC----------H---HHHHHHHHHH Confidence 111355655555444443333333333334444445444432111000 0 0000111000 Q ss_pred chhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccc-cHHHHHHH Q lcl|NC_019916. 297 MAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNS-SGVAMKYK 375 (513) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~ 375 (513) .........++.+.+ ..+.+++.++.......+....+...+.|+..-++|+...+...++. .+..++-. T Consensus 248 ~~~~~g~nag~~~vl---------~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~ 318 (424) T protein:vir:18 248 KEIAGGPVKKRLWIL---------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) T ss_pred HHHhCCcccCCceec---------cCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHH Confidence 000000011122222 22334444444444555667778888999999999986654333222 22333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHH Q lcl|NC_019916. 376 VLGTVELASTKRKQFERGLNQRYTVVAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQ 452 (513) Q Consensus 376 ~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~ 452 (513) .... +...|...++.+..-+...- .........+++.+..-+..|..+.++++.++ +|+++.-.+.+ T Consensus 319 ~~~f----------~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~ 388 (424) T protein:vir:18 319 NLGF----------LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR 388 (424) T ss_pred HHHH----------HHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 2211 22233333333333332211 11111122355555666778899999998887 67888766666 Q ss_pred hCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 453 YLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGV 499 (513) Q Consensus 453 ~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) .++. +++-++-+ ......+..... +..+..++++ T Consensus 389 ~~gl~pi~gGD~~~---------~~~n~~~l~~~~----~~~~p~~~ga 424 (424) T protein:vir:18 389 TDNLPPLPGGDVAM---------RQSQYVPITDLG----TNKEPRNNGA 424 (424) T ss_pred HhCCCCCCCcCeee---------eccCccchHhhh----ccCCCccCCC Confidence 6532 22100000 000000000000 0011111111 No 225 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=87.96 E-value=0.035 Score=28.54 Aligned_cols=407 Identities=9% Similarity=0.013 Sum_probs=163.5 Q ss_pred HHHHHHHHHHH--H---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--Ce----- Q lcl|NC_019916. 23 IAAFIRHHYNN--Q---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--AI----- 90 (513) Q Consensus 23 i~~~i~~~~~~--~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p~----- 90 (513) +++.+.+.... + ..+.+.+.+|..-. +..... ........++..+-....+++.++.|++- |+ T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~---~~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc---cCCCCC--CccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 22222222221 2 23344455554331 111111 01112223455677777888888777642 22 Q ss_pred eecCCcH-----------------------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 91 AMSGPSS-----------------------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 91 ~~~~~~~-----------------------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) ++...+. ..+...+..++|.....++.++...+|.+ ++|.++++.. +. .-|. T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a--~l~~~~~~~~-~~--~~pl 150 (510) T protein:vir:63 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNA--LLYRDSDAAT-VV--AWSL 150 (510) T ss_pred ccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeE--EEEEcCCCcE-EE--EEEc Confidence 2221110 11333445678999999999999999986 4566776542 32 2244 Q ss_pred ceEEEecCCCCcceEEEEEEEeeccc------------ccccceeEEEEEEEc-----CCcE----EEEEeeccCCcccc Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQTV------------VDNITQTKYEVETWT-----ENDY----TRYKPIVVAGSVPT 206 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~~------------~~~~~~~~~~ve~yt-----~~~~----~~~~~~~~~~~~~~ 206 (513) .-+++--|.. +++...+|.+..... ..........+++|+ ++.. ..|... .+... . T Consensus 151 ~~y~v~~d~~-G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~-dg~~~-~ 227 (510) T protein:vir:63 151 RSYAVRRDAT-GRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEI-DGVRV-G 227 (510) T ss_pred ceeEEeeCCC-cCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEe-cCcee-c Confidence 4455554433 344444444332110 000111111233333 1211 111111 11111 1 Q ss_pred ccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccc Q lcl|NC_019916. 207 LEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMV 281 (513) Q Consensus 207 ~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~ 281 (513) ..-..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........|.+.+.-.+.. T Consensus 228 --~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~----------- 294 (510) T protein:vir:63 228 --KEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA----------- 294 (510) T ss_pred --cccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCccccc----------- Confidence 11123456688877753 457999888888888888887666666666556655332210000 Q ss_pred cchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_019916. 282 DPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLT 359 (513) Q Consensus 282 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 359 (513) + ..... ....+... .+...+++.+. +..+.......++.++..|...-. -++. T Consensus 295 ---------~--------~~~~~-----~~~~g~~v--~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~-~~l~ 349 (510) T protein:vir:63 295 ---------V--------VDDYQ-----DAEMGDYV--PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGAN 349 (510) T ss_pred ---------c--------hhhhc-----cCCCceee--cCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHH-hhcc Confidence 0 00000 00000011 11122333332 223455556667777666644311 1111 Q ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCH Q lcl|NC_019916. 360 DDNFSGNSSGVAMKYKVLGTVELASTKRKQFERG--------LNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDD 431 (513) Q Consensus 360 ~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~--------l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~ 431 (513) ..-+...++.-++.. +.+++..+|.. +.-+++.++.++... +........+.... ....+. T Consensus 350 -~~~~~rvTAtEV~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-gl~p~p~~~~~~~~--v~~is~ 418 (510) T protein:vir:63 350 -QRDAERVTAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAI--ETGLPA 418 (510) T ss_pred -cCCCCCcCHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-cCCCCCchhcccce--ecchhH Confidence 111233466544433 23333344443 333444455555432 22111111222111 112222 Q ss_pred HHHHHHHHH----------Hhc---C---CCHHHHH----HhCCCC-C----CHHHHHHHHHHHHHH----HHHHh---h Q lcl|NC_019916. 432 VAIITALVQ----------AGA---Q---IPQEYLY----QYLPNV-T----DADEIVKMMDKQRKA----MLKTY---D 479 (513) Q Consensus 432 ~e~a~~~~k----------l~g---~---iS~et~~----~~l~~v-~----D~~~E~~ri~~E~~~----~~~~~---~ 479 (513) ...++-+.+ +.+ + +....++ ..++ | + -.++|++.+.+++.+ +++.. . T Consensus 419 Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~G-v~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~ 497 (510) T protein:vir:63 419 LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLL 497 (510) T ss_pred HHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhC-CChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222111 111 1 2222222 3333 3 1 134555554433221 11111 1 Q ss_pred hhcCCCCCCCCCC Q lcl|NC_019916. 480 TKGGLIINGTSGN 492 (513) Q Consensus 480 ~~~~~~~~~~~~~ 492 (513) ...+...+...+- T Consensus 498 ~~a~~~~~~~~g~ 510 (510) T protein:vir:63 498 EGASDMTNALAGV 510 (510) T ss_pred HHHHhhcccccCC Confidence 1111111111111 No 226 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=81.52 E-value=0.085 Score=26.45 Aligned_cols=402 Identities=10% Similarity=-0.020 Sum_probs=140.8 Q ss_pred HHHHHHHHHHHH----HHHHH-----------------------HHHHhcCCCccccccccccCCCCCCc---ceeecch Q lcl|NC_019916. 24 AAFIRHHYNNQR----PRLEM-----------------------LYDYYRGQNDGILSPASRRNEKGKAD---HRAVHSF 73 (513) Q Consensus 24 ~~~i~~~~~~~~----~~~~~-----------------------~~~YY~G~~~i~~~~~~~~~~~~~~~---~ri~~n~ 73 (513) ..+|+..+.... ..... +..+..|.-.. .....+... .-+.+.. T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~------~~~~~g~~v~~~~a~~~~~ 74 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTE------LAPDTFVGLATQAYQANGP 74 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhcccccc------ccCccccccchhhhhccHH Confidence 222222211110 00100 11111111000 000001111 1123456 Q ss_pred hHHHHHHHHHHhhcCCeeecCCc--------HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCc---- Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSGPS--------SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQK---- 137 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~~~--------~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~---- 137 (513) ...+|+..+.-+-+-|+.+.... +..+..++.. |. .......+..+++.+|.||+++..++.+. T Consensus 75 v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~ 154 (466) T protein:vir:81 75 VFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPD 154 (466) T ss_pred HHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccc Confidence 66788888888878888763211 1223344433 32 33455677889999999999998776543 Q ss_pred ----eeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccc Q lcl|NC_019916. 138 ----GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHS 213 (513) Q Consensus 138 ----~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (513) +.-.+.++|..+.+..+........+ .|.... . ........+..+.+++++.. .+ T Consensus 155 ~~g~~~~l~~l~~~~v~~~~~~~~~~~~~y---~~~~~~-~----~~~~~~~~~~~~dviHir~~-------------~~ 213 (466) T protein:vir:81 155 WVDVVVEERMVRGGRGELGGGQLGWRKVGY---LYTEGG-R----QSGNESVGFLAEDVVHFAPI-------------PD 213 (466) T ss_pred cCcceeEEEEecCcceEEEEcCCCceEEEE---EEEecC-c----ccccceeeeccccEEEEcCC-------------CC Confidence 22233455555555554322111111 111100 0 00000112233333332110 00 Q ss_pred cCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccc Q lcl|NC_019916. 214 AQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLA 293 (513) Q Consensus 214 ~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (513) ++. ...|.|-+..+...++....+..-....+...+.|-.+++-.... .. +....++ T Consensus 214 ~~d---------~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l-----------~~---e~~~~~~ 270 (466) T protein:vir:81 214 PLA---------SYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMA-----------DP---AAVKKWA 270 (466) T ss_pred ccc---------ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCC-----------CH---HHHHHHH Confidence 011 114666666655555544444333344444444454444321100 00 0111111 Q ss_pred cccchhhhcchh-cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc--ccccHH Q lcl|NC_019916. 294 DEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS--GNSSGV 370 (513) Q Consensus 294 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~n~Sg~ 370 (513) ..=......... ++.+.+ ..+.+++.++.......+....+...+.|+..-++|++..+... +..++. T Consensus 271 ~~~~~~~~g~~n~g~~~vl---------~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~s 341 (466) T protein:vir:81 271 DEVNSKHAGVDNAWKNLNL---------YPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYS 341 (466) T ss_pred HHHHHHhcCccccccceEc---------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccc Confidence 100000000000 112222 23344444544444555667778888999999999987654211 112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeC--CCCCcCHHHHHHHHH-------HH Q lcl|NC_019916. 371 AMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFR--DNLPTDDVAIITALV-------QA 441 (513) Q Consensus 371 Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~--~~~p~d~~e~a~~~~-------kl 441 (513) .++-..... +...|...++.+...+...--. ......+.+.|+ .-+-.|..+.+++.. .+ T Consensus 342 n~eq~~~~f----------~~~tl~P~~~~ie~~l~~~L~~-~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~ 410 (466) T protein:vir:81 342 NYGQARRRL----------ADGTAHPLWQNLSGCIGHVMPD-MGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTL 410 (466) T ss_pred cHHHHHHHH----------HHHHHHHHHHHHHHHHHhhcCC-cccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHH Confidence 222111111 1222222222222212111000 001112345553 334446666555422 11 Q ss_pred --hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 442 --GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 442 --~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) +|+ ....+....+.-++ .+ +.... +...+....... .+...+.+...|. ++..+ T Consensus 411 ~~~g~-t~nE~r~~~~~gd~---~~--~~~~~---~~~~~~~~~~~~------~~~~~~~~~~~Gg--~~ngn 466 (466) T protein:vir:81 411 ITAGY-EPESVVAAVNSGDL---RL--LKHTG---LTSVQLLPPGVS------ASASSDTPTSGGA--DDNGN 466 (466) T ss_pred HHcCC-ChhhccccccCCcc---cc--ccCCC---cchhhhcccccc------cccCCCCcccCCC--CcCCC Confidence 232 33323222211110 00 00000 000000000000 0000000011111 11111 No 227 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=79.19 E-value=0.11 Score=25.91 Aligned_cols=372 Identities=10% Similarity=0.025 Sum_probs=125.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc---HHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS---SDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~---~~~l 100 (513) ..+++ +.+..+...................-+.......+|+..++-+-+-|+.+-... +..+ T Consensus 1 Mg~f~--------------~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~ 66 (395) T protein:vir:10 1 MSILE--------------KIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDV 66 (395) T ss_pred Cchhh--------------hhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchH Confidence 11111 111111111000000000000001112345556677777777777777643222 1223 Q ss_pred HHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce--EEEecCCCCcceEEEEEEEeeccc Q lcl|NC_019916. 101 DDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC--FIIYDRSVNPKPIMAVRYHAVQTV 173 (513) Q Consensus 101 ~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~--~~~~d~~~~~~~~~~ir~~~~~~~ 173 (513) ..++. . |. .......+..+.+..|.+|+++.. ++.. +.+++..+ ..++++. ...+..+ . T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~-----~~~~~~~---~- 132 (395) T protein:vir:10 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL---LIADSFYREEYALYDDI-----FKDVTVK---D- 132 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe---EecCCccceeEeecCcc-----eeEEEEc---C- Confidence 33332 2 33 223444566677777877755432 2221 22333222 2222211 0111100 0 Q ss_pred ccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 174 VDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANY 253 (513) Q Consensus 174 ~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~ 253 (513) . . ....+.+..+++++.. ++ .....|.|-++.....++... +. T Consensus 133 -~----~--~~~~~~~~evih~~~~--------------~~---------~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:10 133 -Y----T--YQRTFTMQEVIYLKYN--------------NN---------KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred -c----e--eeeeeccccEEEEccC--------------CC---------CcccccchHHHHHHHHHHHHH-------HH Confidence 0 0 0112333333333210 00 011234555544444443222 12 Q ss_pred HHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 254 MTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 254 ~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) +...+.+--+++-... ....+... .++..-........ ...++.+ +++++|... T Consensus 176 ~~~~~~~~gii~~~~~----------~~~~e~~~---~~~~~~~~~~~~~~~~~~~v~~l-----------~~g~~~~~l 231 (395) T protein:vir:10 176 QLKNYQIRGILKSASS----------AYDEKNIE---KLQAFTNKLFNTFNKNQLAIAPL-----------IEGFDYEEL 231 (395) T ss_pred HHhcCCCceEEEeCCC----------CCCHHHHH---HHHHHHHHHhccccccCcceEEc-----------CCCceeeec Confidence 2222222111111000 00001110 00000000000000 0011111 223333332 Q ss_pred cCC-------HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 332 EYD-------SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 332 ~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +.+ ...+....+...+.|+..-++|+.......+|.+..... ....+|...+..+... T Consensus 232 ~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~ 296 (395) T protein:vir:10 232 SNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNE 296 (395) T ss_pred cccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHH Confidence 222 123566677778889999999876543221222211121 2222333333333332 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) +...--........+++.++.-+-.|..+.++++.++ +|+++.-.+.++++. +++.. ..+ ...+ T Consensus 297 l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~----------~~~~ 364 (395) T protein:vir:10 297 LNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDE----------YLIT 364 (395) T ss_pred HHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cce----------eeec Confidence 2221000000111234556666677888899988876 578887666666543 22210 000 0000 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ......+.....+.. .+.+...|.+.++ ..| T Consensus 365 ~n~~~~~~~~~~~~~-~~~~~~kgg~~~~-~g~ 395 (395) T protein:vir:10 365 KNYEKANSGENDEKE-KDENTLKGGDEDE-SGD 395 (395) T ss_pred cccccccccccccCc-ccccccCCCCCCC-CCC Confidence 000000000000000 0011111111111 122 No 228 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=79.19 E-value=0.11 Score=25.91 Aligned_cols=372 Identities=10% Similarity=0.025 Sum_probs=125.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc---HHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS---SDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~---~~~l 100 (513) ..+++ +.+..+...................-+.......+|+..++-+-+-|+.+-... +..+ T Consensus 1 Mg~f~--------------~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~ 66 (395) T protein:vir:95 1 MSILE--------------KIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDV 66 (395) T ss_pred Cchhh--------------hhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchH Confidence 11111 111111111000000000000001112345556677777777777777643222 1223 Q ss_pred HHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce--EEEecCCCCcceEEEEEEEeeccc Q lcl|NC_019916. 101 DDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC--FIIYDRSVNPKPIMAVRYHAVQTV 173 (513) Q Consensus 101 ~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~--~~~~d~~~~~~~~~~ir~~~~~~~ 173 (513) ..++. . |. .......+..+.+..|.+|+++.. ++.. +.+++..+ ..++++. ...+..+ . T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~-----~~~~~~~---~- 132 (395) T protein:vir:95 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL---LIADSFYREEYALYDDI-----FKDVTVK---D- 132 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe---EecCCccceeEeecCcc-----eeEEEEc---C- Confidence 33332 2 33 223444566677777877755432 2221 22333222 2222211 0111100 0 Q ss_pred ccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 174 VDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANY 253 (513) Q Consensus 174 ~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~ 253 (513) . . ....+.+..+++++.. ++ .....|.|-++.....++... +. T Consensus 133 -~----~--~~~~~~~~evih~~~~--------------~~---------~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:95 133 -Y----T--YQRTFTMQEVIYLKYN--------------NN---------KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred -c----e--eeeeeccccEEEEccC--------------CC---------CcccccchHHHHHHHHHHHHH-------HH Confidence 0 0 0112333333333210 00 011234555544444443222 12 Q ss_pred HHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 254 MTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 254 ~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) +...+.+--+++-... ....+... .++..-........ ...++.+ +++++|... T Consensus 176 ~~~~~~~~gii~~~~~----------~~~~e~~~---~~~~~~~~~~~~~~~~~~~v~~l-----------~~g~~~~~l 231 (395) T protein:vir:95 176 QLKNYQIRGILKSASS----------AYDEKNIE---KLQAFTNKLFNTFNKNQLAIAPL-----------IEGFDYEEL 231 (395) T ss_pred HHhcCCCceEEEeCCC----------CCCHHHHH---HHHHHHHHHhccccccCcceEEc-----------CCCceeeec Confidence 2222222111111000 00001110 00000000000000 0011111 223333332 Q ss_pred cCC-------HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 332 EYD-------SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 332 ~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +.+ ...+....+...+.|+..-++|+.......+|.+..... ....+|...+..+... T Consensus 232 ~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~ 296 (395) T protein:vir:95 232 SNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNE 296 (395) T ss_pred cccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHH Confidence 222 123566677778889999999876543221222211121 2222333333333332 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) +...--........+++.++.-+-.|..+.++++.++ +|+++.-.+.++++. +++.. ..+ ...+ T Consensus 297 l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~----------~~~~ 364 (395) T protein:vir:95 297 LNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDE----------YLIT 364 (395) T ss_pred HHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cce----------eeec Confidence 2221000000111234556666677888899988876 578887666666543 22210 000 0000 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ......+.....+.. .+.+...|.+.++ ..| T Consensus 365 ~n~~~~~~~~~~~~~-~~~~~~kgg~~~~-~g~ 395 (395) T protein:vir:95 365 KNYEKANSGENDEKE-KDENTLKGGDEDE-SGD 395 (395) T ss_pred cccccccccccccCc-ccccccCCCCCCC-CCC Confidence 000000000000000 0011111111111 122 No 229 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=79.19 E-value=0.11 Score=25.91 Aligned_cols=372 Identities=10% Similarity=0.025 Sum_probs=125.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc---HHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS---SDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~---~~~l 100 (513) ..+++ +.+..+...................-+.......+|+..++-+-+-|+.+-... +..+ T Consensus 1 Mg~f~--------------~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~ 66 (395) T protein:vir:10 1 MSILE--------------KIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDV 66 (395) T ss_pred Cchhh--------------hhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchH Confidence 11111 111111111000000000000001112345556677777777777777643222 1223 Q ss_pred HHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccce--EEEecCCCCcceEEEEEEEeeccc Q lcl|NC_019916. 101 DDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMEC--FIIYDRSVNPKPIMAVRYHAVQTV 173 (513) Q Consensus 101 ~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~--~~~~d~~~~~~~~~~ir~~~~~~~ 173 (513) ..++. . |. .......+..+.+..|.+|+++.. ++.. +.+++..+ ..++++. ...+..+ . T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~-----~~~~~~~---~- 132 (395) T protein:vir:10 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL---LIADSFYREEYALYDDI-----FKDVTVK---D- 132 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe---EecCCccceeEeecCcc-----eeEEEEc---C- Confidence 33332 2 33 223444566677777877755432 2221 22333222 2222211 0111100 0 Q ss_pred ccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 174 VDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANY 253 (513) Q Consensus 174 ~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~ 253 (513) . . ....+.+..+++++.. ++ .....|.|-++.....++... +. T Consensus 133 -~----~--~~~~~~~~evih~~~~--------------~~---------~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:10 133 -Y----T--YQRTFTMQEVIYLKYN--------------NN---------KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred -c----e--eeeeeccccEEEEccC--------------CC---------CcccccchHHHHHHHHHHHHH-------HH Confidence 0 0 0112333333333210 00 011234555544444443222 12 Q ss_pred HHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 254 MTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 254 ~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) +...+.+--+++-... ....+... .++..-........ ...++.+ +++++|... T Consensus 176 ~~~~~~~~gii~~~~~----------~~~~e~~~---~~~~~~~~~~~~~~~~~~~v~~l-----------~~g~~~~~l 231 (395) T protein:vir:10 176 QLKNYQIRGILKSASS----------AYDEKNIE---KLQAFTNKLFNTFNKNQLAIAPL-----------IEGFDYEEL 231 (395) T ss_pred HHhcCCCceEEEeCCC----------CCCHHHHH---HHHHHHHHHhccccccCcceEEc-----------CCCceeeec Confidence 2222222111111000 00001110 00000000000000 0011111 223333332 Q ss_pred cCC-------HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 332 EYD-------SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 332 ~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +.+ ...+....+...+.|+..-++|+.......+|.+..... ....+|...+..+... T Consensus 232 ~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~ 296 (395) T protein:vir:10 232 SNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNE 296 (395) T ss_pred cccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHH Confidence 222 123566677778889999999876543221222211121 2222333333333332 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) +...--........+++.++.-+-.|..+.++++.++ +|+++.-.+.++++. +++.. ..+ ...+ T Consensus 297 l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~----------~~~~ 364 (395) T protein:vir:10 297 LNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDE----------YLIT 364 (395) T ss_pred HHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cce----------eeec Confidence 2221000000111234556666677888899988876 578887666666543 22210 000 0000 Q ss_pred hcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ......+.....+.. .+.+...|.+.++ ..| T Consensus 365 ~n~~~~~~~~~~~~~-~~~~~~kgg~~~~-~g~ 395 (395) T protein:vir:10 365 KNYEKANSGENDEKE-KDENTLKGGDEDE-SGD 395 (395) T ss_pred cccccccccccccCc-ccccccCCCCCCC-CCC Confidence 000000000000000 0011111111111 122 No 230 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=76.91 E-value=0.13 Score=25.43 Aligned_cols=414 Identities=8% Similarity=-0.060 Sum_probs=155.4 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCC-----ccccccccccCCCCCCcceeecchh Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDY-YRGQN-----DGILSPASRRNEKGKADHRAVHSFA 74 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~Y-Y~G~~-----~i~~~~~~~~~~~~~~~~ri~~n~~ 74 (513) |-.=++....-......+.+.....+... + ...-.| |.|-. +|+.. ... -....... ..... T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~----~----~~~~~~~~~g~~~~~~~~iLr~-~~~--~~ly~~m~-~D~hi 68 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGA----S----VPVMSTSYDVVVDREFDELLQG-KDG--LLVYHKML-SDGTV 68 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccc----h----hhhcccccccccccchhHhhcc-ccc--hHHHHHHh-hChHH Confidence 11000000000000011111111111000 0 000001 12211 11100 000 00000011 24556 Q ss_pred HHHHHHHHHHhhcCCeeecCC--c--HH----HHHHHHHh-------cCHHHHHHHHHHHHhhCCeEE-EEeee-cCCCc Q lcl|NC_019916. 75 RYIADFQTSYSVGNAIAMSGP--S--SD----RLDDFNRR-------NDIDTLNYELYLDMTVTGRAY-EYVYR-DPSQK 137 (513) Q Consensus 75 ~~ivd~~~~~l~g~p~~~~~~--~--~~----~l~~~~~~-------n~~~~~~~~~~~~a~~~G~~~-~~v~~-d~~~~ 137 (513) .-++.+....+.+.++++... + +. .+.+++.. ..|......+ .+|..+|.+. +++|. ..+|. T Consensus 69 ~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~ 147 (448) T protein:vir:77 69 KNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGK 147 (448) T ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCc Confidence 667777778888888888531 1 11 23334332 1466666666 6899999886 56663 34454 Q ss_pred eeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCc-cccccccccccCc Q lcl|NC_019916. 138 GEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGS-VPTLEVAEHSAQF 216 (513) Q Consensus 138 ~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~-~~~~~~~~~~~~g 216 (513) ..+. .+.+.. .+. ++++.. +++...+++....... ........+.+++ T Consensus 148 ~~~~-~l~~r~----------~~~---~~~f~~-----------------~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~ 196 (448) T protein:vir:77 148 LILD-KIVPIH----------PFN---IDEVLY-----------------DEEGGPKALKLSGEVKGGSQFVNGLEIPIW 196 (448) T ss_pred eeec-cccccC----------CCc---cceeee-----------------ecCCceEEEecCCcccccccCCCccccccc Confidence 3211 111100 000 111111 1111111110000000 0000011122334 Q ss_pred ccceEEecC----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhcc Q lcl|NC_019916. 217 GFPMIEYRN----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKL 292 (513) Q Consensus 217 ~vPvv~~~n----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l 292 (513) .+ |.+.. +..|.|.+..+-...=-=+..+.+.+...+.|+.|+++.+-..+...+. .+.. .+ T Consensus 197 ~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~---------~~~~---~l 262 (448) T protein:vir:77 197 KT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGT---------KQWE---AA 262 (448) T ss_pred eE--EEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCH---------HHHH---HH Confidence 43 22222 2346777777666555556777888999999999999877432221100 0000 00 Q ss_pred ccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHH Q lcl|NC_019916. 293 ADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAM 372 (513) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai 372 (513) . ..+..+..+..+. .. ...+..++++........+...++...+.|...--.-.++.+..+| .++.|. T Consensus 263 ~-~av~~i~~g~~a~-~i---------iP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g-~~~~~~ 330 (448) T protein:vir:77 263 K-EIVKNFVQKPRHG-II---------LPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMG-VQAVNI 330 (448) T ss_pred H-HHHHHHhcCCceE-EE---------ecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhccccccccccc-hhhhhh Confidence 0 0000111111111 11 2345677888876666667778898888887764433343333222 222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCHHHHH Q lcl|NC_019916. 373 KYKVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQEYLY 451 (513) Q Consensus 373 ~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~ 451 (513) .....-.......-.+.+...+. ++++-++.+ +-+ .+.....+.|....+.|..+.++.+.++.+.+ . T Consensus 331 ~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~l----Nfg--~~~~~P~~~f~~~e~eDl~~~a~~~~~l~~~~-----~ 399 (448) T protein:vir:77 331 GEFVSLTQQTIISLQREFASAVNLYLIPKLVLP----NWP--GATRFPRLTFEMEERNDFSAAANLMGMLINAV-----K 399 (448) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC--CCCCCCEEEecCCChhhHHHHHHHhHHHHHHH-----H Confidence 21111011111111122333332 233333322 211 11123468899999999999999888875421 1 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 452 QYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 452 ~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.+ .+..+ .............+..+..++..+....+.+++--- T Consensus 400 ~~~-~ip~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (448) T protein:vir:77 400 DSE-DIPTE-----------------LKALIDALPSKMRRALGVVDEVREAVRQPADSRYLY 443 (448) T ss_pred HHh-cCCcc-----------------CCcCCCCCchhcccccCCCCCCCchhhcchhhHHHH Confidence 111 01100 000000011111111111111111111111111111 No 231 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=72.10 E-value=0.19 Score=24.57 Aligned_cols=476 Identities=11% Similarity=0.007 Sum_probs=157.9 Q ss_pred Cccc----------hhhceeccCCcccCCHHHHH-----HHHHHH-------HHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019916. 1 MIDM----------QQANMNYQEDADKLTPTRIA-----AFIRHH-------YNNQRPRLEMLYDYYRGQNDGILSPASR 58 (513) Q Consensus 1 ~~~~----------~~~~~~~~~~~~~~~~~~i~-----~~i~~~-------~~~~~~~~~~~~~YY~G~~~i~~~~~~~ 58 (513) |-.| +++.-..+.+.+. ....|. .++.-. .-....+|+.+..+++- T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~-~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEV----------- 68 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEA-SVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEV----------- 68 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCC-ChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccch----------- Confidence 1111 1111111111111 111110 000000 00111122222222221 Q ss_pred cCCCCCCcceeecchhHHHHHHHH-HHhhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeE Q lcl|NC_019916. 59 RNEKGKADHRAVHSFARYIADFQT-SYSVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGRA 126 (513) Q Consensus 59 ~~~~~~~~~ri~~n~~~~ivd~~~-~~l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~ 126 (513) .+-...||+..+ .--..+||.+.-++ -++.+.+++--+|+....+..|.+.+.|+. T Consensus 69 ------------d~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi 136 (564) T protein:vir:10 69 ------------DSAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRS 136 (564) T ss_pred ------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceE Confidence 122222333322 22234556554332 134556677778999999999999999999 Q ss_pred EEEeeecCCCc---eeEEEEEcccceEEEecCCCCc--ceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccC Q lcl|NC_019916. 127 YEYVYRDPSQK---GEVSVKLDPMECFIIYDRSVNP--KPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVA 201 (513) Q Consensus 127 ~~~v~~d~~~~---~~~~~~~~p~~~~~~~d~~~~~--~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~ 201 (513) |.+.-+|.+.. +.-...++|+.+-.++..-... .-...++-+.... . .......-+|.+....--.....+ T Consensus 137 ~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~-~---y~~~~Eyy~Ynp~~~~g~~~~~~~ 212 (564) T protein:vir:10 137 HYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQY-D---YGDFIEYYIYNPKGFAGNIPMVTG 212 (564) T ss_pred EEEEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeec-c---ccccccceeeccccccCccccccc Confidence 99988874331 1111236888776666322111 1111111110000 0 000000112222210000000000 Q ss_pred CccccccccccccCcccceEEe----cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccc Q lcl|NC_019916. 202 GSVPTLEVAEHSAQFGFPMIEY----RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTL 277 (513) Q Consensus 202 ~~~~~~~~~~~~~~g~vPvv~~----~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~ 277 (513) ...........-+-..|+.++. ++...-.|-++..+....-+ +++-|.+....-...|-+=+.-......-.... T Consensus 213 ~~~~~~~~~ikI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQL-kmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KA 291 (564) T protein:vir:10 213 SMDWSNQEGIKIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKA 291 (564) T ss_pred ccccccccceeechhhcceecccceeCCCCceeccchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhH Confidence 0000000000011111222211 11111223333321111111 112222222222222221111111000000000 Q ss_pred cccccchhhhh-hhccccccchhhhcchhcce----eeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 278 LQMVDPSDADA-MKKLADEKMAQLEAMRQANM----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKF 352 (513) Q Consensus 278 ~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 352 (513) .+....-...+ ++..-+.....++.++.-.. ++|+. -..+.+-.+..|---.|+.-. .-++-.++-+|.. T Consensus 292 eqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPR----ReGgrgTEItTLpGgqnLgem-~DV~YF~kKLY~a 366 (564) T protein:vir:10 292 EQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPR----REGGRGTEITTLPGGQNLGEL-KDVEYFKKKLYNS 366 (564) T ss_pred HHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccc----cCCCcccceeeccccCCcchH-HHHHHHHHHHHHH Confidence 00000000000 00000000000111110000 01110 000111222222222232222 2344555666666 Q ss_pred hCcccc--ccccccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----ceeeEEeCC Q lcl|NC_019916. 353 SHTPDL--TDDNFSGNS-SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP----DEIGFIFRD 425 (513) Q Consensus 353 s~~p~~--~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~----~~i~i~f~~ 425 (513) -++|-. ..++.+-+. -+..|-.-+....+.+.+.+..|..-|..+++.=+-+ ++.....++ ..|.+.|.. T Consensus 367 LnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiL---Kgiit~eeW~~i~~~I~~~f~~ 443 (564) T protein:vir:10 367 LNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLIL---KGIITPEDWDDMEEHIQYDFLF 443 (564) T ss_pred hCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---ccCCCHHHHHHHhhcceEEeee Confidence 688743 222211121 1223444455556667888888888888887753322 222222233 457788865 Q ss_pred CCCcCHHHHHH-------HHHHHh---c-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHH---HHhhhhcCCCCCCC Q lcl|NC_019916. 426 NLPTDDVAIIT-------ALVQAG---A-QIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAML---KTYDTKGGLIINGT 489 (513) Q Consensus 426 ~~p~d~~e~a~-------~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~---~~~~~~~~~~~~~~ 489 (513) .-.-.+...++ ++..+. | .+|.+++.+.+=--+| .+++-+.|++|..+.. +....+++....+. T Consensus 444 Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~ 523 (564) T protein:vir:10 444 DNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQN 523 (564) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCC Confidence 55544444333 333442 3 4799999988544443 5666667777655311 00001111100000 Q ss_pred C-------C--C----CCCCCCCCCCCCCCCC--ccCCC Q lcl|NC_019916. 490 S-------G--N----DPEDEGVRGQQGEPED--ERTSD 513 (513) Q Consensus 490 ~-------~--~----~~~~~~~~~~~~~~~~--~~~~~ 513 (513) . + + +.+..........|.+ .+++- T Consensus 524 ~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 562 (564) T protein:vir:10 524 QAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQS 562 (564) T ss_pred CcCCcchhhhccccccccChhhhccCCCCCCCCCCcCcC Confidence 0 0 0 0000000000011111 11111 No 232 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=72.07 E-value=0.19 Score=24.57 Aligned_cols=389 Identities=10% Similarity=0.025 Sum_probs=145.8 Q ss_pred HHHHHHHHH------HHHHHHHHHH-----------HHhcCCCcccccc----ccccCCCCCCcceeecchhHHHHHHHH Q lcl|NC_019916. 24 AAFIRHHYN------NQRPRLEMLY-----------DYYRGQNDGILSP----ASRRNEKGKADHRAVHSFARYIADFQT 82 (513) Q Consensus 24 ~~~i~~~~~------~~~~~~~~~~-----------~YY~G~~~i~~~~----~~~~~~~~~~~~ri~~n~~~~ivd~~~ 82 (513) .-+++.++- ...++.+--. +.+.|-.+..... ............=+.+.-...+|+..+ T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 111111100 0001110000 0011100000000 000000000001112333445667766 Q ss_pred HHhhcCCeee-cCCc------HHHHHHHHHh--cCH---HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceE Q lcl|NC_019916. 83 SYSVGNAIAM-SGPS------SDRLDDFNRR--NDI---DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECF 150 (513) Q Consensus 83 ~~l~g~p~~~-~~~~------~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~ 150 (513) +-+-+-|+.+ ..++ +..+..++.. |.. ......+..+++.+|.||+++-.+. |.+.-.+.++|..+. T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~ 159 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAK 159 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeE Confidence 6666778865 2111 1234444432 332 3445567889999999999988874 444334567888777 Q ss_pred EEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCC Q lcl|NC_019916. 151 IIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQ 230 (513) Q Consensus 151 ~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~ 230 (513) +..++.. .+. |......+. ...+....+.+++... + +...|. T Consensus 160 ~~~~~~~--~~~-----y~~~~~~g~-------~~~~~~~dViHir~~~---------------~---------dg~~G~ 201 (431) T protein:vir:10 160 GRLTSTW--QIV-----YDYTTPTGD-------KIELPAREVFHLRDLS---------------I---------DGVSGV 201 (431) T ss_pred EEEcCCC--eEE-----EEEEeCCce-------EEEEchhhEEEecCcC---------------C---------CCcccc Confidence 7665432 221 111111110 0123333443332110 0 112355 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceee Q lcl|NC_019916. 231 GDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMIL 310 (513) Q Consensus 231 sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 310 (513) |.++-+...+.....+..-..+.+...+.|-.+++-... +..+....++.......-. T Consensus 202 spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~----------------------ls~e~~~~~~~~~~~~~~g 259 (431) T protein:vir:10 202 SRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKE----------------------LSDNAYGRMKASVQENHTG 259 (431) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCC----------------------CCHHHHHHHHHHHHHHhcC Confidence 655554444443333322233333333334333332110 1111111111111110000 Q ss_pred ccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 311 LKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQF 390 (513) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f 390 (513) ....+.......+.+++-++.......+....+...+.|+..-++|+.-.+... ..++..++...... . T Consensus 260 ~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~-~~t~sn~eq~~~~f----------~ 328 (431) T protein:vir:10 260 SENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDD-TSWGSGIEQLAIFF----------I 328 (431) T ss_pred ccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCC-CCccccHHHHHHHH----------H Confidence 000001111122334444444344445556677778899999999986554321 11222222211111 1 Q ss_pred HHHHHHHHHHHHHHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHHh--c----CCCHHHHHHhCC--CCCCHH Q lcl|NC_019916. 391 ERGLNQRYTVVAHIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQAG--A----QIPQEYLYQYLP--NVTDAD 461 (513) Q Consensus 391 ~~~l~~~~~li~~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~--g----~iS~et~~~~l~--~v~D~~ 461 (513) ...|...++.|..-++..- .........+++.+..-+-.|..+.++.+.++. | +++.-.+.++++ -++++. T Consensus 329 ~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~ 408 (431) T protein:vir:10 329 QYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPV 408 (431) T ss_pred HHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcc Confidence 2223333333222222110 000111223455555556778899999888762 3 366655555543 343322 Q ss_pred HHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 462 EIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQ 503 (513) Q Consensus 462 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (513) .. ....+.. ..+.++ .++.+..- T Consensus 409 gD------------~~~~p~n------~~~~~~-~~~~p~~~ 431 (431) T protein:vir:10 409 AD------------QLRNPMT------QKQKGS-GDEPPATT 431 (431) T ss_pred cc------------ceecccc------cccCCC-CCCCCCCC Confidence 11 0011110 001111 11111111 No 233 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=70.88 E-value=0.2 Score=24.38 Aligned_cols=359 Identities=10% Similarity=0.033 Sum_probs=130.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCC---cHHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGP---SSDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~---~~~~l 100 (513) ..+++..+.+.. .....|.+. .+.. .....-+.......+|+..++-+-+-|+.+... .+..+ T Consensus 1 Mg~f~~~f~~~~----~~~~~~~~~--~~~~--------~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l 66 (385) T protein:vir:95 1 MGLFDSVFKRHS----ELSWMYDLE--FLQD--------KSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTL 66 (385) T ss_pred CchhhhhhccCc----ccccccchh--hhhc--------cchhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchH Confidence 111111110000 000011110 0000 000011223445667888888777788875322 12234 Q ss_pred HHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccc Q lcl|NC_019916. 101 DDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVD 175 (513) Q Consensus 101 ~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~ 175 (513) ..++.. |. .......+..+.+.+|.||++. +.++...+.....+.....++.+ .++...... T Consensus 67 ~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~--~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~- 133 (385) T protein:vir:95 67 YYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK--NDEGHFFVADDFEKEDELGLYSH----------RFTNVLVND- 133 (385) T ss_pred HHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE--ecCCCeeeccccccccccccccc----------cceeeeecc- Confidence 444432 32 2355567788999999999654 33333211100000000111100 011000000 Q ss_pred ccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC-----CCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_019916. 176 NITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSLIDLYDVAQSDT 250 (513) Q Consensus 176 ~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~liD~~~~~~S~~ 250 (513) . .....+....+ ++|+.. ..|.|.++.....+ +.. T Consensus 134 ---~--~~~~~~~~~ei----------------------------ih~~~~~~~~~~~G~s~~~~~~~~i-------~~~ 173 (385) T protein:vir:95 134 ---F--EFKRVFTMDDV----------------------------IYLKYNNQKLDAFSLGLFEDYGEIF-------GRM 173 (385) T ss_pred ---c--ceeeeeccccE----------------------------EEecCCCCCcccccchHHHHHHHHH-------HHH Confidence 0 00011222222 333221 12444443333222 222 Q ss_pred HHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceeeccccccccccccCCceeE Q lcl|NC_019916. 251 ANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMILLKTGMAPNGQQTSADANY 328 (513) Q Consensus 251 ~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (513) .+...+.+.|--++....... . ..+....++..-...+... ....++.+ ..+.+++. T Consensus 174 ~~~~~~~~~~~g~l~~~~~~~--------~----~~e~~~~~~~~~~~~~~g~~~~~~~i~~l---------~~g~~~~~ 232 (385) T protein:vir:95 174 IDLQMLNNQIRGILKVDATKF--------Y----NKEKQKELQAYIDTLFDAFQNNTIAVVPL---------TEGLAYEE 232 (385) T ss_pred HHHHHhcCCCceEEEeCCccC--------C----CHHHHHHHHHHHHHHhhhhhhcCCceEEc---------CCCceeEe Confidence 222222222211111100000 0 0000001110000000000 01112222 22333333 Q ss_pred Eeec------CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 329 IHKE------YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVA 402 (513) Q Consensus 329 l~~~------~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~ 402 (513) ++.. .....+....+...+.|+..-++|+.......+|.+. .....+...+...++.+. T Consensus 233 l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~---------------~~~~~~~~~l~P~~~~ie 297 (385) T protein:vir:95 233 HSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEMADLEK---------------TIESYLQFCINPLLRKIE 297 (385) T ss_pred ecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHH---------------HHHHHHHHHHHHHHHHHH Confidence 3321 1234567778888889999999987554221122111 222334444555555554 Q ss_pred HHHHhcc-cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 403 HIEERVN-GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNV--TDADEIVKMMDKQRKAMLKT 477 (513) Q Consensus 403 ~~l~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v--~D~~~E~~ri~~E~~~~~~~ 477 (513) ..+...- .........+++.+..-+..|..+.++++.++ .|+++.-.+.+.++.- +++. -. .. T Consensus 298 ~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~--gd----------~~ 365 (385) T protein:vir:95 298 AELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPE--LD----------KF 365 (385) T ss_pred HHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cc----------ee Confidence 4443211 11111112455666677778899999998887 5788877776666432 1100 00 00 Q ss_pred hhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 478 YDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ..+......+.. ..+++++ | T Consensus 366 ~~~~n~~~~~~~--kgge~~~--------------e 385 (385) T protein:vir:95 366 IITKNLQSADAF--KGGESNE--------------E 385 (385) T ss_pred eecccceecccc--cCCCCCC--------------C Confidence 000000000000 0000000 0 No 234 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=69.29 E-value=0.22 Score=24.14 Aligned_cols=421 Identities=9% Similarity=0.028 Sum_probs=162.1 Q ss_pred HHHHHHHHHH--HH---HHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--Ce----- Q lcl|NC_019916. 23 IAAFIRHHYN--NQ---RPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--AI----- 90 (513) Q Consensus 23 i~~~i~~~~~--~~---~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p~----- 90 (513) .++.+.+... ++ ..+.+.+.+|..-. +..... ......-.++..+-....+++.++.|++- |+ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~---~~~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc---cccCCC--CcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 2333222221 12 23344555554431 111110 01111122345566677778877777642 22 Q ss_pred eecCCcH-----------------------HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEccc Q lcl|NC_019916. 91 AMSGPSS-----------------------DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPM 147 (513) Q Consensus 91 ~~~~~~~-----------------------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~ 147 (513) ++...+. ..+...+..++|.....++.++...+|.+. +|.++++.. +. .-|. T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~~~~-~~--~~pl 150 (510) T protein:vir:78 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEAT-VV--AWSL 150 (510) T ss_pred ccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEE--EEEeCCCCe-EE--EEEc Confidence 2221110 112233455789999999999999999874 455665442 32 2344 Q ss_pred ceEEEecCCCCcceEEEEEEEeecc------------cccccceeEEEEEEEcC-----Cc----EEEEEeeccCCcccc Q lcl|NC_019916. 148 ECFIIYDRSVNPKPIMAVRYHAVQT------------VVDNITQTKYEVETWTE-----ND----YTRYKPIVVAGSVPT 206 (513) Q Consensus 148 ~~~~~~d~~~~~~~~~~ir~~~~~~------------~~~~~~~~~~~ve~yt~-----~~----~~~~~~~~~~~~~~~ 206 (513) .-+++--|.. +++...+|.++... ......+....+++|+. +. ...|... .+... . T Consensus 151 ~~y~v~~d~~-G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~-dg~~i-~ 227 (510) T protein:vir:78 151 RSYAVRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEI-DGVRV-G 227 (510) T ss_pred ceeEEeeCCC-cCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEe-cCeee-c Confidence 5455554433 34544554433320 00001111223344431 11 1111111 11111 1 Q ss_pred ccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccc Q lcl|NC_019916. 207 LEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMV 281 (513) Q Consensus 207 ~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~ 281 (513) ..-..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........|.+.+.- .+. T Consensus 228 --~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~----------- 293 (510) T protein:vir:78 228 --ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKG----------- 293 (510) T ss_pred --cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCC-ccc----------- Confidence 11123356688877653 45799988888888888887766666655555555433211 000 Q ss_pred cchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_019916. 282 DPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLT 359 (513) Q Consensus 282 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 359 (513) .+ +.....+. .+.. ..+...+++.+. +..+.......++.++..|...-. -++. T Consensus 294 --------~~--------~~~l~~~~-----~g~~--v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~-~~l~ 349 (510) T protein:vir:78 294 --------AV--------VDDYQDAE-----MGDY--VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGAN 349 (510) T ss_pred --------cc--------hhhhccCC-----Ccee--ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHh-hccc Confidence 00 00000000 0000 011122333332 223445556666666666644311 1111 Q ss_pred cccccccccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccceeeEEeCCCCCcCHHHHH Q lcl|NC_019916. 360 DDNFSGNSSGVAMKYKVLGTV-ELASTKRKQFERGLNQRYTVVAHIEERVNGKWDI---DPDEIGFIFRDNLPTDDVAII 435 (513) Q Consensus 360 ~~~~~~n~Sg~Ai~~~~~~l~-~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~---~~~~i~i~f~~~~p~d~~e~a 435 (513) ..-+...++.-++..-..+. ..--...+.-.+.+.-+++.++.++...+ .... ......|++..++-+ ++.+ T Consensus 350 -~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g-l~p~p~~~~~~~~v~~is~Lar--aq~~ 425 (510) T protein:vir:78 350 -QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSR--SAAV 425 (510) T ss_pred -cCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCCcccccceeeecccHHHH--HHHH Confidence 11123346654443311111 11122222222333344455555554322 1111 112222333222222 1111 Q ss_pred HHHH-------HHhc---C---CCHHHH----HHhCCCC-C----CHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCC Q lcl|NC_019916. 436 TALV-------QAGA---Q---IPQEYL----YQYLPNV-T----DADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGND 493 (513) Q Consensus 436 ~~~~-------kl~g---~---iS~et~----~~~l~~v-~----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 493 (513) +.+. .+.+ + +....+ ...++ | + -.++|++.+.+++.+..............+...-. T Consensus 426 ~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-v~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~ 504 (510) T protein:vir:78 426 QSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMT 504 (510) T ss_pred HHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhC-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 1111 1111 1 222222 23333 3 1 13566666655443222111100000000000000 Q ss_pred CCCCCC Q lcl|NC_019916. 494 PEDEGV 499 (513) Q Consensus 494 ~~~~~~ 499 (513) +.-.+. T Consensus 505 ~~~~g~ 510 (510) T protein:vir:78 505 NALAGV 510 (510) T ss_pred ccCCCC Confidence 000000 No 235 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=69.01 E-value=0.23 Score=24.09 Aligned_cols=398 Identities=10% Similarity=-0.027 Sum_probs=143.3 Q ss_pred CccccccccccCC----CCCCcc---eeecchhHHHHHHHHHHhhcCCeeecCCc-----HHHHHHHHHh--cC---HHH Q lcl|NC_019916. 49 NDGILSPASRRNE----KGKADH---RAVHSFARYIADFQTSYSVGNAIAMSGPS-----SDRLDDFNRR--ND---IDT 111 (513) Q Consensus 49 ~~i~~~~~~~~~~----~~~~~~---ri~~n~~~~ivd~~~~~l~g~p~~~~~~~-----~~~l~~~~~~--n~---~~~ 111 (513) -..+......... ...... -..+......|+..++-+-+-|+.+...+ +..+-.++.. |. ... T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t~~~ 80 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMPAQV 80 (723) T ss_pred CcccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCCHHH Confidence 1111111100000 000000 01233445566776666667787753221 1123344432 32 334 Q ss_pred HHHHHHHHHhhCCeEEEEeeecC---CCceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEc Q lcl|NC_019916. 112 LNYELYLDMTVTGRAYEYVYRDP---SQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWT 188 (513) Q Consensus 112 ~~~~~~~~a~~~G~~~~~v~~d~---~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt 188 (513) ....+..+++.+|.||+++..+. .|.+.-.+.++|..+.++..+............|.....++ ....+. T Consensus 81 f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G-------~~~~~~ 153 (723) T protein:vir:94 81 LKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDG-------VRVPVL 153 (723) T ss_pred HHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCc-------eeEEec Confidence 45567778889999998887542 23332233355554444433222111111111111110000 000112 Q ss_pred CCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCc Q lcl|NC_019916. 189 ENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDI 268 (513) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~ 268 (513) .+.+++++.. ++ .+...|.|.++.....|+....+.....+.+...+.|-.+++- . T Consensus 154 ~~dIiHir~~--------------~~---------~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~-~ 209 (723) T protein:vir:94 154 ADEMLWLRFS--------------DP---------YDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNL-G 209 (723) T ss_pred ccceEEecCC--------------CC---------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEc-C Confidence 2222222110 00 1122466766655544443333322223333333334444431 1 Q ss_pred ccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccccccccccCCceeEEeecC--CHHHHHHHHHHH Q lcl|NC_019916. 269 DTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGMAPNGQQTSADANYIHKEY--DSAGTELYKKRL 345 (513) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l 345 (513) . .. .+....++..-...+.... .++.+.+........ ..+.+++|..... ....+....+.. T Consensus 210 ~----------l~----~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~-vl~~G~~~~~l~~s~~D~q~le~r~~~ 274 (723) T protein:vir:94 210 D----------MD----EQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGG-AAGKGATFTSLSMSPAEMDYINSRMHS 274 (723) T ss_pred C----------CC----HHHHHHHHHHHHHHhhchhhcCcceeecccccccc-cccCCceEEEccCCHHHHHHHHHHHHh Confidence 0 00 0011111111000111101 122333322111111 1122344444333 344566667777 Q ss_pred HHHHHHHhCcccccccccc--ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEe Q lcl|NC_019916. 346 AADIHKFSHTPDLTDDNFS--GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIF 423 (513) Q Consensus 346 ~~~i~~~s~~p~~~~~~~~--~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f 423 (513) .+.|+..-++|+......+ +|.+ .+.+. .+...|...++.+...++..-- ... ...+.+.| T Consensus 275 ~~eIa~afgVPp~~i~~~st~sN~e-~~~~~--------------f~~~tL~P~~~~ie~~ln~~Ll-~~~-g~~~~~~f 337 (723) T protein:vir:94 275 AEEVMLAFGIRKDALLGGSTYENQA-EAKAA--------------VWTETLIPQMEVMASITDLQLL-PDI-GWTVEWDF 337 (723) T ss_pred HHHHHHHhCCChhHcCCCCCcccHH-HHHHH--------------HHHHHHHHHHHHHHHHHhHhhc-ccc-cCceEEee Confidence 8889999999975432211 1211 11111 1222333333333332222100 011 12456777 Q ss_pred CC--CCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHH--HHH-------------HHHHHHHHHHhhhhc Q lcl|NC_019916. 424 RD--NLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIV--KMM-------------DKQRKAMLKTYDTKG 482 (513) Q Consensus 424 ~~--~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~--~ri-------------~~E~~~~~~~~~~~~ 482 (513) +. -+-.|..+.++.+.++ +|+++.-.+.+.++. +++-+..+ .-. .+|....+. .... T Consensus 338 ~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~--~~~~ 415 (723) T protein:vir:94 338 NSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARML--ALLE 415 (723) T ss_pred cchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhh--hhcc Confidence 64 3457888889888875 578888777666532 33222111 000 001000000 0000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 483 GLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ....+. +..+.........+.+....+++ T Consensus 416 ~~~~~~--p~~~~~~~~~~~~~~~~~~~~~~ 444 (723) T protein:vir:94 416 RVAADR--PLPELPVRATTVLHHDPGPDPQQ 444 (723) T ss_pred cccccc--CcCCCCCCCCCCCCCCcccCCch Confidence 000000 00000000000111112222222 No 236 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=67.38 E-value=0.25 Score=23.86 Aligned_cols=416 Identities=10% Similarity=-0.056 Sum_probs=166.6 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCC-ccccccccccCCCCCCccee--ecchhHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYR-GQN-DGILSPASRRNEKGKADHRA--VHSFARY 76 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~-G~~-~i~~~~~~~~~~~~~~~~ri--~~n~~~~ 76 (513) ||.+-+. -.+.. .+.+....-+ ...+ ..|+ -++ +.+... ........+ ......- T Consensus 1 ~~~~~~~-~~p~~--------~~g~~~~~~~---~~~~----~~~~~~e~~~~lr~~-----~~~~ly~~m~e~D~~i~s 59 (469) T protein:vir:10 1 MTERVKT-AAPVS--------EAGYVFGSGV---VDGW----TVWDPFEQTPELQWP-----QSVAVYSRMDNEDSRVTS 59 (469) T ss_pred CCCcccC-CCCcc--------chhhhhhccc---ccch----hhccccccccccccc-----cchHHHHHHHhhChHHHH Confidence 3332221 11111 1111111100 0000 1111 000 111000 000000111 2466666 Q ss_pred HHHHHHHHhhcCCeeecCC--cHHH---HHH----HHH-------------hcCHHHHHHHHHHHHhhCCeEE-EEeeec Q lcl|NC_019916. 77 IADFQTSYSVGNAIAMSGP--SSDR---LDD----FNR-------------RNDIDTLNYELYLDMTVTGRAY-EYVYRD 133 (513) Q Consensus 77 ivd~~~~~l~g~p~~~~~~--~~~~---l~~----~~~-------------~n~~~~~~~~~~~~a~~~G~~~-~~v~~d 133 (513) ++++....+.+.++++... +++. +.+ .+. ...+...+.++..+++-||.++ ++||.. T Consensus 60 ~l~~rk~av~~~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~ 139 (469) T protein:vir:10 60 LLEAISLPIRSTPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRP 139 (469) T ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeec Confidence 7777778888888887532 2211 111 111 1134566777777788899886 577743 Q ss_pred C----CCceeEEEEEc--ccceEE--EecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccc Q lcl|NC_019916. 134 P----SQKGEVSVKLD--PMECFI--IYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVP 205 (513) Q Consensus 134 ~----~~~~~~~~~~~--p~~~~~--~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~ 205 (513) . +|...+. .+. |..++- .|++. ..++ .++. .....-... .... T Consensus 140 ~~~~~dG~~~~~-~l~~rp~~~i~~~~~~~~--~~l~-~~~~-------------------~~~~~~~~~------~~~~ 190 (469) T protein:vir:10 140 RNQSPDGRFWLR-KLAPRPQWTISKFNVAPD--GGLE-SIEQ-------------------IAPPARTRG------SLYV 190 (469) T ss_pred ccccCCCceeee-eeeecCcccceeeeeccC--Ccee-eeee-------------------cCccccccc------cccc Confidence 2 2222111 111 111110 11111 0111 0000 000000000 0000 Q ss_pred cccccccccCcccceEEec-----CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccc Q lcl|NC_019916. 206 TLEVAEHSAQFGFPMIEYR-----NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQM 280 (513) Q Consensus 206 ~~~~~~~~~~g~vPvv~~~-----n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~ 280 (513) ......+-+..+ +|.++ .+..|.|.+..+-..-=-=+..+.+.+..++.|+.|+++.+-..+.... T Consensus 191 ~~~~~~~lp~~k--~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~------- 261 (469) T protein:vir:10 191 ANIAPPEIPVNR--LVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDED------- 261 (469) T ss_pred CCCCccccccCc--EEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHH------- Confidence 000001111111 23332 2445778888776666666677888999999999999887643221111 Q ss_pred ccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 281 VDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 281 ~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) +.. .| ...+..+. .+..+. ...+.+++++....+...++..++.+.+.|...--...+ T Consensus 262 ----ek~---~l----~~a~~~~~~g~~a~~i---------ip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tl 321 (469) T protein:vir:10 262 ----EVR---KM----AALARSVRGGINAGVG---------LAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFL 321 (469) T ss_pred ----HHH---HH----HHHHHHHhcCCceEEE---------ccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccc Confidence 000 00 01111111 011111 234678888888888888999999999999776544444 Q ss_pred ccccccc-cccHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHH Q lcl|NC_019916. 359 TDDNFSG-NSSGV-AMKYKVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAII 435 (513) Q Consensus 359 ~~~~~~~-n~Sg~-Ai~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 435 (513) +.+.-+| ...|. .-+....-+ +.-.+.+...+. ++++-++.+ +-. .+.....+.|.... .+....+ T Consensus 322 Ts~~~gGS~a~~~vh~ev~~d~~----~sDa~~i~~tln~~li~~l~~l----N~g--~~~~~P~~~~~~~e-~~~~~~a 390 (469) T protein:vir:10 322 NLDGKGGSYALASVLEDPFTQAV----HAYATSICRIANQHIIEDLVDI----NFG--VDTPAPVLTFDPIG-SRQDLTA 390 (469) T ss_pred cccCccchhhHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHh----cCC--CCCCccEEEecCCC-CcHHHHH Confidence 4332122 11122 222222222 222233344442 344433332 211 12223466775443 4556677 Q ss_pred HHHHHH--hcC-----CCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 436 TALVQA--GAQ-----IPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPED 508 (513) Q Consensus 436 ~~~~kl--~g~-----iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (513) +.+.++ .|+ ++.+.+.+.++ ++.++.+-.-+..++ +... +.....+..............+..++. T Consensus 391 ~~i~~l~~~G~~~~~~~~~~~~~e~~g-ip~~~~~~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (469) T protein:vir:10 391 AAVKLLYDAGVFDDDPAVKRAIRQRFN-LPSELNDTPSAEPEE----PAAV--PNQSAAPARTRSSGNADARARAPKADQ 463 (469) T ss_pred HHHHHHHhcCCccCccccHHHHHHHhC-CCCCCCCcccccchh----cccC--CCCCccccccCCCCCcccccccCCChH Confidence 888776 354 44566667764 343322211111111 1110 111111111111111111122223334 Q ss_pred ccCCC Q lcl|NC_019916. 509 ERTSD 513 (513) Q Consensus 509 ~~~~~ 513 (513) ..-.| T Consensus 464 ~~l~d 468 (469) T protein:vir:10 464 GVLFD 468 (469) T ss_pred Hhhcc Confidence 44444 No 237 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=66.81 E-value=0.26 Score=23.77 Aligned_cols=375 Identities=10% Similarity=-0.032 Sum_probs=144.9 Q ss_pred HHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc---- Q lcl|NC_019916. 24 AAFIRHHYNNQRP---RLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS---- 96 (513) Q Consensus 24 ~~~i~~~~~~~~~---~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~---- 96 (513) .-+++.+.....+ .-..+...+-+.... ........+=+.+.-...+|+..++-+-.-|+.+...+ T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~ 73 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRY-------SGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGNEI 73 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhccccc-------CccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCccc Confidence 1122211111000 000111111121100 00000011113345567788888888888888764322 Q ss_pred -HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeec Q lcl|NC_019916. 97 -SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQ 171 (513) Q Consensus 97 -~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~ 171 (513) +..+..++.. |. .......+..+++.+|.||+++-. +.. . -+..+.|..++.. +.+|... T Consensus 74 ~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~----~~~--~--~~~~~~~~~~~~~-------~~~~~~~ 138 (394) T protein:vir:62 74 KDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNG----AQI--H--LASNVFTELDDNL-------VEHFNIG 138 (394) T ss_pred chhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEec----cee--e--ccccceEEECCce-------EEEEeeC Confidence 1223344433 32 234555678889999999987632 111 1 1223334333211 1111100 Q ss_pred ccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 172 TVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTA 251 (513) Q Consensus 172 ~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~ 251 (513) . ..+..+.+.+++... + +.-.|.|.++.+...++....+..-.. T Consensus 139 ------~------~~~~~~eiih~r~~~---------------~---------d~~~G~s~~~~~~~~i~~~~~~~~~~~ 182 (394) T protein:vir:62 139 ------G------HEIPPCMIRHVKNIG---------------A---------DHLRGKGILDLGRDTLEGVMSAEKTLT 182 (394) T ss_pred ------C------EEechhheEEecCcC---------------C---------CCccccChHHHHHHHHHHHHHHHHHHH Confidence 0 012222222221100 0 111366666655555554444433333 Q ss_pred HHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCceeEEe Q lcl|NC_019916. 252 NYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSADANYIH 330 (513) Q Consensus 252 ~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~ 330 (513) ..+...+.|-.+++-...... . ......++..-...+..... ++...+ ..+.++++.. T Consensus 183 ~~~~ng~~~~~il~~~~~~~~----------~--~~~~~~~~~~~~~~~~g~~n~g~~~vl---------~~g~~~~~~~ 241 (394) T protein:vir:62 183 DKYKKGGLLTFLLNLDAHINP----------Q--NGAQSKLINAILDQLESIDEARSVKMI---------PLGKGYSIDT 241 (394) T ss_pred HHHHccCCcceEEEeCCCCCc----------C--HHHHHHHHHHHHHHhccccccCceeEe---------eCCCceeEEe Confidence 434444445444432110000 0 00000111000001111111 111122 1234455544 Q ss_pred ec--CCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019916. 331 KE--YDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERV 408 (513) Q Consensus 331 ~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~ 408 (513) .. .....+....+...+.|+..-++|+...+... ..+.. ......+...|..+++.+..-+... T Consensus 242 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~sn~e-------------~~~~~~~~~~l~P~~~~ie~~l~~k 307 (394) T protein:vir:62 242 LKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI-KEDIE-------------KAMMYIHNKAVRPIMKNFEDHLSLL 307 (394) T ss_pred cCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CcCHH-------------HHHHHHHHHHHHHHHHHHHHHHhhh Confidence 33 23344556677788899999999976554221 11111 1112223334444444444333321 Q ss_pred ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhhhcCC Q lcl|NC_019916. 409 NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDTKGGL 484 (513) Q Consensus 409 ~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~ 484 (513) --. ......+.+.|+.....+....++++.++ +|+++.-.+.++++. ++++.. ..+.-- ..+.+.... T Consensus 308 ll~-~~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~g--d~~~~~-----~n~~~~~~~ 379 (394) T protein:vir:62 308 FYA-QNSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKES--QAIYIS-----NDVTEIGKK 379 (394) T ss_pred hcC-ccccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--Ceeecc-----ccccccccc Confidence 100 11223577888777766677778887776 578888777776643 222111 000000 000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 485 IINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) +..++ +.+.+++.++ T Consensus 380 --------~~~~~-----~~kgge~~en 394 (394) T protein:vir:62 380 --------EATDG-----SLGGGEENEN 394 (394) T ss_pred --------ccccc-----cCCCCCCCCC Confidence 00000 0011111111 No 238 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=65.63 E-value=0.28 Score=23.61 Aligned_cols=378 Identities=12% Similarity=0.046 Sum_probs=136.1 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM 92 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~ 92 (513) |. -.+||.. +++.-. ..+.++ .++-.+............ ....-....|+..++-+..-|+++ T Consensus 1 mg----~~~~~~~----~~~~~~---~~~~~~----~~~~~~~~~~~~~t~~~~--~~~~~v~~cv~~Ia~~ia~~p~~v 63 (403) T protein:vir:10 1 MG----FKSWITE----KLNPGQ---RIIRDM----EPVSHRTNRKPFTTGQAY--SKIEILNRTANMVIDSAAECSYTV 63 (403) T ss_pred Cc----chhhhhh----ccchhh---hhhhcc----cccccccCCcccccHHHH--HHHHHHHHHHHHHHHHHhhCceeE Confidence 11 1122221 111000 000111 111000000000000000 112223334555555555667664 Q ss_pred cCC-----c-----HHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCC Q lcl|NC_019916. 93 SGP-----S-----SDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSV 157 (513) Q Consensus 93 ~~~-----~-----~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~ 157 (513) ... + ...+..++.. |. .......+..+++.+|.||+++ + +.. .+.++|..+.+..+.. T Consensus 64 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~--~~~--l~~l~~~~~~v~~~~~- 136 (403) T protein:vir:10 64 GDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--D--GTS--LYHVPAALMQVEADAN- 136 (403) T ss_pred eecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--e--Cce--eEeecCcceEEEEcCC- Confidence 211 1 1234444432 32 2355556778889999999654 2 221 2334444443332211 Q ss_pred CcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEec-CCCCCCcchhHH Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYR-NNEYRQGDFENV 236 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-n~~~~~sd~e~v 236 (513) .. +++|.... . + .|..+.+.+++ ...++++. +...|.|.+..+ T Consensus 137 --~~---~~~~~~~~------~----~-~~~~~eiih~~--------------------~~~~~~~~~~~~~G~s~i~~~ 180 (403) T protein:vir:10 137 --KF---IKKFIFNN------Q----I-NYRVDEIIFIK--------------------DNSYVCGTNSQISGQSRVATV 180 (403) T ss_pred --ce---EEEEEecC------c----e-eecccceEEec--------------------ccccccCCCCCcccccHHHHH Confidence 11 11111000 0 0 01112222211 11111111 233566766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch-hcceeeccccc Q lcl|NC_019916. 237 LSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR-QANMILLKTGM 315 (513) Q Consensus 237 ~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~ 315 (513) ...++..+.+..-..+.+...+.|-.+++..... . .+....++..=........ .++.+.++ T Consensus 181 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l----------~----~e~~~~~~~~~~~~~~g~~n~g~~~vl~--- 243 (403) T protein:vir:10 181 IDSLEKRSKMLNFKEKFLDNGTVIGLILETDEIL----------N----KKLRERKQEELQLDYNPSTGQSSVLILD--- 243 (403) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC----------C----HHHHHHHHHHHHHHhCCcccCcceeecC--- Confidence 6666555554444444444444454444432110 0 0011111110000000000 11122221 Q ss_pred cccccccCCceeEEeecCC--HHHHHHHHHHHHHHHHHHhCccccccccc-cccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 316 APNGQQTSADANYIHKEYD--SAGTELYKKRLAADIHKFSHTPDLTDDNF-SGNSSGVAMKYKVLGTVELASTKRKQFER 392 (513) Q Consensus 316 ~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~ 392 (513) .+.+++.++...+ ...+....+...+.|+..-++|+...+.. .+|.+...+.+. .. T Consensus 244 ------~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~---------------~~ 302 (403) T protein:vir:10 244 ------GGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPNIELFY---------------YM 302 (403) T ss_pred ------CCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHH---------------HH Confidence 2223344432222 23445666777889999999998665321 122222222222 22 Q ss_pred HHHHHHHHHHHHHHhcccccccccceeeEEeCCC--CCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_019916. 393 GLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN--LPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMD 468 (513) Q Consensus 393 ~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~--~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~ 468 (513) .+...++.+..-+...-+ ..+.+.++.- +-.|..+.++++.++ .|+++.-.+.+.++.-.=++... T Consensus 303 tl~P~~~~ie~~l~~~L~------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~---- 372 (403) T protein:vir:10 303 TIIPMLNKLTSSLTFFFG------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQM---- 372 (403) T ss_pred HHHHHHHHHHHHHHHhcC------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccc---- Confidence 333333333332222111 1233333322 444677778887776 68888877777764322000111 Q ss_pred HHHHHHHHHhhhhcCCCCCC-CCCCCCCCCCCCCCCCC Q lcl|NC_019916. 469 KQRKAMLKTYDTKGGLIING-TSGNDPEDEGVRGQQGE 505 (513) Q Consensus 469 ~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 505 (513) .....+........ ..+.++. ++.....|+ T Consensus 373 ------d~~~~p~n~~~~~~~~~~~e~~-~~~~~~~g~ 403 (403) T protein:vir:10 373 ------NKIRIPANVAGSATGVSGQEGG-RPKGSTEGD 403 (403) T ss_pred ------cccccccccccccccCCCCcCC-CCCCCcCCC Confidence 01111111111111 1111111 111111111 No 239 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=64.87 E-value=0.29 Score=23.51 Aligned_cols=420 Identities=10% Similarity=0.052 Sum_probs=159.3 Q ss_pred CCHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcC--Ce---- Q lcl|NC_019916. 18 LTPTRIAAFIRH-HYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGN--AI---- 90 (513) Q Consensus 18 ~~~~~i~~~i~~-~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~--p~---- 90 (513) |- +....+..+ ....-..+.+.+.+|..-. ....+...... .....+...+-....++..++-|++. |+ T Consensus 1 m~-~~~~~l~~k~~R~~~e~~w~e~a~~~lP~--~~~~~~~~~~~-~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 76 (514) T protein:vir:80 1 MR-QQASAMWAEYRDSTAIRKAEDFAKFTIAS--LMVDPLDKTHQ-AEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPS 76 (514) T ss_pred Cc-cchHHHHHHhhcchHHHHHHHHHHHhccc--ccCCCCCCccc-ccccccccchhHHHHHHHHHHHHHhhhcCCCCcc Confidence 11 111112111 1112234456666664321 00000000000 11112223455666777777766542 21 Q ss_pred -eecCCcH------------HH-----------HHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcc Q lcl|NC_019916. 91 -AMSGPSS------------DR-----------LDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDP 146 (513) Q Consensus 91 -~~~~~~~------------~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p 146 (513) ++..+++ .. +...+..++|.....++.++...+|.+.++ .+++.. .+. .-| T Consensus 77 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~~~~~~-~~~--~~p 151 (514) T protein:vir:80 77 FQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFY--REPGTG-KML--VWT 151 (514) T ss_pred cccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--EecCCC-cEE--EEE Confidence 2222110 11 223345578999999999999999987544 455433 333 234 Q ss_pred cceEEEecCCCCcceEEEEEEEeecccc------------cccceeEEEEEEEc-----CCc----EEEEEeeccCCccc Q lcl|NC_019916. 147 MECFIIYDRSVNPKPIMAVRYHAVQTVV------------DNITQTKYEVETWT-----END----YTRYKPIVVAGSVP 205 (513) Q Consensus 147 ~~~~~~~d~~~~~~~~~~ir~~~~~~~~------------~~~~~~~~~ve~yt-----~~~----~~~~~~~~~~~~~~ 205 (513) ..-+++--|.. +++...+|..+..... ....+....+++|+ ++. ..+|... .+... T Consensus 152 l~~y~v~~d~~-G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~-~g~~i- 228 (514) T protein:vir:80 152 MQSYTVRRTSH-GDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHEL-EGKRV- 228 (514) T ss_pred cCeEEEeeCCC-cCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEec-cceee- Confidence 44455554443 3444444433221100 00011112233333 111 1112111 11111 Q ss_pred cccccccccCcccceEEecC-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCccccccccccccc Q lcl|NC_019916. 206 TLEVAEHSAQFGFPMIEYRN-----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQM 280 (513) Q Consensus 206 ~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~ 280 (513) ..+. ..++..+|++.++- +.+|+|-.++..+-+-.+|.+.-...........|.+.+--.+.. T Consensus 229 ~~es--~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~---------- 296 (514) T protein:vir:80 229 GPES--SYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGG---------- 296 (514) T ss_pred cccC--ccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCccccc---------- Confidence 1111 12234578777653 467999988888888888877666666666555555433110000 Q ss_pred ccchhhhhhhccccccchhhhcchhcceeeccccccccccccCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_019916. 281 VDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHK--EYDSAGTELYKKRLAADIHKFSHTPDL 358 (513) Q Consensus 281 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~ 358 (513) + ... ....+.+.. ..+...+++.+.. ..+.......++.++..|...-..-. T Consensus 297 ----------~--------~~~-----l~~~~~g~~--v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~- 350 (514) T protein:vir:80 297 ----------A--------VDD-----YRDAETGDF--VPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTG- 350 (514) T ss_pred ----------c--------hhh-----hcccCCcee--ecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc- Confidence 0 000 000000000 1112233444332 34556667777777777743211100 Q ss_pred ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhcc-cc-cccccceeeEEeCCCCC Q lcl|NC_019916. 359 TDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR--------YTVVAHIEERVN-GK-WDIDPDEIGFIFRDNLP 428 (513) Q Consensus 359 ~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~--------~~li~~~l~~~~-~~-~~~~~~~i~i~f~~~~p 428 (513) .. .-+.+.++..++. ++.+++..+|..+.++ ++.++.++.... +. ......-+++++.-++. T Consensus 351 ~~-rd~~rvTAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la 422 (514) T protein:vir:80 351 QV-RDAERVTVEEIRT-------VAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIP 422 (514) T ss_pred cC-CCCCCCCHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHH Confidence 00 0123356655543 2344445555544432 222333332211 11 11111223444432221 Q ss_pred -cCHHHHHH-------HHHHHhcC-------CCHHHHHHhC---CCC------CCHHHHHHHHHHHHHHHHHHh-hhhcC Q lcl|NC_019916. 429 -TDDVAIIT-------ALVQAGAQ-------IPQEYLYQYL---PNV------TDADEIVKMMDKQRKAMLKTY-DTKGG 483 (513) Q Consensus 429 -~d~~e~a~-------~~~kl~g~-------iS~et~~~~l---~~v------~D~~~E~~ri~~E~~~~~~~~-~~~~~ 483 (513) -...+.++ .+..+++. +....++..+ -+| .+.+....+.++++++.++.. ..... T Consensus 423 ~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 502 (514) T protein:vir:80 423 ALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGA 502 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111 12222222 2233333332 122 222222222222222111111 11111 Q ss_pred CCCCCCCCCCCC Q lcl|NC_019916. 484 LIINGTSGNDPE 495 (513) Q Consensus 484 ~~~~~~~~~~~~ 495 (513) .....+.+--.+ T Consensus 503 ~~~~~~~~~~~~ 514 (514) T protein:vir:80 503 LAAETSAGVLTS 514 (514) T ss_pred HHHhhhccccCC Confidence 111111111111 No 240 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=56.83 E-value=0.45 Score=22.50 Aligned_cols=372 Identities=10% Similarity=0.021 Sum_probs=124.5 Q ss_pred CCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeee Q lcl|NC_019916. 13 EDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAM 92 (513) Q Consensus 13 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~ 92 (513) |. -.++|.... ..... . .+..+... ..... ......-+.+.-...+|+..++-+-.-|+.+ T Consensus 1 Mg----~~~~~~~~~----~~~~~---~-----~~~~~~~~--~~~~~-~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~ 61 (395) T protein:vir:40 1 MG----FKSWVSGFF----NEEQR---T-----LNLTDTVW--CSIPS-EKLKELSIKKWAIDSCANKIANTLSCAEVLT 61 (395) T ss_pred Cc----hHHHHHhhh----ccccc---c-----cccccchh--hcccc-ccchhhhhhhHHHHHHHHHHHHHHhhCceee Confidence 11 011111111 11000 0 00000000 00000 0001111223344556666666666678775 Q ss_pred cCCcH---HHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEE Q lcl|NC_019916. 93 SGPSS---DRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMA 164 (513) Q Consensus 93 ~~~~~---~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ 164 (513) -.++. ..+..++.. |. .......+..+++.+|.||+++..+. . + + |...... .....- T Consensus 62 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---~---~-~-~~~~~~~------~~~~~~ 127 (395) T protein:vir:40 62 YEKGEEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---I---Y-V-ADSFTKN------DKSLYE 127 (395) T ss_pred ccCCccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---e---e-e-cCCcccc------cccccc Confidence 33221 223333322 32 23445567888899999997664331 1 1 1 1111000 000000 Q ss_pred EEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC-CCCCcchhHHHHHHHHH Q lcl|NC_019916. 165 VRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN-EYRQGDFENVLSLIDLY 243 (513) Q Consensus 165 ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-~~~~sd~e~v~~liD~~ 243 (513) .+++... .++ .. . -..|....+++ |+.+ ..+.+.. .++...+ T Consensus 128 ~~~~~v~-~~~---~~-~-~~~~~~~evih----------------------------~r~~~~~~~~~~---~~l~~~~ 170 (395) T protein:vir:40 128 NTYTEVT-LKD---LT-L-KKEFKESEVLH----------------------------LTLNNESIKSII---DGFYLLY 170 (395) T ss_pred ceeeeee-ecC---ce-e-eeeeccccEEE----------------------------eecCCCCccccc---hhHHHHH Confidence 1111000 000 00 0 01122333333 2211 1111111 1222233 Q ss_pred HHHHHHHHHHHHHhhh--hhhheecCcccccccccccccccchhhhhhhccccccchhhhcc--hhcceeeccccccccc Q lcl|NC_019916. 244 DVAQSDTANYMTDLNE--AMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAM--RQANMILLKTGMAPNG 319 (513) Q Consensus 244 ~~~~S~~~~~~~~~~~--~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 319 (513) ....+...+....... +.+++..... .... ....++..-...+... ..++++.+ T Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~----------~~~~----~~~~~~~~~~~~~~~~~~~~~~~~vl-------- 228 (395) T protein:vir:40 171 GDLLTAAVNKYKKLNSRKIIVKLKAMFG----------QTPE----AEEKLRLMLSERMKKFLAEGDSALPV-------- 228 (395) T ss_pred HHHHHHHHHHHHhcCCCCceEEEecccC----------CCHH----HHHHHHHHHHHHHHHhhccCCceeec-------- Confidence 3333333332222221 2222211110 0000 0001100000001110 01111222 Q ss_pred cccCCceeEEeecCCHHHHH---HHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTE---LYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~---~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) ..+.+++.+........+. .+.+.+.+.|+..=++|+.......+|.+... ...+...|.. T Consensus 229 -~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~sn~e~~~---------------~~f~~~~L~P 292 (395) T protein:vir:40 229 -EDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTVGLSEQV---------------NSFLMFSINP 292 (395) T ss_pred -CCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHH---------------HHHHHHHHHH Confidence 2233333333332222222 22344456788888888755432222222211 2233344444 Q ss_pred HHHHHHHHHHhc--ccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_019916. 397 RYTVVAHIEERV--NGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQ 470 (513) Q Consensus 397 ~~~li~~~l~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E 470 (513) .++.|..-+... ..........+++.+..-+-.|..+.++++.++ .|+++.-.+.+.++. ++++... T Consensus 293 ~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD------- 365 (395) T protein:vir:40 293 IAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQ------- 365 (395) T ss_pred HHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCc------- Confidence 444444333321 111011123455656677777899999998887 577887777666643 2221100 Q ss_pred HHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 471 RKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) ....+. +..+.+...+...++++++ .++| T Consensus 366 -----~~~~~~------n~~~~~~~~~~~kgge~~~---~~~~ 394 (395) T protein:vir:40 366 -----ERFVTK------NYAPLGENEEDLKGGDINE---NKGD 394 (395) T ss_pred -----eeeecc------ccccccccccccCCCCCCC---CcCC Confidence 000000 0000111111111111111 1122 No 241 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=55.02 E-value=0.49 Score=22.29 Aligned_cols=385 Identities=9% Similarity=0.021 Sum_probs=137.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceee-cchhHHHHHHHHHHhhcCCeeec-CCc----- Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAV-HSFARYIADFQTSYSVGNAIAMS-GPS----- 96 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~-~n~~~~ivd~~~~~l~g~p~~~~-~~~----- 96 (513) ..+++.+..+.+..-.....++.... . ..........++. .+.....|+..+.-+-+-|+.+- ..+ T Consensus 1 Mg~~~~f~~k~~~~~~~~~~~~~~~~------~-~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~ 73 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTNAISWFLTQE------A-YDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIR 73 (403) T ss_pred Ccccccccccccccccchhhhhcccc------c-ccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceee Confidence 11111110000000000000110000 0 0000000111121 23345577777777777788741 111 Q ss_pred -HHHHHHHHH--hcCHH---HHHHHHHHHHhhC--CeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEE Q lcl|NC_019916. 97 -SDRLDDFNR--RNDID---TLNYELYLDMTVT--GRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYH 168 (513) Q Consensus 97 -~~~l~~~~~--~n~~~---~~~~~~~~~a~~~--G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~ 168 (513) +..+..++. =|... .....+..+.+.. |.||+++..+..|.+.-.+.++|..+-++.+++.. .+ +| T Consensus 74 ~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~-----~~-~y 147 (403) T protein:vir:80 74 IKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGY-----QI-WY 147 (403) T ss_pred cCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCce-----EE-EE Confidence 122333332 23322 3334455666665 55677666666666554556777776665554320 01 11 Q ss_pred eecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019916. 169 AVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQS 248 (513) Q Consensus 169 ~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S 248 (513) . ...|..+.+.+++.. ..|.- ...|.|-++.+...+........ T Consensus 148 ~--------------~~~~~~~eiih~~~~------------------~~~~~----~~~G~s~~~~~~~~i~~~~~~~~ 191 (403) T protein:vir:80 148 Q--------------GKAYNYDEVLHFIVN------------------PDPEK----PYMGRGYRVVLKDIVNNLKQATT 191 (403) T ss_pred e--------------ecccchhhEEEEecc------------------CCCcC----ccccccHHHHHHHHHHHHHHHHH Confidence 1 011233333332210 00100 01355544444444433332222 Q ss_pred HHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh-hcchhcceeeccccccccccccCCcee Q lcl|NC_019916. 249 DTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL-EAMRQANMILLKTGMAPNGQQTSADAN 327 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (513) -....+...+.|-.+++-..... ........... .... .....++.+.++.+ .....++. T Consensus 192 ~~~~~~~ng~~p~~il~~~~~~~----------~~~~~~~~~~~----~~~~~~~~~~g~~~~~~~~-----~~~~~~~~ 252 (403) T protein:vir:80 192 TKKSFMSGKYMPSLIVKVDAATA----------ELSSEEGRNAV----FKKYLEASEAGQPWIIPAE-----LLDVEQVK 252 (403) T ss_pred HHHHHHhccCCcceEEEeCCCCC----------hHHHHHHHHHH----HHHHhhhhhcCCeeeeccc-----ccccceec Confidence 22222332233333332211100 00000000000 0000 00111122222111 00011111 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 328 YIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEER 407 (513) Q Consensus 328 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~ 407 (513) .+ ......+....+.....|+..-++|+.-.+. +.+.+..... .....|...++.+..-+.. T Consensus 253 ~l--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~~~~~---------------f~~~~l~P~~~~ie~~l~~ 314 (403) T protein:vir:80 253 PL--SLKDLAIHETVELDKRTVAGIFGVPAFLLGV-GKYDKDEYNN---------------FINSTILPIAKGIEQELTR 314 (403) T ss_pred cC--CHHHHHHHHHHHHhHHHHHHHhCCCHHHcCC-CCccHHHHHH---------------HHHHHHHHHHHHHHHHHHH Confidence 11 2233445566777788899998998754431 1122222111 2223344444444333332 Q ss_pred cccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCC Q lcl|NC_019916. 408 VNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLI 485 (513) Q Consensus 408 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~ 485 (513) .-- .+.+ ..+++....-+..|..+.++++.++ +|+++.-.+.+.++.-..+... ....+..... T Consensus 315 kll-~~~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd------------~~~~~~n~~p 380 (403) T protein:vir:80 315 KLL-ISPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS------------ELVILENYIP 380 (403) T ss_pred hcc-CCCC-cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC------------eEeecccccc Confidence 110 1111 1233333566677888999988876 6888887777776432211000 0000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 486 INGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) . ...++.+...+++...++.+.+ T Consensus 381 l----~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 381 L----DKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred h----hhccchhhccCCCCCCCCCCCC Confidence 0 0011111111111111111111 No 242 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=54.04 E-value=0.51 Score=22.17 Aligned_cols=422 Identities=10% Similarity=0.089 Sum_probs=155.5 Q ss_pred CccchhhceeccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHH Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADF 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~ 80 (513) +..-.-+...|....+--..-...++ ..+|+.+-.+++-+.. ...||+. T Consensus 39 ~~~~~~~g~~~~~~~~~~~~~~~~eL--------I~~YR~ma~~pEvd~A-----------------------v~eIvne 87 (511) T protein:vir:56 39 LLAPQLGHAIIPSDAQSEGTIPVKEL--------IKSYRALAEYHEVDDA-----------------------IQEIVDE 87 (511) T ss_pred cccceecceeccccccccCccchHHH--------HHHHHHHhhccchhhH-----------------------HHHhhcc Confidence 11111111112221110000000112 2234444443333221 1112222 Q ss_pred HH-HHhhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccc Q lcl|NC_019916. 81 QT-SYSVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPME 148 (513) Q Consensus 81 ~~-~~l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~ 148 (513) .+ .--...||.+.-++ -++.+.+++--+|+....+..+.+.+.|+.|.+.-.|+...+.-...++|+. T Consensus 88 ~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~ 167 (511) T protein:vir:56 88 AIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMK 167 (511) T ss_pred eeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCccc Confidence 21 12234455543322 1345566777789999999999999999999998888654332222368876 Q ss_pred eEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccce-----EE- Q lcl|NC_019916. 149 CFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPM-----IE- 222 (513) Q Consensus 149 ~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-----v~- 222 (513) +-.|..-.. +....+.. .......-+|++............+ ..+.--+||. ++ T Consensus 168 i~~vr~i~~--~~~~~~~v----------~~~~~ey~~Y~~~~~~~~~~~~~~~--------~~~~~vkI~~daI~y~hS 227 (511) T protein:vir:56 168 MELVREIQK--ETIDGVEV----------VKGTLEYYVYKQSDYKMPSWMSATN--------RAQTSFRIPKDAIVFAHS 227 (511) T ss_pred chhhhhhhc--cccccccc----------ccceeeeeEecCCCcccCccccccc--------ccccceeechhheeeecc Confidence 544432110 11111000 0001112244443211111000000 0001111221 11 Q ss_pred ----e-cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhh-hhcccccc Q lcl|NC_019916. 223 ----Y-RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADA-MKKLADEK 296 (513) Q Consensus 223 ----~-~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~ 296 (513) + .|+....|-++..+....-+ +++-|.+....-...|-+=+.-............+....-...+ ++..=+.. T Consensus 228 GL~d~~~~~g~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~ 306 (511) T protein:vir:56 228 GLMRGCADDPYIIGYLDRAIKPANQL-KMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQ 306 (511) T ss_pred cceeccCCCCeeeccchhhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEecc Confidence 1 12222344444422221111 11222222222222222111111100000000000000000000 00000000 Q ss_pred chhhhcchhcce----eeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc--cc----cccc Q lcl|NC_019916. 297 MAQLEAMRQANM----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTD--DN----FSGN 366 (513) Q Consensus 297 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~----~~~n 366 (513) ...++.++.-.. ++|+.- ..+.+-.+..|--..|+.-. .-++-.++-+|..-++|-.-. +. |... T Consensus 307 TGev~ddrk~msMlEDyWLpRR----eGgrgTEItTLpGgqnlgem-~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~G 381 (511) T protein:vir:56 307 TGQVKNTTNAMSMLEDYYLPRR----EGSKGTEVSTLPGGQSLGDI-EDVLYFNRKLYKAMRIPTSRAASEDQTGGINFG 381 (511) T ss_pred CceeccchhhhhhHhhhccccc----CCCCccceeeccccCCcChH-HHHHHHHHHHHHHhCCCcccccCCCCccccccc Confidence 001111110000 011100 00111222222222222222 234555666677678884332 21 2111 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----ceeeEEeCCCCCcCHHHHHH------ Q lcl|NC_019916. 367 SSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP----DEIGFIFRDNLPTDDVAIIT------ 436 (513) Q Consensus 367 ~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~----~~i~i~f~~~~p~d~~e~a~------ 436 (513) -|..|-.-+....+.+.+.+..|..-|.++++.=+-+ ++.....++ ..|.+.|...-.-.+...++ T Consensus 382 -r~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLil---Kgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl 457 (511) T protein:vir:56 382 -QGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIV---NNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRM 457 (511) T ss_pred -cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 2234555555566677888888888888887753322 222222233 45778886555544444333 Q ss_pred -HHHHHh---c-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCCC Q lcl|NC_019916. 437 -ALVQAG---A-QIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGLI 485 (513) Q Consensus 437 -~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~ 485 (513) ++..+. | .+|.+++++.+=--+| .+++.+.|++|..+ +.+......+ T Consensus 458 ~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~--~~~~~~e~~f 511 (511) T protein:vir:56 458 NAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETN--PRFQQDDQGF 511 (511) T ss_pred HHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcC--CCCCCcccCC Confidence 344443 3 3799999988644443 34444455544333 1111111111 No 243 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=50.55 E-value=0.61 Score=21.77 Aligned_cols=330 Identities=10% Similarity=0.002 Sum_probs=108.8 Q ss_pred HHHHHHHHHHHHHHhcCCCcccccc--ccccCCCCCCcceeecchhHH------HHHHH----HHHhhcCCeeecCC--- Q lcl|NC_019916. 31 YNNQRPRLEMLYDYYRGQNDGILSP--ASRRNEKGKADHRAVHSFARY------IADFQ----TSYSVGNAIAMSGP--- 95 (513) Q Consensus 31 ~~~~~~~~~~~~~YY~G~~~i~~~~--~~~~~~~~~~~~ri~~n~~~~------ivd~~----~~~l~g~p~~~~~~--- 95 (513) ..+++.|. +.+-..+........ .........+..-+..+=+.. +.+-. .+.....||.+.+= T Consensus 1 m~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~ 78 (368) T protein:vir:79 1 MSRNKTRR--AARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARS 78 (368) T ss_pred CCcccccc--chhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHH Confidence 01111100 000011100000000 000000000000000000000 11110 01112223332110 Q ss_pred ------c-------HHHHHHHHHhcC-H-HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcc Q lcl|NC_019916. 96 ------S-------SDRLDDFNRRND-I-DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPK 160 (513) Q Consensus 96 ------~-------~~~l~~~~~~n~-~-~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~ 160 (513) . -+.+..+..-|. + ......++.+.+.+|.||+.+..+..|++.-.+.++|..+-+.-+.. T Consensus 79 ~~~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~---- 154 (368) T protein:vir:79 79 FRAAAHHSSAVYVKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDLN---- 154 (368) T ss_pred HhhccccchhhhhhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccCC---- Confidence 0 001111112222 1 13345678899999999999988887876555556666553322211 Q ss_pred eEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHH Q lcl|NC_019916. 161 PIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLI 240 (513) Q Consensus 161 ~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~li 240 (513) +||.... . . . .-.|.++.+++++. +++. +...|.|.+.....-+ T Consensus 155 -----~~~~~~~-~--~-~----~~~~~~~dIihir~--------------~~~~---------~~~yGlsp~~~a~~si 198 (368) T protein:vir:79 155 -----TYFFVQN-W--Q-Q----PYTFAAGSVFHLQE--------------PDIN---------QEVYGLPEYLSALNAT 198 (368) T ss_pred -----EEEEEec-C--C-e----EEEEccccEEEecC--------------CCCC---------CCcccccHHHHHHHHH Confidence 1111110 0 0 0 00122222222211 0000 0125677666555444 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccc Q lcl|NC_019916. 241 DLYDVAQSDTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAP 317 (513) Q Consensus 241 D~~~~~~S~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~ 317 (513) +.-+.+..-....+...+.|-.++ +|.. ............++. ...... ++++.+.++ T Consensus 199 ~l~~aa~~~~~~~~~NGa~~~gil~~~~~~-----------l~~e~~~~lk~~~~~-----~~G~~N~g~~~vl~~~--- 259 (368) T protein:vir:79 199 WLNESATLFRRRYYKNGSHAGFILYMTDAA-----------QKQEDVDTLREAMKS-----AKGPGNFRNLFMYAPN--- 259 (368) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCC-----------CCHHHHHHHHHHHHH-----hcCCcccCceeEecCC--- Confidence 432222222222222223333222 2210 011111111111111 111111 223333221 Q ss_pred cccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGL 394 (513) Q Consensus 318 ~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l 394 (513) +..++++|.. -......+.+..+...+.|+..-++|+.-.+-..++.++ .-++-. .+..+...+ T Consensus 260 ---g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~----------~~~f~~~~l 326 (368) T protein:vir:79 260 ---GKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKA----------AMVFARNEV 326 (368) T ss_pred ---CCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHH----------HHHHHHHHH Confidence 1233444444 333445567788888899999999998655433233221 111111 111222333 Q ss_pred HHHHHHHHHHHHhcccccccccceeeEEeCCCC--CcCHHHHHHHHHHHh Q lcl|NC_019916. 395 NQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNL--PTDDVAIITALVQAG 442 (513) Q Consensus 395 ~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~--p~d~~e~a~~~~kl~ 442 (513) .-+++.+.++...... . .+.|++.. -.|.+..|+...+.+ T Consensus 327 ~Pl~~~ie~ln~~l~~----e----~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 327 KPLQDRLLAINDWIGD----E----VVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred HHHHHHHHHHHhccCc----c----eeeechhHhhcccccccCCcccccC Confidence 3333333322221111 1 23454321 223333333333333 No 244 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=47.36 E-value=0.7 Score=21.42 Aligned_cols=295 Identities=11% Similarity=0.040 Sum_probs=98.0 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeec-C----------- Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMS-G----------- 94 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~-~----------- 94 (513) ++++ ..++... ..+..+.- ......|-||.|..+- . T Consensus 1 ~~~~-------------------------~~~~~~~---~~~~~~~~----~~~~~~~~~~~p~~v~~~~~~~~~~~~~~ 48 (344) T protein:vir:56 1 MSKK-------------------------KGKTPQP---AAKTMTAS----APKMEAFTFGEPVPVLDRRDILDYVECIS 48 (344) T ss_pred CCCC-------------------------CCCCCch---hhHHhhcC----CCceEEEEcCCceeecCcchhhhHHHhhh Confidence 0110 0000000 00000000 0000122223322210 0 Q ss_pred -------C-cHHHHHHH---------------------HHhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEE Q lcl|NC_019916. 95 -------P-SSDRLDDF---------------------NRRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVK 143 (513) Q Consensus 95 -------~-~~~~l~~~---------------------~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~ 143 (513) . +-..|.++ ++-|.. ......++.+.+.+|.||+.+-.+..|++.-.+. T Consensus 49 ~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k~n~l~~~~~Pnp~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~p 128 (344) T protein:vir:56 49 NGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLET 128 (344) T ss_pred cCccccCCCCHHHHHHHHhhhhhhCccceehhhhHHhhcCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEE Confidence 0 00111111 111211 1234567788899999999888888777654444 Q ss_pred EcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEe Q lcl|NC_019916. 144 LDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEY 223 (513) Q Consensus 144 ~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 223 (513) ++|..+-+.-+.. +||.... .+ . .-.|.++.+++++.. ++. T Consensus 129 l~~~~v~~~~~~~---------~~~~~~~-~g---~----~~~~~~~dIiHir~~--------------~~~-------- 169 (344) T protein:vir:56 129 SPAKYTRRGVEED---------VYWWVPS-FN---E----PTAFAPGSVFHLLEP--------------DIN-------- 169 (344) T ss_pred eCCceeEEeecCC---------EEEEEec-CC---e----EEEEcCccEEEECCC--------------CCC-------- Confidence 5555443322211 1111110 00 0 001233333332110 000 Q ss_pred cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhh---hhhhhee--cCcccccccccccccccchhhhhhhccccccch Q lcl|NC_019916. 224 RNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLN---EAMLVIK--GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMA 298 (513) Q Consensus 224 ~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~---~~~l~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 298 (513) +.-.|.|.+.....-++ ...+-..-...+|. .|-.+++ |.. ............++... T Consensus 170 -~~~~Gls~~~~a~~si~---l~~~a~~~~~~~f~NGa~pg~Il~~~d~~-----------ls~e~~~~lk~~~~~~~-- 232 (344) T protein:vir:56 170 -QELYGLPEYLSALNSAW---LNESATLFRRKYYENGAHAGYIMYVTDAV-----------QDRNDIEMLRENMVKSK-- 232 (344) T ss_pred -CCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEecCCC-----------CCHHHHHHHHHHHHHhc-- Confidence 11246666554333222 21111111122332 2332222 210 01111111111111100 Q ss_pred hhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHH-HHHHHH Q lcl|NC_019916. 299 QLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVA-MKYKVL 377 (513) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A-i~~~~~ 377 (513) .....+.+.+.. +.+..++.++..++-......+.+..+..+++|+..-++|+.-.+-..++.++-+ ++-... T Consensus 233 ---g~~~~r~l~l~~---p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~ 306 (344) T protein:vir:56 233 ---GRNNFKNLFLYA---PQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred ---CCCCccceEEec---CCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH Confidence 000111122211 1111223334444433445557778888889999999999866543323322111 111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHH Q lcl|NC_019916. 378 GTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVA 433 (513) Q Consensus 378 ~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e 433 (513) ......+.-+++.+..+....... .+.|.+.......+ T Consensus 307 ----------~f~~~tL~Pl~~~ie~~n~~l~~~--------~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 307 ----------VFVRNELIPLQDRIREINGWIGQE--------VIRFKNYSLDTDNG 344 (344) T ss_pred ----------HHHHHHHHHHHHHHHHHHhhhccc--------cccCCCccccccCC Confidence 111222222222222222111110 13454433332222 No 245 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=44.18 E-value=0.82 Score=21.07 Aligned_cols=466 Identities=12% Similarity=0.069 Sum_probs=161.2 Q ss_pred Cccchhhce------------eccCCcccCCHHHH-----HHHHH---H--HHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019916. 1 MIDMQQANM------------NYQEDADKLTPTRI-----AAFIR---H--HYNNQRPRLEMLYDYYRGQNDGILSPASR 58 (513) Q Consensus 1 ~~~~~~~~~------------~~~~~~~~~~~~~i-----~~~i~---~--~~~~~~~~~~~~~~YY~G~~~i~~~~~~~ 58 (513) |-.|-.--+ ..+.+.++-+.... ..++. . ..-....+|+.+..+++-+ T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd---------- 70 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEAD---------- 70 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchh---------- Confidence 211110000 00111111000000 00000 0 0011122233333333222 Q ss_pred cCCCCCCcceeecchhHHHHHHHH-HHhhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeE Q lcl|NC_019916. 59 RNEKGKADHRAVHSFARYIADFQT-SYSVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGRA 126 (513) Q Consensus 59 ~~~~~~~~~ri~~n~~~~ivd~~~-~~l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~ 126 (513) +-...||+..+ .-....||.+.-++ -++.+.+++--+|+....+..|.+.+.|+. T Consensus 71 -------------~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi 137 (558) T protein:vir:10 71 -------------GAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRV 137 (558) T ss_pred -------------hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEE Confidence 12222333222 22234555553222 123455667778999999999999999999 Q ss_pred EEEeeecCC----CceeEEEEEcccceEEEecCCCC---cceEEEEEEEeecccccc-cceeEEEEEEEcCCcEEEEEee Q lcl|NC_019916. 127 YEYVYRDPS----QKGEVSVKLDPMECFIIYDRSVN---PKPIMAVRYHAVQTVVDN-ITQTKYEVETWTENDYTRYKPI 198 (513) Q Consensus 127 ~~~v~~d~~----~~~~~~~~~~p~~~~~~~d~~~~---~~~~~~ir~~~~~~~~~~-~~~~~~~ve~yt~~~~~~~~~~ 198 (513) |.+...|.+ |-..+. .++|+.+-.|..-... ......++. ..+. ........-+|++...++... T Consensus 138 yfHKiid~k~pk~GI~ELr-~lDPr~i~~Vr~i~~~~~~~~~~~~~~~-----~~~~~~~~~~~eyy~Y~~~~~~~~~~- 210 (558) T protein:vir:10 138 FYLKVIDTKNPQEGIQDLR-YIDPLKIKFIRQEKRKPGNQDPAIRVRS-----EQDVVPNPEFEEFYIYTPKVQHPTGM- 210 (558) T ss_pred EEEEEEeCCCccccceeee-eeCcccceeeeeeccccccccceeeeec-----ccceeeccceeEeeeecCCccccccc- Confidence 999998754 322232 3788876554431110 111111110 0000 000001112444443332211 Q ss_pred ccCCccccccccccccCcccc--eEEec-------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcc Q lcl|NC_019916. 199 VVAGSVPTLEVAEHSAQFGFP--MIEYR-------NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDID 269 (513) Q Consensus 199 ~~~~~~~~~~~~~~~~~g~vP--vv~~~-------n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~ 269 (513) ++.... .++ -+|| .|.|+ |...-.|-++..+....-+ +++-|.+....-...|-+=+.-... T Consensus 211 --~~~~~~-----~~~-vkI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQL-kmlEDAlVIYRitRAPERRvFYIDV 281 (558) T protein:vir:10 211 --VGQMGG-----KNS-IKIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQL-RMIEDSLVIYRLSRAPERRIFYIDV 281 (558) T ss_pred --ceeecC-----CCc-eeechhheeeecccceecCCCeeeecchHhhHhHHhh-HHHHhhHHHHhhhccccceEEEEec Confidence 110000 000 1222 12221 1111123333321111111 1122222222222222211111110 Q ss_pred cccccccccccccchhhhh-hhccccccchhhhcchhcce----eeccccccccccccCCceeEEeecCCHHHHHHHHHH Q lcl|NC_019916. 270 TLFDDSTLLQMVDPSDADA-MKKLADEKMAQLEAMRQANM----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKR 344 (513) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 344 (513) ...-.....+....-...+ ++..-+.....++.++.-.. ++|+.- ..+.+-.+..|--..|+.-. .-++- T Consensus 282 GnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRR----eGgrgTEItTLpGgqnLgem-~DV~Y 356 (558) T protein:vir:10 282 GNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRR----EGGRGTEITTLPGGQNLGEL-SDVDY 356 (558) T ss_pred CCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhccccc----CCCCccceeeccccCCcchH-HHHHH Confidence 0000000000000000000 00000000000111110000 011100 00111222222222222222 23445 Q ss_pred HHHHHHHHhCccccccccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----ce Q lcl|NC_019916. 345 LAADIHKFSHTPDLTDDNFSG-NS-SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP----DE 418 (513) Q Consensus 345 l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~----~~ 418 (513) .++-+|..-++|-.-.+.-++ +. -|..|-.-+....+.+.+.+..|..-|..+++.=+-+ ++.....++ .. T Consensus 357 F~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLil---Kgiit~eeW~~i~~~ 433 (558) T protein:vir:10 357 FQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVL---KNIVTPEDWKTMEDH 433 (558) T ss_pred HHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---ccCCCHHHHHHHhhc Confidence 556666666787432221111 11 2223544555566677888888888888887753322 222222233 45 Q ss_pred eeEEeCCCCCcCHHHHHH-------HHHHHh---c-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHH---H-Hhhhh Q lcl|NC_019916. 419 IGFIFRDNLPTDDVAIIT-------ALVQAG---A-QIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAML---K-TYDTK 481 (513) Q Consensus 419 i~i~f~~~~p~d~~e~a~-------~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~---~-~~~~~ 481 (513) |.+.|...-.-.+...++ ++..+. | .+|.+++++.+=--+| .+++-+.|++|..+-. + ..+++ T Consensus 434 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~ 513 (558) T protein:vir:10 434 IQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPI 513 (558) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChh Confidence 778886555544443333 344443 3 3799999988544443 5555666666654310 0 01111 Q ss_pred c-CCCCCCCC------CCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 482 G-GLIINGTS------GNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 482 ~-~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) . +..+.+.. +....+++....+.+.+.+.+-+ T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (558) T protein:vir:10 514 TGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKD 552 (558) T ss_pred hccccCccCCchhccCCCCCcccccccchhhhhhhhhhh Confidence 1 11111000 11111111111111111111111 No 246 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=42.19 E-value=0.9 Score=20.85 Aligned_cols=409 Identities=11% Similarity=0.018 Sum_probs=164.1 Q ss_pred CCcc---cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHHhhc Q lcl|NC_019916. 13 EDAD---KLTPTRIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSYSVG 87 (513) Q Consensus 13 ~~~~---~~~~~~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~l~g 87 (513) |+.+ --+|....+..... ..+....-|. +.-+++.+.............++ ......-.+.+...-+.+ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~-----~~~~~~~g~~-~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~ 74 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAM-----EHLGLATSYL-SEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLN 74 (446) T ss_pred CcccccCCCchhhhhhhhhcc-----ccchhhcccC-CcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhc Confidence 2221 13444444433221 0112222222 22122211110000000011111 245666677777777788 Q ss_pred CCeeecCCcHHH---HHHHHHhcCHHHHHHHHHHHHhhCCeEE-EEeeecCCCceeEEEEE------cccceEEEecCCC Q lcl|NC_019916. 88 NAIAMSGPSSDR---LDDFNRRNDIDTLNYELYLDMTVTGRAY-EYVYRDPSQKGEVSVKL------DPMECFIIYDRSV 157 (513) Q Consensus 88 ~p~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~d~~~~~~~~~~~------~p~~~~~~~d~~~ 157 (513) -++++...+++. +.+++..-.+...... ..++..+|.++ +++|.-..|.......+ .|...--.|+... T Consensus 75 ~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~ 153 (446) T protein:vir:98 75 KVGPYQHGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNG 153 (446) T ss_pred CCceecCccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccCC Confidence 888887765543 5566655555554444 57888899886 56775433322111101 1111111111111 Q ss_pred CcceEEEEEEEeecccccccceeEEEE--EEEcCCcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCC Q lcl|NC_019916. 158 NPKPIMAVRYHAVQTVVDNITQTKYEV--ETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQ 230 (513) Q Consensus 158 ~~~~~~~ir~~~~~~~~~~~~~~~~~v--e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~ 230 (513) ....+. ..+.... -.|.+-..+++....... .......+-|..++ +.++. +..|. T Consensus 154 --~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~g~~~~iP~~kf--i~~~~~~~~~~p~G~ 215 (446) T protein:vir:98 154 --RIVDGD------------TVTASQYKSGYWVPLPPYRIGDPPKKV--DVVGSHVRLPSHKR--LFINYNTKGNNPWGT 215 (446) T ss_pred --cccccc------------ccchhhcccccccCcccchhhhhhhhc--ccCcccccccccce--EEEEecCCCCCcccc Confidence 000000 0000000 001110000000000000 00001111122222 33322 34577 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceee Q lcl|NC_019916. 231 GDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMIL 310 (513) Q Consensus 231 sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 310 (513) |.+..+--.-=-=+..+-+.+...+.|+.|+++.+-..+....+...+.. ..........-...+..+..+.... T Consensus 216 gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~-----~~~~~~~~~~L~~av~~~~~da~~i 290 (446) T protein:vir:98 216 SCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDG-----TEITTTIAEQAEDALRRLSTDSGLV 290 (446) T ss_pred chHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhH-----HHHHHHHHHHHHHHHHhccccceee Confidence 77766555444446667778889999999999887543322111110000 0000000000001111111111111 Q ss_pred ccccccccccccCCceeEEeecCCH-HHHHHHHHHHHHHHHHHhCccccccccc----ccc-ccHHHHHHHHHHHHHHHH Q lcl|NC_019916. 311 LKTGMAPNGQQTSADANYIHKEYDS-AGTELYKKRLAADIHKFSHTPDLTDDNF----SGN-SSGVAMKYKVLGTVELAS 384 (513) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~i~~~s~~p~~~~~~~----~~n-~Sg~Ai~~~~~~l~~k~~ 384 (513) +... ...++..+++++..... ..++..++.+.+.|...--...+..+.. +++ ++.+--+.....+..-+ T Consensus 291 i~~~----~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa- 365 (446) T protein:vir:98 291 LTQL----SKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIF- 365 (446) T ss_pred eecc----cCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHH- Confidence 1110 11345677888766443 4589999999999987654443332211 111 11121222111111122 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHhcccccccccce---eeEEeCCCCCcCHHHHHHHHHHHh--cC-CC--HHHHHHhCC Q lcl|NC_019916. 385 TKRKQFERGL-NQRYTVVAHIEERVNGKWDIDPDE---IGFIFRDNLPTDDVAIITALVQAG--AQ-IP--QEYLYQYLP 455 (513) Q Consensus 385 ~~~~~f~~~l-~~~~~li~~~l~~~~~~~~~~~~~---i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS--~et~~~~l~ 455 (513) +.+...+ +++++-++.+ +......... -.++|....+.|....++++.++. |+ ++ .+.+.+.++ T Consensus 366 ---~~i~~tln~~Li~~l~~l----Nf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~g 438 (446) T protein:vir:98 366 ---DTVIHAFTEQVIGNLIRL----NFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITG 438 (446) T ss_pred ---HHHHHHHHHHHHHHHHHh----CCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhC Confidence 2333333 2344444332 2211111111 134566678889999999999874 54 44 344555554 Q ss_pred CCCCHHHHH Q lcl|NC_019916. 456 NVTDADEIV 464 (513) Q Consensus 456 ~v~D~~~E~ 464 (513) +.+++.-- T Consensus 439 -iP~~~~~~ 446 (446) T protein:vir:98 439 -LPDAISST 446 (446) T ss_pred -cCCCCCCC Confidence 33211111 No 247 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=40.82 E-value=0.95 Score=20.70 Aligned_cols=301 Identities=13% Similarity=0.074 Sum_probs=97.8 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcce---eecchhH------HHHHHHHHHhhc----CCeeec Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHR---AVHSFAR------YIADFQTSYSVG----NAIAMS 93 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~r---i~~n~~~------~ivd~~~~~l~g----~p~~~~ 93 (513) ++++ + ...+. +.... ......+ +..+=|. .+.+-.--+..| -|+... T Consensus 1 m~~~----~----------~~~~~----~~~~~--~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~ 60 (344) T protein:vir:60 1 MSKK----K----------GKTLQ----PAAKK--MTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFT 60 (344) T ss_pred CCcc----c----------CCCCC----chHHh--hcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHH Confidence 0000 0 00000 00000 0000000 0001000 011111001111 122221 Q ss_pred CCc----------------HHHHHHHHHhcC-H-HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecC Q lcl|NC_019916. 94 GPS----------------SDRLDDFNRRND-I-DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDR 155 (513) Q Consensus 94 ~~~----------------~~~l~~~~~~n~-~-~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~ 155 (513) +=. -+.+...++-|. + ......++.+.+.+|.||+.+-.+..|++.-...++|..+-+..+. T Consensus 61 ~la~~~~a~~~h~~~i~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~ 140 (344) T protein:vir:60 61 GLAKSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEE 140 (344) T ss_pred HHHHHHHhhhhhccchhhhhhHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecC Confidence 100 011111112232 1 1234567788899999999888888787654445555554333221 Q ss_pred CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCC Q lcl|NC_019916. 156 SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQ 230 (513) Q Consensus 156 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~ 230 (513) . +||.+... + . .-.|.++.+ +++++ .-.|. T Consensus 141 ~---------~~~~v~~~-~---~----~~~~~~~eI----------------------------iHir~~~~~~~~yGl 175 (344) T protein:vir:60 141 D---------VYWWVPSF-N---E----PTAFAPGSV----------------------------FHLLEPDINQELYGL 175 (344) T ss_pred C---------eEEEEccC-C---e----EEEEcCccE----------------------------EEEcCCCCCCCcccc Confidence 1 12211100 0 0 001222222 33322 12466 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhh---hhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchh Q lcl|NC_019916. 231 GDFENVLSLIDLYDVAQSDTANYMTDLN---EAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ 305 (513) Q Consensus 231 sd~e~v~~liD~~~~~~S~~~~~~~~~~---~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 305 (513) |.+.....-++ ...+-..-...+|. .|-.++ +|.. ....+.......++.... ... T Consensus 176 sp~~~a~~si~---l~~~a~~~~~~~f~NG~~pg~il~~~~~~-----------ls~e~~~~ik~~~~~~~g-----~~~ 236 (344) T protein:vir:60 176 PEYLSALNSAW---LNESATLFRRKYYENGAHAGYIMYVTDAV-----------QDRNDIEMLRENMVKSKG-----RNN 236 (344) T ss_pred cHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEecCcC-----------CCHHHHHHHHHHHHHhcC-----CCC Confidence 66554333322 22111111112222 222222 2210 111111111111111100 001 Q ss_pred cceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHH-HHHHHHHHHHHHH Q lcl|NC_019916. 306 ANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVA-MKYKVLGTVELAS 384 (513) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A-i~~~~~~l~~k~~ 384 (513) .+.+.+.. +.+..++.++..++-......+.+..+..++.|+..=++|+.-.+-..++.++-+ ++-. T Consensus 237 ~r~~~l~~---p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~--------- 304 (344) T protein:vir:60 237 FKNLFLYA---PQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKV--------- 304 (344) T ss_pred CcceEEec---CCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHH--------- Confidence 11122111 1111222333333333444557788888999999999999865543322222110 1110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCC-CCcCHH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDN-LPTDDV 432 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~-~p~d~~ 432 (513) .+......+.-+++.+..+-...+.. .+.|.+. +..+.+ T Consensus 305 -~~~f~~~~L~Pl~~~~e~ln~~lg~~--------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 305 -AKVFVRNELIPLQDRIREINGWLGQE--------VIRFKNYSLDTDNG 344 (344) T ss_pred -HHHHHHHHHHHHHHHHHHHHHhcCCc--------ccccCccccCCCCC Confidence 00111222222222222222222111 1345432 222233 No 248 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=40.11 E-value=0.99 Score=20.62 Aligned_cols=291 Identities=11% Similarity=0.056 Sum_probs=96.5 Q ss_pred cCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecC-------------------C-cHHHHHHHH- Q lcl|NC_019916. 46 RGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSG-------------------P-SSDRLDDFN- 104 (513) Q Consensus 46 ~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~-------------------~-~~~~l~~~~- 104 (513) ..+... ++... ....+..+. ..|.||.|..+.. . +-..|.+++ T Consensus 1 m~~~~~--~~~~~--~~~~~~~~~------------~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~ 64 (340) T protein:vir:98 1 MSKRKP--RKAVA--MTASAPQKM------------EAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLR 64 (340) T ss_pred CCCCCC--Ccccc--ccccCccce------------eEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHH Confidence 111100 00000 000000000 1122222222100 0 001111111 Q ss_pred --------------------HhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 105 --------------------RRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 105 --------------------~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) .-|.. ......++.+.+.+|.||+.+-.+..|++.-...++|..+-+..+. . T Consensus 65 a~~~h~s~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~----~-- 138 (340) T protein:vir:98 65 SAVHHSSPIYVKRNVLASTYIPHPLLSRQDFSRFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDD----S-- 138 (340) T ss_pred hccccchhhhhhhhHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccC----c-- Confidence 11211 1334567788899999999988887776544444454443332111 0 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDL 242 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~ 242 (513) ++|..... . . ...|.++.+++++.. ++. ....|.|.+.....-++. T Consensus 139 ---~~~~~~~~---~-~----~~~~~~~eViHir~~--------------~~~---------~~~~Gls~~~~a~~si~l 184 (340) T protein:vir:98 139 ---VFWFVENF---T-Q----PHEFAPDTVFHLLEP--------------DIN---------QEIYGLPEYLSALNSAWL 184 (340) T ss_pred ---EEEEEecC---C-e----EEEEccccEEEEcCC--------------CCC---------CCcccccHHHHHHHHHHH Confidence 12221110 0 0 011233333333210 000 011455655543332221 Q ss_pred HHHHHHHHHHHHHHhhhhh--hheecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccc Q lcl|NC_019916. 243 YDVAQSDTANYMTDLNEAM--LVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNG 319 (513) Q Consensus 243 ~~~~~S~~~~~~~~~~~~~--l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 319 (513) -+.+..-....+..-+.|- +.++|... ...........++. ...... .+++.+.++ + T Consensus 185 ~~aa~~~~~~~f~NGa~pg~il~~~~~~l-----------s~e~~~~lk~~~~~-----~~G~~n~~~~~vl~~~----g 244 (340) T protein:vir:98 185 NESATLFRRKYYQNGAHAGYIMYVTDPAQ-----------SATDVESLRDAMRN-----SKGLGNFKNLFFYSPN----G 244 (340) T ss_pred HHHHHHHHHHHHhccCCCceEEEecCCCC-----------CHHHHHHHHHHHHH-----hcCccccCceeEecCC----C Confidence 1111101111111111222 22222110 00000111111110 010111 122332221 1 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG--VAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg--~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) ...+.++..++-......+.+..+..++.|+..=++|+.-.+-..++.++ .+-+.. +..+...+.-+ T Consensus 245 ~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~-----------~~f~~~~l~Pl 313 (340) T protein:vir:98 245 KPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVA-----------KVFVRNELSPL 313 (340) T ss_pred CccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHH-----------HHHHHHHHHHH Confidence 12233444444444555677888888999999999998654432222221 111111 11222223333 Q ss_pred HHHHHHHHHhcccccccccceeeEEeCCCC-CcCH Q lcl|NC_019916. 398 YTVVAHIEERVNGKWDIDPDEIGFIFRDNL-PTDD 431 (513) Q Consensus 398 ~~li~~~l~~~~~~~~~~~~~i~i~f~~~~-p~d~ 431 (513) ++.+.++....... -++|++.. .+.+ T Consensus 314 ~~~iee~n~~L~~e--------~~rF~~~~l~~~d 340 (340) T protein:vir:98 314 QDRFREVNDWLGME--------VIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHhccccc--------ccccCccccccCC Confidence 33332222111110 13453222 2222 No 249 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=38.90 E-value=1 Score=20.48 Aligned_cols=318 Identities=9% Similarity=-0.026 Sum_probs=98.0 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccccccCCCC-CCc----ceeecchhHHHHHHHHHHhhcCCeeecCCcHHHHHHHH- Q lcl|NC_019916. 31 YNNQRPRLEMLYDYYRGQNDGILSPASRRNEKG-KAD----HRAVHSFARYIADFQTSYSVGNAIAMSGPSSDRLDDFN- 104 (513) Q Consensus 31 ~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~----~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~~~~l~~~~- 104 (513) .+++.. ......-..+.... ..+ .+. ++-...+...+-+.. +...--|+.+ ..|-+++ T Consensus 1 ~~~~~~---------~~~~~~~~~~~~~~-~~~~~p~~~~~~~~~~~~~~~~~~~~-~~~~epp~~~-----~~La~l~~ 64 (348) T protein:vir:26 1 MTEQLI---------HSHTTDGTESKSVY-SFDPNPEPVDTNSWMTRYCELFYNDF-DDYWEPPISL-----KGLAEIAN 64 (348) T ss_pred CCcccc---------chhhccccCCceEE-EecCCCeeecCcchHHHHHHHHhcCC-CccccCCCCH-----HHHHHHHh Confidence 000000 00000000000000 000 000 000011111111000 0000111111 1111111 Q ss_pred --------------------HhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceE Q lcl|NC_019916. 105 --------------------RRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPI 162 (513) Q Consensus 105 --------------------~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~ 162 (513) .-|.. .....+++.+.+.+|.||+.+-.+..|++.-...++|..+-+.-| . . T Consensus 65 ~n~~h~~~i~~k~N~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d-~---~-- 138 (348) T protein:vir:26 65 ANGYHGSLLKARANYVAGRFMNGGGLPMYKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKN-G---D-- 138 (348) T ss_pred hhhhhhhhHhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeec-C---c-- Confidence 11211 134456778889999999999888888765444455544332111 0 0 Q ss_pred EEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHH Q lcl|NC_019916. 163 MAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDL 242 (513) Q Consensus 163 ~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~ 242 (513) +|.... .+ . ...|.++.+++++.. ++. +...|.|.+...+.-+.. T Consensus 139 ----~~~~~~-~g---~----~~~f~~~dIiHir~~--------------~~~---------~~~~Gls~~~~a~~si~l 183 (348) T protein:vir:26 139 ----FVQLLR-NN---E----QKVFKAKDVIFIPQY--------------DPQ---------QQIYGLPDYLGSIQSSLL 183 (348) T ss_pred ----EEEEEe-cC---e----EEEEcCccEEEEcCC--------------CCC---------CCcccccHHHHHHHHHHH Confidence 111100 00 0 012333333332210 000 112466665543333321 Q ss_pred HHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeecccccccccccc Q lcl|NC_019916. 243 YDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAPNGQQT 322 (513) Q Consensus 243 ~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (513) -+.+..-....+..-+.|-.+++-.. ..............++..... ..-.+.+.+.++ +... T Consensus 184 ~~~a~~~~~~~f~NGa~pg~Il~~~~---------~~ls~e~~~~lk~~~~~~~G~----~n~~~~~vl~~~----g~~~ 246 (348) T protein:vir:26 184 NRDATLFRRRYYLNGAHMGFIFYATD---------PNLSEADEKALKEKIASSKGI----GNFRSMFVNIPN----GKEK 246 (348) T ss_pred HHHHHHHHHHHHhccCCCceEEEecC---------CCCCHHHHHHHHHHHHHhcCc----ccccceeEEcCC----CCcc Confidence 11111111111222222222221100 001111111111111111000 001122322221 1122 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 323 SADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSS--GVAMKYKVLGTVELASTKRKQFERGLNQRYTV 400 (513) Q Consensus 323 ~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S--g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~l 400 (513) +.++..++-......+.+..+.-+++|+..-++|+.-.+-...+.+ +.+-+.. +..+...+.-+++. T Consensus 247 Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~-----------~~f~~~~l~P~~~~ 315 (348) T protein:vir:26 247 GIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVS-----------QVYDFYEVIPVCKR 315 (348) T ss_pred ceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHH-----------HHHHHHHHHHHHHH Confidence 2333333333344456777788888999999999765432211111 1111111 11122223333333 Q ss_pred HHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHH Q lcl|NC_019916. 401 VAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITAL 438 (513) Q Consensus 401 i~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~ 438 (513) +...++..-.. .....+++.|++..-++.+. ++ T Consensus 316 ie~~ln~~l~~--~~~~~~~fdl~~~~e~~~~~---a~ 348 (348) T protein:vir:26 316 FMDAVNNDPEI--PDNLKLKFNLNPGVESANGS---AV 348 (348) T ss_pred HHHHHhhhhCC--CCccEEEEecCcccccchhh---cC Confidence 33222211110 11122333344333222222 22 No 250 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=38.71 E-value=1.1 Score=20.46 Aligned_cols=353 Identities=9% Similarity=-0.010 Sum_probs=123.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHHHHHHHHHHhhcCCeeecCCc---HHHH Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARYIADFQTSYSVGNAIAMSGPS---SDRL 100 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~p~~~~~~~---~~~l 100 (513) .-+++..+.++.. .....++-.+.. .....-+.......+|+..++-+-+-|+.+.... +..+ T Consensus 1 Mg~f~~l~~~~~~-~~~~~~~~~~~~-------------~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~l 66 (376) T protein:vir:78 1 MGFFSELFKRNKE-IEWMWDLDFLED-------------KTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVRDKL 66 (376) T ss_pred CchhhhhhccCCc-cccccchhhccc-------------cchhhhhhhHHHHHHHHHHHHhhcccceeeccccccccchH Confidence 1111110000000 000000000000 0000112234455677777777777788763222 2233 Q ss_pred HHHHH-h-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEEeeccccc Q lcl|NC_019916. 101 DDFNR-R-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVD 175 (513) Q Consensus 101 ~~~~~-~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~ 175 (513) ..++. . |. .......+..+.+.+|.||+++..+..+.....+.+.|..+ +. .+++...... T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~---~~----------~~~~~~~~~~- 132 (376) T protein:vir:78 67 YYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAF---FP----------DVFEGVTVKD- 132 (376) T ss_pred HHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccce---ee----------eeeeeeeeec- Confidence 33332 2 32 33455667788888999998877665543322222222211 11 1111110000 Q ss_pred ccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 176 NITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMT 255 (513) Q Consensus 176 ~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~ 255 (513) . .....+..+.+++++. +..| +.+ ...+++..+..+.+....... T Consensus 133 ---~--~~~~~~~~~evih~~~------------------~~~~---------~~~---~~~~~~~~~~~~~~~~~~~~~ 177 (376) T protein:vir:78 133 ---Y--RYNRNFSMDDVIFLEY------------------GNER---------LSA---FTDGMFEDYGELFGKMIRAQM 177 (376) T ss_pred ---c--eeeeeeccccEEEecc------------------CCCC---------chh---hhhHHHHHHHHHHHHHHHHHH Confidence 0 0011233333333321 1111 111 112233333333333222221 Q ss_pred Hhh--hhhhheecCcccccccccccccccchhhhhhhccccccchhhhcch--hcceeeccccccccccccCCceeEEee Q lcl|NC_019916. 256 DLN--EAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMR--QANMILLKTGMAPNGQQTSADANYIHK 331 (513) Q Consensus 256 ~~~--~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~ 331 (513) ... .+.+++.. ......+ ....++..=........ ...++.+ +++++|... T Consensus 178 ~~~~~~~~~~~~~-----------~~~~~~e---~~~~~~~~~~~~~~g~~~~~~~v~~l-----------~~g~~~~~l 232 (376) T protein:vir:78 178 RNFQIRGAVNFKM-----------AGVADKD---KQTKLQEYIDKVYASFNNNEIAIVPQ-----------LEGFNYEEF 232 (376) T ss_pred hcCCCceeEEEcc-----------CCCCCHH---HHHHHHHHHHHHhccccccCcceEEc-----------CCCceEEee Confidence 111 11111110 0000000 00111110000010000 0111212 233343333 Q ss_pred cCC-------HHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 332 EYD-------SAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 332 ~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +.+ ...+....+...+.|+..-++|+.-.....+|.+...+. .+..++...++.+..- T Consensus 233 ~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~~~---------------f~~~~l~P~~~~ie~~ 297 (376) T protein:vir:78 233 GTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNMKA---------------YMEYCIDPLTKKLEDE 297 (376) T ss_pred ccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHHHH---------------HHHHHHHHHHHHHHHH Confidence 222 234566777778889999999876543222222222222 2223333333333333 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMMDKQRKAMLKTYDT 480 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~ 480 (513) +...--. .....+.+.+...+-.|..+.++++.++ .|+++.-.+.+.++. +++.. .. ....+ T Consensus 298 l~~kll~--~~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~--~d----------~~~~~ 363 (376) T protein:vir:78 298 LNAKLFT--FSEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPE--LD----------KYLIT 363 (376) T ss_pred HHhhhCC--cccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cc----------eeeec Confidence 3211100 0111222333334456888889988876 577887666666533 11110 00 00000 Q ss_pred hcCCCCCCCCCCCCCCCC Q lcl|NC_019916. 481 KGGLIINGTSGNDPEDEG 498 (513) Q Consensus 481 ~~~~~~~~~~~~~~~~~~ 498 (513) ... .....+++++ T Consensus 364 ~n~-----~~~~~~~e~g 376 (376) T protein:vir:78 364 KNY-----QSADEGGEDG 376 (376) T ss_pred cCc-----eehhccccCC Confidence 000 0000111111 No 251 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=38.05 E-value=1.1 Score=20.39 Aligned_cols=316 Identities=15% Similarity=0.079 Sum_probs=101.5 Q ss_pred HHHHHhcCCCccccccccccCCCCCCccee-ecch--hHH------HHHHHHHHhhcC----CeeecCCcH--------- Q lcl|NC_019916. 40 MLYDYYRGQNDGILSPASRRNEKGKADHRA-VHSF--ARY------IADFQTSYSVGN----AIAMSGPSS--------- 97 (513) Q Consensus 40 ~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri-~~n~--~~~------ivd~~~~~l~g~----p~~~~~~~~--------- 97 (513) +-++.+...+.-...+... ...+...+. +.-| +.. +.+-.--+..|+ |+.+.+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPS--AGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCCCCCCCCCchh--hhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhh Confidence 0000011110000000000 000000010 1110 111 111111111222 222211000 Q ss_pred HHHH-------HHHHhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEE Q lcl|NC_019916. 98 DRLD-------DFNRRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYH 168 (513) Q Consensus 98 ~~l~-------~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~ 168 (513) ..|. ..+.-|.. .....+++.+.+.+|.||+.+-.+..|.+.-.+.++|..+-+..+.. +|| T Consensus 79 ~~l~~k~n~l~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~---------~~~ 149 (351) T protein:vir:79 79 SALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS---------GFV 149 (351) T ss_pred hhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCC---------eEE Confidence 0010 01111211 12345677899999999999988888876555556666554433221 111 Q ss_pred eecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019916. 169 AVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQS 248 (513) Q Consensus 169 ~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S 248 (513) .... . . ..-.|.++.+++++.. ++. +...|.|.+.....-+..-+.+.. T Consensus 150 ~~~~-~--g-----~~~~~~~~eIihir~~--------------~~~---------~~~yGl~~~~~a~~si~l~~~a~~ 198 (351) T protein:vir:79 150 YVNG-W--Q-----ERHEFEPDSVFQLVRP--------------DIN---------QEVYGLPEYLSSLHSAWLNESSTL 198 (351) T ss_pred EEec-C--c-----eEEEEcCccEEEeCCC--------------CCC---------CCcccccHHHHHHHHHHHHHHHHH Confidence 1110 0 0 0012333333332210 000 112466655543333321111111 Q ss_pred HHHHHHHHhhhhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCc Q lcl|NC_019916. 249 DTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSAD 325 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 325 (513) -....+...+.|-.++ +|.. ....+.......++. ...... .+.+.+.++ +...+.+ T Consensus 199 ~~~~~f~NGa~pg~il~~~~~~-----------ls~e~~~~lk~~~~~-----~~G~~N~~~~~v~~~~----g~~~gi~ 258 (351) T protein:vir:79 199 FRRKYYENGSHAGFILYMTDAA-----------QKQDDVDNMRDALKN-----AKGPGNFRNVFMYAPG----GKKDGIQ 258 (351) T ss_pred HHHHHHhccCCCceEEEecCCC-----------CCHHHHHHHHHHHHH-----hcCccccCceeEecCC----CCccceE Confidence 1111112222222222 2210 011111111111110 111111 122222221 1122233 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 326 ANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 326 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +..+.-......+.+..+..++.|+..-++|+.-.+-..++.++ .-++-. .+..+...|.-+++.+.++ T Consensus 259 ~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~----------~~~f~~~~l~Pl~~~ie~l 328 (351) T protein:vir:79 259 LIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTA----------ARVFGRNEIRPLQARFAEL 328 (351) T ss_pred EEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHH Confidence 33334333445577788888899999999997655433222221 111111 1112222233333333222 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA 441 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl 441 (513) ....+. + -+.|++.. +..+..++ T Consensus 329 n~~lg~----~----~~~F~~~~------llr~d~~a 351 (351) T protein:vir:79 329 NDWLGD----E----VVTFDDYE------IPPAPVAA 351 (351) T ss_pred HhhcCc----c----eeeeChhh------hccccccC Confidence 111111 1 14564422 11111121 No 252 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=36.22 E-value=1.2 Score=20.18 Aligned_cols=316 Identities=13% Similarity=0.062 Sum_probs=105.3 Q ss_pred HHHHHhcCCCccccccccccCCCCCCccee-ecch--hHHH------HHHHHHHhhcC----CeeecCCcH--------- Q lcl|NC_019916. 40 MLYDYYRGQNDGILSPASRRNEKGKADHRA-VHSF--ARYI------ADFQTSYSVGN----AIAMSGPSS--------- 97 (513) Q Consensus 40 ~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri-~~n~--~~~i------vd~~~~~l~g~----p~~~~~~~~--------- 97 (513) +-++.+...+.-...+... ...+...+. +.-| +..+ ++-.--+..|+ |+.+.+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPS--AGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCCCCCCCCCchh--hhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhh Confidence 0000011110000000000 000000010 1110 1111 11111111122 122111000 Q ss_pred HHHH-------HHHHhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEEEEEEE Q lcl|NC_019916. 98 DRLD-------DFNRRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIMAVRYH 168 (513) Q Consensus 98 ~~l~-------~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~ 168 (513) ..|. ..+.-|.. .....+++.+.+.+|.||+.+-.+..|.+.-.+.++|..+.+..+.. +|| T Consensus 79 ~~l~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~---------~~~ 149 (351) T protein:vir:78 79 SALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS---------GFV 149 (351) T ss_pred hhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC---------eEE Confidence 0010 01111221 23355677899999999999988888876555556666655443321 111 Q ss_pred eecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019916. 169 AVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQS 248 (513) Q Consensus 169 ~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S 248 (513) .... . . ....|.++.+++++.. ++. +...|.|.+.....-++.-+.+.. T Consensus 150 ~~~~-~--~-----~~~~~~~~eVihir~~--------------~~~---------~~~yGl~~~~~a~~si~l~~~a~~ 198 (351) T protein:vir:78 150 YVNG-W--Q-----ERHEFAPDSVFQLVRP--------------DIN---------QEVYGLPEYLSSLHSAWLNESSTL 198 (351) T ss_pred EEec-C--C-----eEEEEccccEEEEcCC--------------CCC---------CCcccccHHHHHHHHHHHHHHHHH Confidence 1110 0 0 0012333333332210 000 112466666554444332222111 Q ss_pred HHHHHHHHhhhhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeeccccccccccccCCc Q lcl|NC_019916. 249 DTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLKTGMAPNGQQTSAD 325 (513) Q Consensus 249 ~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 325 (513) -..+.+...+.|-.++ +|.. ....+.......++. ...... ++++.+.++ +...+.+ T Consensus 199 ~~~~~f~NGa~pggIl~~~~~~-----------ls~e~~~~lr~~~~~-----~~G~~N~~~~~v~~~~----g~~~g~k 258 (351) T protein:vir:78 199 FRRKYYENGSHAGFILYMTDAA-----------QKQDDVDNMRDALKN-----AKGPGNFRNVFMYAPG----GKKDGIQ 258 (351) T ss_pred HHHHHHhccCCCceEEEecCCC-----------CCHHHHHHHHHHHHH-----hcCcccccceeeecCC----CCcccee Confidence 1122222222232222 2210 001111111111110 111111 122222211 1122334 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 326 ANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHI 404 (513) Q Consensus 326 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~ 404 (513) +..++-......+.+..+..+++|+..-++|+.-.+-..++.++ .-++-. .+..+...+..+++.+..+ T Consensus 259 ~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~----------~~~f~~~~l~P~~~~iee~ 328 (351) T protein:vir:78 259 LIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTA----------ARVFGRNEIRPLQARFAEL 328 (351) T ss_pred EEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHH Confidence 44444444455577788888899999999997654433222221 111110 1112222333333333332 Q ss_pred HHhcccccccccceeeEEeCCCCCcCHHHHH Q lcl|NC_019916. 405 EERVNGKWDIDPDEIGFIFRDNLPTDDVAII 435 (513) Q Consensus 405 l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 435 (513) ...... + .+.|++..-..-.+.+ T Consensus 329 n~~l~~----~----~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 329 NDWLGD----E----VVRFDDYEIPPAPVAA 351 (351) T ss_pred HhhcCc----c----ceecChhhhccccccC Confidence 222211 1 1455443222111111 No 253 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=36.08 E-value=1.2 Score=20.16 Aligned_cols=315 Identities=11% Similarity=0.019 Sum_probs=99.4 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcce---eecchhH------HHHHHHHHHhhcC----Ceeec Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHR---AVHSFAR------YIADFQTSYSVGN----AIAMS 93 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~r---i~~n~~~------~ivd~~~~~l~g~----p~~~~ 93 (513) +++|. +.-......... ........+...+ +..+=|. .+.+-.--+..|+ |+... T Consensus 1 m~~~~-----------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~ 68 (350) T protein:vir:11 1 MSKRR-----------SHRRQQPVTVQS-AQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSME 68 (350) T ss_pred CCccc-----------cCCCcCccccCC-cchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHH Confidence 11110 000000000000 0000000000000 0000000 0111111111121 22221 Q ss_pred CCcH----------------HHHHHHHHhcC-H-HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecC Q lcl|NC_019916. 94 GPSS----------------DRLDDFNRRND-I-DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDR 155 (513) Q Consensus 94 ~~~~----------------~~l~~~~~~n~-~-~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~ 155 (513) +=.. +.+-..+.-|. + .....+++.+.+.+|.||+.+..+..|++.-.+.++|..+-+.-+. T Consensus 69 ~la~~~~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~ 148 (350) T protein:vir:11 69 GLAKSVGSSVYLQSGLKFKRNMLAKTFIPHRLLSRATFEQFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDL 148 (350) T ss_pred HHHHHHhhhhhhccchhhhhhhhhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecC Confidence 1000 00101111122 1 1334567788999999999998888877654445666554332221 Q ss_pred CCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhH Q lcl|NC_019916. 156 SVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFEN 235 (513) Q Consensus 156 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~ 235 (513) . +||..... .. ...|.++.+++++.. ++. +.-.|.|.+.. T Consensus 149 ~---------~~~~~~~~---~~-----~~~~~~~eVihir~~--------------~~~---------~~~yGls~~~~ 188 (350) T protein:vir:11 149 E---------TFYQVRSW---KD-----EHEFEKGSVIQLREA--------------DIN---------QEIYGVPEWFC 188 (350) T ss_pred C---------eEEEEeeC---Ce-----EEEECcccEEEeCCC--------------CCC---------CCcccccHHHH Confidence 1 11211110 00 012233333332110 000 11246666655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchh-cceeecc Q lcl|NC_019916. 236 VLSLIDLYDVAQSDTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQ-ANMILLK 312 (513) Q Consensus 236 v~~liD~~~~~~S~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~ 312 (513) .+.-++.-+.+..-....+...+.|-.++ +|.. ....+.......++.. ..... ++.+.+. T Consensus 189 a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~-----------ls~e~~~~l~~~~~~~-----~G~~N~~~~~v~~ 252 (350) T protein:vir:11 189 ALQSALLNESATLFRRKYYNNGSHAGFILYMTDAA-----------QNEEDIDALRTALKTA-----KGPGNFRNLFVYA 252 (350) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCC-----------CCHHHHHHHHHHHHHh-----cCccccCceeeec Confidence 44433321111111112222222222222 2210 0001111111111110 00111 1222222 Q ss_pred ccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 313 TGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGV-AMKYKVLGTVELASTKRKQFE 391 (513) Q Consensus 313 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~-Ai~~~~~~l~~k~~~~~~~f~ 391 (513) ++ +..++.++..++-......+.+..+..+++|+..=++|+.-.+-..++.++- .++-. .+..+. T Consensus 253 ~~----g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~----------~~~f~~ 318 (350) T protein:vir:11 253 PN----GKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDA----------AAVWAS 318 (350) T ss_pred CC----CCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHH----------HHHHHH Confidence 21 1112223333443344455778888888999999999976544332222211 11110 011122 Q ss_pred HHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCH Q lcl|NC_019916. 392 RGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDD 431 (513) Q Consensus 392 ~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~ 431 (513) ..|.-+++.+.++...... + .+.|.+.....+ T Consensus 319 ~~L~P~~~~ie~ln~~l~~----~----~~~F~~~~~~~l 350 (350) T protein:vir:11 319 LELAPMQTRLQQVNEMIGE----E----VVRFAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHHHHHhhcCc----c----ccccCcccccCC Confidence 2223333222222211111 0 123433322223 No 254 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=30.91 E-value=1.5 Score=19.56 Aligned_cols=391 Identities=10% Similarity=0.033 Sum_probs=139.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHHhhcCCeee---cCCc- Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDY-YRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSYSVGNAIAM---SGPS- 96 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~Y-Y~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~l~g~p~~~---~~~~- 96 (513) .-+++... .+.......... ..|. +....... . .......+ .++....+|+..++-+-+-|+.+ ..+. T Consensus 1 Mg~~~~~~-~~~~~~~~~~~~~~~~~--~~~~~~~~-~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~ 75 (423) T protein:vir:81 1 MGFLQKLG-LAPSVVATPEPIELVGP--IFESLKLS-T-KNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGG 75 (423) T ss_pred CchhHhhc-cccccccCccccccccc--cccccccc-c-chhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCc Confidence 11221110 000000000000 0000 00000000 0 00000111 23556677888888887888864 1111 Q ss_pred -----HHHHHHHHHh-cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEE---ecCCCCcceEEE Q lcl|NC_019916. 97 -----SDRLDDFNRR-ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFII---YDRSVNPKPIMA 164 (513) Q Consensus 97 -----~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~---~d~~~~~~~~~~ 164 (513) +..+..++.. |. .......+..+.+.+|.||+++..+..+...+ +.+.|..+-.+ ........+ T Consensus 76 ~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~-~~l~p~~~~~v~~~~~~~~~~~~--- 151 (423) T protein:vir:81 76 RERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPT-LDIRPIPVSWVQRRAYKDGWGSL--- 151 (423) T ss_pred eeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcce-EEEeecccceeeeeeccCCCcce--- Confidence 1234455543 32 34555667788999999999888775544433 23333322111 000000111 Q ss_pred EEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCC-----CCCCcchhHHHHH Q lcl|NC_019916. 165 VRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNN-----EYRQGDFENVLSL 239 (513) Q Consensus 165 ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~v~~l 239 (513) +|......... . ..+ .+.++.+ +++++. ..|.|.+..+... T Consensus 152 -~Y~~~~~~~~~-g---~~~-~~~~~ev----------------------------ih~r~~~~~~~~~G~spi~~~~~~ 197 (423) T protein:vir:81 152 -DYIIIESGDND-G---RSV-KVPGERV----------------------------IHRHGYNPKTMKRGKSPVQSLRDI 197 (423) T ss_pred -EEEEEEecCCC-c---eEE-EEcccce----------------------------EEecCCCCCCccccccHHHHHHHH Confidence 11000000000 0 000 1122222 223221 1466766655555 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc--chhcceeeccccccc Q lcl|NC_019916. 240 IDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA--MRQANMILLKTGMAP 317 (513) Q Consensus 240 iD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~ 317 (513) ++....+..-....+...+.|-.+++-..... ...............+.. .+.. ...++.+.+ T Consensus 198 i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~-----~~~l~~e~~~~~~~~~~~----~~~~~~~n~g~~~vl------ 262 (423) T protein:vir:81 198 LGEQIEAAIFRAQMWRNGPRPGMVIMRDPESK-----AGKWDAESRTRFMANLRA----SFSPKSSDVGGTLLL------ 262 (423) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEecCccc-----CccCCHHHHHHHHHHHHH----HhccccccCCcceec------ Confidence 54444433333333333344444443211100 000000110011000000 0000 001122222 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQR 397 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 397 (513) ..+.++..++.......+....+.....|+..=++|+...+...+ .+...++.+.. ..+...|.-. T Consensus 263 ---~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~-~t~sn~e~~~~----------~f~~~~L~P~ 328 (423) T protein:vir:81 263 ---EDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDN-ANYSNVREFRK----------ALYGDNLGSW 328 (423) T ss_pred ---CCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCC-CCcccHHHHHH----------HHHHHHHHHH Confidence 223344444433333345555677788899999999765543222 11111111111 1122223333 Q ss_pred HHHHHHHHHhcc-cccccccceeeEEe--CCCCCcCHHHHHHHHHHH---hcCCCHHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_019916. 398 YTVVAHIEERVN-GKWDIDPDEIGFIF--RDNLPTDDVAIITALVQA---GAQIPQEYLYQYLPNVTDADEIVKMMDKQR 471 (513) Q Consensus 398 ~~li~~~l~~~~-~~~~~~~~~i~i~f--~~~~p~d~~e~a~~~~kl---~g~iS~et~~~~l~~v~D~~~E~~ri~~E~ 471 (513) ++.+..-+...- .....+.....+.| ..-+..|..++++++.++ +|+++.-.+.+.++.-..+. T Consensus 329 ~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~g---------- 398 (423) T protein:vir:81 329 IRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDG---------- 398 (423) T ss_pred HHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCC---------- Confidence 333322222111 11111122223444 455667888888887763 47777665655553311100 Q ss_pred HHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_019916. 472 KAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTS 512 (513) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) ++....+.+-..+. ++..++++.++ T Consensus 399 ----------GD~~~~p~n~~~~~------~~~~~~~~~~t 423 (423) T protein:vir:81 399 ----------GDDLARPLNTEFGD------SEDAPGEEVET 423 (423) T ss_pred ----------cceeecccccccCc------cCCCCCCCCCC Confidence 00000000000000 01111122222 No 255 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=25.68 E-value=2 Score=18.90 Aligned_cols=342 Identities=8% Similarity=0.033 Sum_probs=122.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHHhhcCCeee-c--CC--- Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSYSVGNAIAM-S--GP--- 95 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~l~g~p~~~-~--~~--- 95 (513) .-+ ..+...+-.+.-. ........ .....+ .......+|+..++-+-.-|+.+ . .. T Consensus 1 Mg~-----------f~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~ 64 (378) T protein:vir:16 1 MNL-----------FGKVVSFSRGKLN---NDTQRVTA--WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Ccc-----------chhhhhhhccccc---CCcceeee--cccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccc Confidence 000 0111111111000 00000000 000111 12334456666666666678753 1 10 Q ss_pred -------cHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEE Q lcl|NC_019916. 96 -------SSDRLDDFNRR--N---DIDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIM 163 (513) Q Consensus 96 -------~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~ 163 (513) .+..+..++.. | ........+..+++.+|.||++..++.. .+.+. .+.|... T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~-~g~~~-~l~~~~~-------------- 128 (378) T protein:vir:16 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN-TGELL-DLLFADD-------------- 128 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC-CceEE-EEEecCC-------------- Confidence 12345555542 3 2345556678889999999976544321 11111 1111100 Q ss_pred EEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHH Q lcl|NC_019916. 164 AVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLY 243 (513) Q Consensus 164 ~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~ 243 (513) . ..|..+.++++ ++.-.+......+..+.+++ T Consensus 129 ---------------~-----~~~~~~diih~----------------------------r~~~~~~~~~s~l~~~~~~i 160 (378) T protein:vir:16 129 ---------------K-----KEYKPEELVRL----------------------------TSPFYINEDTSILDNALASI 160 (378) T ss_pred ---------------e-----eEecccceEEe----------------------------cCccCccchhHHHHHHHHHH Confidence 0 01122233332 21111112222333344433 Q ss_pred HHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc----chhcceeeccccccccc Q lcl|NC_019916. 244 DVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA----MRQANMILLKTGMAPNG 319 (513) Q Consensus 244 ~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 319 (513) +..++. +.+--+++-.. ..... .....+..-...+.. ...++++.+ T Consensus 161 ~~~~~~--------~~~~g~l~~~~-----------~l~~~---~~~~~~~~~~~~~~~~~~~~~~g~~~vl-------- 210 (378) T protein:vir:16 161 QTKLEQ--------GKLRGLLKINA-----------FLDID---NTQEYREKALTTIKNMQEGSSYNGLTPV-------- 210 (378) T ss_pred HHHHhc--------CccceeeEeCC-----------cCCHH---HHHHHHHHHHHHHHHhhcccccccceEc-------- Confidence 332221 11111111100 00000 000000000011110 011222332 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYT 399 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~ 399 (513) ..+.++.-++.+.....+ ..++.+++.|+..=++|+.-. .+..|.... ...+...|...++ T Consensus 211 -~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l---~g~~~e~~~--------------~~f~~~tl~P~~~ 271 (378) T protein:vir:16 211 -DNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENIL---LGTASQEQQ--------------IYFYNSTIIPLLI 271 (378) T ss_pred -CCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHh---cCCchHHHH--------------HHHHHHHHHHHHH Confidence 223333333333333333 345677889999989987433 222221111 1233344444444 Q ss_pred HHHHHHHhcc--------cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHH Q lcl|NC_019916. 400 VVAHIEERVN--------GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMM 467 (513) Q Consensus 400 li~~~l~~~~--------~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri 467 (513) .|..-+...= +........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++. +++-+.=+-.. T Consensus 272 ~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~ 351 (378) T protein:vir:16 272 QLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANL 351 (378) T ss_pred HHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecc Confidence 4444333210 00011123455666777778899999998886 578888777777643 22111000000 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) ...+....... .....++..++.++++ T Consensus 352 ---------n~~~~~~~~~~-------~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 352 ---------NAVAVKNLSDL-------QGSRKDVTSTDETNNQ 378 (378) T ss_pred ---------ccccccchhhh-------cCccCCCCCCCCCCCC Confidence 00000000000 0000111111111111 No 256 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=23.41 E-value=2.3 Score=18.59 Aligned_cols=356 Identities=11% Similarity=0.033 Sum_probs=134.4 Q ss_pred HHHHHHHHH----HHHHHHHHHHHHhcCCCccccc-ccccc----------------------CCCCCC-cce--eecch Q lcl|NC_019916. 24 AAFIRHHYN----NQRPRLEMLYDYYRGQNDGILS-PASRR----------------------NEKGKA-DHR--AVHSF 73 (513) Q Consensus 24 ~~~i~~~~~----~~~~~~~~~~~YY~G~~~i~~~-~~~~~----------------------~~~~~~-~~r--i~~n~ 73 (513) .-+++..+. ...+.-.-..+|..|+.++..- .+... ...+.. ..+ +.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 111111000 0000111122333333222210 00000 000000 000 11233 Q ss_pred hHHHHHHHHHHhhcCCeeecC--CcHHHHHHHHHh--cCH---HHHHHHHHHHHhhCCeEEEE-eeecCCCceeEEEEEc Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSG--PSSDRLDDFNRR--NDI---DTLNYELYLDMTVTGRAYEY-VYRDPSQKGEVSVKLD 145 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~--~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~~~~~-v~~d~~~~~~~~~~~~ 145 (513) ....|+..++-+-+-|+..-. ...+.+..++.. |.. ......+..+.+. |.+|++ +..+.+|.+.-.+.++ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~ 159 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVP 159 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEEC Confidence 445677777777777876421 111222222221 221 2223334444444 889876 4577778766556678 Q ss_pred ccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecC Q lcl|NC_019916. 146 PMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN 225 (513) Q Consensus 146 p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 225 (513) |..+-+..++.. ..+|.... .+..+ .|++++. T Consensus 160 p~~v~v~~~~~g-------~~~y~~~~-------------~~~~~----------------------------eiiHir~ 191 (409) T protein:vir:83 160 PWLVNVELKKGA-------RREYRIGG-------------LNVTD----------------------------EILHIRY 191 (409) T ss_pred CcceEEEEcCCc-------eEEEEEcc-------------ccCcc----------------------------ceEEeCC Confidence 877666555431 11121100 01111 2333332 Q ss_pred -----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhh Q lcl|NC_019916. 226 -----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQL 300 (513) Q Consensus 226 -----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 300 (513) .-.|.|-++.....++..+.+..-..+.+...+.|-.+++-... ............... .. T Consensus 192 ~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~----------ls~e~~~~~~~~~~~----~~ 257 (409) T protein:vir:83 192 QGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERR----------LSETEAVDLMDRWIE----SR 257 (409) T ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCC----------CCHHHHHHHHHHHHH----hh Confidence 12466767666666654444333333333333444444332110 001111111111100 00 Q ss_pred hcchhcceeeccccccccccccCCce-eEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-------ccccHHHH Q lcl|NC_019916. 301 EAMRQANMILLKTGMAPNGQQTSADA-NYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFS-------GNSSGVAM 372 (513) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-------~n~Sg~Ai 372 (513) ....++.+.+. .+.+. +.++.......+....+...+.|+..-++|++..+... +|.....+ T Consensus 258 -~~nag~~~il~---------~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~ 327 (409) T protein:vir:83 258 -SKYAGHPALVT---------GGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFS 327 (409) T ss_pred -CCccCccceec---------CCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHH Confidence 00011111211 12221 12222222333455566678889999999976654221 11111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHH Q lcl|NC_019916. 373 KYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYL 450 (513) Q Consensus 373 ~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~ 450 (513) .+....|.-.+.+.+..+...| + . ....+++.+..-+-.|.++.++++.++ +|+++.-.+ T Consensus 328 ~f~~~tL~P~~~~ie~~l~~~L----------l---~-----~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~ 389 (409) T protein:vir:83 328 FHDRSSLRPKATAVMAALDRWA----------L---P-----SPQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEA 389 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHhh----------C---C-----CCcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2222222222222222221111 1 1 112455555566667888888888776 467776544 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCC Q lcl|NC_019916. 451 YQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGV 499 (513) Q Consensus 451 ~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) .+.++. .+ ...+++-++.++ T Consensus 390 R~~~gl----------------------pp-------~~ggd~l~~~gv 409 (409) T protein:vir:83 390 RAMERL----------------------HS-------EAAAVRLSGGGV 409 (409) T ss_pred HHHhCC----------------------CC-------CCCCcccCCCCC Confidence 433211 00 001111111111 No 257 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=22.91 E-value=2.4 Score=18.52 Aligned_cols=410 Identities=10% Similarity=-0.018 Sum_probs=154.9 Q ss_pred Cccchhhcee-ccCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCC-----ccccccccccCCCCCCcceeecch Q lcl|NC_019916. 1 MIDMQQANMN-YQEDADKLTPTRIAAFIRHHYNNQRPRLEMLYDY-YRGQN-----DGILSPASRRNEKGKADHRAVHSF 73 (513) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~Y-Y~G~~-----~i~~~~~~~~~~~~~~~~ri~~n~ 73 (513) |..=++.+.. .+.. ....+....++.... .+ .-.| |.|-. +++..+ .. -....... .... T Consensus 1 m~k~~~k~~~~~~~~-~~~~~~~~~~~~~~~----~~----~~~~~~~g~~~~~~~~iLr~~-~~--~~ly~~m~-~D~h 67 (448) T protein:vir:79 1 MAKRGRKPKELVPGP-GSIDPSDVPKLEGAS----VP----VMSTSYDVVVDREFDELLQGK-DG--LLVYHKML-SDGT 67 (448) T ss_pred CCCCCCCCccccCcc-cccccccchhhhhhh----hh----hcccccccccccchhHhhccc-cc--hHHHHHHh-hChH Confidence 2211111110 0000 111111111111000 00 0000 11111 011000 00 00000011 1455 Q ss_pred hHHHHHHHHHHhhcCCeeecCC--c--HHH----HHHHHHh-------cCHHHHHHHHHHHHhhCCeEE-EEeee-cCCC Q lcl|NC_019916. 74 ARYIADFQTSYSVGNAIAMSGP--S--SDR----LDDFNRR-------NDIDTLNYELYLDMTVTGRAY-EYVYR-DPSQ 136 (513) Q Consensus 74 ~~~ivd~~~~~l~g~p~~~~~~--~--~~~----l~~~~~~-------n~~~~~~~~~~~~a~~~G~~~-~~v~~-d~~~ 136 (513) ..-++.+....+.+.++++... + +.. +.+++.. ..|..+. .-..+|.-+|.++ +++|. ..+| T Consensus 68 i~s~l~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~-~~~lda~~~G~s~~Eivw~~~~~g 146 (448) T protein:vir:79 68 VKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLF-AIYENAYIYGMAAGEIVLTLGADG 146 (448) T ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHH-HHHHHhhhhcceeEEEEeeecCCC Confidence 5667777778888888888531 1 111 2333322 1344433 3346688889886 56663 3444 Q ss_pred ceeEEEEE---cccce-EEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccccc Q lcl|NC_019916. 137 KGEVSVKL---DPMEC-FIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEH 212 (513) Q Consensus 137 ~~~~~~~~---~p~~~-~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~ 212 (513) ...+. .+ +|+.. .-.|+... .+. +.+.... ..+.... ....+ T Consensus 147 ~~~~~-~l~~r~~~~~~~f~~~~d~--~l~-----------------------~~~~~~~-------~~~~~~~-~~~~~ 192 (448) T protein:vir:79 147 KLILD-KIVPIHPFNIDEVLYDEEG--GPK-----------------------ALKLSGE-------VKGGSQF-VSGLE 192 (448) T ss_pred ceecc-cccccCCccccceeeecCC--ceE-----------------------EeecCCc-------ccccccC-CCccc Confidence 32211 11 11110 01111110 000 0010000 0000000 01112 Q ss_pred ccCcccceEEecC----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhh Q lcl|NC_019916. 213 SAQFGFPMIEYRN----NEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADA 288 (513) Q Consensus 213 ~~~g~vPvv~~~n----~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~ 288 (513) .+++.+ |.+.. +..|.|.+..+-...=--+..+.+.+..++.|+.|+++.+-..+...+.. +.. T Consensus 193 lP~~~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~---------~~~- 260 (448) T protein:vir:79 193 IPIWKT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTK---------QWE- 260 (448) T ss_pred cccceE--EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHH---------HHH- Confidence 233333 33332 23467777777666666677888899999999999998775432211100 000 Q ss_pred hhccccccchhhhcchhcceeeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccc Q lcl|NC_019916. 289 MKKLADEKMAQLEAMRQANMILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSS 368 (513) Q Consensus 289 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 368 (513) .+. ..+..+.....+. .. ...+.+++++....+...+...++...+.|...--.-.++.+..+| .+ T Consensus 261 --~l~-~av~~i~~g~~a~-~i---------iP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g-~~ 326 (448) T protein:vir:79 261 --AAK-EIVKNFVQKPRHG-II---------LPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMG-VQ 326 (448) T ss_pred --HHH-HHHHHHhcCCceE-EE---------ecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccccc-hh Confidence 000 0000011011111 11 2345677888776665666678888888886653333233332222 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHHHHHHHHHhcCCCH Q lcl|NC_019916. 369 GVAMKYKVLGTVELASTKRKQFERGLN-QRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAIITALVQAGAQIPQ 447 (513) Q Consensus 369 g~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~ 447 (513) ..+......-....+..-.+.+...+. ++++-++.+ +-. .......+.|....+.|..+.|+.+.++.+... T Consensus 327 ~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~l----Nfg--~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~- 399 (448) T protein:vir:79 327 AINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLP----NWP--SATRFPRLTFEMEERNDFSAAANLMGMLINAVK- 399 (448) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC--CcCCCcEEEecCCChHHHHHHHHHhhhhhccch- Confidence 222211101011111111122333332 233333332 211 111234788888889999999998888754311 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_019916. 448 EYLYQYLPNVTDADEIVKMMDKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDERTSD 513 (513) Q Consensus 448 et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) +.+.+-+++ ...+.........+.+..+...+....|.+++.-= T Consensus 400 ---------------~~~~~~~~~-------~~~p~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 443 (448) T protein:vir:79 400 ---------------DSEDIPTEL-------KALIDALPSKMRRALGVVDEVREAVRQPADSRYLY 443 (448) T ss_pred ---------------hhHHHHHHh-------hcCCCCCCCccccccCCCCcccccccCCccccchh Confidence 111111111 00111111111111111122222222333333332 No 258 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=22.26 E-value=2.5 Score=18.43 Aligned_cols=445 Identities=10% Similarity=0.041 Sum_probs=145.1 Q ss_pred CccchhhceeccCCcccCCHHHHHHHH-HHHHHH------------------HHHHHHHHHHHhcCCCccccccccccCC Q lcl|NC_019916. 1 MIDMQQANMNYQEDADKLTPTRIAAFI-RHHYNN------------------QRPRLEMLYDYYRGQNDGILSPASRRNE 61 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~i~~~i-~~~~~~------------------~~~~~~~~~~YY~G~~~i~~~~~~~~~~ 61 (513) |-.|+. |. ..+.....+-+ ++...- ..+..-....||-|.-.-.+.-...... T Consensus 1 ~~~~~~----w~----~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ 72 (533) T protein:vir:58 1 MPSLEK----YK----KLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDR 72 (533) T ss_pred CCCcch----hh----hhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHH Confidence 111111 00 01111111100 000000 0000011112232210000000000000 Q ss_pred CCCC-cceeecchhHHHHHHHH-HHhhcCCeeecCCcH----HHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeec-C Q lcl|NC_019916. 62 KGKA-DHRAVHSFARYIADFQT-SYSVGNAIAMSGPSS----DRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRD-P 134 (513) Q Consensus 62 ~~~~-~~ri~~n~~~~ivd~~~-~~l~g~p~~~~~~~~----~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d-~ 134 (513) . + .+--+.+-...||+..+ ..-...||.+..++. .....+....+|+....+..|.+.+.|+.|.+.-.+ + T Consensus 73 m--a~~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~ 150 (533) T protein:vir:58 73 M--DYTDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGS 150 (533) T ss_pred h--hccCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCc Confidence 0 0 00011122233333322 233456776654432 223456667889999999999999999999988543 2 Q ss_pred CCce-eEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccc Q lcl|NC_019916. 135 SQKG-EVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHS 213 (513) Q Consensus 135 ~~~~-~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (513) .+.+ .+. .++|+.+-.+++.-.. ..| -+|++...... ++... T Consensus 151 k~GI~elr-~lDPr~i~~vr~~~t~------~ey-----------------yvy~~~~~~~~-----s~~~~-------- 193 (533) T protein:vir:58 151 DGTIEKFQ-VVSPYIFSKRYNPETD------TWY-----------------YVITDVYRNVV-----SGYFN-------- 193 (533) T ss_pred ccchhhhe-ecCCeeeEEEEeeccc------eEE-----------------Eeecccccccc-----cCccc-------- Confidence 2222 222 3688877666643221 111 13333322110 00000 Q ss_pred cCcccc---eEEec------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccch Q lcl|NC_019916. 214 AQFGFP---MIEYR------NNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPS 284 (513) Q Consensus 214 ~~g~vP---vv~~~------n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~ 284 (513) -.|| |+++. +...+.|-++..+....-+- ++-|.+..-.-...|-+=+.-......-.....+....- T Consensus 194 --~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLk-miEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~i 270 (533) T protein:vir:58 194 --EDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLR-LMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNI 270 (533) T ss_pred --cccchhheeeeeeccccCCCCceehhhhHHHHHHHHHH-HHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHH Confidence 0111 12221 12233455554322221111 122222222222222111111110000000000000000 Q ss_pred hhhh-hhccccccchhhhcchhcc---e----eeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_019916. 285 DADA-MKKLADEKMAQLEAMRQAN---M----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTP 356 (513) Q Consensus 285 ~~~~-~~~l~~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 356 (513) ...+ +...-+.....+.+++.-. . ++|+.- ..+.+-.+..|-- .+. +...-+.-.++.+|..-.+| T Consensus 271 m~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR----eGgrgTEI~TLpG-g~l-gemeDV~YF~kkLy~ALnVP 344 (533) T protein:vir:58 271 AMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR----GDRRAVEIDILQG-SKV-DLAEDVEYMLNRLISALKVP 344 (533) T ss_pred HHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc----CCCccceeeecCC-CCC-CcHHHHHHHHHHHHHHhCCC Confidence 0000 0000000000011000000 0 001000 0011111222211 121 22344556667777777888 Q ss_pred cccccccc--ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHHH Q lcl|NC_019916. 357 DLTDDNFS--GNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVAI 434 (513) Q Consensus 357 ~~~~~~~~--~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e~ 434 (513) -.-.+.-+ |..| .|-.-+....+.+.+.+..|..-|+.-+ ++ ++..... +..+.|...-.-.+... T Consensus 345 ~sRl~~e~~fgr~~--eItRDEiKF~KFI~rLR~rF~~ll~~qL--il-----k~iit~e---ew~~~f~~Dn~f~ElKe 412 (533) T protein:vir:58 345 KAFIGYEGDVNAKN--TLATQDIKFNNTIKRIQGFFVEELERMV--RM-----NKEFADQ---DFRLVMNRSNSIVEGER 412 (533) T ss_pred eeecCCCCCCccch--hhhHHHHHHHHHHHHHHHHHHHHHhccc--cc-----ccCcchh---heeeeeeccchHHHHHH Confidence 53322111 2222 2322333334445555666766665422 11 1222222 33567755444333333 Q ss_pred -------HHHHHHHhcCCCHHHHHHhC-CCCCCHHHHHHHHHHHHHH----------HHHHhhhhcCCCCCCCCCCCCCC Q lcl|NC_019916. 435 -------ITALVQAGAQIPQEYLYQYL-PNVTDADEIVKMMDKQRKA----------MLKTYDTKGGLIINGTSGNDPED 496 (513) Q Consensus 435 -------a~~~~kl~g~iS~et~~~~l-~~v~D~~~E~~ri~~E~~~----------~~~~~~~~~~~~~~~~~~~~~~~ 496 (513) ++++..+.+.+++.++++.+ -..+|...+.+.|++|..+ ++...+..+..+..-+.+.+++. T Consensus 413 ~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~ 492 (533) T protein:vir:58 413 FAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTE 492 (533) T ss_pred HHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhh Confidence 34455566789999988774 4444555555556655332 00000000000000000000000 Q ss_pred ----C-------CCCCCCCCCCCccCCC Q lcl|NC_019916. 497 ----E-------GVRGQQGEPEDERTSD 513 (513) Q Consensus 497 ----~-------~~~~~~~~~~~~~~~~ 513 (513) . +.-.-.+.++.+.+.| T Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 520 (533) T protein:vir:58 493 FDFGTEGGEELGGELNLGGAFEEFEEET 520 (533) T ss_pred HhcccCCcccccccccccccchhhhhhc Confidence 0 0000001112222222 No 259 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=22.01 E-value=2.5 Score=18.39 Aligned_cols=302 Identities=13% Similarity=0.048 Sum_probs=94.9 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCcceeecchhHH------HHHHHHHHhhcC----CeeecCCc Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRAVHSFARY------IADFQTSYSVGN----AIAMSGPS 96 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri~~n~~~~------ivd~~~~~l~g~----p~~~~~~~ 96 (513) ++++...+ +. ...+.. ...... ..-+..+=+.. +.+-.--+..|+ |+.+.+=. T Consensus 1 ~~~~~~~~-~~--------~~~~~~-----~~~~~~---~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la 63 (344) T protein:vir:20 1 MSKKKGKT-PQ--------PAAKTM-----TASGPK---MEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLA 63 (344) T ss_pred CCcccCCC-Cc--------chhhhh-----hccCCc---eEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHH Confidence 11100000 00 000000 000000 00000000000 100000000111 11111000 Q ss_pred ----------------HHHHHHHHHhcCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCC Q lcl|NC_019916. 97 ----------------SDRLDDFNRRNDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVN 158 (513) Q Consensus 97 ----------------~~~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~ 158 (513) -+.+...+.-|.. ......++.+.+.+|.||+.+-.+..|++.-.+.++|..+-+-.+.. T Consensus 64 ~~~~a~~~h~~~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~-- 141 (344) T protein:vir:20 64 KSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED-- 141 (344) T ss_pred HHHhhhhhhCccceehhhhHHHhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCC-- Confidence 0111111222221 12345677888999999998888877765444445554433221110 Q ss_pred cceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecC-----CCCCCcch Q lcl|NC_019916. 159 PKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRN-----NEYRQGDF 233 (513) Q Consensus 159 ~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd~ 233 (513) +||.... .+ . ...|.++. |+++++ .-.|.|.+ T Consensus 142 -------~~~~~~~-~~--~-----~~~~~~~e----------------------------IiHir~~~~~~~~yGls~~ 178 (344) T protein:vir:20 142 -------VYWWVPS-FN--E-----PTAFAPGS----------------------------VFHLLEPDINQELYGLPEY 178 (344) T ss_pred -------EEEEEcc-CC--e-----EEEEcCcc----------------------------EEEeCCCCCCCCcccccHH Confidence 1111110 00 0 00122222 233332 12466655 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHh---hhhhhhee--cCcccccccccccccccchhhhhhhccccccchhhhcchhcce Q lcl|NC_019916. 234 ENVLSLIDLYDVAQSDTANYMTDL---NEAMLVIK--GDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANM 308 (513) Q Consensus 234 e~v~~liD~~~~~~S~~~~~~~~~---~~~~l~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 308 (513) .....-+ ....+-..-...+| +.|-.+++ |.. ....+.......++... .....+. T Consensus 179 ~~a~~si---~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~-----------l~~e~~~~ik~~~~~~~-----g~~n~r~ 239 (344) T protein:vir:20 179 LSALNSA---WLNESATLFRRKYYENGAHAGYIMYVTDAV-----------QDRNDIEMLRENMVKSK-----GRNNFKN 239 (344) T ss_pred HHHHHHH---HHHHHHHHHHHHHHhccCCCceEEEecCcC-----------CCHHHHHHHHHHHHHhc-----CCCCccc Confidence 5433322 22222111112223 22333322 210 01111111111111110 0001111 Q ss_pred eeccccccccccccCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH--HHHHHHHHHHHHHHH Q lcl|NC_019916. 309 ILLKTGMAPNGQQTSADANYIH--KEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG--VAMKYKVLGTVELAS 384 (513) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg--~Ai~~~~~~l~~k~~ 384 (513) +.+.. +.+ ...+++|.. -......+.+..+..++.|+..=++|+.-.+-..++.++ .+-+.. T Consensus 240 l~l~~---p~g--~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~--------- 305 (344) T protein:vir:20 240 LFLYA---PQG--KADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVA--------- 305 (344) T ss_pred eEEec---CCC--CccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHH--------- Confidence 22211 111 223444444 333445577788888899999999998655432222221 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHH Q lcl|NC_019916. 385 TKRKQFERGLNQRYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVA 433 (513) Q Consensus 385 ~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e 433 (513) +......+.-+++.+..+....+.. .+.|..+......| T Consensus 306 --~~f~~~~l~P~~~~~e~in~~lg~~--------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 306 --KVFVRNELIPLQDRIREINGWLGQE--------VIRFKNYSLDTDND 344 (344) T ss_pred --HHHHHHHHHHHHHHHHHHHHhcCCc--------ccccCccccccCCC Confidence 0111222222222222222222211 13344333322222 No 260 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=21.85 E-value=2.5 Score=18.37 Aligned_cols=465 Identities=11% Similarity=0.055 Sum_probs=158.1 Q ss_pred Cccchhhcee---ccCCcccCCHH-----------HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCc Q lcl|NC_019916. 1 MIDMQQANMN---YQEDADKLTPT-----------RIAAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKAD 66 (513) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~-----------~i~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~ 66 (513) +.+-++.... .+.+..+-... .+..-++. ......+|+.+-.+++-+ T Consensus 9 i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~-~~eLI~~YR~ma~~pEvd------------------ 69 (533) T protein:vir:10 9 LERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRN-EYQLISRYREMVLQPECD------------------ 69 (533) T ss_pred cccccccccCCCCCCCCcccccceeecccccceeeecccccch-HHHHHHHHHHHhhccchh------------------ Confidence 1111111100 00000000000 00000000 001122233333333222 Q ss_pred ceeecchhHHHHHHHH-HHhhcCCeeecCCc-----------HHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEeeecC Q lcl|NC_019916. 67 HRAVHSFARYIADFQT-SYSVGNAIAMSGPS-----------SDRLDDFNRRNDIDTLNYELYLDMTVTGRAYEYVYRDP 134 (513) Q Consensus 67 ~ri~~n~~~~ivd~~~-~~l~g~p~~~~~~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~d~ 134 (513) +-...||+..+ .-....||.+.-++ -++...+++--+|+....+..|.+.+.|+.|.+.-+|. T Consensus 70 -----~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~ 144 (533) T protein:vir:10 70 -----SAVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP 144 (533) T ss_pred -----hHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecC Confidence 11222333222 22234555554332 12455667777899999999999999999999988875 Q ss_pred CC----ceeEEEEEcccceEEEecCCCCcceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCcccccccc Q lcl|NC_019916. 135 SQ----KGEVSVKLDPMECFIIYDRSVNPKPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVA 210 (513) Q Consensus 135 ~~----~~~~~~~~~p~~~~~~~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~ 210 (513) +. -..+ ..++|+.+-+|..-. ++....++++...... ..... ..-+|++..... ....+-.+...... T Consensus 145 ~~pk~GI~EL-r~lDPr~i~~vr~i~--~~~~~~~~~~~~~~~v--~~~~~-eyf~Ynp~g~~~--~~~~~vkI~~dAI~ 216 (533) T protein:vir:10 145 DNPQGGLIEL-RYIDPRKIRKINETE--QKRPEQLRGLPLNQQL--SPKSA-EYFLYDPKGLKN--STTQGLKIAPDSIC 216 (533) T ss_pred CCccccceee-eeccccceeeeeeee--ccCCCccceeecchhh--hccce-eeeeeccccccc--cCCCceecchhhee Confidence 42 2222 236887765543210 0111111111100000 00001 112444433221 00000000000000 Q ss_pred ccccCcccceEEecCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhh-h Q lcl|NC_019916. 211 EHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLYDVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADA-M 289 (513) Q Consensus 211 ~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~-~ 289 (513) ..| -|-+| ++...-.|-++..+....-+ +++-|.+....-...|-+=+.-......-.....+....-...+ + T Consensus 217 y~h-SGl~d----~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KN 290 (533) T protein:vir:10 217 YVH-SGIMD----LNKNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRN 290 (533) T ss_pred eee-cccee----CCCCceeccchHhHHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccc Confidence 000 11111 11111123343322111111 11222222222222222111111100000000000000000000 0 Q ss_pred hccccccchhhhcchhcce----eeccccccccccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc Q lcl|NC_019916. 290 KKLADEKMAQLEAMRQANM----ILLKTGMAPNGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSG 365 (513) Q Consensus 290 ~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 365 (513) +..-+.....++.++.-.. ++|+.- ..+.+-.+..|--..|+.-. .-++-.++-+|..-++|-.-.+.-++ T Consensus 291 klVYDa~TGev~ddrk~msMlEDyWLPRR----eGgrgTEItTLpGgqnLgem-~DV~YF~kKLY~aLnVP~SRl~~e~~ 365 (533) T protein:vir:10 291 KLVYDANTGEIKDDKKFMSMLEDFWLPRR----EGGRGTEITTLPGGQNLGEL-EDVKYFQKKLYKSLNVPGSRLETETT 365 (533) T ss_pred eEEEeccCceecccchhhhhHhhhccccc----CCCCccceeeccccCCcChH-HHHHHHHHHHHHHhCCCccccCCCCc Confidence 0000000000111110000 011100 00111222222222222222 23444556666666787433222111 Q ss_pred -cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----ceeeEEeCCCCCcCHHHHHH--- Q lcl|NC_019916. 366 -NS-SGVAMKYKVLGTVELASTKRKQFERGLNQRYTVVAHIEERVNGKWDIDP----DEIGFIFRDNLPTDDVAIIT--- 436 (513) Q Consensus 366 -n~-Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~l~~~~~~~~~~~----~~i~i~f~~~~p~d~~e~a~--- 436 (513) |+ -|..|-.-+....+.+.+.+..|..-|..+++.=+-+ ++.....++ ..|.+.|...-.-.+...++ T Consensus 366 f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiL---Kgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 442 (533) T protein:vir:10 366 FNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVL---KGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRN 442 (533) T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh---ccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHH Confidence 11 1223555555566677888888888888887753322 222222233 45778886555544444333 Q ss_pred ----HHHHHh---c-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhhhcCC-----CCCCCCCCCCCCCCCCC Q lcl|NC_019916. 437 ----ALVQAG---A-QIPQEYLYQYLPNVTD--ADEIVKMMDKQRKAMLKTYDTKGGL-----IINGTSGNDPEDEGVRG 501 (513) Q Consensus 437 ----~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 501 (513) ++..+. | .+|.+++.+.+=--+| .+++-++|++|..+ ..+..-... +...+.......++... T Consensus 443 ~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~--~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (533) T protein:vir:10 443 ERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMES--GIIADPAAEMDPAMAAGDPDAGGAPAEEVAP 520 (533) T ss_pred HHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhC--CCCCCCcchhhHHhcCCCCCcCCcccccCCC Confidence 333442 3 4799999988644443 45555556555442 011000000 00000000001111111 Q ss_pred CCCCCCCccCCC Q lcl|NC_019916. 502 QQGEPEDERTSD 513 (513) Q Consensus 502 ~~~~~~~~~~~~ 513 (513) +.-+|++|+.-. T Consensus 521 ~~~~~~~~~~~~ 532 (533) T protein:vir:10 521 EGPDPSDERKAE 532 (533) T ss_pred CCCCcchhhccC Confidence 122333333333 No 261 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=21.65 E-value=2.6 Score=18.34 Aligned_cols=316 Identities=11% Similarity=0.022 Sum_probs=103.1 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccc--cccccCCCCCCcceeecchhHHHHHHHHHH-h-hcC----CeeecC---- Q lcl|NC_019916. 27 IRHHYNNQRPRLEMLYDYYRGQNDGILS--PASRRNEKGKADHRAVHSFARYIADFQTSY-S-VGN----AIAMSG---- 94 (513) Q Consensus 27 i~~~~~~~~~~~~~~~~YY~G~~~i~~~--~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~-l-~g~----p~~~~~---- 94 (513) ++++.. + ....+.-... .... -..+.+. .+.+-. .+.+- ++. . .|. |+...+ T Consensus 1 m~~~~~----~-------~~~~~~~~~~~~~~~~-~~~~~p~--~~~~~~-~~~~~-~~~~~~~~~~~~pp~~~~~la~l 64 (346) T protein:vir:10 1 MKKQLR----K-------NLTQNDRLQPQAQTEI-FSFGDPI--PVLDRA-DILNY-LECSAMYEKWYNPPMSFDGLAKS 64 (346) T ss_pred CCcccC----C-------CCCcccccccccCeEE-EecCCcc--eecCch-hHHHH-HHHhhcCCceEecCCCHHHHHHH Confidence 010000 0 0011100000 0000 0000000 000000 01000 111 1 111 111110 Q ss_pred ------------CcHHHHHHHHHh-cCH--HHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCc Q lcl|NC_019916. 95 ------------PSSDRLDDFNRR-NDI--DTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNP 159 (513) Q Consensus 95 ------------~~~~~l~~~~~~-n~~--~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~ 159 (513) ...+.+..+++. |.. .....+++.+.+.+|.||+.+..+..|++.-.+.++|..+.+.-++.. T Consensus 65 ~~~~~~h~~~i~~k~n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~-- 142 (346) T protein:vir:10 65 LRSSTHHESAIITKANILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQ-- 142 (346) T ss_pred HHhhhhcchhhhhhhhhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCe-- Confidence 001123333332 211 233456778889999999999888888766555677777655333211 Q ss_pred ceEEEEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHH Q lcl|NC_019916. 160 KPIMAVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSL 239 (513) Q Consensus 160 ~~~~~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~l 239 (513) .. |.....++ . ...|.++.+++++... + .....|.|.+...... T Consensus 143 -~~-----~~~~~~~g---~----~~~~~~~dIih~r~~~--------------~---------~~~~~G~~~~~~a~~s 186 (346) T protein:vir:10 143 -FY-----YVPQRFDH---Q----EHEFAKGSIYHLLEPD--------------I---------NQDIYGLPQYLSALQS 186 (346) T ss_pred -EE-----EEEEccCC---e----EEEEecccEEEecCCC--------------C---------CCCeeeccHHHHHHHH Confidence 11 11100000 0 0112333333322100 0 0112466655544333 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhe--ecCcccccccccccccccchhhhhhhccccccchhhhcchhcceeeccccccc Q lcl|NC_019916. 240 IDLYDVAQSDTANYMTDLNEAMLVI--KGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEAMRQANMILLKTGMAP 317 (513) Q Consensus 240 iD~~~~~~S~~~~~~~~~~~~~l~~--~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (513) +..-+.+..-..+.+...+.|-.++ ++.. ....+.......++... -...-++++.+.+++ T Consensus 187 i~l~~~a~~~~~~~~~NG~~~~~il~~~d~~-----------l~~e~~~~i~~~~~~~~----g~~n~~~~~vl~~~~-- 249 (346) T protein:vir:10 187 AWLNESATLFRRKYFLNGAHAGFVFYMSDAS-----------QKQEDVENIRQQLKQSK----GVGNFKNLFVHAPNG-- 249 (346) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCC-----------CCHHHHHHHHHHHHHhc----CccccCceeEecCCC-- Confidence 3322111111111112112222222 2210 00111111111111100 000112223332221 Q ss_pred cccccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 318 NGQQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSG-VAMKYKVLGTVELASTKRKQFERGLNQ 396 (513) Q Consensus 318 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~f~~~l~~ 396 (513) ..++.++..++-......+.+..+..+++|+..=++|+...+-..++.++ ..++-+ ........+.- T Consensus 250 --~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~----------~~~f~~~~l~P 317 (346) T protein:vir:10 250 --KKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADA----------AEVFFITEIEP 317 (346) T ss_pred --CccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHH----------HHHHHHHHHHH Confidence 11222333343333445567778888999999999998654422222211 111110 01112222333 Q ss_pred HHHHHHHHHHhcccccccccceeeEEeCCCCCcCHHH Q lcl|NC_019916. 397 RYTVVAHIEERVNGKWDIDPDEIGFIFRDNLPTDDVA 433 (513) Q Consensus 397 ~~~li~~~l~~~~~~~~~~~~~i~i~f~~~~p~d~~e 433 (513) +++.+..+...... + .|.|++...-...| T Consensus 318 ~~~~iee~n~~L~~----e----~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 318 LQERLKEFNQWLGQ----E----VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHhhccc----c----eeeechhhhcccCC Confidence 33333222222211 0 24565443332222 No 262 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=21.06 E-value=2.7 Score=18.25 Aligned_cols=342 Identities=8% Similarity=0.016 Sum_probs=120.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccCCCCCCccee--ecchhHHHHHHHHHHhhcCCeee-cC--C--- Q lcl|NC_019916. 24 AAFIRHHYNNQRPRLEMLYDYYRGQNDGILSPASRRNEKGKADHRA--VHSFARYIADFQTSYSVGNAIAM-SG--P--- 95 (513) Q Consensus 24 ~~~i~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~~l~g~p~~~-~~--~--- 95 (513) .-+..+ ...+-.+.- ......... .....+ .......+|+..++-+-+-|+++ .. . T Consensus 1 Mg~f~~-----------~~~f~~~~~---~~~~~~~~~--~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:93 1 MNLFGK-----------VVSFSRGKL---NNDTQRVTA--WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchh-----------hhhhhcccc---CCCcceeee--cccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccc Confidence 000000 000000000 000000000 000111 12234456677777777778764 11 0 Q ss_pred -------cHHHHHHHHHh--cC---HHHHHHHHHHHHhhCCeEEEEeeecCCCceeEEEEEcccceEEEecCCCCcceEE Q lcl|NC_019916. 96 -------SSDRLDDFNRR--ND---IDTLNYELYLDMTVTGRAYEYVYRDPSQKGEVSVKLDPMECFIIYDRSVNPKPIM 163 (513) Q Consensus 96 -------~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~d~~~~~~~~~~~~p~~~~~~~d~~~~~~~~~ 163 (513) .+..+..++.. |. .......+..+.+.+|.||+++..+.. .+.+. .+-|.. . T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~-~g~~~-~l~~~~--------~------ 128 (378) T protein:vir:93 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN-TGELL-DLLFAD--------D------ 128 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC-CceEE-EEEecC--------C------ Confidence 11345555542 32 335555678899999999976544321 11111 111100 0 Q ss_pred EEEEEeecccccccceeEEEEEEEcCCcEEEEEeeccCCccccccccccccCcccceEEecCCCCCCcchhHHHHHHHHH Q lcl|NC_019916. 164 AVRYHAVQTVVDNITQTKYEVETWTENDYTRYKPIVVAGSVPTLEVAEHSAQFGFPMIEYRNNEYRQGDFENVLSLIDLY 243 (513) Q Consensus 164 ~ir~~~~~~~~~~~~~~~~~ve~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~v~~liD~~ 243 (513) . + .|..+.+.++ ++.-.+......+..+..++ T Consensus 129 ---------------~----~-~~~~~diih~----------------------------r~~~~~~~~~s~l~~~~~~i 160 (378) T protein:vir:93 129 ---------------K----K-EYKTEELVRL----------------------------TSPFYINEDTSILDNALASI 160 (378) T ss_pred ---------------e----e-EeccceeEEe----------------------------cCccccchhhHHHHHHHHHH Confidence 0 0 1122222222 21111111122233333333 Q ss_pred HHHHHHHHHHHHHhhhhhhheecCcccccccccccccccchhhhhhhccccccchhhhc----chhcceeeccccccccc Q lcl|NC_019916. 244 DVAQSDTANYMTDLNEAMLVIKGDIDTLFDDSTLLQMVDPSDADAMKKLADEKMAQLEA----MRQANMILLKTGMAPNG 319 (513) Q Consensus 244 ~~~~S~~~~~~~~~~~~~l~~~G~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 319 (513) +..++. +.+--+++-.. ..... ....+++.-...+.. ...++++.+ T Consensus 161 ~~~~~~--------~~~~g~l~~~~-----------~l~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l-------- 210 (378) T protein:vir:93 161 QTKLEQ--------GKLRGLLKINA-----------FLDID---NTQEYREKALTTIKNMQEGSSYNGLTPV-------- 210 (378) T ss_pred HHHHhc--------CcccceeeeCC-----------cCCHH---HHHHHHHHHHHHHHHhhcccccccceEc-------- Confidence 222111 11111111100 00000 000001100011111 011223332 Q ss_pred cccCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 320 QQTSADANYIHKEYDSAGTELYKKRLAADIHKFSHTPDLTDDNFSGNSSGVAMKYKVLGTVELASTKRKQFERGLNQRYT 399 (513) Q Consensus 320 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~~ 399 (513) ..+.+++.++.......+ ...+.+.+.|+..-++|+.-. .+..|... ....+...|...++ T Consensus 211 -~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l---~g~~~e~~--------------~~~f~~~tl~P~~~ 271 (378) T protein:vir:93 211 -DNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENIL---LGTATQEQ--------------QIYFYNSTIIPLLI 271 (378) T ss_pred -CCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHh---cCCcHHHH--------------HHHHHHHHHHHHHH Confidence 222333433333333333 445677889999999987433 12222111 11233444555554 Q ss_pred HHHHHHHhcc--------cccccccceeeEEeCCCCCcCHHHHHHHHHHH--hcCCCHHHHHHhCCC--CCCHHHHHHHH Q lcl|NC_019916. 400 VVAHIEERVN--------GKWDIDPDEIGFIFRDNLPTDDVAIITALVQA--GAQIPQEYLYQYLPN--VTDADEIVKMM 467 (513) Q Consensus 400 li~~~l~~~~--------~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ri 467 (513) .+..-+...- +........+++.+..-+-.|..+.++++.++ +|+++.-.+.++++. +++.+.=+-. T Consensus 272 ~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~- 350 (378) T protein:vir:93 272 QLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIAN- 350 (378) T ss_pred HHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeec- Confidence 4444433210 00011122345556677778899999998887 578887777766643 2221100000 Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019916. 468 DKQRKAMLKTYDTKGGLIINGTSGNDPEDEGVRGQQGEPEDER 510 (513) Q Consensus 468 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (513) ....+.......+ ....++..++.++++ T Consensus 351 --------~n~~~~~~~~~~~-------~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 351 --------LNAVAVKNLSDLQ-------GSRKDVTSTDETNNQ 378 (378) T ss_pred --------cccccccchhhhc-------CccCCCCCCCCCCCC Confidence 0000000000000 000001111111111 Done!