Query lcl|NC_021297.1_cdsid_YP_008050779.1 [gene=14] [protein=portal protein] [protein_id=YP_008050779.1] [location=8957..10399] Match_columns 480 No_of_seqs 150 out of 499 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 16:18:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_14 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_14_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78227 Length: 480 100.0 2E-119 1E-122 671.7 54.8 480 1-480 1-480 (480) 2 protein:vir:78537 Length: 480 100.0 2E-119 1E-122 671.0 54.9 480 1-480 1-480 (480) 3 protein:vir:4223 Length: 486 # 100.0 2E-105 1E-108 595.2 53.8 468 1-476 11-486 (486) 4 protein:vir:2427 Length: 485 # 100.0 2E-104 1E-107 589.0 52.8 467 1-477 11-485 (485) 5 protein:vir:104082 Length: 485 100.0 1E-103 8E-107 584.5 53.7 467 1-477 11-485 (485) 6 protein:vir:7768 Length: 484 # 100.0 1E-103 7E-107 584.8 53.0 470 1-479 10-484 (484) 7 protein:vir:2341 Length: 488 # 100.0 2E-102 1E-105 578.1 52.7 463 1-474 7-488 (488) 8 protein:vir:99916 Length: 504 100.0 7E-100 5E-103 564.0 52.2 464 1-475 18-504 (504) 9 protein:vir:8184 Length: 474 # 100.0 8.9E-98 5E-101 552.7 48.4 445 1-455 12-474 (474) 10 protein:vir:80680 Length: 441 100.0 2.8E-96 1.8E-99 544.4 49.4 434 1-455 1-441 (441) 11 protein:vir:2500 Length: 501 # 100.0 1.5E-94 9.2E-98 535.0 50.4 455 1-477 23-501 (501) 12 protein:vir:9568 Length: 410 # 100.0 1.4E-92 8.7E-96 524.2 41.4 399 15-438 1-410 (410) 13 protein:vir:94742 Length: 409 100.0 2.7E-91 1.7E-94 517.1 42.7 399 1-425 1-409 (409) 14 protein:vir:99072 Length: 479 100.0 1.4E-89 8.6E-93 507.7 49.6 460 1-479 9-479 (479) 15 protein:vir:1634 Length: 409 # 100.0 1.2E-90 7.2E-94 513.7 42.1 399 1-425 1-409 (409) 16 protein:vir:9751 Length: 422 # 100.0 4.2E-89 2.6E-92 505.1 43.5 412 1-439 1-422 (422) 17 protein:vir:105819 Length: 456 100.0 2.2E-87 1.4E-90 495.7 48.2 432 1-462 1-456 (456) 18 protein:vir:102602 Length: 456 100.0 2.2E-87 1.4E-90 495.7 48.2 432 1-462 1-456 (456) 19 protein:vir:7987 Length: 456 # 100.0 9.2E-87 5.7E-90 492.3 48.7 431 1-462 3-456 (456) 20 protein:vir:98444 Length: 434 100.0 2.1E-83 1.3E-86 473.9 46.9 416 37-473 1-434 (434) 21 protein:vir:106571 Length: 499 100.0 1.6E-81 9.6E-85 463.6 48.7 461 1-480 13-497 (499) 22 protein:vir:2732 Length: 501 # 100.0 1.1E-81 6.6E-85 464.5 46.2 440 1-469 37-501 (501) 23 protein:vir:96494 Length: 501 100.0 7.3E-82 4.5E-85 465.4 45.1 441 1-470 37-501 (501) 24 protein:vir:3964 Length: 453 # 100.0 2.8E-81 1.7E-84 462.2 46.4 428 1-469 11-453 (453) 25 protein:vir:97171 Length: 512 100.0 4.4E-81 2.8E-84 461.1 47.3 449 1-470 31-512 (512) 26 protein:vir:94805 Length: 492 100.0 3.4E-81 2.1E-84 461.7 45.1 435 1-476 42-492 (492) 27 protein:vir:99781 Length: 511 100.0 1.5E-80 9.2E-84 458.2 48.5 448 1-469 39-511 (511) 28 protein:vir:78805 Length: 511 100.0 1.6E-80 9.8E-84 458.1 48.0 448 1-470 39-511 (511) 29 protein:vir:96366 Length: 511 100.0 1.6E-80 9.8E-84 458.1 48.0 448 1-470 39-511 (511) 30 protein:vir:97336 Length: 492 100.0 5.4E-81 3.4E-84 460.6 45.4 435 1-469 42-492 (492) 31 protein:vir:9306 Length: 511 # 100.0 1.7E-80 1E-83 457.9 47.8 448 1-469 39-511 (511) 32 protein:vir:4898 Length: 502 # 100.0 1.8E-80 1.1E-83 457.8 46.0 443 1-480 38-502 (502) 33 protein:vir:103951 Length: 511 100.0 4.4E-80 2.8E-83 455.6 47.8 448 1-470 39-511 (511) 34 protein:vir:1236 Length: 483 # 100.0 2.9E-80 1.8E-83 456.7 45.5 435 1-468 33-483 (483) 35 protein:vir:93747 Length: 472 100.0 4.7E-80 2.9E-83 455.5 45.4 435 1-468 22-472 (472) 36 protein:vir:5961 Length: 503 # 100.0 2.2E-79 1.3E-82 451.8 47.0 451 1-479 27-503 (503) 37 protein:vir:96240 Length: 511 100.0 3.3E-79 2.1E-82 450.8 47.9 448 1-470 39-511 (511) 38 protein:vir:733 Length: 453 # 100.0 1.4E-79 8.9E-83 452.8 45.3 429 1-463 11-453 (453) 39 protein:vir:99522 Length: 470 100.0 4.9E-79 3.1E-82 449.9 48.1 438 1-468 19-470 (470) 40 protein:vir:3609 Length: 452 # 100.0 3.3E-79 2E-82 450.9 45.6 426 1-470 17-452 (452) 41 protein:vir:96266 Length: 474 100.0 5.2E-79 3.2E-82 449.8 46.6 434 1-470 25-474 (474) 42 protein:vir:95899 Length: 474 100.0 5.2E-79 3.2E-82 449.8 46.6 434 1-470 25-474 (474) 43 protein:vir:94498 Length: 474 100.0 8.8E-79 5.5E-82 448.5 45.3 434 1-476 25-474 (474) 44 protein:vir:97447 Length: 474 100.0 8.8E-79 5.5E-82 448.5 45.3 434 1-476 25-474 (474) 45 protein:vir:9871 Length: 429 # 100.0 1.1E-78 6.7E-82 448.0 45.4 420 1-462 1-429 (429) 46 protein:vir:106639 Length: 481 100.0 1.2E-77 7.7E-81 442.2 48.4 437 1-465 29-481 (481) 47 protein:vir:105292 Length: 478 100.0 7.3E-78 4.5E-81 443.5 46.4 433 1-466 1-478 (478) 48 protein:vir:102330 Length: 451 100.0 1E-77 6.3E-81 442.7 45.4 427 2-453 1-451 (451) 49 protein:vir:95806 Length: 440 100.0 8.2E-78 5.1E-81 443.2 44.2 424 10-464 1-440 (440) 50 protein:vir:107112 Length: 478 100.0 2.6E-77 1.6E-80 440.5 46.9 433 1-466 24-478 (478) 51 protein:vir:105461 Length: 470 100.0 2.2E-77 1.4E-80 440.8 44.6 431 1-464 3-470 (470) 52 protein:vir:95113 Length: 474 100.0 5E-77 3.1E-80 438.9 46.1 435 1-466 25-474 (474) 53 protein:vir:9922 Length: 489 # 100.0 1.6E-76 9.7E-80 436.2 47.7 445 1-471 13-489 (489) 54 protein:vir:102950 Length: 471 100.0 1.4E-76 8.6E-80 436.5 45.2 428 1-466 1-471 (471) 55 protein:vir:96839 Length: 474 100.0 2.1E-76 1.3E-79 435.5 45.7 432 1-468 24-474 (474) 56 protein:vir:79043 Length: 479 100.0 1.2E-75 7.4E-79 431.3 47.7 432 1-465 18-479 (479) 57 protein:vir:94546 Length: 506 100.0 1E-75 6.4E-79 431.7 46.3 443 1-470 16-506 (506) 58 protein:vir:96179 Length: 468 100.0 3.2E-75 2E-78 429.0 45.2 423 1-457 24-468 (468) 59 protein:vir:105889 Length: 474 100.0 1E-74 6.3E-78 426.2 46.1 430 1-466 14-474 (474) 60 protein:vir:94101 Length: 474 100.0 1E-74 6.3E-78 426.2 46.1 430 1-466 14-474 (474) 61 protein:vir:78083 Length: 537 100.0 3.7E-75 2.3E-78 428.6 43.6 451 1-479 1-537 (537) 62 protein:vir:38 Length: 496 # N 100.0 7.7E-56 4.8E-59 322.7 42.0 440 1-468 19-496 (496) 63 protein:vir:80959 Length: 499 100.0 8.5E-52 5.3E-55 300.6 43.1 443 1-466 1-499 (499) 64 protein:vir:101494 Length: 527 100.0 1.9E-51 1.2E-54 298.7 33.7 458 1-474 21-527 (527) 65 protein:vir:102239 Length: 527 100.0 2.2E-51 1.4E-54 298.3 33.7 458 1-474 21-527 (527) 66 protein:vir:1587 Length: 508 # 100.0 9.7E-49 6E-52 283.8 42.7 442 1-464 4-508 (508) 67 protein:vir:79703 Length: 505 100.0 1.7E-48 1E-51 282.5 43.5 439 1-462 4-505 (505) 68 protein:vir:9815 Length: 500 # 100.0 5.2E-47 3.2E-50 274.3 42.5 441 1-470 4-500 (500) 69 protein:vir:3028 Length: 500 # 100.0 5.2E-47 3.2E-50 274.3 42.5 441 1-470 4-500 (500) 70 protein:vir:4782 Length: 522 # 100.0 2.7E-44 1.7E-47 259.5 42.9 451 1-476 4-522 (522) 71 protein:vir:7430 Length: 563 # 100.0 9.5E-45 5.9E-48 261.9 34.1 465 1-480 9-558 (563) 72 protein:vir:98883 Length: 517 100.0 5.7E-41 3.5E-44 241.2 42.0 445 1-476 4-517 (517) 73 protein:vir:78907 Length: 518 100.0 1.8E-40 1.1E-43 238.5 38.4 440 1-473 4-518 (518) 74 protein:vir:97265 Length: 513 100.0 1.8E-28 1.1E-31 172.7 37.6 459 1-479 1-513 (513) 75 protein:vir:94956 Length: 452 99.9 3E-26 1.9E-29 160.5 34.8 423 1-465 1-452 (452) 76 protein:vir:80453 Length: 535 99.9 3.8E-25 2.4E-28 154.5 36.2 453 1-476 32-535 (535) 77 protein:vir:95149 Length: 501 99.9 1.9E-23 1.2E-26 145.2 36.7 441 1-476 1-501 (501) 78 protein:vir:105461 Length: 470 99.9 3.9E-21 2.4E-24 132.5 44.8 410 3-478 1-470 (470) 79 protein:vir:78393 Length: 489 99.9 5.7E-22 3.6E-25 137.0 34.2 438 1-470 1-489 (489) 80 protein:vir:95014 Length: 491 99.9 2.1E-20 1.3E-23 128.5 35.4 441 1-469 1-491 (491) 81 protein:vir:96783 Length: 488 99.8 2.2E-18 1.3E-21 117.4 36.6 424 1-451 16-488 (488) 82 protein:vir:93630 Length: 776 99.8 6.7E-20 4.1E-23 125.7 25.8 466 1-480 38-688 (776) 83 protein:vir:108295 Length: 711 99.7 7.9E-18 4.9E-21 114.3 29.0 468 1-480 25-672 (711) 84 protein:vir:80040 Length: 461 99.7 2.9E-17 1.8E-20 111.2 28.2 439 1-473 1-461 (461) 85 protein:vir:2764 Length: 714 # 99.7 1.1E-14 7.1E-18 97.0 37.8 450 1-480 1-632 (714) 86 protein:vir:10117 Length: 714 99.7 1.1E-14 7.1E-18 97.0 37.8 450 1-480 1-632 (714) 87 protein:vir:9950 Length: 714 # 99.7 1.1E-14 7.1E-18 97.0 37.8 450 1-480 1-632 (714) 88 protein:vir:3296 Length: 714 # 99.7 1.1E-14 7.1E-18 97.0 37.8 450 1-480 1-632 (714) 89 protein:vir:817 Length: 714 # 99.7 1.1E-14 7.1E-18 97.0 37.8 450 1-480 1-632 (714) 90 protein:vir:79538 Length: 502 99.7 6.6E-14 4.1E-17 92.8 41.6 443 1-478 1-502 (502) 91 protein:vir:100920 Length: 725 99.7 9.5E-16 5.9E-19 102.9 29.5 472 1-480 1-640 (725) 92 protein:vir:105619 Length: 772 99.7 2.5E-16 1.6E-19 106.1 25.3 453 1-480 14-617 (772) 93 protein:vir:9263 Length: 725 # 99.6 3.3E-15 2E-18 100.0 29.0 457 1-480 1-624 (725) 94 protein:vir:104437 Length: 714 99.6 4.5E-15 2.8E-18 99.2 29.5 449 1-480 15-628 (714) 95 protein:vir:77597 Length: 725 99.6 3.2E-15 2E-18 100.1 28.5 467 1-480 1-649 (725) 96 protein:vir:8846 Length: 705 # 99.6 1.1E-13 6.6E-17 91.7 32.5 453 1-480 10-622 (705) 97 protein:vir:172 Length: 708 # 99.6 9.5E-14 5.9E-17 92.0 32.1 471 1-480 1-652 (708) 98 protein:vir:80165 Length: 651 99.6 4.7E-14 2.9E-17 93.7 30.2 456 1-480 15-637 (651) 99 protein:vir:95449 Length: 584 99.6 1.1E-13 7.1E-17 91.5 31.3 436 1-460 1-584 (584) 100 protein:vir:105429 Length: 708 99.5 1.6E-13 9.7E-17 90.8 30.6 469 1-480 1-652 (708) 101 protein:vir:96738 Length: 505 99.5 2E-12 1.2E-15 84.7 38.5 431 1-475 14-505 (505) 102 protein:vir:5249 Length: 437 # 99.5 5.5E-13 3.4E-16 87.8 32.4 410 1-478 1-437 (437) 103 protein:vir:94049 Length: 532 99.5 4.2E-14 2.6E-17 93.9 24.6 434 1-480 27-527 (532) 104 protein:vir:107742 Length: 537 99.5 4.5E-14 2.8E-17 93.8 23.3 427 1-480 70-536 (537) 105 protein:vir:3420 Length: 533 # 99.5 3.9E-12 2.4E-15 83.1 33.6 443 1-478 1-533 (533) 106 protein:vir:389 Length: 530 # 99.5 7.4E-12 4.6E-15 81.6 38.0 439 1-477 1-530 (530) 107 protein:vir:95542 Length: 548 99.5 8.7E-12 5.4E-15 81.2 36.6 454 1-480 1-545 (548) 108 protein:vir:105520 Length: 706 99.5 6.6E-12 4.1E-15 81.9 34.0 463 1-480 1-637 (706) 109 protein:vir:6382 Length: 553 # 99.4 1.2E-11 7.3E-15 80.5 34.1 455 1-480 1-552 (553) 110 protein:vir:79647 Length: 435 99.4 5.8E-13 3.6E-16 87.6 26.8 407 1-476 5-435 (435) 111 protein:vir:3520 Length: 720 # 99.4 6.1E-13 3.8E-16 87.5 25.8 469 1-480 1-659 (720) 112 protein:vir:107662 Length: 427 99.4 1.2E-12 7.7E-16 85.9 26.5 402 12-475 1-427 (427) 113 protein:vir:10321 Length: 495 99.4 7E-12 4.4E-15 81.7 30.5 437 1-476 3-495 (495) 114 protein:vir:104338 Length: 422 99.3 1.3E-11 8.1E-15 80.3 28.0 402 1-474 1-422 (422) 115 protein:vir:99563 Length: 862 99.3 1.1E-12 6.9E-16 86.1 21.3 424 1-480 100-601 (862) 116 protein:vir:3139 Length: 599 # 99.3 5.2E-11 3.2E-14 77.0 27.9 447 1-470 15-599 (599) 117 protein:vir:95821 Length: 763 99.2 1.5E-11 9.1E-15 80.0 22.5 451 1-480 26-673 (763) 118 protein:vir:96068 Length: 765 99.2 3.9E-11 2.4E-14 77.6 24.7 426 1-480 71-564 (765) 119 protein:vir:3153 Length: 467 # 99.1 3.5E-09 2.2E-12 66.9 29.8 381 46-480 1-452 (467) 120 protein:vir:80644 Length: 551 99.0 4E-09 2.5E-12 66.6 29.1 413 1-480 54-538 (551) 121 protein:vir:63755 Length: 547 99.0 5.1E-09 3.2E-12 66.0 28.5 404 1-480 50-527 (547) 122 protein:vir:95315 Length: 559 99.0 9.1E-09 5.7E-12 64.7 36.6 449 1-480 1-559 (559) 123 protein:vir:94599 Length: 641 99.0 9.3E-09 5.8E-12 64.6 30.2 462 1-479 20-641 (641) 124 protein:vir:6240 Length: 457 # 98.9 1.6E-08 1E-11 63.3 25.9 423 5-480 1-454 (457) 125 protein:vir:79772 Length: 648 98.9 1.9E-08 1.2E-11 62.9 30.0 422 1-480 51-511 (648) 126 protein:vir:10447 Length: 536 98.9 2.2E-08 1.3E-11 62.6 38.4 445 1-474 1-536 (536) 127 protein:vir:1538 Length: 535 # 98.9 2.2E-08 1.4E-11 62.5 39.4 438 1-479 1-535 (535) 128 protein:vir:102668 Length: 547 98.9 2.5E-08 1.6E-11 62.3 38.2 433 1-468 1-547 (547) 129 protein:vir:7321 Length: 556 # 98.9 2.5E-08 1.6E-11 62.2 36.1 446 1-473 1-556 (556) 130 protein:vir:102080 Length: 429 98.8 1.6E-08 1E-11 63.3 23.0 399 5-476 1-429 (429) 131 protein:vir:107822 Length: 555 98.8 4.6E-08 2.9E-11 60.8 35.6 449 1-479 1-555 (555) 132 protein:vir:98506 Length: 555 98.8 4.6E-08 2.9E-11 60.8 35.6 449 1-479 1-555 (555) 133 protein:vir:107404 Length: 555 98.8 4.6E-08 2.9E-11 60.8 35.6 449 1-479 1-555 (555) 134 protein:vir:94709 Length: 522 98.8 5.8E-08 3.6E-11 60.3 37.8 435 1-471 1-522 (522) 135 protein:vir:2198 Length: 536 # 98.8 6.2E-08 3.8E-11 60.1 38.9 444 1-474 1-536 (536) 136 protein:vir:103765 Length: 549 98.7 6.9E-08 4.3E-11 59.8 33.7 436 1-475 1-549 (549) 137 protein:vir:3361 Length: 535 # 98.7 7.8E-08 4.8E-11 59.6 40.3 438 1-477 1-535 (535) 138 protein:vir:3843 Length: 397 # 98.7 7.4E-08 4.6E-11 59.7 23.1 385 1-476 1-397 (397) 139 protein:vir:101648 Length: 518 98.7 1.2E-07 7.4E-11 58.5 28.9 403 1-480 9-451 (518) 140 protein:vir:107605 Length: 432 98.7 1.3E-07 8.2E-11 58.3 25.4 401 5-476 1-432 (432) 141 protein:vir:105002 Length: 432 98.7 1.3E-07 8.2E-11 58.3 25.4 401 5-476 1-432 (432) 142 protein:vir:102855 Length: 432 98.7 1.3E-07 8.2E-11 58.3 25.4 401 5-476 1-432 (432) 143 protein:vir:99312 Length: 563 98.7 1.4E-07 8.5E-11 58.2 25.5 423 1-480 26-544 (563) 144 protein:vir:95599 Length: 563 98.7 1.4E-07 8.5E-11 58.2 25.5 423 1-480 26-544 (563) 145 protein:vir:8883 Length: 543 # 98.6 1.6E-07 1E-10 57.8 36.4 443 1-480 1-542 (543) 146 protein:vir:80796 Length: 574 98.6 1.6E-07 1E-10 57.8 26.2 422 1-480 13-540 (574) 147 protein:vir:99672 Length: 532 98.6 1.7E-07 1.1E-10 57.7 33.5 438 1-464 1-532 (532) 148 protein:vir:3648 Length: 695 # 98.6 2E-07 1.3E-10 57.3 27.0 430 1-480 60-573 (695) 149 protein:vir:78589 Length: 695 98.6 2.1E-07 1.3E-10 57.2 27.1 418 1-480 60-552 (695) 150 protein:vir:8418 Length: 409 # 98.6 2.3E-07 1.4E-10 57.0 26.4 386 5-478 1-409 (409) 151 protein:vir:1326 Length: 457 # 98.6 2.4E-07 1.5E-10 56.8 26.5 417 5-480 1-452 (457) 152 protein:vir:99452 Length: 651 98.6 4.4E-08 2.8E-11 60.9 19.0 446 1-480 1-549 (651) 153 protein:vir:103177 Length: 533 98.6 1.1E-07 6.8E-11 58.8 21.0 445 1-480 27-529 (533) 154 protein:vir:101541 Length: 694 98.6 2.9E-07 1.8E-10 56.5 27.0 435 1-480 61-572 (694) 155 protein:vir:94572 Length: 535 98.6 3E-07 1.9E-10 56.3 35.4 441 1-472 1-535 (535) 156 protein:vir:7853 Length: 518 # 98.6 3E-07 1.9E-10 56.3 28.6 406 1-480 9-451 (518) 157 protein:vir:100039 Length: 522 98.5 3.4E-07 2.1E-10 56.1 35.8 431 1-475 1-522 (522) 158 protein:vir:7407 Length: 392 # 98.5 3.7E-07 2.3E-10 55.8 28.7 371 7-471 1-392 (392) 159 protein:vir:78696 Length: 542 98.5 4.2E-07 2.6E-10 55.6 38.6 431 1-476 1-542 (542) 160 protein:vir:1785 Length: 555 # 98.5 4.4E-07 2.7E-10 55.4 35.3 435 4-477 1-555 (555) 161 protein:vir:4156 Length: 542 # 98.5 5.3E-07 3.3E-10 55.0 23.0 412 1-480 11-465 (542) 162 protein:vir:104892 Length: 558 98.4 6.2E-07 3.8E-10 54.6 23.4 449 1-480 43-551 (558) 163 protein:vir:106999 Length: 564 98.4 4.4E-07 2.7E-10 55.4 21.2 453 1-480 25-563 (564) 164 protein:vir:106716 Length: 698 98.4 8.3E-07 5.2E-10 53.9 25.0 430 1-480 60-570 (698) 165 protein:vir:104500 Length: 537 98.4 8.8E-07 5.4E-10 53.8 23.6 445 1-479 28-537 (537) 166 protein:vir:3989 Length: 392 # 98.4 1E-06 6.4E-10 53.4 28.1 371 7-471 1-392 (392) 167 protein:vir:1023 Length: 392 # 98.4 1E-06 6.4E-10 53.4 28.1 371 7-471 1-392 (392) 168 protein:vir:93610 Length: 454 98.3 1.2E-06 7.7E-10 53.0 27.1 402 11-480 1-449 (454) 169 protein:vir:4952 Length: 386 # 98.3 1.3E-06 7.8E-10 52.9 27.7 372 5-468 1-386 (386) 170 protein:vir:1266 Length: 416 # 98.3 1.4E-06 8.8E-10 52.6 26.3 392 6-475 1-416 (416) 171 protein:vir:100691 Length: 535 98.3 1.5E-06 9.1E-10 52.6 29.2 412 1-480 53-529 (535) 172 protein:vir:105782 Length: 449 98.3 1.6E-06 9.8E-10 52.4 24.9 399 1-478 1-449 (449) 173 protein:vir:79984 Length: 441 98.3 2E-06 1.2E-09 51.8 24.5 397 1-477 11-441 (441) 174 protein:vir:9408 Length: 441 # 98.3 2E-06 1.2E-09 51.8 24.5 397 1-477 11-441 (441) 175 protein:vir:103330 Length: 517 98.2 2.2E-06 1.4E-09 51.6 36.1 424 1-475 1-517 (517) 176 protein:vir:9359 Length: 348 # 98.2 2.9E-06 1.8E-09 50.9 26.8 334 35-469 1-348 (348) 177 protein:vir:4995 Length: 384 # 98.2 3.1E-06 1.9E-09 50.8 21.6 366 5-458 1-384 (384) 178 protein:vir:93943 Length: 409 98.2 3.2E-06 2E-09 50.7 28.6 394 1-469 1-409 (409) 179 protein:vir:101647 Length: 460 98.2 3.3E-06 2.1E-09 50.6 25.9 406 1-478 1-460 (460) 180 protein:vir:81095 Length: 416 98.1 3.9E-06 2.4E-09 50.2 26.8 386 12-469 1-416 (416) 181 protein:vir:4598 Length: 416 # 98.1 3.9E-06 2.4E-09 50.2 26.8 386 12-469 1-416 (416) 182 protein:vir:98396 Length: 441 98.1 3.9E-06 2.4E-09 50.2 25.4 398 1-469 1-441 (441) 183 protein:vir:94426 Length: 409 98.1 4.4E-06 2.7E-09 50.0 28.3 395 1-469 1-409 (409) 184 protein:vir:81152 Length: 411 98.1 4.4E-06 2.7E-09 49.9 27.6 381 5-475 1-411 (411) 185 protein:vir:96579 Length: 576 98.1 4.5E-06 2.8E-09 49.9 28.9 396 1-480 54-540 (576) 186 protein:vir:960 Length: 413 # 98.1 4.9E-06 3E-09 49.7 23.8 380 1-467 1-413 (413) 187 protein:vir:102727 Length: 945 98.1 5.2E-06 3.2E-09 49.6 28.5 419 1-480 40-537 (945) 188 protein:vir:100650 Length: 395 98.1 5.2E-06 3.2E-09 49.5 20.6 367 22-476 1-395 (395) 189 protein:vir:9507 Length: 395 # 98.1 5.2E-06 3.2E-09 49.5 20.6 367 22-476 1-395 (395) 190 protein:vir:101289 Length: 395 98.1 5.2E-06 3.2E-09 49.5 20.6 367 22-476 1-395 (395) 191 protein:vir:4194 Length: 540 # 98.0 6.3E-06 3.9E-09 49.1 24.7 419 1-480 1-472 (540) 192 protein:vir:4454 Length: 414 # 98.0 6.6E-06 4.1E-09 49.0 30.4 393 5-479 1-414 (414) 193 protein:vir:4828 Length: 382 # 98.0 6.7E-06 4.2E-09 48.9 23.3 369 5-468 1-382 (382) 194 protein:vir:94666 Length: 723 98.0 7E-06 4.3E-09 48.9 29.4 400 1-480 1-446 (723) 195 protein:vir:95378 Length: 406 97.9 1E-05 6.5E-09 47.9 23.9 382 5-466 1-406 (406) 196 protein:vir:81017 Length: 521 97.9 1.1E-05 6.8E-09 47.8 21.5 417 1-463 46-521 (521) 197 protein:vir:9702 Length: 406 # 97.9 1.2E-05 7.8E-09 47.5 22.4 385 12-479 1-406 (406) 198 protein:vir:106282 Length: 521 97.9 1.3E-05 7.9E-09 47.4 20.1 407 1-463 47-521 (521) 199 protein:vir:100150 Length: 437 97.8 1.5E-05 9.3E-09 47.0 28.4 397 1-480 1-437 (437) 200 protein:vir:6322 Length: 510 # 97.8 1.7E-05 1.1E-08 46.7 36.3 421 4-461 1-510 (510) 201 protein:vir:2683 Length: 412 # 97.8 1.7E-05 1.1E-08 46.7 30.0 395 1-469 1-412 (412) 202 protein:vir:100598 Length: 516 97.8 2E-05 1.2E-08 46.4 23.3 400 1-460 64-516 (516) 203 protein:vir:4854 Length: 386 # 97.8 2.1E-05 1.3E-08 46.3 28.7 372 5-468 1-386 (386) 204 protein:vir:6596 Length: 521 # 97.7 2.3E-05 1.4E-08 46.0 20.7 418 1-463 46-521 (521) 205 protein:vir:6896 Length: 523 # 97.7 2.3E-05 1.4E-08 46.0 18.5 409 1-460 36-523 (523) 206 protein:vir:108049 Length: 524 97.7 2.5E-05 1.5E-08 45.8 18.5 419 1-460 45-524 (524) 207 protein:vir:95254 Length: 488 97.7 2.7E-05 1.7E-08 45.7 30.0 429 1-480 1-484 (488) 208 protein:vir:103458 Length: 524 97.7 2.7E-05 1.7E-08 45.7 19.8 408 1-460 69-524 (524) 209 protein:vir:7208 Length: 524 # 97.7 2.8E-05 1.8E-08 45.5 19.8 408 1-463 69-524 (524) 210 protein:vir:78942 Length: 510 97.7 3E-05 1.9E-08 45.4 37.5 420 4-474 1-510 (510) 211 protein:vir:7017 Length: 515 # 97.6 3.3E-05 2E-08 45.2 32.8 420 1-469 1-515 (515) 212 protein:vir:96988 Length: 516 97.6 3.4E-05 2.1E-08 45.1 33.7 419 1-467 5-516 (516) 213 protein:vir:103860 Length: 528 97.6 3.6E-05 2.2E-08 44.9 33.4 413 1-480 1-461 (528) 214 protein:vir:1380 Length: 422 # 97.6 4E-05 2.5E-08 44.7 30.6 390 1-478 1-422 (422) 215 protein:vir:105641 Length: 516 97.6 4.1E-05 2.5E-08 44.6 33.6 421 1-467 1-516 (516) 216 protein:vir:99853 Length: 488 97.6 4.1E-05 2.6E-08 44.6 30.2 391 8-480 1-417 (488) 217 protein:vir:102118 Length: 409 97.6 4.3E-05 2.6E-08 44.5 26.9 382 10-464 1-409 (409) 218 protein:vir:98265 Length: 524 97.6 4.4E-05 2.7E-08 44.5 21.4 412 1-460 51-524 (524) 219 protein:vir:4337 Length: 434 # 97.6 4.6E-05 2.9E-08 44.4 27.0 398 1-469 1-434 (434) 220 protein:vir:100882 Length: 383 97.6 4.6E-05 2.9E-08 44.4 27.2 368 5-476 1-383 (383) 221 protein:vir:98816 Length: 446 97.5 5.1E-05 3.2E-08 44.1 23.0 410 1-428 1-446 (446) 222 protein:vir:107880 Length: 491 97.5 5.2E-05 3.2E-08 44.1 30.7 393 1-480 15-430 (491) 223 protein:vir:79233 Length: 526 97.5 5.7E-05 3.6E-08 43.8 33.5 406 1-480 1-459 (526) 224 protein:vir:81072 Length: 432 97.5 5.8E-05 3.6E-08 43.8 26.9 390 11-478 1-432 (432) 225 protein:vir:104259 Length: 403 97.5 5.9E-05 3.6E-08 43.8 27.9 369 1-470 1-403 (403) 226 protein:vir:483 Length: 413 # 97.5 5.9E-05 3.6E-08 43.8 32.8 389 6-479 1-413 (413) 227 protein:vir:99232 Length: 526 97.5 5.9E-05 3.7E-08 43.8 34.6 411 1-480 1-456 (526) 228 protein:vir:3868 Length: 417 # 97.4 7.8E-05 4.8E-08 43.1 23.2 377 26-478 1-417 (417) 229 protein:vir:96980 Length: 409 97.3 9.7E-05 6E-08 42.6 29.8 395 1-469 1-409 (409) 230 protein:vir:8317 Length: 409 # 97.3 0.00011 6.5E-08 42.4 20.5 350 12-460 1-409 (409) 231 protein:vir:5665 Length: 511 # 97.3 0.00011 6.7E-08 42.3 18.2 406 1-460 55-511 (511) 232 protein:vir:101806 Length: 516 97.3 0.00011 6.9E-08 42.3 22.7 400 1-460 64-516 (516) 233 protein:vir:101189 Length: 516 97.3 0.00011 6.9E-08 42.3 22.7 400 1-460 64-516 (516) 234 protein:vir:5737 Length: 419 # 97.3 0.00011 7E-08 42.2 28.1 372 23-480 1-418 (419) 235 protein:vir:189 Length: 424 # 97.2 0.00013 8.1E-08 41.9 25.0 383 1-478 1-424 (424) 236 protein:vir:100187 Length: 385 97.1 0.00016 9.6E-08 41.5 27.8 366 1-467 1-385 (385) 237 protein:vir:97060 Length: 432 97.1 0.00016 9.8E-08 41.4 26.0 398 1-480 1-432 (432) 238 protein:vir:10362 Length: 432 97.0 0.00023 1.4E-07 40.5 26.2 400 1-480 1-432 (432) 239 protein:vir:1884 Length: 424 # 97.0 0.00023 1.5E-07 40.5 27.5 386 1-477 1-424 (424) 240 protein:vir:1986 Length: 512 # 96.9 0.00024 1.5E-07 40.4 31.7 389 1-480 39-448 (512) 241 protein:vir:4509 Length: 424 # 96.9 0.00029 1.8E-07 40.0 26.0 379 7-473 1-424 (424) 242 protein:vir:105064 Length: 421 96.7 0.00044 2.7E-07 39.0 27.1 389 10-480 1-420 (421) 243 protein:vir:80134 Length: 403 96.6 0.00044 2.7E-07 39.0 25.8 373 1-466 1-403 (403) 244 protein:vir:79063 Length: 491 96.6 0.0005 3.1E-07 38.7 30.3 396 1-480 1-430 (491) 245 protein:vir:81218 Length: 423 96.6 0.0005 3.1E-07 38.7 28.0 392 5-467 1-423 (423) 246 protein:vir:6210 Length: 394 # 96.5 0.00053 3.3E-07 38.6 24.1 372 5-467 1-394 (394) 247 protein:vir:78641 Length: 278 96.2 0.00089 5.5E-07 37.3 23.9 262 64-389 1-278 (278) 248 protein:vir:1082 Length: 359 # 96.2 0.00089 5.6E-07 37.3 27.0 341 1-425 1-359 (359) 249 protein:vir:108215 Length: 469 96.0 0.0011 6.9E-07 36.8 30.0 419 1-480 1-467 (469) 250 protein:vir:95965 Length: 385 95.9 0.0012 7.7E-07 36.5 20.5 361 1-466 1-385 (385) 251 protein:vir:5839 Length: 533 # 95.7 0.0016 9.9E-07 35.9 25.2 424 1-480 1-525 (533) 252 protein:vir:100249 Length: 431 95.4 0.0021 1.3E-06 35.3 30.4 385 1-474 1-431 (431) 253 protein:vir:80333 Length: 419 95.3 0.0022 1.4E-06 35.1 26.2 381 10-480 1-416 (419) 254 protein:vir:78161 Length: 355 95.1 0.0027 1.7E-06 34.7 23.4 308 137-480 1-349 (355) 255 protein:vir:103219 Length: 201 94.9 0.0012 7.3E-07 36.7 10.1 196 218-476 1-201 (201) 256 protein:vir:1431 Length: 419 # 94.1 0.0055 3.4E-06 33.0 30.7 385 10-480 1-417 (419) 257 protein:vir:8100 Length: 466 # 93.8 0.0062 3.9E-06 32.7 27.9 414 5-478 1-466 (466) 258 protein:vir:98643 Length: 395 93.3 0.0079 4.9E-06 32.1 22.1 365 22-467 1-395 (395) 259 protein:vir:4089 Length: 395 # 92.8 0.0098 6.1E-06 31.6 25.4 375 5-475 1-395 (395) 260 protein:vir:80211 Length: 514 92.3 0.012 7.5E-06 31.1 36.5 419 1-457 1-514 (514) 261 protein:vir:77981 Length: 448 91.9 0.014 8.4E-06 30.8 29.1 404 1-480 1-440 (448) 262 protein:vir:100328 Length: 346 90.3 0.021 1.3E-05 29.7 17.8 296 1-394 26-346 (346) 263 protein:vir:2013 Length: 344 # 89.6 0.025 1.6E-05 29.3 19.9 289 1-394 28-344 (344) 264 protein:vir:103971 Length: 376 89.4 0.027 1.7E-05 29.2 21.4 295 1-400 58-376 (376) 265 protein:vir:78191 Length: 351 89.1 0.028 1.8E-05 29.1 21.2 304 1-400 33-351 (351) 266 protein:vir:79511 Length: 448 89.1 0.029 1.8E-05 29.1 27.8 400 1-480 5-440 (448) 267 protein:vir:98853 Length: 219 88.2 0.034 2.1E-05 28.7 13.2 201 154-393 1-219 (219) 268 protein:vir:78310 Length: 376 87.2 0.04 2.5E-05 28.2 26.8 352 1-469 1-376 (376) 269 protein:vir:5691 Length: 344 # 85.9 0.05 3.1E-05 27.7 21.3 293 1-394 28-344 (344) 270 protein:vir:1661 Length: 378 # 85.1 0.056 3.5E-05 27.5 19.9 352 12-478 1-378 (378) 271 protein:vir:79207 Length: 351 84.5 0.06 3.7E-05 27.3 21.5 302 1-402 33-351 (351) 272 protein:vir:6058 Length: 344 # 84.2 0.063 3.9E-05 27.2 21.2 288 1-394 1-344 (344) 273 protein:vir:267 Length: 348 # 83.9 0.064 4E-05 27.1 21.4 309 1-399 1-348 (348) 274 protein:vir:93867 Length: 378 83.2 0.071 4.4E-05 26.9 20.7 350 12-478 1-378 (378) 275 protein:vir:98567 Length: 340 83.1 0.072 4.4E-05 26.9 19.8 292 1-397 25-340 (340) 276 protein:vir:9641 Length: 395 # 80.0 0.099 6.1E-05 26.1 26.8 365 22-467 1-395 (395) 277 protein:vir:1150 Length: 350 # 75.4 0.15 9.2E-05 25.1 21.6 293 1-392 36-350 (350) 278 protein:vir:94869 Length: 378 71.5 0.19 0.00012 24.5 22.2 345 5-478 1-378 (378) 279 protein:vir:345 Length: 663 # 67.7 0.25 0.00015 23.9 26.3 461 1-480 1-617 (663) 280 protein:vir:94002 Length: 378 67.1 0.26 0.00016 23.8 20.2 348 12-468 1-378 (378) 281 protein:vir:78749 Length: 337 63.6 0.32 0.0002 23.3 19.7 297 1-392 22-337 (337) 282 protein:vir:3780 Length: 345 # 61.5 0.35 0.00022 23.1 22.5 302 1-394 1-345 (345) 283 protein:vir:3609 Length: 452 # 51.2 0.59 0.00036 21.8 17.8 399 1-476 11-452 (452) 284 protein:vir:858 Length: 378 # 38.3 1.1 0.00067 20.4 21.2 348 5-478 1-378 (378) 285 protein:vir:97336 Length: 492 32.8 1.4 0.00087 19.8 22.4 414 1-476 38-492 (492) 286 protein:vir:96839 Length: 474 30.8 1.5 0.00096 19.5 15.1 398 1-459 28-474 (474) 287 protein:vir:79150 Length: 368 26.5 1.9 0.0012 19.0 19.9 302 1-412 41-368 (368) No 1 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=1.7e-119 Score=671.73 Aligned_cols=480 Identities=100% Similarity=1.436 Sum_probs=460.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |+|+.++|++|+++|..+++|++++++||+|+|+|++.+..+++.++++++++|||++|||++++||+++||++++|++. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.++++|++|+|+.++.++++++++||+||++||++...+.|++|.++++++||.+++++||+...+++.+++++|... T Consensus 81 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~ 160 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee Confidence 99999999999999999999999999999999999988888889999999999999999999999899999999999988 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s 240 (480) ++.+.++++++|+++.+++|+..++....|++..+..+|+||+||||+|+|++++.++||+|+|+++|++|||+||+++| T Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s 240 (480) T protein:vir:78 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) T ss_pred cCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHH Confidence 88888999999999999999998888888888889999999999999999999999999999999899999999999999 Q ss_pred HHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCCh Q lcl|NC_021297. 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) Q Consensus 241 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~ 320 (480) ++++++++|++|+++|+|.+.+.+.++.+..+++...+++|.++|+++++++++.+++++|+++++.++++|++++++|+ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 320 (480) T protein:vir:78 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) T ss_pred HHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCh Confidence 99999999999999999999888888777888888999999999999999999999999999999999999999999999 Q ss_pred HHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021297. 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 321 ~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~ 400 (480) +.||+++.|++||+||++++..|+.||+++++.|+.+|++++++++++.|.....++..++++|+++.++|+++.+|+++ T Consensus 321 ~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~ 400 (480) T protein:vir:78 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) T ss_pred HHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999877778889999999999999999999999 Q ss_pred HHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 401 kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) |++++|.+++|++|+++++||+++++++|+++++++++++++++.+..++.++..+++.+++.+++++.+|+|.|.++|| T Consensus 401 kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccccCCCCcccCC Confidence 99999999999999999999999999999999999999999999988888888888899999899999999999999999 No 2 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=2.3e-119 Score=671.03 Aligned_cols=480 Identities=100% Similarity=1.437 Sum_probs=460.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |+|+.++|++|+++|..+++|++++++||+|+|+|++.+..++++++++++++|||++|||++++||+++||++++|++. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.++++|++|+|+.++.++++++++||+||++||++...+.|++|.++|+++||.+++|+||+...+++.+++++|... T Consensus 81 ~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~ 160 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee Confidence 99999999999999999999999999999999999988877889999999999999999999999899999999999988 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s 240 (480) ++.+.++++++|+++.+++|...++...++++..++.+|+||.||||+|+|++++.++||+|+|+++|++|+|+||+++| T Consensus 161 d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s 240 (480) T protein:vir:78 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) T ss_pred cCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHH Confidence 88888999999999999999998888888888889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCCh Q lcl|NC_021297. 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) Q Consensus 241 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~ 320 (480) ++++++++|++|+++++|.+.+.+.++.+..++....+++|.++|++++|++++.+++++|+++++.++++|++++++|+ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 320 (480) T protein:vir:78 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) T ss_pred HHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCH Confidence 99999999999999999999888877777788888999999999999999999999999999999999999999999999 Q ss_pred HHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021297. 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 321 ~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~ 400 (480) +.||++++|++||+||++++.+|+.||+++++.|+.+|++++++++++.|.....++..++++|+++.++|+++.+|+++ T Consensus 321 ~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~ 400 (480) T protein:vir:78 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) T ss_pred HHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999877778889999999999999999999999 Q ss_pred HHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 401 kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) |++++|.+++|++|+++++||+++++++|+++++++.+++.+++.+...+++++.+++.+++...+++.+|+|.|.+.+| T Consensus 401 kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 99999999999999999999999999999999999999999999998888888889999999889999999999999999 No 3 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.6e-105 Score=595.17 Aligned_cols=468 Identities=44% Similarity=0.748 Sum_probs=400.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) -.++.+++++|+++|..+++|++++.+||+|+|+|++++..++++++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~~~~~ 90 (486) T protein:vir:42 11 IEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLGDADEA 90 (486) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecCCCchh Confidence 45557789999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc--cCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~--~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ++.++++|++|+|+.++.++++++++|||||++||+++... .+.++.++++++||.+++++||+.. +++.+++++|. T Consensus 91 ~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~-~~~~~~~~~~~ 169 (486) T protein:vir:42 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRI-NRVSKAIRVAY 169 (486) T ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCC-CCeEEEEEEEE Confidence 88999999999999999999999999999999999876443 3568889999999999999999875 46888888776 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) + ++++.++++++|+++.+++|...++. | ...+..+|+||.||||+|+|++++.++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (486) T protein:vir:42 170 D-KEGNEIQAATLYTPMETIGWFRADGE---W-AEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (486) T ss_pred e-cCCCeEEEEEEEcCCcEEEEEecCCc---E-EeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHH Confidence 4 45666788999999999998776543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 239 LMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++|+|.+++.+..+ .....++...+++|.+++++++|+|++.+++++|+++++.++++|+.++ T Consensus 245 ~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 324 (486) T protein:vir:42 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (486) T ss_pred HHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCCCCceEEeecccCHHHHHHHHHHHHHHHhccc Confidence 99999999999999999999988766433 3445678888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021297. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++..|++||+++++.|+++|++++++++++.+. ....+..++++.|+++.++|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ 404 (486) T protein:vir:42 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (486) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999998764 445677899999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHH---HHhcccccCcccccCCCCCCCCcccccCCC Q lcl|NC_021297. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMID---TLYSTTKAQADATPKPTVTDTKTETQTSPS 472 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~ 472 (480) ||+++|+++++.+++|++|+++++||+++++++|+++++++.+.... .+.+...+..+ .+.++.++ .++....++ T Consensus 405 ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~ 482 (486) T protein:vir:42 405 ADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPG-SPSPTAPP-KPQPAIESS 482 (486) T ss_pred HHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCC-CCCCCCCC-CCCcccCCC Confidence 99999999999899999999999999999999988776666444333 33333222111 11111111 111121222 Q ss_pred CCCC Q lcl|NC_021297. 473 GFNR 476 (480) Q Consensus 473 ~~~~ 476 (480) |.++ T Consensus 483 ~~~~ 486 (486) T protein:vir:42 483 GGDA 486 (486) T ss_pred CCCC Confidence 3333 No 4 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=2.1e-104 Score=589.03 Aligned_cols=467 Identities=44% Similarity=0.734 Sum_probs=399.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) -.+...++..|+++|..+++|++++.+||+|+|+|++.+..++++++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 90 (485) T protein:vir:24 11 IADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRLGDADEA 90 (485) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccCceecCCCchh Confidence 34456788999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc--cCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~--~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ++.++++|++|+|+.++.++++++++||+||++||.+.... .+.++.++|+++||.+++++||+..+ ++.+++++|. T Consensus 91 ~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~-~~~~~~~~~~ 169 (485) T protein:vir:24 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIG-RPAKAIRVAY 169 (485) T ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCcC-ceeEEEEEEE Confidence 99999999999999999999999999999999999876432 34568899999999999999998865 5666666554 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) . ++.+..+++++|+++.+++|...++. | ...+..+|+||.||||+|+|++++.++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (485) T protein:vir:24 170 D-AEGNEIQAATLYTPNETFGWFRAEGE---W-VEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (485) T ss_pred e-ecCCeEEEEEEEcCCcEEEEEecCCc---e-EeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHH Confidence 4 34556788999999999998876543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 239 LMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++++|.+++.+... .+...++...+++|.+++++++|++++.+++++|+++|+.++++|++++ T Consensus 245 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 324 (485) T protein:vir:24 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (485) T ss_pred HHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhccc Confidence 99999999999999999999988766433 3455678889999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021297. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~i~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++.+|++||+++++.|+++|++++++++++.+ .....+..+++++|+++.++|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ 404 (485) T protein:vir:24 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (485) T ss_pred CCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHH Confidence 999999999999999999999999999999999999999999999999998865 4556678899999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHH---HHHhcccccCcccccCCCCCCCCcccccCCC Q lcl|NC_021297. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMI---DTLYSTTKAQADATPKPTVTDTKTETQTSPS 472 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~ 472 (480) +|+++|++++|.+++|++|+++++||+++++++|+++.+++.+... +.+.+..... ...+..++ +.+.+..++ T Consensus 405 ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~---~~~~~~~e-~~~~~~~~~ 480 (485) T protein:vir:24 405 ADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTV---PGSPNPTP-APKPQPAIE 480 (485) T ss_pred HHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCC---CCCCCCCC-CCCCccCCC Confidence 9999999999999999999999999999999998877766654333 3332222211 11111111 223333445 Q ss_pred CCCCC Q lcl|NC_021297. 473 GFNRT 477 (480) Q Consensus 473 ~~~~~ 477 (480) |.+++ T Consensus 481 ~~~~a 485 (485) T protein:vir:24 481 GGDSA 485 (485) T ss_pred CCCCC Confidence 55555 No 5 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=1.4e-103 Score=584.54 Aligned_cols=467 Identities=42% Similarity=0.716 Sum_probs=399.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) -+|+..++++|+++|..+++|++++.+||+|+|+|++++...++.++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 90 (485) T protein:vir:10 11 IEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRFGDADEA 90 (485) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceecCCCchh Confidence 67788899999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccccc--CCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~--d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ++.++++|++|+|+.++.++++++++||+||++||+++.... +.++.++|++++|.+++++||+..++ +.+++++|. T Consensus 91 ~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~-~~~~~~~~~ 169 (485) T protein:vir:10 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGR-VSKAIRVAY 169 (485) T ss_pred HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCc-eeEEEEEEE Confidence 999999999999999999999999999999999998764432 35688999999999999999987654 556666654 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) . ++++..+++++|+++.+++|...++. | ...+..||+||.||||+|+|++++.++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (485) T protein:vir:10 170 D-AEGNEIQAATLYTPNDIFGWYRVENE---W-QEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (485) T ss_pred e-eCCCeEEEEEEEeCCeEEEEEEcCCc---e-EEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHH Confidence 3 34555778899999999999876543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCcccccc--ccccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 239 LMNLQSASQILGTPLRVISGVTTDELTN--DGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++++|.+.+.+.. ..+...++...+++|.+++++++|+|++.+++++|+++|+.++++|++++ T Consensus 245 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~ 324 (485) T protein:vir:10 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (485) T ss_pred HHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999999998776543 33455678888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021297. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++..|++||+++++.|+.+|++++++++++.+. ....+..+++|.|+++.|+|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ 404 (485) T protein:vir:10 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (485) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999988764 445567899999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHH---HHHHHHhcccccCcccccCCCCCCCCcccccCCC Q lcl|NC_021297. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETE---DMIDTLYSTTKAQADATPKPTVTDTKTETQTSPS 472 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~ 472 (480) +|+++|++++|.+++|++|+++++||+++++++++++.+++.+ ...+.+.+...+.+++...+ +++++.+.-+ T Consensus 405 ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 480 (485) T protein:vir:10 405 ADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPA----PAPKPAALES 480 (485) T ss_pred HHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCcc----ccccCcCCCC Confidence 9999999999989999999999999999999888766555433 34445544433322222111 1122222223 Q ss_pred CCCCC Q lcl|NC_021297. 473 GFNRT 477 (480) Q Consensus 473 ~~~~~ 477 (480) |.+++ T Consensus 481 ~~~~~ 485 (485) T protein:vir:10 481 GGDAA 485 (485) T ss_pred CCCCC Confidence 33344 No 6 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=1.2e-103 Score=584.84 Aligned_cols=470 Identities=44% Similarity=0.767 Sum_probs=398.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) =.|++++++.|++.|..+.+|++++.+||+|+|+|++++...+++++++++++|||++|||++++||+++||+++++++. T Consensus 10 ~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 89 (484) T protein:vir:77 10 NVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGGADKA 89 (484) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCCcchh Confidence 34567899999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccC--CCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD--PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d--~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ++.++++|++|+|+.++.++++++++||+||++||.+..+... ..+.++|+++||.+++++||+.. +++.+++++|. T Consensus 90 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~-~~~~~a~~~~~ 168 (484) T protein:vir:77 90 DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRT-RQVMRAIRAIE 168 (484) T ss_pred HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCCC-CceEEEEEEEE Confidence 9999999999999999999999999999999999987654332 34568899999999999999874 57888888876 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) ..+ .+.++++++|+++.+++|...++. | ...+..+|+||+||||+|+|+++.+++||+|+|+++|++|+|+||++ T Consensus 169 ~~~-~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~ 243 (484) T protein:vir:77 169 DEE-GNEVIGATLYLPNNTVIWNREDGQ---W-VQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAART 243 (484) T ss_pred eec-CCcEEEEEEEecCeEEEEEecCCc---e-EeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHH Confidence 554 455778899999999998776543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 239 LMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|+++++++++++|+++++|.+++.+..+ .+...++...+++|..+++++++++++.+++++|+++|+.++++|++++ T Consensus 244 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 323 (484) T protein:vir:77 244 LMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFEDHESKAQQFSAAELRNFVDALDALDRKAAAYT 323 (484) T ss_pred HHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCCCCceeEeecCCChHHHHHHHHHHHHHHhccc Confidence 99999999999999999999987765433 3455678888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021297. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++.+|++||+++++.|+++|++++++++++.+. ....+..++++.|+++.++|+++. T Consensus 324 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ 403 (484) T protein:vir:77 324 GLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAK 403 (484) T ss_pred CCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999988764 445567889999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCC Q lcl|NC_021297. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFN 475 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~ 475 (480) +|+++|++++|.+++|++|+++++||+++++++|+++++++.......+.......++.++.++.++ ..++.|+... T Consensus 404 ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ 480 (484) T protein:vir:77 404 ADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPE---TPEPQPNPAE 480 (484) T ss_pred HHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCC---cccccCCCcc Confidence 9999999999999999999999999999999998877766655443333332222222222222222 1222222222 Q ss_pred CCCC Q lcl|NC_021297. 476 RTKT 479 (480) Q Consensus 476 ~~~~ 479 (480) +++- T Consensus 481 ~~~~ 484 (484) T protein:vir:77 481 EAAA 484 (484) T ss_pred ccCC Confidence 2222 No 7 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=2.1e-102 Score=578.06 Aligned_cols=463 Identities=44% Similarity=0.758 Sum_probs=392.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC----- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----- 75 (480) ++ ++++|++|+.+|..+++|++++++||+|+|+|++.+..+++.++++++++|||++|||++++||.++||+++ T Consensus 7 ~d-~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~ 85 (488) T protein:vir:23 7 ID-PEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGE 85 (488) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceeccCCccc Confidence 55 578999999999999999999999999999999999999999999999999999999999999999999754 Q ss_pred -----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc--cCCCceeEEEEEccceeEEEEeCCCCC Q lcl|NC_021297. 76 -----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 76 -----~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~--~d~~~~~~i~~~~p~~~~~v~d~~~~~ 148 (480) +|++..+.++++|++|+|+.++.++++++++||+||++|+++.... .++++.++|++++|.+++++||+..+ T Consensus 86 ~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d~~~~- 164 (488) T protein:vir:23 86 EPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAEVDPRTR- 164 (488) T ss_pred ccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEEEecCCC- Confidence 3566778899999999999999999999999999999999875432 36788899999999999999998754 Q ss_pred eeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhH Q lcl|NC_021297. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) ++.+++++|... +++..+++++|+++.+++|...++. | ...+..+|+||+||||+|.|+++..++||+|+|+++| T Consensus 165 ~~~~~~~~~~~~-~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v 239 (488) T protein:vir:23 165 KVLYAIRAIYGA-DGNEIVSATLYLPDTTMTWLRAEGE---W-EAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPEL 239 (488) T ss_pred ceEEEEEEEEec-CCCcEEEEEEEecCcEEEEEecCCc---e-EeccccccCCCCcceEEeccccccCCcCCccchhhhH Confidence 677778777554 4455778899999999998776543 2 2345678999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhhhhhcccC-CCCceEEEeccccHHHHHHHH Q lcl|NC_021297. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 229 ~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l 305 (480) ++|+|+||+++|+++++++||++|+++|+|++++.+... .....++...+++|.++ |++++|++++.+++++|+++| T Consensus 240 ~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l 319 (488) T protein:vir:23 240 RSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSAAELRNFVDAL 319 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecCCCChHHHHHHH Confidence 999999999999999999999999999999988766543 33456788888898875 567999999999999999999 Q ss_pred HHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEe Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVW 384 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~i~v~f 384 (480) +.++++|++++++|+++||+++.|++||+||++++..|++||+++++.|+++|++++++++++.++. ...++.+++++| T Consensus 320 ~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f 399 (488) T protein:vir:23 320 DALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVW 399 (488) T ss_pred HHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEe Confidence 9999999999999999999999999999999999999999999999999999999999999987643 345678899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHH---HHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDM---IDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 461 (480) +++.++|+++.+|+++|++++|.+++|+||+++++||+++++++++++++++.+.. ++++.+..+..+..+..+.+ T Consensus 400 ~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 478 (488) T protein:vir:23 400 RDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVG- 478 (488) T ss_pred cCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCC- Confidence 99999999999999999999999999999999999999999988877655554433 34443333322222211111 Q ss_pred CCCcccccCCCCC Q lcl|NC_021297. 462 DTKTETQTSPSGF 474 (480) Q Consensus 462 d~~~~~~~~p~~~ 474 (480) +.++++++ -. T Consensus 479 --~~~~~e~~-~a 488 (488) T protein:vir:23 479 --EPPAPEPD-AA 488 (488) T ss_pred --CCCCCCCC-CC Confidence 11111111 11 No 8 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=7.4e-100 Score=564.05 Aligned_cols=464 Identities=22% Similarity=0.295 Sum_probs=391.7 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++. .++|++|+++|..+++|+.++.+||+|+|++++++..++++++++++++|||++|||+++++|.++||+++++++ T Consensus 18 l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~d~~~ 97 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWPDGDY 97 (504) T ss_pred CCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceeeCCCCCh Confidence 6554 568999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) .+..+++||+.|+|+.+..++++++++|||||++||+++ +.++.++|+++||.++|++||+.. +++.+++++|.. T Consensus 98 ~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~----d~~~~~~I~~~sP~~~~~iyD~~~-~~~~~a~~~~~~ 172 (504) T protein:vir:99 98 GSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGG----AGEPDSLIHVKSAMQATGEWNSRR-NAMDSLLSITSR 172 (504) T ss_pred hhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCC----CCCceeEEEEeccceeEEEEeCCC-CceeEEEEEEEe Confidence 888999999999999999999999999999999999753 335568899999999999999875 466667766543 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) +..+..+.+++|+++.+++|...++. .+..+..+|++| ||||+|.|++++.+++|+|+|+++|++|+|+||+++ T Consensus 173 -d~~g~~~~~~~y~~~~~~~~~~~~~~----~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~ 246 (504) T protein:vir:99 173 -DAEGHPTGIALYEDGVTVTADMDDDG----DWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGC 246 (504) T ss_pred -cCCCeEEEEEEEcCCcEEEEEEcCCc----eeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHH Confidence 45667788999999999998876543 344567789998 999999999999999999999989999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccc--cchhhhhhhhhcccCC---------CCceEEEeccccHHHHHHHHHHH Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGE--NTTLDIYYGRILTLAS---------EAAKISEFKAAELRNFAEEMEVF 308 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~--~~~~~~~~~~~~~~~g---------~~~~~~~~~~~~~~~~~~~l~~~ 308 (480) ++++++++||++|+++++|++++++.++.+ ...++...+++|.+++ .++++++++++++++|+++|+.+ T Consensus 247 ~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~ 326 (504) T protein:vir:99 247 IRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQI 326 (504) T ss_pred HHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHH Confidence 999999999999999999998877654433 3467888889988753 35889999999999999999999 Q ss_pred HHHHHhccCCChHHhcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEec Q lcl|NC_021297. 309 RKEAASITGLPPQYLSSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWR 385 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~-~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~ 385 (480) +++|+++++||+++||..+ .|++||+||++++.+|..|++++++.|+.+|++++++++++.+. ....++.++++.|+ T Consensus 327 i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~ 406 (504) T protein:vir:99 327 AMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFR 406 (504) T ss_pred HHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEec Confidence 9999999999999999654 58899999999999999999999999999999999999988764 34566788999999 Q ss_pred CCCCCCHHHHHHHHHHHHhcccc-CCCHHHHHHhCCCChhHHHHHHH-HHHHHHHHHHHHHhcccccCcccccCCCCC-- Q lcl|NC_021297. 386 DPSTPTVAAKADAVSKLYANGQG-PIPKEQARIDLGYTATQREQMRD-WDKQETEDMIDTLYSTTKAQADATPKPTVT-- 461 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~-~~s~et~~~~lg~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 461 (480) ++.++|+++.||+++|+++++.. +++.+++++++|+++++++++++ +++++....++++.+.....+..+..++.+ T Consensus 407 d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 486 (504) T protein:vir:99 407 SPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAG 486 (504) T ss_pred CCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCC Confidence 99999999999999999998864 45678999999999999998764 445566667777765443222221111111 Q ss_pred ----CCCcccccCCCCCC Q lcl|NC_021297. 462 ----DTKTETQTSPSGFN 475 (480) Q Consensus 462 ----d~~~~~~~~p~~~~ 475 (480) ..+++..+.|+-.| T Consensus 487 e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 487 EPPANEPPAALGRPTLVG 504 (504) T ss_pred CCCCCCCCccCCCcccCC Confidence 22223344455444 No 9 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=8.9e-98 Score=552.66 Aligned_cols=445 Identities=22% Similarity=0.266 Sum_probs=389.4 Q ss_pred CCC-HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTT-YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++ +.+++++|+++|..+++|+.++++||+|+|++++++..+|++++++++++|||+++||+++++|.++||++++++. T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~~ 91 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWPDGDL 91 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceECCCCCc Confidence 654 4679999999999999999999999999999999999999999999999999999999999999999999988777 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ....++++|++|+|+....+++++|++|||||++|++++ +.++.++|+++||.+++++||+..+ ++.+++.++. T Consensus 92 ~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~----d~~~~~~i~~~sp~~~~~~~D~~~~-~~~~al~~~~- 165 (474) T protein:vir:81 92 DSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGE----DDEPEALIHVKDASEATGEWNRRRR-GLNNLLSIID- 165 (474) T ss_pred cchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCC----CCCceeEEEEeccceEEEEEeCCCC-cceeeeEEEE- Confidence 778899999999999999999999999999999999764 4566799999999999999999765 5666666554 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .+..+..+.+++|+++.+++|...++. +.+..+..+|++| ||||+|.|++++.++||+|+|++.|++|+|++|+++ T Consensus 166 ~~~~g~~~~~~ly~~~~~~~~~~~~~~---~~w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~ 241 (474) T protein:vir:81 166 KDKEGKVLSLALYLDNETVTAQRDKAT---LKWQVDRDEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVREL 241 (474) T ss_pred EcCCCcEEEEEEEeCCcEEEEEEcCcc---ceeeeccCCCCCC-cceEEecccccccCcCCccccchhHHHHHHHHHHHH Confidence 344556778899999999988876532 2344577889998 799999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCcccccccc--ccchhhhhhhhhcccCCC---------CceEEEeccccHHHHHHHHHHH Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDG--ENTTLDIYYGRILTLASE---------AAKISEFKAAELRNFAEEMEVF 308 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~--~~~~~~~~~~~~~~~~g~---------~~~~~~~~~~~~~~~~~~l~~~ 308 (480) +++.++++||++|+++++|++++++.++. ....|+...+++|.++++ .++|+|++++++++|+++|+.+ T Consensus 242 ~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~ 321 (474) T protein:vir:81 242 ARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGL 321 (474) T ss_pred HHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHH Confidence 99999999999999999999988775443 345688888899887643 3689999999999999999999 Q ss_pred HHHHHhccCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccceeeeEE Q lcl|NC_021297. 309 RKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEYTRLETV 383 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~-~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~~~i~v~ 383 (480) +++|+++++||+++||.. ++|++||+||++++..|..|++++++.|+.+|++++++++++.|+ ....++..+++. T Consensus 322 ~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~ 401 (474) T protein:vir:81 322 AKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAK 401 (474) T ss_pred HHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeE Confidence 999999999999999954 589999999999999999999999999999999999999999874 334557899999 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHH-HHHHHHHHHHHHHHhcccccCcccc Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMR-DWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) |+++.++++++.||+++|+++++.++++++++++++|++++++++++ ++++++..+.++++.+....++.+. T Consensus 402 W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 402 WRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred ecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 99999999999999999999999999999999999999999999887 5556677777787765443222111 No 10 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=2.8e-96 Score=544.40 Aligned_cols=434 Identities=28% Similarity=0.411 Sum_probs=375.6 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++. .++|++|+++|..+++|++++.+||+|+|++++.+...++.++++|+++|||++|||++++||.++||+.+++ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~d~-- 78 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNGDG-- 78 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCCCh-- Confidence 6654 7799999999999999999999999999999999999999999999999999999999999999999987653 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) +.++++|+.|+|+.++.+++++++++|+||++||. +++|.+++++++|.+++++||+..++...++++++. T Consensus 79 --~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~- 149 (441) T protein:vir:80 79 --YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIP------HGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQT- 149 (441) T ss_pred --HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEe------CCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEE- Confidence 46899999999999999999999999999999996 457899999999999999999887665555555543 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .++ ...++++|+++.+++|...+... .+..+..+|+||+||||+|.|++++.++||+|+|+++|++|||+||+++ T Consensus 150 ~~~--~~~~~~vy~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~ 224 (441) T protein:vir:80 150 CDP--EVVEAELLLPDVIVQVERRGSRE---WVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTL 224 (441) T ss_pred ecC--ceEEEEEEecCeEEEEEEcCCcc---eeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHH Confidence 322 24578999999999987765432 3456778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC----CCCceEEEeccccHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) |++++++++|++|+++++|+.++++.... ++...+++|.++ ++.+++++++.+++++|+++|+.++++|++. T Consensus 225 s~~~~~~~~~~~~~~~i~G~~~~~~~~~~----~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~ 300 (441) T protein:vir:80 225 LGQSVNRDFYAYPQRWVTGVSADEFSQPG----WVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGE 300 (441) T ss_pred HHHHHHHHhhcCceeeeecCCccccccch----hhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcc Confidence 99999999999999999998876654432 455667777654 3357899999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC--cccceeeeEEecCCCCCCHH Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV--TEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~~~~~i~v~f~~~~~~~~~ 393 (480) +++|++.||+.+.|++||+||++++.+|+.||+++++.|+++|++++++++++.+... .....+++++|+++.|+|++ T Consensus 301 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~ 380 (441) T protein:vir:80 301 AAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRA 380 (441) T ss_pred cCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHH Confidence 9999999999888888999999999999999999999999999999999999987432 33457899999999999999 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccc Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) +.+|+++|++++|.+++|++++++.+|+++++++++++++++ +++.+.++++....++++- T Consensus 381 e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~~~~~~e~~e-~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 381 ATADAVTKLVGAGILPADSRTVLEMLGLDDVQVEAVMRHRAE-SSDPLAVLAGAISRQTNEV 441 (441) T ss_pred HHHHHHHHHHhcCcccccHHHHHHhCCCCHHHHHHHHHHHHH-HHHHHHHHhhhhhcccccC Confidence 999999999999988899999999999999999998765544 3445666665554443333 No 11 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=1.5e-94 Score=534.99 Aligned_cols=455 Identities=23% Similarity=0.313 Sum_probs=378.2 Q ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh--hccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~--~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+.. .+++++|+.+|..+++|++++++||+|+|+++.+++..++.++. .++++|||++|||++++||+++||+.++ T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d 102 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNAL 102 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecCC Confidence 4443 45788999999999999999999999999999999988888764 3578899999999999999999999875 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEE-eCCCCCeeEEEEE Q lcl|NC_021297. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL-DPRNTRRVTRAVR 155 (480) Q Consensus 77 d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~-d~~~~~~~~~~~~ 155 (480) ++ ..+.++++|++|+|+.++.++++++++|||||++||.+ +++ ++|+++||.+++++| |+..++++.++++ T Consensus 103 ~~-~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d------e~~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~ 174 (501) T protein:vir:25 103 AK-ENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPT------DEG-PVFRTRSPRQILAVYADPSVDAWPQYALE 174 (501) T ss_pred cc-chHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecC------CCC-CeEEEeccccEEEEEecCCCCcceeEEEE Confidence 54 56789999999999999999999999999999999974 234 579999999999999 5677778999999 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCc-------c----------cccccccccccccccccceEEeecccccCcc Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGL-------N----------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~----------~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~ 218 (480) +|....+.+...++++|+++.+++|...... . ..........+|+||.||||+|+|+++.. . T Consensus 175 ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~-~ 253 (501) T protein:vir:25 175 TWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDAD-D 253 (501) T ss_pred EEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCccccC-c Confidence 9988887777888999999888777543210 0 00011234568999999999999998874 5 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccccH Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) +|+|+|++ |++|+|+||+++|+++++++|+++|+++++|++.+.+. .+++..+++|.++|++++|++++.+++ T Consensus 254 ~g~sdie~-v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~------~~~~~~~~i~~~~~~~~~~~q~~~~~~ 326 (501) T protein:vir:25 254 MIVGEVAP-LILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAE------VLKASALRVWTFEDPEVKAQAFPPASV 326 (501) T ss_pred cccchhhh-hHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccc------hhhhcccceeccCCCCceEEEecccCh Confidence 79999975 99999999999999999999999999999998765432 467888999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) ++|+++++.++++|++.+++|++.||+.++| +||+||++++.+|++|++++++.|+++|++++++++.+.|.....+.. T Consensus 327 ~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N-~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~ 405 (501) T protein:vir:25 327 EPYNLILEEMLQHVAMVAQISPAQVTGKMIN-VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADS 405 (501) T ss_pred HHHHHHHHHHHHHHHhhcCCChhhhccccCC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccce Confidence 9999999999999999999999999987777 599999999999999999999999999999999999999877666778 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHH-HHHHHHHHhcccccCccccc Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQE-TEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l-g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 456 (480) ++++.|+++.|+|+++.||+++|++++| +|.++++.++ |++++++++++++++++ ....++++.+..+.+.. + T Consensus 406 ~i~v~w~~~~~~s~~~~ada~~kl~~~g---is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~--~ 480 (501) T protein:vir:25 406 GAEVLWRDTEARSFGAVVDGITKLASAG---IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVP--P 480 (501) T ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhcC---CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCC--C Confidence 9999999999999999999999999875 6999988875 78888888887665554 33455555443332222 2 Q ss_pred CCCCCCCCcccccCCCCCCCC Q lcl|NC_021297. 457 KPTVTDTKTETQTSPSGFNRT 477 (480) Q Consensus 457 ~~~~~d~~~~~~~~p~~~~~~ 477 (480) .+.....++++.+..++.+|+ T Consensus 481 ~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 481 PPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCCCccccccccCCCCCCC Confidence 222222334555555666666 No 12 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=1.4e-92 Score=524.16 Aligned_cols=399 Identities=19% Similarity=0.215 Sum_probs=351.8 Q ss_pred HHHHHHHHHHHHHHhcccCcchhcCcccchhhh-hhccccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhcCH Q lcl|NC_021297. 15 LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDL 93 (480) Q Consensus 15 ~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~i~~~n~~ 93 (480) +..+++|+.++.+||+|+|++++++..+|++++ ++++++|||+++||+++++|.++||+.+++ .++++|++|+| T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~~d~-----~l~~i~~~N~l 75 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFANDDF-----NVTEIFDRNNP 75 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccCCCc-----hHHHHHhhcCh Confidence 666788999999999999999999999999887 678999999999999999999999986532 48999999999 Q ss_pred HHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEe Q lcl|NC_021297. 94 DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYL 173 (480) Q Consensus 94 ~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (480) +....+++++|++|||||++|+++ ++|.|+|+++||.+++++||+.. +++.++++++.. ++++..+.+.+|+ T Consensus 76 d~~~~~~~~~al~~G~sf~~v~~~------~d~~~~i~~~sP~~~~~i~Dp~~-~~~~~al~~~~~-~~~~~~~~~~~~~ 147 (410) T protein:vir:95 76 DIFFDSAILSALIGSCSFVYISKG------EDDEVRLQVIESSNATGVIDPIT-GLLVEGYAVLAR-DDYNRPTLEAYFE 147 (410) T ss_pred HHHHHHHHHHHHHhCceeEEEecC------CCCceEEEEEcccceEEEEeCCC-CceEEEEEEEEe-cCCCeEEEEEEEe Confidence 999999999999999999999974 46789999999999999999964 678888877644 4556678899999 Q ss_pred CCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchh Q lcl|NC_021297. 174 PDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPL 253 (480) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~ 253 (480) ++.++++..++.. ..++|++|+||||+|+|++++.++||+|+|++.|++|+|++|+++++++++++||++|+ T Consensus 148 ~~~~~~~~~~~~~--------~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pq 219 (410) T protein:vir:95 148 PNATHFIPKDGEP--------YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQ 219 (410) T ss_pred CCcEEEEeeCCcc--------ccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchh Confidence 9999998765432 24689999999999999999999999999998899999999999999999999999999 Q ss_pred hhhcCCCccccccccccchhhhhhhhhcccCC----CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccccc Q lcl|NC_021297. 254 RVISGVTTDELTNDGENTTLDIYYGRILTLAS----EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN 329 (480) Q Consensus 254 ~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n 329 (480) ++++|++.+.+ ....|+...+++|.++. +.++|+|++++++++|+++|+.++++|++++++|+++||+.++| T Consensus 220 r~i~G~d~d~~----~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~N 295 (410) T protein:vir:95 220 KYILGLDPDAE----PMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDN 295 (410) T ss_pred heeeccCCCCC----cCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCc Confidence 99999865433 23457788888988753 45899999999999999999999999999999999999999999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEec---CCCCCCHHHHHHHHHHHHh Q lcl|NC_021297. 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWR---DPSTPTVAAKADAVSKLYA 404 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~i~v~f~---~~~~~~~~~~ad~~~kl~~ 404 (480) ++||+||++++.+|..|++++++.|+.+|++++++++++.+.. .+.++.++++.|+ ++..+++++.||+++|+++ T Consensus 296 psSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~ 375 (410) T protein:vir:95 296 PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQ 375 (410) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999997643 3456788999998 7888999999999999999 Q ss_pred ccccCCCHHHHHHhCCCChhHHHHHH-HHHHHHHH Q lcl|NC_021297. 405 NGQGPIPKEQARIDLGYTATQREQMR-DWDKQETE 438 (480) Q Consensus 405 ~~~~~~s~et~~~~lg~~~~~~~~~~-~~~~~~~~ 438 (480) ++++++++++++++|||+++++.+++ ++++++.+ T Consensus 376 a~~g~~~~~~~~~~lg~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 376 ALPGYINAETIRDLTGIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred hccCCccHHHHHHhcCCChHHHHHHHHHHHHhCCC Confidence 99999999999999999988766543 22222222 No 13 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=2.7e-91 Score=517.12 Aligned_cols=399 Identities=19% Similarity=0.214 Sum_probs=355.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhh-hhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ .++|.+|++++..+++|+.++.+||+|+|++++++..+|++++ ++++++|||++|||+++++|.++||+.++ T Consensus 1 ~~--~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d--- 75 (409) T protein:vir:94 1 MT--EKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDD--- 75 (409) T ss_pred CC--HHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccCCc--- Confidence 64 6899999999999999999999999999999999999998875 66899999999999999999999998542 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|++|+|+....+++++|++|||||++||++ ++|.|+|+++||.+++++||+.. +++.++++++.. T Consensus 76 --~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~------~dg~~~i~~~sp~~~~~i~D~~~-~~~~~a~~~~~~ 146 (409) T protein:vir:94 76 --FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG------ENDAVRLQVIEAVNATGIIDPIT-GLLTEGYAVLER 146 (409) T ss_pred --hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC------CCCceEEEEeccceEEEEEecCC-CceeeeEEEEEe Confidence 358999999999999999999999999999999974 47889999999999999999965 468888877643 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) +..+..+...+|++++++++...++. ...++|++|+||||+|.|++++.++||+|+|++.|++|+|++|+++ T Consensus 147 -d~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~ 218 (409) T protein:vir:94 147 -DENNNVVLEAHFLPDRTDYYYRDSRN-------NISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTL 218 (409) T ss_pred -cCCCceEEEEEEecCcEEEEEecCce-------eEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHH Confidence 34455677889999999988776543 2345899999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC----CCCceEEEeccccHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) +++++.++||++|+++++|++.+. .....|+...+++|.++ |++++|+|++++++++|+++++.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~----~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (409) T protein:vir:94 219 ERADVTAEFYSFPQKYVTGLSDDA----EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGE 294 (409) T ss_pred HHHHHHHHHhcChhheeEecCCCC----cccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhh Confidence 999999999999999999986433 23346888888998874 5578999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEecC---CCCC Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRD---PSTP 390 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~i~v~f~~---~~~~ 390 (480) +++|+++||+.++|++||+||++++.+|..|++++++.|+++|++++++++++.++. ...++.++++.|++ +.++ T Consensus 295 t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~ 374 (409) T protein:vir:94 295 TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADAS 374 (409) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchH Confidence 999999999999999999999999999999999999999999999999999997753 34567889999994 4556 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021297. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~ 425 (480) ++++.||+++|++++|+++.+++++++++||++++ T Consensus 375 ~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 67889999999999999999999999999999887 No 14 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.4e-89 Score=507.73 Aligned_cols=460 Identities=16% Similarity=0.235 Sum_probs=360.7 Q ss_pred CCCHH--H-HHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchh---hhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYH--E-HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPE---LAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~--~-~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~---~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) |++++ + ++.+|+.+|..+++|++++++||+|+|+|++.+...+.. .-.+++++|||++|||+++++|+++||+. T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~ 88 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK 88 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC Confidence 77653 3 345789999999999999999999999998876544332 22345688999999999999999999997 Q ss_pred CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 75 SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 75 ~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) +++ +..+.++++|++|+|+.++.++++++++||+||++||++... .|++|.+++++++|++++++||+...+... + T Consensus 89 ~d~-~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~-~d~~g~~~i~~~~p~~~~~iydd~~~~~~~--~ 164 (479) T protein:vir:99 89 TGT-NENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISP-LDGTTVARIKCIDPRDAFAIWEDPYWDEWP--K 164 (479) T ss_pred CCc-hhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-cCCCCceEEEEechhheEEEecCCccccee--e Confidence 754 456789999999999999999999999999999999975432 467899999999999999999765543322 2 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) ++.+.+.. ..+.+|+...+++|...++. | ...+..+|+||.||||+|.|+++. +++|+|+|+ .|++|||+ T Consensus 165 -~~~~~~~~---~~~~~~~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e-~v~~liDa 234 (479) T protein:vir:99 165 -YLLERQPN---GQYWWWTEEDYSIFEFKQGK---F-IYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVE-PLVTVAKA 234 (479) T ss_pred -EEEeecCc---eeEEEEecceEEEEEecCCc---e-eeccccccCCCCcceEEeecCCCc-CcCCcchhH-HHHHHHHH Confidence 22222222 24567888888777765443 2 334678999999999999999987 568999997 59999999 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHh Q lcl|NC_021297. 235 ASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAAS 314 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~ 314 (480) ||+++|++++++++|++|+++++|....+.... ....++...++++..+|+++++++++.+++++|+++++.++++|++ T Consensus 235 ~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~ 313 (479) T protein:vir:99 235 IDKTGLDILLVQHHQSFQIRWATGLMLPEGANA-DQEKMRFAQESMLISQNEKASFGAIPAAPLDGLLNAYKESLLEFLA 313 (479) T ss_pred HHHHHHHHHHHHHHhhchhhhhcCCCccccccc-chhccccccccceeecCCCceEEEecccchHHHHHHHHHHHHHHhc Confidence 999999999999999999999999876543322 2334666778888889999999999999999999999999999999 Q ss_pred ccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021297. 315 ITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 315 ~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~ 394 (480) ++++|++.||. ++| +||+||++++.+|+.||+++++.|+.+|++++++++++.|.....+...++++|+++.++|+++ T Consensus 314 ~t~~p~~~~g~-~~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~ 391 (479) T protein:vir:99 314 LAQLPPHIAGQ-IVN-VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQ 391 (479) T ss_pred cCCCCHHHccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHH Confidence 99999999985 334 7999999999999999999999999999999999999998776667788999999999999999 Q ss_pred HHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHHHH--HHHHHHhcccccC-cccccCCCCCC-CCccccc Q lcl|NC_021297. 395 KADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQETE--DMIDTLYSTTKAQ-ADATPKPTVTD-TKTETQT 469 (480) Q Consensus 395 ~ad~~~kl~~~~~~~~s~et~~~~l-g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~d-~~~~~~~ 469 (480) .+|+++||+++| ++|.||+++++ |++++++++++++++++.+ ...+.+.+..... .+.++.+.... +++.+.+ T Consensus 392 ~ad~~~kl~~ag--~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (479) T protein:vir:99 392 FADAWAKMVESL--KIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTG 469 (479) T ss_pred HHHHHHHHHhcC--CCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCc Confidence 999999999875 68999999998 5677777877655444422 2334443322211 11111211111 2223334 Q ss_pred CCCCCCCCCC Q lcl|NC_021297. 470 SPSGFNRTKT 479 (480) Q Consensus 470 ~p~~~~~~~~ 479 (480) .|.+.|.+-- T Consensus 470 ~~~~~~~~~~ 479 (479) T protein:vir:99 470 EPASLNKSGA 479 (479) T ss_pred chhccCCCCC Confidence 4544443333 No 15 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=1.2e-90 Score=513.65 Aligned_cols=399 Identities=19% Similarity=0.209 Sum_probs=355.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhh-hhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ .++|++|.+++..+++|+.++.+||+|+|++++++..+|++++ ++++++|||++|||+++++|.++||+.++ T Consensus 1 ~~--~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d--- 75 (409) T protein:vir:16 1 MT--EKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDD--- 75 (409) T ss_pred CC--HHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccCcc--- Confidence 64 6899999999999999999999999999999999999999885 67899999999999999999999998542 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|++|+|+....+++++|++|||||++|+++ ++|.++|+++||.+++++||+.. +++.+++++|.. T Consensus 76 --~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~------~dg~~~i~~~sP~~~~~i~D~~~-~~~~~a~~~~~~ 146 (409) T protein:vir:16 76 --FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKG------ENDAVRLQVIEATNATGIIDPIT-GLLTEGYAVLER 146 (409) T ss_pred --hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecC------CCCceEEEEEcccceEEEeeccc-ccceeeeEEEEe Confidence 358999999999999999999999999999999974 46789999999999999999865 466777776643 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) +..+..+..++|+++.++++...++.. ...+|++|+||||+|+|++++.++||+|+|++.|++|+|++|+++ T Consensus 147 -d~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~ 218 (409) T protein:vir:16 147 -DENNNVVLEAHFLPDRTDYYYRDSRNN-------ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTL 218 (409) T ss_pred -cCCCceEEEEEEecCcEEEEEecCccc-------cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHH Confidence 344556778899999998887765432 345899999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC----CCCceEEEeccccHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) ++++++++||++|+++++|++.+. .....|+...+++|.++ |+.++|+|++++++++|+++++.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~----~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (409) T protein:vir:16 219 ERADVTAEFYSFPQKYVTGLSDDA----EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGE 294 (409) T ss_pred HHHHHHHHHhcChhheeEecCCCC----CccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhh Confidence 999999999999999999986533 23345788888998874 4568999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCC---CC Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPS---TP 390 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~---~~ 390 (480) +++|+++||++++|++||+||++++.+|..|++++++.|+.+|++++++++++.++ .......++++.|+++. .+ T Consensus 295 s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~ 374 (409) T protein:vir:16 295 TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADAS 374 (409) T ss_pred cCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchh Confidence 99999999999999999999999999999999999999999999999999999875 33445678999999655 66 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021297. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~ 425 (480) ++++.||+++|++++++++..++++++++||++++ T Consensus 375 s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred hHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 68999999999999999999999999999999887 No 16 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=4.2e-89 Score=505.11 Aligned_cols=412 Identities=18% Similarity=0.229 Sum_probs=348.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh-hccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY-LDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~-~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ ...|++|++++..+++|++++.+||+|+|++++++..+|+.++. +++++|||+++||++++++.++||+.++ T Consensus 1 m~--~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d--- 75 (422) T protein:vir:97 1 MN--YMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTNDD--- 75 (422) T ss_pred CC--hHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCCc--- Confidence 64 57899999999999999999999999999999999999998864 4788899999999999999999998653 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|++|+|+....+++++|++|||||++|++++ .+|.|+|+++||.+++++||+..+ ++.+++.+|. T Consensus 76 --~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~-----~~~~p~i~~~sp~~~~~i~D~~~~-~~~~a~~~~~- 146 (422) T protein:vir:97 76 --FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGA-----EDGLPKMQVIEASKATGILDPTTF-LLTEGYAILE- 146 (422) T ss_pred --hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCC-----CCCeeEEEEechhhEEEEEeCCCC-cceeeEEEEE- Confidence 2589999999999999999999999999999999752 357899999999999999999765 5666666654 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .+..+..+.+.+|++..++++... +. ....+|++|+||||+|+|++++.++||+|+|++.|++|+|+||+++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~-~~-------~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~ 218 (422) T protein:vir:97 147 SDSNGNPTLEAYFTDKDIWYYPKK-GK-------PYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTL 218 (422) T ss_pred ecCCCcEEEEEEEcCceEEEEcCC-Cc-------cccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHH Confidence 334455555566666655555432 22 2346899999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC----CCCceEEEeccccHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) +++.++++||++|+++++|++.+... ...++...+++|.++ |+.++|+|++++++++|+++|+.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~~~----~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (422) T protein:vir:97 219 ERAEVTAEFYSFPQKYVLGMDPDAKP----MEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGG 294 (422) T ss_pred HHHHHHHHHhcchhhhhcccCccccc----CchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcc Confidence 99999999999999999998654432 335777788888874 3468999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEecCCCCCC-- Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRDPSTPT-- 391 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~i~v~f~~~~~~~-- 391 (480) +++|+++||+.++|++||+||++++.+|..|++++++.|+.+|++++++++++.++. ....+.++++.|++..+.+ T Consensus 295 s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~ 374 (422) T protein:vir:97 295 SGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADAN 374 (422) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChH Confidence 999999999999999999999999999999999999999999999999999998753 3445678999999655555 Q ss_pred -HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021297. 392 -VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 392 -~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~ 439 (480) +++.||+++|+++++++++++++++++|||++.+++. .+..+++++. T Consensus 375 s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~-~~~~~~~~d~ 422 (422) T protein:vir:97 375 MLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKPI-PAITEVTTDG 422 (422) T ss_pred HHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHHH-HHHHhhhccC Confidence 8889999999999999999999999999997664322 2222222111 No 17 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2.2e-87 Score=495.66 Aligned_cols=432 Identities=21% Similarity=0.229 Sum_probs=348.6 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh--hccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) || |+++++++|+.+|..+++|++++++||+|+|+|++++...++.+++ +|+++|||++|||+.++|++++||+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 55 5688999999999999999999999999999999998888887764 6899999999999999999999998754 Q ss_pred --CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 77 --DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 77 --d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) |.+....++++|++|+|+.++.++++++++|||||++||. +++|.+++++++|.+++++||+..++++.+++ T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i 154 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR------RDDGTATITADSPETMVVSVDPLQPWRIRAAM 154 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEee------CCCCceEEEEEccceeEEEEcCCCCcceEEEE Confidence 4556778999999999999999999999999999999996 35788999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEec-------C-----CcccccccccccccccccccceEEeecccccCccCCcc Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRN-------G-----GLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~-----~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s 222 (480) ++|.+.++. ..+.++|.++.+.++... . .....+ ......+|+++.|||++|.| ++|.| T Consensus 155 ~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~pvv~~~N------~~g~g 225 (456) T protein:vir:10 155 RWWRDLDAE--SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSW-VPVGDAVVTGSPPPVVVYQN------PDGMG 225 (456) T ss_pred EEEEecCCc--eeEEEEEeccceeEEEEEEEEeecccceeeeecCCce-eeccccCCCCCceeEEEecC------CCCCc Confidence 999765433 345556666554433211 0 011112 22344689999999988755 36889 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhhhhhcccCCCCceEEEeccc Q lcl|NC_021297. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 223 ~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~ 296 (480) ++++ +++|||+||+++|+++++++++++|+++++|...+.+..+..+ ..++...+++|..+ +++++++++.+ T Consensus 226 d~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~~~ 303 (456) T protein:vir:10 226 EVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQAN 303 (456) T ss_pred hhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCC-CCcceEEeccc Confidence 9975 8999999999999999999999999999999876554322222 23566777788765 66899999999 Q ss_pred cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) ++++|+++++.++++|++.+++|++.||+.++| +||+||++++.+|++||+++++.|+++|++++++++++.|.. + T Consensus 304 ~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~ 379 (456) T protein:vir:10 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---V 379 (456) T ss_pred ChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---c Confidence 999999999999999999999999999987777 599999999999999999999999999999999999988743 3 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...+++.|+++.|+|+++.||+++|++++| ++|.+++++++||+++++++++..+.+++++. +.+ ++.+.| T Consensus 380 ~~~~~v~w~~~~~~~~~~~ada~~kl~~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~---~~~----~~~~~~ 450 (456) T protein:vir:10 380 EDTVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITL---FAG----NPVQRP 450 (456) T ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHHcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHHH---Hhh----hhhhcC Confidence 457999999999999999999999998875 68999999999999988765543222222211 111 111111 Q ss_pred CCCCCC Q lcl|NC_021297. 457 KPTVTD 462 (480) Q Consensus 457 ~~~~~d 462 (480) +++++- T Consensus 451 ~~~~~~ 456 (456) T protein:vir:10 451 QEDGSR 456 (456) T ss_pred CCCCCC Confidence 121111 No 18 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2.2e-87 Score=495.66 Aligned_cols=432 Identities=21% Similarity=0.229 Sum_probs=348.6 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh--hccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) || |+++++++|+.+|..+++|++++++||+|+|+|++++...++.+++ +|+++|||++|||+.++|++++||+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 55 5688999999999999999999999999999999998888887764 6899999999999999999999998754 Q ss_pred --CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 77 --DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 77 --d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) |.+....++++|++|+|+.++.++++++++|||||++||. +++|.+++++++|.+++++||+..++++.+++ T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i 154 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR------RDDGTATITADSPETMVVSVDPLQPWRIRAAM 154 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEee------CCCCceEEEEEccceeEEEEcCCCCcceEEEE Confidence 4556778999999999999999999999999999999996 35788999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEec-------C-----CcccccccccccccccccccceEEeecccccCccCCcc Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRN-------G-----GLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~-----~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s 222 (480) ++|.+.++. ..+.++|.++.+.++... . .....+ ......+|+++.|||++|.| ++|.| T Consensus 155 ~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~pvv~~~N------~~g~g 225 (456) T protein:vir:10 155 RWWRDLDAE--SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSW-VPVGDAVVTGSPPPVVVYQN------PDGMG 225 (456) T ss_pred EEEEecCCc--eeEEEEEeccceeEEEEEEEEeecccceeeeecCCce-eeccccCCCCCceeEEEecC------CCCCc Confidence 999765433 345556666554433211 0 011112 22344689999999988755 36889 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhhhhhcccCCCCceEEEeccc Q lcl|NC_021297. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 223 ~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~ 296 (480) ++++ +++|||+||+++|+++++++++++|+++++|...+.+..+..+ ..++...+++|..+ +++++++++.+ T Consensus 226 d~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~~~ 303 (456) T protein:vir:10 226 EVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQAN 303 (456) T ss_pred hhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCC-CCcceEEeccc Confidence 9975 8999999999999999999999999999999876554322222 23566777788765 66899999999 Q ss_pred cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) ++++|+++++.++++|++.+++|++.||+.++| +||+||++++.+|++||+++++.|+++|++++++++++.|.. + T Consensus 304 ~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~ 379 (456) T protein:vir:10 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---V 379 (456) T ss_pred ChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---c Confidence 999999999999999999999999999987777 599999999999999999999999999999999999988743 3 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...+++.|+++.|+|+++.||+++|++++| ++|.+++++++||+++++++++..+.+++++. +.+ ++.+.| T Consensus 380 ~~~~~v~w~~~~~~~~~~~ada~~kl~~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~---~~~----~~~~~~ 450 (456) T protein:vir:10 380 EDTVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITL---FAG----NPVQRP 450 (456) T ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHHcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHHH---Hhh----hhhhcC Confidence 457999999999999999999999998875 68999999999999988765543222222211 111 111111 Q ss_pred CCCCCC Q lcl|NC_021297. 457 KPTVTD 462 (480) Q Consensus 457 ~~~~~d 462 (480) +++++- T Consensus 451 ~~~~~~ 456 (456) T protein:vir:10 451 QEDGSR 456 (456) T ss_pred CCCCCC Confidence 121111 No 19 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=9.2e-87 Score=492.26 Aligned_cols=431 Identities=20% Similarity=0.219 Sum_probs=351.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh--hccccchHHHHHHHHHhhhccCccccC--C Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRIS--E 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~--~ 76 (480) =.|+++++++|+++|..+++|++++++||+|+|+|++++...++.++. +++++|||++|||++++||+++||+++ + T Consensus 3 ~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~ 82 (456) T protein:vir:79 3 ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCC Confidence 446688999999999999999999999999999999998888887763 468899999999999999999999875 3 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 77 d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) |.+..+.++++|++|+|+.++.++++++++||+||+++|++ ++|.+++++++|.+++++||+...+++.+++++ T Consensus 83 d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~------edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~ 156 (456) T protein:vir:79 83 DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------DDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) T ss_pred CccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC------CCCceEEEEeccceeEEEEcCCCCCceEEEEEE Confidence 45667889999999999999999999999999999999964 578899999999999999999999999999999 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCC------------cccccccccccccccccccceEEeecccccCccCCcccc Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGG------------LNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i 224 (480) |.+.++ ...+.++|+++.+++|..... ....+ ......+|+++.|||++|.| ++|.|+| T Consensus 157 ~~~~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~pvv~~~N------~~~~gd~ 227 (456) T protein:vir:79 157 WRDLDA--ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSW-VPVGDAVVTGSPPPVVVYQN------PDGMGEV 227 (456) T ss_pred EEecCC--ceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCce-eecccccCCCCceeEEEecC------CCCCchh Confidence 875543 355677888887766543211 11111 22345689999999999865 4688899 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhhhhhcccCCCCceEEEeccccH Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) ++ +++|||+||+++|+++++++++++|+++++|.....+..+..+ ..++...+++|..+ +++++++++.+++ T Consensus 228 e~-v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~-~~~~~~q~~~~~~ 305 (456) T protein:vir:79 228 EP-HIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQTNDF 305 (456) T ss_pred hh-hHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCC-CCcceeeecccCh Confidence 75 8999999999999999999999999999999876554333222 23555677777765 5689999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) ++|+++++.++++|++.+++|++.||+.++| +||+||++++.+|++||+++++.|+++|++++++++++.|.. +.. T Consensus 306 ~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~---~~~ 381 (456) T protein:vir:79 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---VED 381 (456) T ss_pred HHHHHHHHHHHHHHHhhcCCChhHhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---ccc Confidence 9999999999999999999999999987777 599999999999999999999999999999999999988743 345 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHH-HHHhcccccCcccccC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMI-DTLYSTTKAQADATPK 457 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 457 (480) +++++|+++.++|+++.||+++|++++| ++|.+++++.+|++++++++++..+.+++.+.. ..+.. .++ T Consensus 382 ~i~v~w~~~~~~s~~~~ada~~kl~~~G--~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~--------~~~ 451 (456) T protein:vir:79 382 TVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQ--------RPQ 451 (456) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHHhcC--CChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhh--------cCC Confidence 7999999999999999999999998875 678999999999999887664322222222211 11111 111 Q ss_pred CCCCC Q lcl|NC_021297. 458 PTVTD 462 (480) Q Consensus 458 ~~~~d 462 (480) +++.- T Consensus 452 ~~~~~ 456 (456) T protein:vir:79 452 EDGSR 456 (456) T ss_pred CCCCC Confidence 11111 No 20 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=2.1e-83 Score=473.89 Aligned_cols=416 Identities=20% Similarity=0.256 Sum_probs=329.4 Q ss_pred hcCcccchhhhhh--ccccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEE Q lcl|NC_021297. 37 TIGIGAPPELAYL--DVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITV 114 (480) Q Consensus 37 ~~~~~~~~~~~~~--~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v 114 (480) ++++..++.++.. ++++|||++|||+++++|.++||+.++ .+.++.++++|++|+|+.++.++++++++|||||++| T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~d-~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v 79 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGPD-GEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLV 79 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecCC-CchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEE Confidence 7778887877643 468899999999999999999999764 4578889999999999999999999999999999999 Q ss_pred ecCcccc-cCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCc--cc--- Q lcl|NC_021297. 115 SHPDVES-GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGL--ND--- 188 (480) Q Consensus 115 ~~~~~~~-~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--- 188 (480) |.+.... .+.++.++|+++||++++++||+..+ ++.+++++|....++.. +..+|+.+..++|...... .. T Consensus 80 ~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (434) T protein:vir:98 80 GAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDIDGFG--YARVFFDDTSFPYRTRERTGARLPWG 156 (434) T ss_pred ecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEeccCCce--EEEEEEeCcEEEEEEeeccccccccc Confidence 9764332 23456789999999999999998765 58888988876554432 3445555554444332111 11 Q ss_pred --cc---ccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccc Q lcl|NC_021297. 189 --QW---VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDE 263 (480) Q Consensus 189 --~~---~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~ 263 (480) .+ ....+..+|+||+||||+|+||++..+ +|+|+|+ .|++|||+||+++|++++++++|++|+++|+|.+++. T Consensus 157 ~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e-~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~ 234 (434) T protein:vir:98 157 PDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFA-GVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAK 234 (434) T ss_pred cccceecccccccccCCCCccceEEeccCCCcCc-CCcchhh-hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccc Confidence 11 112345689999999999999998766 6999996 5999999999999999999999999999999998877 Q ss_pred cccccccc-----hhhhhhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH Q lcl|NC_021297. 264 LTNDGENT-----TLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 264 ~~~~~~~~-----~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~ 338 (480) +.++..+. .+....+++|..++++++++|++.+++++|+++|+.++++|++++++|++.||+. .+++||+||++ T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~-~~n~Sg~Al~~ 313 (434) T protein:vir:98 235 RTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATD-LVNISADTIGA 313 (434) T ss_pred ccccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccc-cCChHHHHHHH Confidence 76554332 3456678899999999999999999999999999999999999999999999964 45589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID 418 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~ 418 (480) ++.+|+.|++++++.|+++|++++++++++.|.. .+..++++.|+++.|+|+++.||+++|++++| +|.++++++ T Consensus 314 ~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~--~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g---~~~e~~~~~ 388 (434) T protein:vir:98 314 LDILHVAKVREHIASFSEGLESVLALAAAQAGVP--EDYTEAEVRWANPAHVTMAVKADAATKLKSIG---YPLDVIAEE 388 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--hhheeeeEEecCCCCCCHHHHHHHHHHHHhcC---CcHHHHHHh Confidence 9999999999999999999999999999987743 45678999999999999999999999999866 689999999 Q ss_pred CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 419 LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 419 lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) +|++++++++++++++++.. ..+..+...++ .+.+..+++ ++.+.| T Consensus 389 lg~~~~e~~r~~~e~~~~~~--~~~~~~~~~~~---~~~g~~~~~----~~~~dg 434 (434) T protein:vir:98 389 LDESPARVRRIVAGAASQAL--LAASLLPAPGA---PSAGNVPDS----GGAVDG 434 (434) T ss_pred CCCCHHHHHHHHHHHHHHHH--HHHhhhccCCC---CCCCCCCcc----cCCCCC Confidence 99999998888765544432 22222211111 111112221 122222 No 21 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=1.6e-81 Score=463.61 Aligned_cols=461 Identities=13% Similarity=0.117 Sum_probs=350.5 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc-CCCc Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDS 78 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~ 78 (480) ++.. .++|++++++|..+.+|++++.+||+|+|+|..... .+...+++|+++||+++||++.++||+++++++ ++++ T Consensus 13 ~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~~~ 91 (499) T protein:vir:10 13 VNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEF-DNATVEAANVMVNHAKYITDMNVGFMTGNPVKYVAEKG 91 (499) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCc-CcCCCCcceeecchHHHHHHHHhhhhcccCceeecCCh Confidence 2222 567999999999999999999999999999976432 344566899999999999999999999999875 4566 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccccc-----------CCCceeEEEEEccceeEEEEeCCCC Q lcl|NC_021297. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG-----------DPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 79 ~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~-----------d~~~~~~i~~~~p~~~~~v~d~~~~ 147 (480) +..+.++++|+.|+|+.++.++++.++++|+||++||.+..+.. .....++++.++|++++++|++... T Consensus 92 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~ 171 (499) T protein:vir:10 92 KNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVE 171 (499) T ss_pred hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCC Confidence 77889999999999999999999999999999999997643211 1233578999999999999999888 Q ss_pred CeeEEEEEEEEEec--CCceEEEEEEEeCCcEEEEEecCCccc-ccccccccccccccccceEEeecccccCccCCcccc Q lcl|NC_021297. 148 RRVTRAVRLYTTRD--DVAVPDRATLYLPDETVPLRRNGGLND-QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 148 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i 224 (480) +++.+++++|...+ ..+..+++++||++++++|...+.... .........+|+||.||||+|.|+. +|.|++ T Consensus 172 ~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~~d~ 246 (499) T protein:vir:10 172 HDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNNE-----ERQGDF 246 (499) T ss_pred cceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCCC-----CCCCch Confidence 89999999987654 345667899999999999987654321 2234456778999999999999964 588899 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc---CCCCceEEEec--cccHH Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL---ASEAAKISEFK--AAELR 299 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~--~~~~~ 299 (480) ++ +++|||+||+++|++++.++++++|+++++|+..+...+. .. ....+.++.. ++.++++.+++ ....+ T Consensus 247 e~-v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~--~~--~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~ 321 (499) T protein:vir:10 247 EQ-LISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDD--IQ--RLKRGAIEAPPREEGADIEWLTKSFDETQVN 321 (499) T ss_pred Hh-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccch--hh--hhhhcceeccCCCCCCcceEEeccCCHHHHH Confidence 64 9999999999999999999999999999999876543322 11 2233344443 33456665543 44567 Q ss_pred HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccce Q lcl|NC_021297. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYT 378 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~ 378 (480) +++++|+..|+.++.++++++..|+++ +||+||++++++|++||.++++.|+++|++++++++++.+. +...++. T Consensus 322 ~~~~~l~~~I~~~s~~p~~~~~~~~gn----~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~ 397 (499) T protein:vir:10 322 LLSQSIENDIHKISYVPNMNDEKFMGN----VSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDAS 397 (499) T ss_pred HHHHHHHHHHHHHhCcccCCchhhccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc Confidence 778888888888877777776666532 69999999999999999999999999999999999988753 4455677 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++++|+++.|.|.++.+++++|+. +++|+||+++++|+++++.++++++++|+++.............++..+.+ T Consensus 398 ~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (499) T protein:vir:10 398 GCKISLVANIPSNLSDVVNNVKNAD----GIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELE 473 (499) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Confidence 8999999999999999999999983 468999999999999887777776666655443322222112222222222 Q ss_pred CCCCCCc--ccccCCCCCCCCCCC Q lcl|NC_021297. 459 TVTDTKT--ETQTSPSGFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~--~~~~~p~~~~~~~~~ 480 (480) +..++.. +.++..+....+||| T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~ 497 (499) T protein:vir:10 474 DKQDDSSENDKEAGSNHNQSHRTR 497 (499) T ss_pred CCCcccCCCCCCCccccccCCCCC Confidence 2221111 111122344457778 No 22 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.1e-81 Score=464.52 Aligned_cols=440 Identities=16% Similarity=0.145 Sum_probs=340.9 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccC-cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d- 77 (480) +...-++|.++|..|.. +.+|++++.+||+|+| .|.......++...++|+++||+++||++.++|++++++++..+ T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 116 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCC Confidence 55556789999999975 4789999999999985 55444444455567889999999999999999999999875432 Q ss_pred ----chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 78 ----SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 78 ----~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +...+.++++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 117 ~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 190 (501) T protein:vir:27 117 NDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYR------NEYDETRIKRLNPLETFVIYDNSLEDNSIAA 190 (501) T ss_pred ccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEe------CCCCceEEEEEccceeEEEecCCCCCceEEE Confidence 234567899999999999999999999999999999996 3578899999999999999999888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) +++|......+...++++||++.+++|...++ .+..+..+|+||.||||+|+|++ +|+|++++ |++||| T Consensus 191 ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~-v~~liD 259 (501) T protein:vir:27 191 VRYYNRGTLQNAKDVVEIYTNEHIYTLDASDD-----FNEISVTTHAFGTVPITEFLNNV-----DGIGDYET-ELYLID 259 (501) T ss_pred EEEEEeeecCCcEEEEEEEeCCeEEEEEeCCc-----eeeccccccCCCcccEEEecCCC-----CCCCchhh-hHHHHH Confidence 99998887777788999999999998876543 24566789999999999999874 68999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc----------CCCCceEE--EeccccHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL----------ASEAAKIS--EFKAAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~--~~~~~~~~~~ 301 (480) +||+++|++++.++++++|+++++|...+...+... .+...+++.+ ++.++++. +++....+++ T Consensus 260 a~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 335 (501) T protein:vir:27 260 LYDSAESDTANHMSDMADAILAIYGDLALPKGMQAS----DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 335 (501) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCccCCcccchh----hhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHH Confidence 999999999999999999999999976654332211 1111112111 12245554 3445567888 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcccce Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~ 378 (480) +++|+..|+.++.++++++..|++ | +||+||++++.+|.+||.++++.|+++|++++++++++.+. ....+.. T Consensus 336 ~~~l~~~I~~~s~~p~~~~~~~~~---n-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 411 (501) T protein:vir:27 336 KTRLNRDIHIFTNIPDMSDTNFSG---N-TSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDES 411 (501) T ss_pred HHHHHHHHHHHhCCcccCcccccc---C-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 888888888888887776666543 3 69999999999999999999999999999999999987652 3345567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc---cCcccc Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK---AQADAT 455 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 455 (480) +++++|+++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+++.......+... +..... T Consensus 412 ~i~v~f~~~~p~n~~e~ad~~~kl~----g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~ 487 (501) T protein:vir:27 412 LLKITFTPNLPKSLNEQVSILTGLG----GQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDE 487 (501) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCC Confidence 8999999999999999999999974 468999999999999988888887777765433222222111 111112 Q ss_pred cCCCCCCCCccccc Q lcl|NC_021297. 456 PKPTVTDTKTETQT 469 (480) Q Consensus 456 ~~~~~~d~~~~~~~ 469 (480) +.+..+++..+..+ T Consensus 488 ~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 488 VKETHTDDFERAYE 501 (501) T ss_pred CCCCccccccccCC Confidence 22222222212222 No 23 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=7.3e-82 Score=465.41 Aligned_cols=441 Identities=15% Similarity=0.126 Sum_probs=338.9 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhcccC-cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCC-- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~-- 76 (480) +.+..++|.++|..|..+ .+||+++.+||+|+| .|.......+...+++|+++||+++||++.++|++++++++.. T Consensus 37 ~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~ 116 (501) T protein:vir:96 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCC Confidence 666678899999999865 679999999999985 5544444444556688999999999999999999999987632 Q ss_pred ---CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 77 ---DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 77 ---d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) .+...+.|+++|+.|+|+.++.++++++++||+||+++|. +++|.+++++++|.+++|+||+...+++.++ T Consensus 117 ~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------dedg~~~i~~~~p~~~~~v~d~~~~~~~~~~ 190 (501) T protein:vir:96 117 NDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYR------SEYDETRIKRLSPLETFVIYDNSLEDNSIAA 190 (501) T ss_pred ccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEE------cCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 2334677899999999999999999999999999999996 4578999999999999999999888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) +++|......+..+++++||++++++|...++ .+..+..+|+||.||||+|+|++ +|+|+|++ |++||| T Consensus 191 v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~-v~~liD 259 (501) T protein:vir:96 191 VRYYNRGTLQSAKDVVEIYTDEHIYTLDASDD-----FNEISVTTHAFGTVPITEYLNNI-----DGIGDYET-ELYLID 259 (501) T ss_pred EEEEEeecCCCcEEEEEEEcCCcEEEEeeCCC-----ceeccccccCCCccceEEecCCc-----cCCCchhh-hHHHHH Confidence 99998887777788999999999999875543 24556788999999999999874 68999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc----------cCCCCceEE--EeccccHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT----------LASEAAKIS--EFKAAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~----------~~g~~~~~~--~~~~~~~~~~ 301 (480) +||+++|++++.++++++|+++++|.......+... .+...+++. ..+.++++. +++..++++| T Consensus 260 a~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 335 (501) T protein:vir:96 260 LYDSAESDTANHMSDMADAILAIYGDLALPKGMQAS----DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 335 (501) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecccccCcccchh----hhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHH Confidence 999999999999999999999999987654332211 111111111 122345553 4455667888 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CCCcccce Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG---REVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~ 378 (480) +++|+..|+.++.++++++..|++ | +||+||++++.+|.+||.++++.|+.+|++++++++++.+ .....++. T Consensus 336 ~~~l~~~I~~~s~~p~~~~~~~~~---n-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 411 (501) T protein:vir:96 336 KTRLNRDIHIFTNTPDMSDTNFSG---N-TSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDES 411 (501) T ss_pred HHHHHHHHHHHhCCcccCcccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 888888777777777766655543 3 6999999999999999999999999999999999988764 33445567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCc--cccc Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA--DATP 456 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 456 (480) +++++|+++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+++.............. .... T Consensus 412 ~i~i~f~~~~p~n~~e~ad~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 487 (501) T protein:vir:96 412 LLKITFTPNLPKSLNEQVSILTGLG----GQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDE 487 (501) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCc Confidence 8999999999999999999999985 368999999999999998888887777665422112111111000 0000 Q ss_pred CCCCCCCCcccccC Q lcl|NC_021297. 457 KPTVTDTKTETQTS 470 (480) Q Consensus 457 ~~~~~d~~~~~~~~ 470 (480) ..+...+++++... T Consensus 488 ~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 488 VKETHTDDFEREYE 501 (501) T ss_pred CCCCCCCccccccC Confidence 01111111112111 No 24 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=2.8e-81 Score=462.23 Aligned_cols=428 Identities=12% Similarity=0.122 Sum_probs=335.0 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC- Q lcl|NC_021297. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~- 75 (480) |+.. .++|.+|+.+|..+++|++++.+||+|+|+|...+.. .....++|+++||+++||++.++||+++|+++. T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 89 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTK-DLWKPDNRLTVNFTKYIVDTFTGYFNGIPVKKSH 89 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCc-cccCccceeecchHHHHHHHHhhhhcccCceecc Confidence 3332 5689999999999999999999999999999776432 344568899999999999999999999998764 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++..+.|+++|++|+|+.++.+++++++++|+||++||. +++|.+++++++|.+++|+||+...+++.++++ T Consensus 90 ~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir 163 (453) T protein:vir:39 90 SDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQ------NEETQTNVIYNTPENMFMVYDDTIKQEPLFAVR 163 (453) T ss_pred CChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEecCCCCCeEEEEEE Confidence 56677889999999999999999999999999999999996 357889999999999999999988888999999 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) +|.. .+..+++++|+++++++|...++. ....+..+|+||.||||+|+|++ +|+|+|++ |++|||+| T Consensus 164 ~~~~---~~~~~~~~~yt~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~liDa~ 230 (453) T protein:vir:39 164 YGYD---DDYKLYGEVYTKETTYALNGTMGF----YNMTEQAPNPFDDLPVVEFYFNE-----ERMSIFES-VISLVNAF 230 (453) T ss_pred EEEe---CCeEEEEEEEeCCeEEEEEecCCc----eeeecccccCCCceeEEEecCCC-----CCCcchhh-hHHHHHHH Confidence 8853 334678999999999998866543 23456779999999999999874 68999975 99999999 Q ss_pred HHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-------cCCCCceEEEec--cccHHHHHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-------LASEAAKISEFK--AAELRNFAEEME 306 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-------~~g~~~~~~~~~--~~~~~~~~~~l~ 306 (480) |+++|++++.++++++|+++++|...+...... ...++++. .+++++++.+.+ ....++++++|. T Consensus 231 ~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~ 304 (453) T protein:vir:39 231 NKAISEKANDVDYFSDQYLTFLGAAVEEEDLKN------IRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLT 304 (453) T ss_pred HHHHHHHHHHHHHhhCceeeeecCCCCchhhhh------hhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHH Confidence 999999999999999999999997665432211 12222222 234556665543 345666667776 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEec Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWR 385 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~ 385 (480) ..|+.++.++++++..|| ++||+||++++++|+.||.++++.|+.+|++++++++++.+. +...+..+++++|+ T Consensus 305 ~~I~~~s~~p~~~~~~~g-----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~ 379 (453) T protein:vir:39 305 KLIFQTTMVANISDESFG-----SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFT 379 (453) T ss_pred HHHHHHhCCccccccccc-----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeC Confidence 666666666655544443 369999999999999999999999999999999999988763 34556778999999 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ++.|.|+++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+............ .+.+.+. +. T Consensus 380 ~~~p~~~~~~a~~~~kl~----g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~---~~~~~~~---~~ 449 (453) T protein:vir:39 380 RNEPKDIKEQAETANILM----GITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSE---KGTDTVV---PE 449 (453) T ss_pred CCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCC---CCCCCCC---CC Confidence 999999999999999984 3689999999999999888888877777655443222111111 1111111 11 Q ss_pred cccc Q lcl|NC_021297. 466 ETQT 469 (480) Q Consensus 466 ~~~~ 469 (480) ..++ T Consensus 450 ~~~e 453 (453) T protein:vir:39 450 TNEE 453 (453) T ss_pred cCCC Confidence 1111 No 25 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=4.4e-81 Score=461.10 Aligned_cols=449 Identities=11% Similarity=0.080 Sum_probs=337.0 Q ss_pred CCCH-------HHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCccc-chhhhhhccccchHHHHHHHHHhhhccCc Q lcl|NC_021297. 1 MTTY-------HEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~-------~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) |++. .+.|.+++.+|.. +.+|++++.+||+|+|+|.+..... .+...++|+++||+++||++.++|+++++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccC Confidence 4332 2456678888764 5789999999999999986654443 33456889999999999999999999999 Q ss_pred ccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCee Q lcl|NC_021297. 72 FRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 ~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~ 150 (480) +++ +++++..+.|+++|+.|+|+.++.++++++++||+||++||. +++|.++++.++|.+++|+||+...+++ T Consensus 111 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~------ded~~~~i~~~~p~~~~~iyd~~~~~~~ 184 (512) T protein:vir:97 111 IQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNS 184 (512) T ss_pred ceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCce Confidence 876 456677889999999999999999999999999999999996 4578899999999999999999888899 Q ss_pred EEEEEEEEEecC----CceEEEEEEEeCCcEEEEEecCCcccc-cccccccccccccccceEEeecccccCccCCcccch Q lcl|NC_021297. 151 TRAVRLYTTRDD----VAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 151 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .+++++|..... .+.+.++++|+++.+++|...++.... ..+..+..+|+||.||||+|.|++ +|.|+++ T Consensus 185 ~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~gd~e 259 (512) T protein:vir:97 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE 259 (512) T ss_pred EEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCC-----CCCCchh Confidence 999999865432 234678899999999999876544322 234456789999999999999974 5788886 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc---ccchhhhhhh----hh---cccCCCCceEEEe-- Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---ENTTLDIYYG----RI---LTLASEAAKISEF-- 293 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~~~~----~~---~~~~g~~~~~~~~-- 293 (480) .|++|||+||+++|++++.++++++|+++++|.......... ....+..... .. -..+|+++++.++ T Consensus 260 -~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 338 (512) T protein:vir:97 260 -KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecC Confidence 599999999999999999999999999999997544332211 1111111110 00 0123445656544 Q ss_pred ccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_021297. 294 KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-- 371 (480) Q Consensus 294 ~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-- 371 (480) +....++++++|+..|+.++.++++++..|++ ++||+||++++.+|.+||+++++.|+++|++++++++++.+. T Consensus 339 ~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~g----n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~ 414 (512) T protein:vir:97 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 414 (512) T ss_pred CHHHHHHHHHHHHHHHHHHhCCcccCcccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 34567788888877777777777766655543 369999999999999999999999999999999999887542 Q ss_pred --CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccc- Q lcl|NC_021297. 372 --EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT- 448 (480) Q Consensus 372 --~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 448 (480) ....++.+++++|+++.|.|+++.+++++|+. |++|.||+++++|+++++.++++++++|+.+.+........ T Consensus 415 ~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~----giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~ 490 (512) T protein:vir:97 415 SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) T ss_pred CcccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccC Confidence 23455678999999999999999999999984 46899999999999998888887777776554433322211 Q ss_pred -ccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 449 -KAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 449 -~~~~~~~~~~~~~d~~~~~~~~ 470 (480) ..+.+...+++..+ +..+++. T Consensus 491 ~~~~~~~~~~~~~~~-~~~~~~~ 512 (512) T protein:vir:97 491 DPRDINDDEQDDDTK-DTVDKKE 512 (512) T ss_pred CCCCCCCCCCCCCcc-ccccccC Confidence 11111111111111 1111111 No 26 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=3.4e-81 Score=461.73 Aligned_cols=435 Identities=11% Similarity=0.077 Sum_probs=341.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +.+..++|.+|+.+|..+++|++++.+||+|+|+|...... ..+...++|+++||+++|||+.++|++++++++ T Consensus 42 ~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~ 121 (492) T protein:vir:94 42 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 121 (492) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCcee Confidence 55667889999999999999999999999999998654322 122345789999999999999999999999875 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . +|++..+.|+++++ |+|+..+.++++++++||+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 122 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~------d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ 194 (492) T protein:vir:94 122 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 194 (492) T ss_pred ccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55666777888775 88999999999999999999999996 4578999999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc------ccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++.+++|...++... ...+..+..+|+||.||||+|.|++ +|.|++++ T Consensus 195 ir~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~sd~e~- 264 (492) T protein:vir:94 195 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 264 (492) T ss_pred EEEEeeccc----eeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCC-----CCCCchHH- Confidence 999876543 3579999999988876543211 1234456678999999999999975 58899975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEE--EeccccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKIS--EFKAAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~--~~~~~~~~~~~~~ 304 (480) |++|||+||+++|++++.+++|++|+++++|.+.+...+.. . ......++ ..+++++++. +++....++++++ T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 340 (492) T protein:vir:94 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFK--R--LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 340 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhH--H--HHhhccceecCCCCcceeEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433221 1 11222232 2345567764 4556678899999 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++++..|+++ +||+||++++.+|+.||+++++.|+++|++++++++++.+.. .+..+++++| T Consensus 341 l~~~I~~~s~~p~~~~~~~~~n----~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~--~~~~~i~v~f 414 (492) T protein:vir:94 341 LYQKIMLFGQAVDFSSDKFGSA----PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--GEHKDVDISF 414 (492) T ss_pred HHHHHHHHhCCcCCCccccccC----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cccceeeEEe Confidence 9999999999999888777643 699999999999999999999999999999999999988743 3567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+.+.+. ..........+..+..+.. T Consensus 415 ~~~~p~~~~e~~~~~~kl~----giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~--- 486 (492) T protein:vir:94 415 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNK-QLPNLDDGGADSAQQQERS--- 486 (492) T ss_pred cCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hccccccccCCCCccccCC--- Confidence 9999999999999999984 4799999999999999888888777777654433 2222222222211111111 Q ss_pred cccccCCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFNR 476 (480) Q Consensus 465 ~~~~~~p~~~~~ 476 (480) .+.-++ T Consensus 487 ------~~~e~e 492 (492) T protein:vir:94 487 ------NNKESE 492 (492) T ss_pred ------ccccCC Confidence 111111 No 27 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=1.5e-80 Score=458.23 Aligned_cols=448 Identities=11% Similarity=0.098 Sum_probs=338.4 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccc-hhhhhhccccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) +.+.+ .|.+++..|.. +++|++++.+||.|+|+|.+.....+ ....++|+++||+++||++.++|++++++++. ++ T Consensus 39 ~~~~~-~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:99 39 LQNVN-EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hccHH-HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCc Confidence 33433 36677887765 67899999999999999876544433 34568899999999999999999999998764 56 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~------ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~ 191 (511) T protein:vir:99 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 677899999999999999999999999999999999996 45788999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCcccc-cccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... +.+.+.++++|+++.+++|...+..... ........+|+||.||||+|.|++ +|+|+++ .|++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e-~v~~li 265 (511) T protein:vir:99 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLI 265 (511) T ss_pred EeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCC-----CCCCchh-hhHHHH Confidence 6543 2345678899999999999876654322 233456778999999999999974 6889996 499999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc---ccchhhh------hhhhhcccCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---ENTTLDI------YYGRILTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~------~~~~~~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.+++|++|+++++|.......... ....+.. ........++.++++.++ +....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:99 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 999999999999999999999999996543322111 1111111 111112233455666544 34556777 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|+..|+.++.++++++..|++ | +||+||++++.+|.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~g---n-~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:99 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccc Confidence 777777777777777666555543 3 69999999999999999999999999999999999887542 234456 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc--cCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK--AQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 455 (480) ..+++.|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.++++++++|+++.+......... ...+.. T Consensus 422 ~~i~i~f~~~~p~n~~e~~~~~~kl~----GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:99 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCC Confidence 78999999999999999999999884 468999999999999988888888777776554444332221 111111 Q ss_pred cCCCCCCCCccccc Q lcl|NC_021297. 456 PKPTVTDTKTETQT 469 (480) Q Consensus 456 ~~~~~~d~~~~~~~ 469 (480) ..++..+.+.+..+ T Consensus 498 ~~~~~~~~~~d~~e 511 (511) T protein:vir:99 498 EQDDSTKDSIDKKE 511 (511) T ss_pred CCCCCCcCcccccC Confidence 11111111111112 No 28 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.6e-80 Score=458.07 Aligned_cols=448 Identities=11% Similarity=0.094 Sum_probs=336.2 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCccc-chhhhhhccccchHHHHHHHHHhhhccCcccc-CCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d 77 (480) +.+++ .|.+++..|.. +++|++++.+||+|+|+|....... .+...++|+++||+++||++.++||+++++++ +++ T Consensus 39 ~~~~~-~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:78 39 LQNVN-EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hcCHH-HHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 44443 35667777764 6789999999999999986554433 34456889999999999999999999999876 456 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.++++|+.|+|+.++.++++.++++|+||++||. +++|.+++++++|++++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~ 191 (511) T protein:vir:78 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 677899999999999999999999999999999999996 45789999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCccccc-ccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... ..+.+.++++||++.+++|...++..... .+..+..+|+||.||||+|.|++ +|+|++++ |++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~g~gd~e~-v~~li 265 (511) T protein:vir:78 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNE-----RRKGDYEK-VITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCC-----CCCCchhh-hHHHH Confidence 7553 22456788999999999998776543222 23456788999999999999874 58899975 99999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhhhhh------cccCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYGRI------LTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~---~~~~~~~~~~~~~------~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|......... .....+....... ...+++++++.+. +....+++ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:78 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999644332211 1111111111111 1123445666443 34556777 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|...|+.++.++++++..|++ | +||+||+++++++.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~---n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:78 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 777777777777777766555542 3 69999999999999999999999999999999999887542 234556 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc--cCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK--AQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 455 (480) .+++++|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+........... .+.+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl~----G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:78 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Confidence 78999999999999999999999984 468999999999999988888877777765544333322211 111111 Q ss_pred cCCCCCCCCcccccC Q lcl|NC_021297. 456 PKPTVTDTKTETQTS 470 (480) Q Consensus 456 ~~~~~~d~~~~~~~~ 470 (480) ++. ....+..+++. T Consensus 498 ~~~-~~~~~~~~e~~ 511 (511) T protein:vir:78 498 EQD-DDTKDTVDKKE 511 (511) T ss_pred CCC-CCccCcccccC Confidence 111 11111111111 No 29 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.6e-80 Score=458.07 Aligned_cols=448 Identities=11% Similarity=0.094 Sum_probs=336.2 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCccc-chhhhhhccccchHHHHHHHHHhhhccCcccc-CCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d 77 (480) +.+++ .|.+++..|.. +++|++++.+||+|+|+|....... .+...++|+++||+++||++.++||+++++++ +++ T Consensus 39 ~~~~~-~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:96 39 LQNVN-EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hcCHH-HHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 44443 35667777764 6789999999999999986554433 34456889999999999999999999999876 456 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.++++|+.|+|+.++.++++.++++|+||++||. +++|.+++++++|++++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~ 191 (511) T protein:vir:96 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 677899999999999999999999999999999999996 45789999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCccccc-ccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... ..+.+.++++||++.+++|...++..... .+..+..+|+||.||||+|.|++ +|+|++++ |++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~g~gd~e~-v~~li 265 (511) T protein:vir:96 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNE-----RRKGDYEK-VITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCC-----CCCCchhh-hHHHH Confidence 7553 22456788999999999998776543222 23456788999999999999874 58899975 99999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhhhhh------cccCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYGRI------LTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~---~~~~~~~~~~~~~------~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|......... .....+....... ...+++++++.+. +....+++ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:96 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999644332211 1111111111111 1123445666443 34556777 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|...|+.++.++++++..|++ | +||+||+++++++.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~---n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:96 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 777777777777777766555542 3 69999999999999999999999999999999999887542 234556 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc--cCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK--AQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 455 (480) .+++++|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+........... .+.+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl~----G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:96 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Confidence 78999999999999999999999984 468999999999999988888877777765544333322211 111111 Q ss_pred cCCCCCCCCcccccC Q lcl|NC_021297. 456 PKPTVTDTKTETQTS 470 (480) Q Consensus 456 ~~~~~~d~~~~~~~~ 470 (480) ++. ....+..+++. T Consensus 498 ~~~-~~~~~~~~e~~ 511 (511) T protein:vir:96 498 EQD-DDTKDTVDKKE 511 (511) T ss_pred CCC-CCccCcccccC Confidence 111 11111111111 No 30 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=5.4e-81 Score=460.62 Aligned_cols=435 Identities=11% Similarity=0.087 Sum_probs=341.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|++|+.+|..+++|+.++.+||+|+|+|...... ..+...++|+++||+++|||+.++||+++++++ T Consensus 42 ~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~ 121 (492) T protein:vir:97 42 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 121 (492) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 45557789999999999999999999999999998654332 223345789999999999999999999999875 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+++..+.++++++++||+||+++|. +++|++++++++|++++|+||+...+++.++ T Consensus 122 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~------d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~ 194 (492) T protein:vir:97 122 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 194 (492) T ss_pred ccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55666777888774 89999999999999999999999996 4578899999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc------ccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++.+++|...++... ...+..+..+|+||.||||+|.|++ +|+|++++ T Consensus 195 vr~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~- 264 (492) T protein:vir:97 195 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 264 (492) T ss_pred EEEEeeccc----eeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCC-----CCCCchHh- Confidence 999976543 3679999999988876654321 1223456678999999999999975 58899975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEE--EeccccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKIS--EFKAAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~--~~~~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|...++..+.. . .+....++ ..+++++++. +++....++++++ T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 340 (492) T protein:vir:97 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFK--R--LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 340 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHH--H--HHhhccceecCCCCcceeEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433211 1 12222233 2345667774 4455678888999 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++..+++++..|++ ++||+||++++.+|+.||+++++.|+++|++++++++++.+. ..++.+++++| T Consensus 341 L~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~i~v~f 414 (492) T protein:vir:97 341 LYQKIMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKDVDISF 414 (492) T ss_pred HHHHHHHHhCCCCCCcccccc----CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CcccceeeEEe Confidence 999999998888888777764 369999999999999999999999999999999999998874 34567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+.. ........++.+..++.+.++.. T Consensus 415 ~~~~p~~~~e~a~~~~kl~----G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 489 (492) T protein:vir:97 415 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQTEYN-KQLPNLDDGGADSAQQQERSNNK 489 (492) T ss_pred cCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HhhhccccCCCCCCccccccccc Confidence 9999999999999999984 469999999999999988888877777665433 22222222322222222222211 Q ss_pred ccccc Q lcl|NC_021297. 465 TETQT 469 (480) Q Consensus 465 ~~~~~ 469 (480) .++ T Consensus 490 --~~e 492 (492) T protein:vir:97 490 --ESE 492 (492) T ss_pred --ccC Confidence 111 No 31 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=1.7e-80 Score=457.93 Aligned_cols=448 Identities=11% Similarity=0.090 Sum_probs=336.9 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccc-hhhhhhccccchHHHHHHHHHhhhccCcccc-CCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d 77 (480) ..+ .+.|.+++..|.. +++|++++.+||.|+|+|.......+ +...++|+++||+++||++.++||+++|+++ +++ T Consensus 39 ~~~-~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:93 39 LQN-VNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hcc-HHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeeccCC Confidence 222 2347778888864 68999999999999999876544433 3456889999999999999999999999886 466 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+|+.++.++++++++||+||++||. +++|.+++++++|.+++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~------de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~ 191 (511) T protein:vir:93 118 KDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 677889999999999999999999999999999999996 35788999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCcccc-cccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... +.+.+.++++|+++.+++|...++.... .....+..+|+||.||||+|.|+. +|.|+++ .|++|| T Consensus 192 ~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~gd~e-~v~~li 265 (511) T protein:vir:93 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCC-----CCCCchh-hHHHHH Confidence 6543 2345678899999999999876654322 223456778999999999999874 5888886 499999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc---ccchhhhhh------hhhcccCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---ENTTLDIYY------GRILTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~~~------~~~~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.+++|++|+++++|.......... ....+.... ......+++++++.+. +....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 345 (511) T protein:vir:93 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 999999999999999999999999996554332211 111111111 1112234456666544 34557777 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|+..|+.++.++++++..|++ | +||+||+++++++.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~---n-~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~ 421 (511) T protein:vir:93 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccc Confidence 777777777777777766655543 3 69999999999999999999999999999999999877542 233456 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcc--cccCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST--TKAQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 455 (480) .++++.|+++.|.|.++.+++++|+. |++|.||+++++|+++++.++++++++|+.+......... ...+.+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:93 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCC Confidence 78999999999999999999999983 4689999999999999888888777776654433222111 11111111 Q ss_pred cCCCCCCCCccccc Q lcl|NC_021297. 456 PKPTVTDTKTETQT 469 (480) Q Consensus 456 ~~~~~~d~~~~~~~ 469 (480) .+++..+...+.++ T Consensus 498 ~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 498 EQDDDTKDTVDKKE 511 (511) T ss_pred CCCCcccccccccC Confidence 11111111111111 No 32 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=1.8e-80 Score=457.79 Aligned_cols=443 Identities=15% Similarity=0.120 Sum_probs=330.9 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhcccC-cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCC-- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~-- 76 (480) ..+.-++|.++|.+|..+ ++|++++.+||+|+| .|.......+....++|+++|||++||++.++||+++|+++.. T Consensus 38 ~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 117 (502) T protein:vir:48 38 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 117 (502) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCC Confidence 444457799999999764 789999999999975 5655555555566788999999999999999999999987532 Q ss_pred C---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 77 D---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 77 d---~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) + +...+.++++|+.|+|+.++.++++++++||+||+++|. +++|.+++++++|.+++|+||+...+++.++ T Consensus 118 ~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~------dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ 191 (502) T protein:vir:48 118 NEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR------SEYDETRIKRLSPLETFVIYDNSLEDNSIAA 191 (502) T ss_pred ccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 2 234567899999999999999999999999999999996 4578999999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) +++|......+..+++++||++++++|...++ .+..+..+|+||.||||+|+|++ +|.|++++ +++||| T Consensus 192 ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~-v~~liD 260 (502) T protein:vir:48 192 VRYYNRGTLQNAKDVVEIYTNQHIYTLDASDS-----FNEISVTPHAFGTVPITEFLNNA-----DGIGDYET-ELYLID 260 (502) T ss_pred EEEEEEeecCCcEEEEEEEeCCeEEEEEeCCc-----eeeccceecCCCccceEEecCCC-----CCCCchhh-hHHHHH Confidence 99998777777788999999999998875443 24566788999999999999874 68899975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc----------cCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT----------LASEAAKISEF--KAAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~----------~~g~~~~~~~~--~~~~~~~~ 301 (480) +||+++|++++.+++|++|+++++|........... .++ ..+++. .++.++++.++ +....+++ T Consensus 261 a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~ 336 (502) T protein:vir:48 261 LYDSAESDTANHMSDMADAILAIYGDLALPQGMQAS--DMK--RTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 336 (502) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCcccccccchh--hhh--hcceeeccccccccccccCcceeEeeecCCHHHHHHH Confidence 999999999999999999999999976543322211 111 111111 12335555443 34456666 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcccce Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~ 378 (480) +++|...|+.++.++++++..|+ +| +||+||++++++|.+|+.++++.|+++|++++++++++.+. ....+.. T Consensus 337 ~~~L~~~I~~~s~~p~~~~~~~~---~n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 412 (502) T protein:vir:48 337 KTRLNKDIHVFTNTPDMSDNHFS---GN-ASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDES 412 (502) T ss_pred HHHHHHHHHHHhCCCCcCccccc---cC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 66666666666655555444443 33 69999999999999999999999999999999999887652 3345567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++++|+++.|+|.++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+........... +..+.+ T Consensus 413 ~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~---~~~~~~ 485 (502) T protein:vir:48 413 RLKITFTPNLPKSLYEQVSILNDLG----GQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFY---DNVGKY 485 (502) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhccccccc---cccccc Confidence 8999999999999999999999984 368999999999999988888877776665422222111111 100000 Q ss_pred CCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 459 TVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .|+ +.+.+.+..+..+- T Consensus 486 --~d~---~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 486 --TDE---VKETHTDDFERVYE 502 (502) T ss_pred --CCC---ccCCCCcCcCCCCC Confidence 000 00000000000000 No 33 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=4.4e-80 Score=455.62 Aligned_cols=448 Identities=11% Similarity=0.093 Sum_probs=336.4 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccc-hhhhhhccccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) |.+.+ .|.+++..|.. +++|++++.+||.|+|+|.......+ +...++|+++||+++||++.++|++++++++. ++ T Consensus 39 ~~~~~-~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:10 39 LQNVN-EVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred ccCHH-HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 55544 46667777755 58999999999999999976544433 44568899999999999999999999998764 56 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.++++|+.|+|+.++.++++++++||+||++||. +++|++++++++|.+++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~------dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~ 191 (511) T protein:vir:10 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 677889999999999999999999999999999999996 45789999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCcccc-cccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... +.+...++++|+++.+++|...++.... ........+|+||.||||+|+|+. +|.|+++ .|++|| T Consensus 192 ~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~-----~g~gd~e-~v~~li 265 (511) T protein:vir:10 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLI 265 (511) T ss_pred EeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCC-----CCCCchh-hhHHHH Confidence 7553 2345678899999999998876544322 223456778999999999999974 5788886 599999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhhh------hhcccCCCCceEEE--eccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYG------RILTLASEAAKISE--FKAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~---~~~~~~~~~~~------~~~~~~g~~~~~~~--~~~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|......... .....+..... .....++.++++.+ ++....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:10 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999654332211 11111111111 11112344566544 344567778 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|...|+.++.++++++..|++ | +||+||+++++++.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~---n-~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~ 421 (511) T protein:vir:10 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccc Confidence 888877777777777766555543 3 69999999999999999999999999999999999887542 234566 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccc--ccCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT--KAQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 455 (480) .+++++|+++.|+|.++.+++++|+. |++|+||+++++|+++++.++++++++|+++.......... ....+.. T Consensus 422 ~~i~i~f~~~~p~d~~~~~~~~~kl~----G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:10 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ceeeEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCC Confidence 78999999999999999999999984 46899999999999998888887777776554333322111 1111111 Q ss_pred cCCCCCCCCcccccC Q lcl|NC_021297. 456 PKPTVTDTKTETQTS 470 (480) Q Consensus 456 ~~~~~~d~~~~~~~~ 470 (480) ..++. +.+...++. T Consensus 498 ~~~~~-~~~~~~~~~ 511 (511) T protein:vir:10 498 EQDDD-TKDTVDKKE 511 (511) T ss_pred CCCCc-ccCcccccC Confidence 11111 111111111 No 34 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=2.9e-80 Score=456.68 Aligned_cols=435 Identities=11% Similarity=0.090 Sum_probs=337.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|.+|+.+|..+++|+.++.+||+|+|+|...... ..+...++|+++||+++|||+.++||+++++++ T Consensus 33 ~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~ 112 (483) T protein:vir:12 33 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 112 (483) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 44557789999999999999999999999999998654322 223345789999999999999999999999886 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 -~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +++++..+.++++++ |+|+..+.++++++++||+||++||. |++|++++++++|.+++|+||+...+++.++ T Consensus 113 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~------d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 185 (483) T protein:vir:12 113 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 185 (483) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEE------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 456666778888775 78999999999999999999999996 4578999999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc------cccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++++++|...++.. ....+..+..+|+||.||||+|.|++ +|+|++++ T Consensus 186 ir~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~- 255 (483) T protein:vir:12 186 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 255 (483) T ss_pred EEEEEeecc----eEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCC-----CCCCchhh- Confidence 999976543 357999999998887654321 11234456678999999999999874 68899975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEEEec--cccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.+++|++|+++++|...+...+.. .. +....++ ..+++++++.+++ ....++++++ T Consensus 256 v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~--~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 331 (483) T protein:vir:12 256 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK--RL--LRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 331 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHH--Hh--hhhccccccCCCCcceEEeecCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433221 11 1222233 3456677775543 4456777788 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++++..|++ ++||+||++++.+|+.||.++++.|+++|++++++++++.+.. .++.+++++| T Consensus 332 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~--~~~~~i~v~f 405 (483) T protein:vir:12 332 LYQKIMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--GEHKDVDISF 405 (483) T ss_pred HHHHHHHHhCCCCCCcccccc----CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccceeeEEe Confidence 877777777777776665553 3699999999999999999999999999999999999988743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.+++++|+. |++|+||+++++|+++|+.++++++++|+++..... .....+..+..+..+..+ + T Consensus 406 ~~~~p~~~~~~a~~~~kl~----GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~d~~~~~~~~~-~ 479 (483) T protein:vir:12 406 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQL-PNLDDGGADGAQQQERSN-N 479 (483) T ss_pred CCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhc-ccccccccCCcccCCCCC-c Confidence 9999999999999999984 479999999999999998888887777765543322 222222111111111111 1 Q ss_pred cccc Q lcl|NC_021297. 465 TETQ 468 (480) Q Consensus 465 ~~~~ 468 (480) .+++ T Consensus 480 ~e~e 483 (483) T protein:vir:12 480 KESE 483 (483) T ss_pred ccCC Confidence 1111 No 35 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=4.7e-80 Score=455.48 Aligned_cols=435 Identities=11% Similarity=0.097 Sum_probs=338.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|.+|+.+|..+++|+.++.+||+|+|+|...... ..+...++|+++||+++|||+.++||+++|+++ T Consensus 22 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~ 101 (472) T protein:vir:93 22 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 101 (472) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeee Confidence 55668899999999999999999999999999998544322 223345789999999999999999999999876 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.+++++ .|+|+..+.++++++++||+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 102 ~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ 174 (472) T protein:vir:93 102 KHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 174 (472) T ss_pred ccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEE------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 5566677787777 479999999999999999999999996 4578899999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc------cccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++++++|...++.. ....+..+..+|+||.||||+|.|++ +|+|+|++ T Consensus 175 ir~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-----~g~s~~e~- 244 (472) T protein:vir:93 175 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 244 (472) T ss_pred EEEEEeecc----eeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCC-----CCCCchhh- Confidence 999876543 357899999988887654321 11234456678999999999999974 69999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEEE--eccccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISE--FKAAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~--~~~~~~~~~~~~ 304 (480) |++|||+||+++|++++.+++|++|+++++|.+.++..+.. ..+ ....++ ..+++++++.+ ++....++++++ T Consensus 245 v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 320 (472) T protein:vir:93 245 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK--RLL--RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 320 (472) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhH--HHH--hhccccccCCCCcceeEeecCCHHHHHHHHHH Confidence 99999999999999999999999999999998654432221 112 222233 34566777754 445567788888 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+++++++++++..|++ ++||+||++++.+|+.||+++++.|+++|++++++++.+.|.. .+..+++++| T Consensus 321 l~~~i~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~i~v~f 394 (472) T protein:vir:93 321 LYQKIMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--GEHKDVDISF 394 (472) T ss_pred HHHHHHHHhCCCCCCcccccc----CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 888888888888877766654 3699999999999999999999999999999999999988743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+.+.+.. +.....++++.+++.+.++++ T Consensus 395 ~~~~p~~~~~~~~~~~k~~----giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~d~~~~~~~~~~~ 469 (472) T protein:vir:93 395 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQQERSNNK 469 (472) T ss_pred CCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-ccCcCcccCCCCCCCCCCCcc Confidence 9999999999999999974 46899999999999998888887777766554322 222222222222222221111 Q ss_pred cccc Q lcl|NC_021297. 465 TETQ 468 (480) Q Consensus 465 ~~~~ 468 (480) +++ T Consensus 470 -~~e 472 (472) T protein:vir:93 470 -ESE 472 (472) T ss_pred -cCC Confidence 111 No 36 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=2.2e-79 Score=451.84 Aligned_cols=451 Identities=12% Similarity=0.073 Sum_probs=339.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc---------chhhhhhccccchHHHHHHHHHhhhccCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA---------PPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~---------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) +....++|.+++..| +.+++.++.+||.|+|+|....... .....++|+++||+++||++.++|++++| T Consensus 27 ~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~ 104 (503) T protein:vir:59 27 AEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEP 104 (503) T ss_pred cchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhcCC Confidence 444567788888888 4678999999999999986554321 22344779999999999999999999999 Q ss_pred cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 72 FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 72 ~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +++..+++....+.+.|..|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++. T Consensus 105 ~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~dg~~~i~~~~p~~~~~i~d~~~~~~~~ 178 (503) T protein:vir:59 105 VTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFV------DEEGEFDYVIFPAEEMIVVYKDNTRRDIL 178 (503) T ss_pred eeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEee------cCCCceEEEEEccceeEEEEeCCCCCceE Confidence 886655444444445555699999999999999999999999986 45789999999999999999998888999 Q ss_pred EEEEEEEEec-CCceEEEEEEEeCCcEEEEEecCCccccc----------ccccccccccccccceEEeecccccCccCC Q lcl|NC_021297. 152 RAVRLYTTRD-DVAVPDRATLYLPDETVPLRRNGGLNDQW----------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) Q Consensus 152 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G 220 (480) +++++|...+ ++..++++++|+++++++|...++..... ....+..+|+|+.||||+|.|++ +| T Consensus 179 ~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~-----~~ 253 (503) T protein:vir:59 179 FALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNE-----EM 253 (503) T ss_pred EEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCC-----CC Confidence 9999997665 44567789999999999988765432110 11234578999999999999875 58 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc-CCCCceEEE--ecccc Q lcl|NC_021297. 221 RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISE--FKAAE 297 (480) Q Consensus 221 ~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~--~~~~~ 297 (480) .|++++ +++|||+||+++|++++.++++++|+++++|.+.+...+.. .++....++.. +++++++.+ ++... T Consensus 254 ~sd~~~-~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 328 (503) T protein:vir:59 254 VSDLKF-YKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFT----ANLRYHSVIKVSGDGGVDTLRAEIPVDS 328 (503) T ss_pred Ccchhh-hHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhh----hhhhcccceeccCCCcceeEeccCCHHH Confidence 889975 99999999999999999999999999999998765433221 12333334333 345566654 44567 Q ss_pred HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CCCc Q lcl|NC_021297. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG---REVT 374 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~---~~~~ 374 (480) .+++++.|+..|++++..+++++..|++ ++||+||++++.+|.+||+++++.|+.+|++++++++.+.+ .... T Consensus 329 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~ 404 (503) T protein:vir:59 329 AAKELERIQDELYKSAQAVDNSPETIGG----GATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDF 404 (503) T ss_pred HHHHHHHHHHHHHHHhcccCCCcccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 7889999999999999999888776654 37999999999999999999999999999999999887754 2222 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccc Q lcl|NC_021297. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) ....+++++|+++.|.|+++.+++++|++++| ++|+||+++++|+++++.++++++++|+++... ............ T Consensus 405 ~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~G--iiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~ 481 (503) T protein:vir:59 405 NPDKELTMTFTRTRIQNDSEIVQSLVQGVTGG--IMSKETAVARNPFVQDPEEELARIEEEMNQYAE-MQGNLLDDEGGD 481 (503) T ss_pred ccccceeEEeCCCCCCCHHHHHHHHHHHHhCC--CCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hhccccCccCCC Confidence 34567999999999999999999999999876 689999999999999887887777665544322 221111111100 Q ss_pred ccCCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNRTKT 479 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~~~~ 479 (480) ..++. +.+... +..++..|.+| T Consensus 482 -~~~~~-~~~~~~-~~~~~~~g~~~ 503 (503) T protein:vir:59 482 -DDLEE-DDPNAG-AAESGGAGQVS 503 (503) T ss_pred -CCCCc-CCCCCC-cccCCCCCCcC Confidence 11111 111111 11123333333 No 37 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=3.3e-79 Score=450.84 Aligned_cols=448 Identities=11% Similarity=0.091 Sum_probs=335.1 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccc-hhhhhhccccchHHHHHHHHHhhhccCcccc-CCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d 77 (480) +.+. +.|.+++..|.. +++|++++.+||.|+|+|.......+ ....++|+++||+++||++.++|++++++++ +++ T Consensus 39 ~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~ 117 (511) T protein:vir:96 39 LQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hccH-HHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeecCc Confidence 2233 337778888764 67899999999999999876544433 3455789999999999999999999999876 456 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+|+.++.+++++++++|+||++||. +++|.+++++++|.+++|+||+...+++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~------ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~ 191 (511) T protein:vir:96 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 677899999999999999999999999999999999996 45789999999999999999998888999999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCcccc-cccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) .... +.+.+.++++|+++.+++|...++.... ........+|+||.||||+|+|+. +|+|+++ .|++|| T Consensus 192 ~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-----~g~gd~e-~v~~li 265 (511) T protein:vir:96 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCC-----CCCCchh-hhHHHH Confidence 7543 2345668899999999999876544322 223456778999999999999974 5888886 599999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhh------hhhhcccCCCCceEEEe--ccccHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIY------YGRILTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~---~~~~~~~~~------~~~~~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|......... .....+... .......++.++++.+. +....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:96 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999654332211 111111111 11112223445665443 44567777 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~ 377 (480) +++|...|+.++.++++++..|++ | +||+||+++++++.+||.++++.|+++|++++++++++.+. ....++ T Consensus 346 ~~~L~~~I~~~s~~p~~~~~~~~~---n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~ 421 (511) T protein:vir:96 346 KDRLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccc Confidence 788777777777777766655543 3 69999999999999999999999999999999999877542 234456 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc--cCcccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK--AQADAT 455 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 455 (480) .+++++|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.++++++++|+.+........... .+.+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~~~~~kl~----G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:96 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Confidence 78999999999999999999999873 468999999999999988888877777765443333221111 111111 Q ss_pred cCCCCCCCCcccccC Q lcl|NC_021297. 456 PKPTVTDTKTETQTS 470 (480) Q Consensus 456 ~~~~~~d~~~~~~~~ 470 (480) +.++..+.. ..++. T Consensus 498 ~~~~~~~~~-~~~~~ 511 (511) T protein:vir:96 498 EQDDDTKDT-VDKKE 511 (511) T ss_pred CCCCccccc-ccccC Confidence 111111111 11111 No 38 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=1.4e-79 Score=452.84 Aligned_cols=429 Identities=13% Similarity=0.105 Sum_probs=332.9 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC- Q lcl|NC_021297. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~- 75 (480) |+.. .+.|.+|+++|..+++|++++.+||+|+|+|..... ..+..+++|+++||+++|||+.++|++++|+++. T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 89 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA-KDSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKKTH 89 (453) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC-CCccCccceeecchHHHHHHHhhhhhcccCceeec Confidence 2211 246888999999999999999999999999976533 3455678999999999999999999999998764 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++..+.++++|+.|+|+.++.++++++++||+||++||.+ ++|.+++++++|.+++|+||+..++.+.++++ T Consensus 90 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d------~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~ 163 (453) T protein:vir:73 90 DDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQN------ESTESEVIYCSPLNVFMVYDDSIKQKPLFAVY 163 (453) T ss_pred CChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeC------CCCceEEEEEcccceEEEEeCCCCceeEEEEE Confidence 566677899999999999999999999999999999999963 57889999999999999999988888999998 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) +|.+.+. .+++++|+++++++|...++. ....+..+|+||.||||+|+|++ +|+|+|++ |++|||+| T Consensus 164 ~~~~~~~---~~~~~vyt~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~s~~~~-v~~liDa~ 230 (453) T protein:vir:73 164 YGFDEEG---NLSGTVYTLLETISITGKAGE----VKFGESTYNVYSDLPIVEYNFNE-----ERQSIFEP-VHSLINSY 230 (453) T ss_pred EEEecCc---eEEEEEEeCCeEEEEEecCCc----eEEccceeccCCceeEEEecCCC-----CCCcchhh-HHHHHHHH Confidence 8764432 467899999999998765432 24456778999999999999975 68899975 99999999 Q ss_pred HHHHHHHHHHHHHhcchhhhhcCCCccccccccc--cchhhh---hhhhhc-ccCCCCceEEEec--cccHHHHHHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQILGTPLRVISGVTTDELTNDGE--NTTLDI---YYGRIL-TLASEAAKISEFK--AAELRNFAEEMEV 307 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~--~~~~~~---~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~l~~ 307 (480) |+++|++++.+++|++|+++++|..++.+..... ...+.. ..+... ...++++++.+++ ....+++++.++. T Consensus 231 ~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 310 (453) T protein:vir:73 231 NKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLER 310 (453) T ss_pred HHHHHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHH Confidence 9999999999999999999999987654332221 111111 111111 1123345665544 3456677777777 Q ss_pred HHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcccceeeeEEecC Q lcl|NC_021297. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REVTEEYTRLETVWRD 386 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~i~v~f~~ 386 (480) .|+.++.++++++..|| ++||+||++++.+|+.||+++++.|+.+|++++++++++.+ .+...++.+++++|++ T Consensus 311 ~I~~~s~~p~~~~~~~g-----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~ 385 (453) T protein:vir:73 311 SIFQFTMAANISDENFG-----NSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTR 385 (453) T ss_pred HHHHHhCCcccCccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCC Confidence 77777666665544332 37999999999999999999999999999999999998865 3445567789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +.|.|.++.+++++|++ |++|.||+++++|+++|+.++++++++|+++....+..+... .++...++- T Consensus 386 ~~p~~~~~~a~~~~k~~----giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~-----~~~~~~~~~ 453 (453) T protein:vir:73 386 NEPKDIKEQAETANILK----GITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLV-----RMKQMRGNL 453 (453) T ss_pred CCCCCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCC-----cchhhhcCC Confidence 99999999999999985 469999999999999988788877777765544333222111 111222221 No 39 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=4.9e-79 Score=449.88 Aligned_cols=438 Identities=12% Similarity=0.086 Sum_probs=329.5 Q ss_pred CCCHH----HHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021297. 1 MTTYH----EHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~~----~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) |+..+ +.|.+++.+|.. +++||+++++||+|+|+|.+... ++..+++|+++||+++||++.++|++++++++ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~--~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~ 96 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE--KETGADNRIVVNSAKYVVDVYNGYFCGIEPKLA 96 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc--cccCCcceeecchHHHHHHHHhhhhccCCeeEe Confidence 33222 235567887765 56899999999999999976533 34456889999999999999999999998764 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 -~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) .+|.+..+.++++|++|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 97 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ 170 (470) T protein:vir:99 97 LLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQ------GEDARPHLMYSSPNHAFIIYDDTVQRQPLAF 170 (470) T ss_pred eCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCeEEEEEEccceeEEEEcCCCCcceEEE Confidence 455666789999999999999999999999999999999996 3578999999999999999999888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) +++|....+.....++++|+++.+++|...+... .....+..+|+||.||||+|.|++ +|+|++++ |++||| T Consensus 171 vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~liD 242 (470) T protein:vir:99 171 VHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEE--DTNAAGYAINPYGLVPAVEFFENE-----ERQGIFDS-IKTLIN 242 (470) T ss_pred EEEEEEecCCeeEEEEEEEecCeEEEEEeccccc--ccccccccccCCCccceEeecCCC-----CCCcchHh-HHHHHH Confidence 9999887777777889999999999887665433 234456678999999999999874 68999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC----CCCceEEEec-cccHHHHHHHHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFK-AAELRNFAEEMEVF 308 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~-~~~~~~~~~~l~~~ 308 (480) +||+++|++++.++++++|+++++|...+..... .........+++.++ +.++++..+. ..+.+.+...++.+ T Consensus 243 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 320 (470) T protein:vir:99 243 ALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEG--NPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHL 320 (470) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCccccccc--chhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHH Confidence 9999999999999999999999999876543222 222233344444332 2334433332 22344455555555 Q ss_pred HHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecC Q lcl|NC_021297. 309 RKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRD 386 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~ 386 (480) ...|+..+++|+..++..++| +||+||++++.+|.+||.++++.|+++|++++++++.+.+. ....+..+++++|++ T Consensus 321 ~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~ 399 (470) T protein:vir:99 321 TDFIFMMAMVPNIQDKNFAGN-SSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTR 399 (470) T ss_pred HHHHHHHhCCccccccccccC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCC Confidence 666666666666555543333 69999999999999999999999999999999999887653 334456789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +.|.|.++.+++++|++ |++|+||+++++|+++ +.++++++++|+.+..... ............++++ + T Consensus 400 ~~p~~~~e~a~~~~kl~----giis~et~l~~l~~vd-~~~E~eri~~E~~~~~~~~-~~~~~~~d~~~~d~~~-----e 468 (470) T protein:vir:99 400 NLPEDMASAIDNAKNAE----GIVSKKTQLGMIPDIE-PDAEMKQIAKEKADAIKQT-QQLSMPIDILKRDNNA-----E 468 (470) T ss_pred CCCcCHHHHHHHHHHHh----ccCCHHHHHHhCCCCC-HHHHHHHHHHHHHHHHHHH-HhhcCCCCcCCCCCCc-----c Confidence 99999999999999985 4689999999999984 4456666655554432222 1111111111111111 1 Q ss_pred cc Q lcl|NC_021297. 467 TQ 468 (480) Q Consensus 467 ~~ 468 (480) ++ T Consensus 469 e~ 470 (470) T protein:vir:99 469 EE 470 (470) T ss_pred CC Confidence 11 No 40 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=3.3e-79 Score=450.87 Aligned_cols=426 Identities=13% Similarity=0.122 Sum_probs=328.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc-CCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~~ 79 (480) |+ .+.|.+|++.|..+++|+.++.+||+|+|+|...... .....++|+++||+++||++.++||+++|+++ +++++ T Consensus 17 ~~--~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 93 (452) T protein:vir:36 17 IT--VEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAK-DSWKPDNRLAVNFTKYIVDTFTGYFNGIPVKKSHSDKE 93 (452) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccc-cccCccceeecchHHHHHHHHhhhhcccCceeecCChh Confidence 53 3578889999999999999999999999999765432 33456889999999999999999999999875 45667 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..+.++++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.+++++|.. T Consensus 94 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~ 167 (452) T protein:vir:36 94 ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQ------DEDTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVD 167 (452) T ss_pred HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 7889999999999999999999999999999999996 4578999999999999999999888899999999875 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .+. ..++++||++++++|...++. ....+..+|+||.||||+|+|++ +|+|+++ .|++|||+||+++ T Consensus 168 ~~~---~~~~~vyt~~~i~~~~~~~~~----~~~~~~~~~~~g~iPvv~~~n~~-----~g~sd~e-~v~~liDa~d~~~ 234 (452) T protein:vir:36 168 EDK---KLQGEVYTLLETIKISGENDE----ISFGEGTYNPYPDLPVVEFYFNE-----ERMSIFE-SVISLVNAFNKAI 234 (452) T ss_pred cCc---eEEEEEEecCeEEEEEEcCCc----eEEecceeccCCcccEEEecCCC-----CCCcchH-HHHHHHHHHHHHH Confidence 543 467899999999998765533 24556778999999999999975 5889997 4999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC------CCCceEEEecc--ccHHHHHHHHHHHHHH Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------SEAAKISEFKA--AELRNFAEEMEVFRKE 311 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~------g~~~~~~~~~~--~~~~~~~~~l~~~~~~ 311 (480) |++++.++++++|+++++|...+..... +...++++.++ +.++++.+++. ...+++++.++..|+. T Consensus 235 s~~~~~~~~~~~p~~~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 308 (452) T protein:vir:36 235 SEKANDVDYFSDQYLTFLGAAVEEEDLK------NIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQ 308 (452) T ss_pred HHHHHHHHHhcCceeEeecCCcCchhhh------hhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHH Confidence 9999999999999999999865432211 11222333322 22456655443 3455555555555555 Q ss_pred HHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCC Q lcl|NC_021297. 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTP 390 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~~~~~~ 390 (480) ++ ++|...++. . +++||+||++++++|++||.++++.|+.+|++++++++++.+. +...++.++++.|+++.|. T Consensus 309 ~s---~~p~~~~~~-~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~ 383 (452) T protein:vir:36 309 TT---MVANISDES-F-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPK 383 (452) T ss_pred Hh---CccccCccc-c-cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCc Confidence 55 454433332 1 2369999999999999999999999999999999999988653 3445667899999999999 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) |.++.+++++|++ |++|.||+++++|+++|+.++++++++|+++........ ..++.......++ ++.. T Consensus 384 d~~~~a~~~~k~~----g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~---~~~~~~~~~~~~~----~~~e 452 (452) T protein:vir:36 384 DIKEQAETANILM----GITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDK---QPSEKGTDTVVSE----TNEE 452 (452) T ss_pred CHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhc---cCCCCcccccCcc----ccCC Confidence 9999999999974 368999999999999888888887777765543221111 1111111111111 1111 No 41 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=5.2e-79 Score=449.75 Aligned_cols=434 Identities=11% Similarity=0.102 Sum_probs=334.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++|+++|..+++|+.++.+||+|+|+|...... ..+...++|+++||+++||++.++||+++++++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:96 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 67778999999999999999999999999999999765322 122345789999999999999999999999886 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+|+..+.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 105 ~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ 177 (474) T protein:vir:96 105 AHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYI------NEDGELKLFRVPAEQAIPIWTDKEREQLNAF 177 (474) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 5 44556677777764 88999999999999999999999996 4578899999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc------ccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... .+++++|+++++++|...++... ...+..+..+|+||.||||+|+|++ +|.|++++ T Consensus 178 ir~~~~~~----~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-----~~~~d~e~- 247 (474) T protein:vir:96 178 IRIFTFNG----ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNP-----EEVSDIWM- 247 (474) T ss_pred EEEEeecC----eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCC-----CCCCchHH- Confidence 99986432 45789999999999887654321 1234456678999999999999974 58888865 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEEEec--cccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||.++|++++.+++|++|+++++|+..++..+.. -.....+++ ..+++++++.+++ ....++++++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:96 248 YKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFM----EGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchh----hhhhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765332211 122233333 3345567775543 4456777777 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++.+..|++ ++||+||+++++++++||.++++.|+++|++++++++++.|.. .+..+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f 397 (474) T protein:vir:96 324 MRAYIVEFGQGVDFQTDKFGS----ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDAKEIEITF 397 (474) T ss_pred HHHHHHHHhCCcCcccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777776666555543 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|.++.++.+++ + |++|+||+++++|+++|+.++++++++|+.+.. ..+.....+..+...++..++ T Consensus 398 ~~~~p~~~~e~a~~~~~---~--giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~-- 469 (474) T protein:vir:96 398 NFNVMVNDLEQSQIGAQ---S--QYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDGGADGAQQQQQSE-- 469 (474) T ss_pred cCCCccCHHHHHHHHHH---c--CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccccCCCCCCcCCCC-- Confidence 99999999999997765 2 579999999999999998888877776665432 222222222222111111111 Q ss_pred cccccC Q lcl|NC_021297. 465 TETQTS 470 (480) Q Consensus 465 ~~~~~~ 470 (480) +++.. T Consensus 470 -~~e~~ 474 (474) T protein:vir:96 470 -NNQSK 474 (474) T ss_pred -ccccC Confidence 11111 No 42 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=5.2e-79 Score=449.75 Aligned_cols=434 Identities=11% Similarity=0.102 Sum_probs=334.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++|+++|..+++|+.++.+||+|+|+|...... ..+...++|+++||+++||++.++||+++++++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:95 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 67778999999999999999999999999999999765322 122345789999999999999999999999886 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+|+..+.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 105 ~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ 177 (474) T protein:vir:95 105 AHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYI------NEDGELKLFRVPAEQAIPIWTDKEREQLNAF 177 (474) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 5 44556677777764 88999999999999999999999996 4578899999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc------ccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... .+++++|+++++++|...++... ...+..+..+|+||.||||+|+|++ +|.|++++ T Consensus 178 ir~~~~~~----~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-----~~~~d~e~- 247 (474) T protein:vir:95 178 IRIFTFNG----ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNP-----EEVSDIWM- 247 (474) T ss_pred EEEEeecC----eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCC-----CCCCchHH- Confidence 99986432 45789999999999887654321 1234456678999999999999974 58888865 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhc-ccCCCCceEEEec--cccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||.++|++++.+++|++|+++++|+..++..+.. -.....+++ ..+++++++.+++ ....++++++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 248 YKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFM----EGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchh----hhhhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765332211 122233333 3345567775543 4456777777 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++.+..|++ ++||+||+++++++++||.++++.|+++|++++++++++.|.. .+..+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f 397 (474) T protein:vir:95 324 MRAYIVEFGQGVDFQTDKFGS----ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDAKEIEITF 397 (474) T ss_pred HHHHHHHHhCCcCcccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777776666555543 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|.++.++.+++ + |++|+||+++++|+++|+.++++++++|+.+.. ..+.....+..+...++..++ T Consensus 398 ~~~~p~~~~e~a~~~~~---~--giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~-- 469 (474) T protein:vir:95 398 NFNVMVNDLEQSQIGAQ---S--QYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDGGADGAQQQQQSE-- 469 (474) T ss_pred cCCCccCHHHHHHHHHH---c--CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccccCCCCCCcCCCC-- Confidence 99999999999997765 2 579999999999999998888877776665432 222222222222111111111 Q ss_pred cccccC Q lcl|NC_021297. 465 TETQTS 470 (480) Q Consensus 465 ~~~~~~ 470 (480) +++.. T Consensus 470 -~~e~~ 474 (474) T protein:vir:95 470 -NNQSK 474 (474) T ss_pred -ccccC Confidence 11111 No 43 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=8.8e-79 Score=448.50 Aligned_cols=434 Identities=11% Similarity=0.092 Sum_probs=332.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCc------ccchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGI------GAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~------~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) -.+-.++|.+|+..|..+++|+.++.+||+|+|+|..... ..+...+++|+++||+++||++.++|++++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:94 25 FETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCcee Confidence 3444689999999999999999999999999999864322 1234456889999999999999999999999987 Q ss_pred CCC-chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 SED-SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d-~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+ ++..+.++.++ +|+|+..+.++++.++++|+||+++|. +++|++++++++|.+++|+||+...+++.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:94 105 SCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 554 44555566555 589999999999999999999999996 4578899999999999999999888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccc------cccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++.+++|...++.... ........+|+||.||||+|.|++ +|+|++++ T Consensus 178 ir~~~~~~----~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~- 247 (474) T protein:vir:94 178 IRYYKFNN----EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNP-----EEVSDIWM- 247 (474) T ss_pred EEEEEecC----eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCc-----CCCCcHHH- Confidence 99986543 347899999999998876543211 122345678999999999999974 68999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEec--cccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|...+...+.. . .+...+++. .+++++++.+++ ....++++++ T Consensus 248 v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~--~--~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:94 248 YKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFM--R--GLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhh--h--hhhccceeeccCCCceeEEeecCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433211 1 122233333 345677776644 3456677777 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++++..|++ ++||+||++++.++++||.++++.|+++|++++++++++.|.. .++.+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~d~~~i~v~f 397 (474) T protein:vir:94 324 MRVYIMEFGQGVDFQTDKFGS----APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK--TDVKDIEISF 397 (474) T ss_pred HHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777666555543 3699999999999999999999999999999999999988754 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.|++++++ +++|+||++.++|+++|+.++++++++|+.+.. .........+.+..+ T Consensus 398 ~~~~p~~~~e~a~~~~~~-----g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~-------- 463 (474) T protein:vir:94 398 NFNRMMNDAEQSQIIAQS-----QYLSRETLVKSSPLVDDYKAELERIEQEQMEYN-KQLPNLDDGGADGAQ-------- 463 (474) T ss_pred ccCcccCHHHHHHHHHHc-----CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccCCCCCCCcc-------- Confidence 999999999999987763 468999999999999988888877776665432 222121221111111 Q ss_pred cccccCCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFNR 476 (480) Q Consensus 465 ~~~~~~p~~~~~ 476 (480) +++++.++.++ T Consensus 464 -~~~~~~~~~~e 474 (474) T protein:vir:94 464 -QQEGSNNKESE 474 (474) T ss_pred -cCCCCcccccC Confidence 12222222222 No 44 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=8.8e-79 Score=448.50 Aligned_cols=434 Identities=11% Similarity=0.092 Sum_probs=332.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCc------ccchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGI------GAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~------~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) -.+-.++|.+|+..|..+++|+.++.+||+|+|+|..... ..+...+++|+++||+++||++.++|++++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:97 25 FETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCcee Confidence 3444689999999999999999999999999999864322 1234456889999999999999999999999987 Q ss_pred CCC-chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 SED-SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d-~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+ ++..+.++.++ +|+|+..+.++++.++++|+||+++|. +++|++++++++|.+++|+||+...+++.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:97 105 SCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 554 44555566555 589999999999999999999999996 4578899999999999999999888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccc------cccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++.+++|...++.... ........+|+||.||||+|.|++ +|+|++++ T Consensus 178 ir~~~~~~----~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~- 247 (474) T protein:vir:97 178 IRYYKFNN----EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNP-----EEVSDIWM- 247 (474) T ss_pred EEEEEecC----eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCc-----CCCCcHHH- Confidence 99986543 347899999999998876543211 122345678999999999999974 68999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEec--cccHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|...+...+.. . .+...+++. .+++++++.+++ ....++++++ T Consensus 248 v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~--~--~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:97 248 YKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFM--R--GLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhh--h--hhhccceeeccCCCceeEEeecCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433211 1 122233333 345677776644 3456677777 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) |+..|+.++.++++++..|++ ++||+||++++.++++||.++++.|+++|++++++++++.|.. .++.+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~d~~~i~v~f 397 (474) T protein:vir:97 324 MRVYIMEFGQGVDFQTDKFGS----APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK--TDVKDIEISF 397 (474) T ss_pred HHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777666555543 3699999999999999999999999999999999999988754 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.|++++++ +++|+||++.++|+++|+.++++++++|+.+.. .........+.+..+ T Consensus 398 ~~~~p~~~~e~a~~~~~~-----g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~-------- 463 (474) T protein:vir:97 398 NFNRMMNDAEQSQIIAQS-----QYLSRETLVKSSPLVDDYKAELERIEQEQMEYN-KQLPNLDDGGADGAQ-------- 463 (474) T ss_pred ccCcccCHHHHHHHHHHc-----CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccCCCCCCCcc-------- Confidence 999999999999987763 468999999999999988888877776665432 222121221111111 Q ss_pred cccccCCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFNR 476 (480) Q Consensus 465 ~~~~~~p~~~~~ 476 (480) +++++.++.++ T Consensus 464 -~~~~~~~~~~e 474 (474) T protein:vir:97 464 -QQEGSNNKESE 474 (474) T ss_pred -cCCCCcccccC Confidence 12222222222 No 45 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=1.1e-78 Score=448.04 Aligned_cols=420 Identities=13% Similarity=0.095 Sum_probs=327.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) || .++|.+|+++|..+.+|+.++++||+|+|+|..... ..+...++|+++||+++||++.++||+++|+++. +++. T Consensus 1 l~--~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 77 (429) T protein:vir:98 1 MT--KDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQ-KEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQ 77 (429) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-cccCCCcceeecchHHHHHHHHhhhhcccCceeecCChH Confidence 66 467888999999999999999999999999865432 3344568899999999999999999999998764 4556 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..+.++++|++|+|+.++.++++++++||+||++||. +++|++++++++|.+++|+||+...+++.+++++|.. T Consensus 78 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~ 151 (429) T protein:vir:98 78 VSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFN------DENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYN 151 (429) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------cCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEe Confidence 7788999999999999999999999999999999986 4578999999999999999998888889999999864 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .+ ...+.++|+++.+++|...++. ....+..+|+||.||||+|+|++ +|+|+|++ |++|+|+||+++ T Consensus 152 ~~---~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~liD~~d~~~ 218 (429) T protein:vir:98 152 KG---GVLEGSYSDASNITYFKDGEKG----IEIGESEPHPFDGVPMIEYVENE-----ERQSLLAS-VVTLINAFNKAI 218 (429) T ss_pred cC---ceEEEEEEeCceEEEEEecCCc----eEecccccccCCccceEEecCCC-----CCCCcHHH-HHHHHHHHHHHH Confidence 43 3567788999988887655432 34456779999999999999874 68999975 999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCC-----CCceEEEecc--ccHHHHHHHHHHHHHHH Q lcl|NC_021297. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS-----EAAKISEFKA--AELRNFAEEMEVFRKEA 312 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g-----~~~~~~~~~~--~~~~~~~~~l~~~~~~i 312 (480) |++++.++++++|+++++|...+.... .+....+++.+++ .++++.+++. ...+++++.+...|+.+ T Consensus 219 s~~~~~~~~~~~p~~~i~g~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 292 (429) T protein:vir:98 219 SEKANDVEYFADAYLKILGAELDDETL------KSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRT 292 (429) T ss_pred HHHHHHHHHhcCceeeeecCCCCcchh------hhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 999999999999999999986543211 1223334444432 2355555443 23444555555555555 Q ss_pred HhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCC Q lcl|NC_021297. 313 ASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPT 391 (480) Q Consensus 313 ~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~i~v~f~~~~~~~ 391 (480) +.++++++..+ +++||+||++++.+|..|+.++++.|+++|++++++++++.+. ....+..++++.|+++.|.| T Consensus 293 s~~p~~~~~~~-----gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~ 367 (429) T protein:vir:98 293 AMVANISDESF-----GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPAN 367 (429) T ss_pred hCccccCcccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcC Confidence 54444433322 2369999999999999999999999999999999999998763 33456678999999999999 Q ss_pred HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) +++.+++++|+. +++|+||+++++|+++|+.++++++++|+.+. .+.......++. .+...| T Consensus 368 ~~~~a~~~~kl~----g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~-~~~~~~~~~~~~----~~~~~~ 429 (429) T protein:vir:98 368 LLEESQIAGNLA----GIVSEETQVGVLSIVENPQKEIERKNSDKSTL-ISRQAGGLNGQN----TTTILE 429 (429) T ss_pred HHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-HHHHHhhhcCCC----CCCCCC Confidence 999999999973 46899999999999998888887777766543 233222222111 111111 No 46 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.2e-77 Score=442.21 Aligned_cols=437 Identities=14% Similarity=0.144 Sum_probs=327.9 Q ss_pred CCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCcchhcCccc---chhhhhhccccchHHHHHHHHHhhhccCcccc-C Q lcl|NC_021297. 1 MTTYHEHVERLQGLLA-RDLPNLLEAEAYRNGTRRLKTIGIGA---PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-S 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~~~~r~~~~~~YY~g~~~i~~~~~~~---~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~ 75 (480) |.++ +.|++++.+|. .+++|++++.+||+|+|++....... .....++|+++||+++||++.++|++++++++ + T Consensus 29 ~~~~-~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 107 (481) T protein:vir:10 29 LLKE-ENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITH 107 (481) T ss_pred hcCH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhccCCceEec Confidence 4444 45778888875 56799999999999998764332221 12344778999999999999999999999764 4 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++.++.++++|++|+|+.++.+++++++++|+||+++|. +++|++++++++|++++|+||+....++.++++ T Consensus 108 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~------d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~ 181 (481) T protein:vir:10 108 QDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYR------DFEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVR 181 (481) T ss_pred CChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEe------CCCCeEEEEEEcccceEEEEcCCCCCceEEEEE Confidence 56778889999999999999999999999999999999986 457899999999999999999988888999999 Q ss_pred EEEEecC-CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 156 LYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 156 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) +|...+. ++.++++++|+++++++|...++. ....+..||+||.||||+|.|+. +|+|+|++ |++|||+ T Consensus 182 ~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~~~~~~-v~~lida 251 (481) T protein:vir:10 182 YFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGT----YHRVEEVEHYYNDVPIIEYLNDQ-----FKQGDFEN-VIALIDL 251 (481) T ss_pred EEEEeeCCCceEEEEEEEecCeEEEEEecCCc----eeecccccccCCceeEEEeecCC-----CCCCchhh-HHHHHHH Confidence 9876554 445678999999999998876543 23456789999999999999864 68999974 9999999 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc---chhhhhhhhhccc--CCCCceEEEec--cccHHHHHHHHHH Q lcl|NC_021297. 235 ASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN---TTLDIYYGRILTL--ASEAAKISEFK--AAELRNFAEEMEV 307 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~---~~~~~~~~~~~~~--~g~~~~~~~~~--~~~~~~~~~~l~~ 307 (480) ||+++|++++.++++++|+++++|............ ..+.......... ++.++++.+++ ...++++++.|+. T Consensus 252 ~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (481) T protein:vir:10 252 YDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQN 331 (481) T ss_pred HHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHH Confidence 999999999999999999999998654332211110 0111111111111 23345554443 3445555555555 Q ss_pred HHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEec Q lcl|NC_021297. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWR 385 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~i~v~f~ 385 (480) .++.+ +++|+..++..++ ++||+||++++++|.+||+++++.|+.+|++++++++++.+.. ...+..++++.|+ T Consensus 332 ~i~~~---s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~ 407 (481) T protein:vir:10 332 DIHKY---TNTPDLNDEQFSG-VQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFT 407 (481) T ss_pred HHHHH---hCCcccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeC Confidence 55555 5555544543333 3699999999999999999999999999999999999886532 3345678999999 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC-CCC Q lcl|NC_021297. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-DTK 464 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~ 464 (480) ++.|+|.++.+++++|+. |++|.||+++++|+++++.++++++++|+++.......... +++.++++.+ |++ T Consensus 408 ~~~~~~~~~~a~~~~kl~----g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~~~~~dd~~ 480 (481) T protein:vir:10 408 PNLPKSMMESINAFNALS----GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGY---GEAFENHLNVDDSN 480 (481) T ss_pred CCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccC---CccCCCCCCCCCCC Confidence 999999999999999884 46899999999999998888888777777554433222211 1222222222 222 Q ss_pred c Q lcl|NC_021297. 465 T 465 (480) Q Consensus 465 ~ 465 (480) + T Consensus 481 g 481 (481) T protein:vir:10 481 G 481 (481) T ss_pred C Confidence 1 No 47 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=7.3e-78 Score=443.49 Aligned_cols=433 Identities=11% Similarity=0.095 Sum_probs=331.0 Q ss_pred CC-----------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhcc Q lcl|NC_021297. 1 MT-----------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDV 51 (480) Q Consensus 1 ~~-----------------------~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~ 51 (480) |. +..++|.+++..|..+++++.++.+||+|+|+|...... ..+...++|+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccccccee Confidence 22 337899999999999999999999999999998654322 1223457799 Q ss_pred ccchHHHHHHHHHhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEE Q lcl|NC_021297. 52 QPGWVATYLRTLSDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLI 130 (480) Q Consensus 52 ~~n~~~~iVd~~~~~l~~~~~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i 130 (480) ++||+++||++.++|++++++++ +++++..+.|+++++ |+++.++.+++++++++|+||+++|. +++|++++ T Consensus 81 ~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~------d~~g~~~~ 153 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYV------DEEGEFKT 153 (478) T ss_pred ccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEE Confidence 99999999999999999999875 455566778888875 78999999999999999999999986 45789999 Q ss_pred EEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc----------cccccccccccc Q lcl|NC_021297. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND----------QWVVDGDVIKHG 200 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~ 200 (480) ++++|.+++|+||+...+.+.+++++|.... ..++++|+++++++|...++... .........+|+ T Consensus 154 ~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~----~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (478) T protein:vir:10 154 FRVPAEQAVPIWTNKERDELQAFIRVYELDG----AERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMS 229 (478) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEecC----ceEEEEEeCCeEEEEEEcCCeeeccccccccccccceeccccccc Confidence 9999999999999888888999999986543 34689999999998877653321 112234566899 Q ss_pred ccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhh Q lcl|NC_021297. 201 LGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRI 280 (480) Q Consensus 201 ~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~ 280 (480) ||.||||+|+|+ ++|+|++++ |++|||+||+++|++++.+++|++|+++++|+..+...+.. ..+...++ T Consensus 230 ~~~vPvv~~~n~-----~~g~sd~~~-v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~----~~~~~~~~ 299 (478) T protein:vir:10 230 WGRVPFIPFKNN-----PQEVSDLFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM----HNLKYYKA 299 (478) T ss_pred CCccceEEeccC-----CCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhh----hhhhhcce Confidence 999999999987 478999975 99999999999999999999999999999998765433322 22233333 Q ss_pred ccc---CCCCceEEEec--cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LTL---ASEAAKISEFK--AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 281 ~~~---~g~~~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +.+ +|+++++.+++ ..+.+++++.++..|+.++..+++++..|++ ++||+||+++++.|.+||+++++.|+ T Consensus 300 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~~~~~~~ 375 (478) T protein:vir:10 300 ISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN----SPSGIALKFMYSNLDLKANKLKNKTL 375 (478) T ss_pred EEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 45567776544 3456667777777777776666665555542 36999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHH Q lcl|NC_021297. 356 GAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQ 435 (480) Q Consensus 356 ~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~ 435 (480) ++|++++++++++.|.. .+..+++++|+++.|.|.++.|++++|+. |++|+||+++++|+++|+.++++++++| T Consensus 376 ~~l~~~~~li~~~~g~~--~~~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E 449 (478) T protein:vir:10 376 TALQELLQYIIDFYRLD--VKVQDIEITFNFNVMVNELENSQIAMNST----GLLSKETILSNHAWVEDPVAEMERIEQE 449 (478) T ss_pred HHHHHHHHHHHHHhCCC--cccccceEEecCCCCCCHHHHHHHHHHHh----CCCChHHHHHhCCCCCCHHHHHHHHHHH Confidence 99999999999998754 45668999999999999999999999873 3689999999999998888888777766 Q ss_pred HHHHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 436 ETEDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +++.. ..+.....+.. ..++....+.+.+ T Consensus 450 ~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~ 478 (478) T protein:vir:10 450 NIELN-QQLPDIEEGLN-GEQQRQSENNQPE 478 (478) T ss_pred HHHHH-hhccccccccC-CCCCCCCCCCCCC Confidence 54422 22211111111 1111111111111 No 48 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=1e-77 Score=442.70 Aligned_cols=427 Identities=11% Similarity=0.094 Sum_probs=329.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc------chhhhhhccccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021297. 2 TTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 2 ~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) -| .+.|.+|++.|..+++|+.++.+||+|+|+|....... .....++|+++||+++||++.++|+++++++. T Consensus 1 l~-~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 ME-LEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 44 35566689999999999999999999999997654322 12235779999999999999999999999764 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccccc--CCCceeEEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAGIPLIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 75 -~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~--d~~~~~~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) .++++..+. .+.|..|+|+.++.+++++++++|+||+++|.+..... ...|.++++.++|++++|+||+...+++. T Consensus 80 ~~~~~~~~~~-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~ 158 (451) T protein:vir:10 80 IDNNKELNEK-VTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELE 158 (451) T ss_pred cCCcHHHHHH-HHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceE Confidence 333444444 45555699999999999999999999999997543221 22478899999999999999988888999 Q ss_pred EEEEEEEEecCC------ceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccch Q lcl|NC_021297. 152 RAVRLYTTRDDV------AVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 152 ~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) +++|+|...++. ...+++++||++.+++|...+..........+..+|+||.||||+|.|+. +|.|+++ T Consensus 159 ~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~~d~e 233 (451) T protein:vir:10 159 AVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNI-----KKQSDLS 233 (451) T ss_pred EEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCC-----CCCCchh Confidence 999999766542 23568899999999999887666555566778889999999999999975 4678886 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc------CCCCceEEEec--ccc Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL------ASEAAKISEFK--AAE 297 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~------~g~~~~~~~~~--~~~ 297 (480) + |++|||+||.++|++++.+++|++|+++++|+..++..+.. .++...+++.+ .++++++.+++ ... T Consensus 234 ~-v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~ 308 (451) T protein:vir:10 234 K-YKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFL----KELKRYKTIKTETDSEGDSGGLKTMQIEIPTEA 308 (451) T ss_pred h-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhH----HHHhhCCeEEecCcCCccCCcceEEeecCCHHH Confidence 4 99999999999999999999999999999998765433221 12222222222 23567775544 445 Q ss_pred HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021297. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 377 (480) .++++++|+..|+.++.++++++..|| ++||+||++++.+|++||+++++.|+++|++++++++++.+.. ++ T Consensus 309 ~~~~~~~l~~~I~~~s~~p~~~~~~~g-----n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~---d~ 380 (451) T protein:vir:10 309 RKIILEILKKQIYESGQGLQQDTENFG-----NASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT---DY 380 (451) T ss_pred HHHHHHHHHHHHHHHhCcccccccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---Cc Confidence 667777777777777777766555443 3799999999999999999999999999999999999998743 46 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) .+++++|+++.|.|+++.+++++|++ |++|+||++.++|+++++.++++++.+++.+... +.....+.-.+ T Consensus 381 ~~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~-~~~~~~~~~~~ 451 (451) T protein:vir:10 381 KKIQQTYTRNMMSNDLEDADIATKSV----GIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQAS-KVSDDYNNFTE 451 (451) T ss_pred cceeEEecCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-HHHhhcCCCCC Confidence 78999999999999999999999984 4799999999999999877766655554443332 22222211111 No 49 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=8.2e-78 Score=443.21 Aligned_cols=424 Identities=16% Similarity=0.121 Sum_probs=317.6 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccCcchhcCc-ccchhhhhhccccchHHHHHHHHHhhhccCccccC----CCchhHHHH Q lcl|NC_021297. 10 RLQGLLARDLPNLLEAEAYRNGTRRLKTIGI-GAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----EDSEGLEEL 84 (480) Q Consensus 10 ~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~-~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----~d~~~~~~l 84 (480) -|...+..+++||+++.+||+|+|++..... .......++|+++||+++||++.++||+++++++. ++++..+.+ T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~l 80 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLSTI 80 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHHH Confidence 4555667789999999999999999865433 23344568899999999999999999999987653 234456689 Q ss_pred HHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCc Q lcl|NC_021297. 85 WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVA 164 (480) Q Consensus 85 ~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~ 164 (480) +++|++|+|+.++.++++++++||+||+++|. +++|++++++++|.+++|+||+...+++.+++++|...+ T Consensus 81 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~--- 151 (440) T protein:vir:95 81 KDIEWQNDINALNSDLAFDASVYGRAYEYHFR------DKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYAD--- 151 (440) T ss_pred HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--- Confidence 99999999999999999999999999999996 457889999999999999999988889999999986554 Q ss_pred eEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 165 VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQS 244 (480) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~ 244 (480) ..++++|+++.+++|...+..... ....+..+|+||.||||+|+|+. +|+|++++ +++|||+||+++|++++ T Consensus 152 -~~~~~vyt~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~lida~~~~~s~~~~ 223 (440) T protein:vir:95 152 -KVNMTVYTKDKVITYKPYSNNSVR-LVVDDVKKHSYNDVPVVEWWNNR-----FRMGDYES-EISLIDAYDAGQSDTAN 223 (440) T ss_pred -ceEEEEEeCCeEEEEEEecCCccc-eeecceeeccCceeeEEEeeCCC-----CCCCchhh-hHHHHHHHHHHHHHHHH Confidence 246789999999998876554333 34567789999999999999974 58899975 99999999999999999 Q ss_pred HHHHhcchhhhhcCCCccccccccccchhhhhhhhhc---------ccCCCCceEEEeccccHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 245 ASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL---------TLASEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 245 ~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~---------~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) .+++|++|+++++|.......+.+....++.. ..++ ...++++++.+++. +.+.+...++.+...|+.. T Consensus 224 ~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~lt~~~-~~~~~~~~~~~l~~~i~~~ 301 (440) T protein:vir:95 224 YMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDA-NMLFLKTGISTTGQQTTADASYIYKQY-DVNGTEAYKNRLANDIHRF 301 (440) T ss_pred HHHHhhcceeeeecccccCCCCccchhhhhhc-cceecccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHH Confidence 99999999999999755443333322222111 1111 12234566665543 3344444444445555555 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~~~~ 393 (480) +++|+.+++..++| +||+||++++.+|++||+++++.|+++|++++++++.+.+. ....+..+++++|+++.|+|.+ T Consensus 302 s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~ 380 (440) T protein:vir:95 302 SRIPNLDDDRFNST-SSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVW 380 (440) T ss_pred hCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHH Confidence 55555445433334 69999999999999999999999999999999999887653 3345667899999999999999 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +.+++++|+. +++|.||+++++|+++++ ++++++++|+.+............ .+.+.|.+ T Consensus 381 ~~ad~~~kl~----g~iS~et~~~~l~~~d~~-~E~~ri~~E~~~~~~~~~~~~~~~------~~~~~~~e 440 (440) T protein:vir:95 381 TEIKAYIEAG----GEISQETLMENASFTDYK-TEHSRILKQGGSSDLEIGQIVGDA------DVGQADTE 440 (440) T ss_pred HHHHHHHHHh----ccCcHHHHHHhCCCCCcH-HHHHHHHHHHHHhhhhHHhhccCC------CCCCcCCC Confidence 9999999983 368999999999998654 455555555544332221111111 11111111 No 50 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=2.6e-77 Score=440.47 Aligned_cols=433 Identities=11% Similarity=0.099 Sum_probs=332.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) =-+..++|++|+++|..+++|++++.+||+|+|+|...... ......++|+++||+++||++.++|++++++++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~ 103 (478) T protein:vir:10 24 YETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTF 103 (478) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCcee Confidence 12447799999999999999999999999999998654322 123345789999999999999999999999875 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 -~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +++++..+.++++++ |+|+.++.++++.++++|++|++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 104 ~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ 176 (478) T protein:vir:10 104 GVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYV------DEEGEFKTFRVPAEQAVPIWTNKERDELQAF 176 (478) T ss_pred ecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 456667788888885 89999999999999999999999996 4578999999999999999998878889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc----------ccccccccccccccccceEEeecccccCccCCccc Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|...+ ..++++|+++++++|...++... .........+|+||.||||+|.|++ +|.|+ T Consensus 177 ir~~~~~~----~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd 247 (478) T protein:vir:10 177 IRVYELDG----AERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNP-----QEVSD 247 (478) T ss_pred EEEEeeeC----ceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCC-----CCCCc Confidence 99886543 34679999999998877653221 1112344568999999999999974 58899 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc---CCCCceEEEec--cccH Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL---ASEAAKISEFK--AAEL 298 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~--~~~~ 298 (480) +++ |++|||+||+++|++++.+++|++|+++++|++.++..+.. .++...+++.+ +|+++++.+++ ...+ T Consensus 248 ~e~-v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 322 (478) T protein:vir:10 248 LFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM----HNLKYYKAISVAGESGSGVDTIKVEVPIDSV 322 (478) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchh----hhhhhCceeEecCCCCCcceEEeecCCHHHH Confidence 975 99999999999999999999999999999998765433321 12222333333 34567776544 3456 Q ss_pred HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) ++++++++..|+.++.++++.+..|++ ++||+||++++.+|++||.++++.|+++|++++++++++.|.. .+.. T Consensus 323 ~~~~~~l~~~I~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~d~~ 396 (478) T protein:vir:10 323 KEYTKMLRDYIIEFGQGVDFQQDKFGN----SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLD--VRVQ 396 (478) T ss_pred HHHHHHHHHHHHHHhCCcCcCcccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccc Confidence 666666666666666666655554542 3699999999999999999999999999999999999998754 4556 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++++|.++.|.|.++.+++++++. |++|+||+++++|+++|+.++++++++|+.+.... ......+.. ..+.+ T Consensus 397 ~i~i~f~~~~p~~~~e~~~~~~~~~----g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~-d~~~~ 470 (478) T protein:vir:10 397 DIEITFNFNVMVNELENSQIAMNST----GLLSKETILGNHSWVQDPVAEMERIEQENIELNQQ-LPDIEEGLN-DEQQR 470 (478) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHh----CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-ccccCCCCc-ccccc Confidence 8999999999999999999998873 46899999999999999888888877777654322 111111111 11111 Q ss_pred CCCCCCcc Q lcl|NC_021297. 459 TVTDTKTE 466 (480) Q Consensus 459 ~~~d~~~~ 466 (480) ...+++++ T Consensus 471 ~~~d~~~e 478 (478) T protein:vir:10 471 QSEDNQSE 478 (478) T ss_pred cCcCCCCC Confidence 11121112 No 51 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=2.2e-77 Score=440.83 Aligned_cols=431 Identities=13% Similarity=0.081 Sum_probs=332.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc----------cchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----------APPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~----------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) ..+.+++|++++..|..+.+|+.++.+||+|+|+|...... .+....++|+++||+++||++.++|++++ T Consensus 3 ~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G~ 82 (470) T protein:vir:10 3 LDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVASV 82 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheecc Confidence 66778899999999999999999999999999998654221 12234578999999999999999999999 Q ss_pred cccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 71 GFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 71 ~~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~ 149 (480) |+++ +++++..+.++++++. ++...+.+++++++++|+||+++|. +++|+++++.++|.+++|+||+...++ T Consensus 83 p~~~~~~d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~------d~~~~~~~~~~~p~~~~~v~d~~~~~~ 155 (470) T protein:vir:10 83 FPDIDVGKDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWI------DEDGNFRYGIIQPDQITPIYATTLDNK 155 (470) T ss_pred ceeeecCchHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCc Confidence 9875 3566677889999974 6788889999999999999999996 467899999999999999999988889 Q ss_pred eEEEEEEEEEecCC--ceEEEEEEEeCCcEEEEEecCCcccc----------------cccccccccccccccceEEeec Q lcl|NC_021297. 150 VTRAVRLYTTRDDV--AVPDRATLYLPDETVPLRRNGGLNDQ----------------WVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 150 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~iPvv~~~n 211 (480) +.+++++|...+.. ...+++++|+++.+++|...+..... .....+..+|+||.||||+|+| T Consensus 156 ~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (470) T protein:vir:10 156 LLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK 235 (470) T ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeec Confidence 99999999766543 34567899999999998766432211 1122456689999999999999 Q ss_pred ccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC------C Q lcl|NC_021297. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------S 285 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~------g 285 (480) |. +|+|+++ .|++|||+||.++|++++.+++|++|+++++|+..+...+... .+ .....+.++ + T Consensus 236 n~-----~g~sd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~--~~--~~~~~i~~~~~~~~~~ 305 (470) T protein:vir:10 236 NK-----YRLPELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMN--DL--RKYKSIKINNTGNGDN 305 (470) T ss_pred CC-----CCCCchh-HHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhh--hh--hhcCeEeccCCCCCcC Confidence 85 6899996 5999999999999999999999999999999987654332211 11 111222221 2 Q ss_pred CCceEEEecc--ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 EAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 286 ~~~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) .++++.+++. ...+.++++|+..|+.++..+++++..+ +++||+||++++++|++||+++++.|+++|+++++ T Consensus 306 ~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-----gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (470) T protein:vir:10 306 SGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-----SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (470) T ss_pred ceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3456655443 4456677777777777776666654433 23799999999999999999999999999999999 Q ss_pred HHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT 443 (480) Q Consensus 364 ~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~ 443 (480) +++++.+.. ..++.+++++|+++.|.|..+.++.++++. |++|.||+++++|+++|+.++++++++|+.+.. .. T Consensus 381 ~i~~~l~~~-~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~----g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~-~~ 454 (470) T protein:vir:10 381 AIMRYLNFS-DADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDKEEND-PY 454 (470) T ss_pred HHHHHhccc-CcccceeeEEeccCCCCCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-Hh Confidence 999887753 345678999999999999999999998873 479999999999999998888888777765532 11 Q ss_pred HhcccccCcccccCCCCCCCC Q lcl|NC_021297. 444 LYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 444 ~~~~~~~~~~~~~~~~~~d~~ 464 (480) ... . ++.. ..+.+|++ T Consensus 455 ~~~-~---~~~~-~~~~dde~ 470 (470) T protein:vir:10 455 SNQ-A---DELN-GKGVNDEQ 470 (470) T ss_pred hcc-c---cccC-CCCCCCCC Confidence 111 0 1111 11111111 No 52 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=5e-77 Score=438.92 Aligned_cols=435 Identities=10% Similarity=0.071 Sum_probs=325.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc------cchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) -.+-.++|++|+.+|..+++|+.++++||+|+|+|.+.... .+...+++|+++||+++||++.++||+++++++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:95 25 FETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCcee Confidence 22335689999999999999999999999999998653221 233445789999999999999999999999886 Q ss_pred CCC-chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 SED-SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d-~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+ ++..+.++.++ .|+|+..+.++++.++++|+||++||. +++|++++++++|.+++|+||+...+.+.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:95 105 SCEDESVLKIIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhcCcEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 544 44555666665 588999999999999999999999986 4578899999999999999998878889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccc------cccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|..... .++++|+++++++|...++.... ........+|+||.||||+|+|++ +|.|+|++ T Consensus 178 i~~~~~~~~----~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~-----~g~sd~e~- 247 (474) T protein:vir:95 178 IRYYKFNNE----EKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNP-----EEVSDIWM- 247 (474) T ss_pred EEEEEEcCe----eEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCC-----CCCCcHHH- Confidence 999865432 46899999999998876543211 112344578999999999999974 68999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEeccccHHHHHHHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFKAAELRNFAEEME 306 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~~~~~~~~~~~l~ 306 (480) |++|||+||+++|++++.++++++|+++++|...++..+... .....+++. .+++++++.+++. +.+++...++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~ 322 (474) T protein:vir:95 248 YKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMR----GLKYYKAINVDGDGGVETIQVEV-PVSSTKEYID 322 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh----hhhccceeeccCCCceeEEeecC-CHHHHHHHHH Confidence 999999999999999999999999999999987654322111 122233333 3455677766543 3445555555 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecC Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRD 386 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~ 386 (480) .+...|+..+++|..+++..+ +++||+||++++.++..||+++++.|+++|++++++++.+.|.. .+..+++++|.+ T Consensus 323 ~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~~~i~v~f~~ 399 (474) T protein:vir:95 323 LMRAYIMEFGQGVDFQTDKFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLK--MDVKDIEISFNF 399 (474) T ss_pred HHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEecc Confidence 555555555555543333222 23699999999999999999999999999999999999998753 456789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC-CCCc Q lcl|NC_021297. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-DTKT 465 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~ 465 (480) +.|.|+++.+++++++ |++|++|++.++|+++|+.++++++++|+.+..... ....+..++..++.+.. +.++ T Consensus 400 ~~p~d~~e~a~~~~~~-----g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~ 473 (474) T protein:vir:95 400 NRMMNDAEQSQIIAQS-----QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQL-PNLDDGGADGAQQQERSNDKES 473 (474) T ss_pred CCCcCHHHHHHHHHhc-----CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc-cccccccCCCCcCCCCCccCCC Confidence 9999999999987762 479999999999999988888877777665543322 22222222211111111 1111 Q ss_pred c Q lcl|NC_021297. 466 E 466 (480) Q Consensus 466 ~ 466 (480) + T Consensus 474 ~ 474 (474) T protein:vir:95 474 E 474 (474) T ss_pred C Confidence 1 No 53 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=1.6e-76 Score=436.17 Aligned_cols=445 Identities=14% Similarity=0.064 Sum_probs=332.5 Q ss_pred CCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc-CCCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLA-RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~ 78 (480) +....+.+..|+..|. .+++|++++++||+|+|+|...+.......+++|+++||+++||++.++|++++++++ ++++ T Consensus 13 ~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 92 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLGVPVEYKNENK 92 (489) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhccCCceeecCCh Confidence 4323344555677665 5679999999999999999887666666667889999999999999999999999875 4566 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 79 ~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) +..+.++++|++|+|+.++.++++.++++|+||+++|... ..++++++++++++|.+++|+||+....++.+++++|. T Consensus 93 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~--~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~ 170 (489) T protein:vir:99 93 DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEK--IDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYD 170 (489) T ss_pred hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeecc--CcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 7788999999999999999999999999999999998643 23568899999999999999999888888999999886 Q ss_pred Eec-CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHH Q lcl|NC_021297. 159 TRD-DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 159 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~ 237 (480) ..+ +++...++++|+++.+++|....... ......+..+|+||.||||+|+|+. +|.|+++ .|++|||+||+ T Consensus 171 ~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~s~~~-~v~~liDa~d~ 243 (489) T protein:vir:99 171 IDYGSGKRKQIIKAYTSDTIYTYEDYNLET-KGMRLKDYEGHFFKGVPVNEYANNE-----ERTGAYE-SVLDNIDAYDL 243 (489) T ss_pred EecCCCceEEEEEEEeCCcEEEEEecCCCc-ccceecccccccCCceeEEEeecCC-----CCCCchh-hhHHHHHHHHH Confidence 554 34566789999999999988754332 2334567789999999999999974 5888886 49999999999 Q ss_pred HHHHHHHHHHHhcchhhhhcCCCccccccccccchhh------------hhhhhhcccC--------CCCceEEEe--cc Q lcl|NC_021297. 238 TLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLD------------IYYGRILTLA--------SEAAKISEF--KA 295 (480) Q Consensus 238 ~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~------------~~~~~~~~~~--------g~~~~~~~~--~~ 295 (480) ++|++++.++++++|+++++|................ ...++++..+ +.++++..+ +. T Consensus 244 ~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 323 (489) T protein:vir:99 244 SQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDT 323 (489) T ss_pred HHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCCh Confidence 9999999999999999999997654432221111111 1112222221 123444333 34 Q ss_pred ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---C Q lcl|NC_021297. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---E 372 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~ 372 (480) ...++++++|...++.++.++++++..|++ ++||+||+++++++.+||.++++.|+.+|++++++++.+.+. . T Consensus 324 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~ 399 (489) T protein:vir:99 324 AGSEAYKNRLVADILRFTFTPDTQDMKFSG----VQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNE 399 (489) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 456667777777777777666665554443 369999999999999999999999999999999999887642 1 Q ss_pred --CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCh--hHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_021297. 373 --VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA--TQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 373 --~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ....+.+++++|+++.|.|.++.+++++|++ |++|+||+++++|+++ +..++++++++|+.+.. . .. T Consensus 400 ~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~----giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~-~-~~--- 470 (489) T protein:vir:99 400 ATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY----GIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQ-S-LP--- 470 (489) T ss_pred cccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHh-c-cc--- Confidence 2233567999999999999999999999974 4689999999999864 44555666655543321 1 10 Q ss_pred ccCcccccCCCCCCCCcccccCC Q lcl|NC_021297. 449 KAQADATPKPTVTDTKTETQTSP 471 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~p 471 (480) +....++..+++..++.+| T Consensus 471 ----~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 471 ----EPRLVGDASGQEEPTAEKP 489 (489) T ss_pred ----cccccCCCCCCcCCCCCCC Confidence 1111112222222334444 No 54 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=1.4e-76 Score=436.46 Aligned_cols=428 Identities=12% Similarity=0.095 Sum_probs=326.0 Q ss_pred CCC--HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc--------------chhhhhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTT--YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--------------PPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~--~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~--------------~~~~~~~~~~~n~~~~iVd~~~ 64 (480) |+. ..++|.+++.+|..+++++.++.+||+|+|+|....... .....++|+++||+++||++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 554 488999999999999999999999999999997543221 1223477999999999999999 Q ss_pred hhhccCccccCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEe Q lcl|NC_021297. 65 DRLDIEGFRISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELD 143 (480) Q Consensus 65 ~~l~~~~~~~~~-d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d 143 (480) +|++++++++.. +++..+.++.+++ |+|+.++.++++.++++|+||+++|... ++|.+++..++|.+++|+|| T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~-----~~g~~~~~~~~p~~~~~i~d 154 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDA-----SDNSFRYACVDSKEVIPIYS 154 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeC-----CCCeeEEEEEcccceEEEEc Confidence 999999987654 4455666766664 8999999999999999999999999642 46889999999999999999 Q ss_pred CCCCCeeEEEEEEEEEec--CCceEEEEEEEeCCcEEEEEecCCccc----------------ccccccccccccccccc Q lcl|NC_021297. 144 PRNTRRVTRAVRLYTTRD--DVAVPDRATLYLPDETVPLRRNGGLND----------------QWVVDGDVIKHGLGVVP 205 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~iP 205 (480) +...+++.+++++|...+ +....+++++|+++.+++|...++... ......+..+|+||.|| T Consensus 155 ~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 234 (471) T protein:vir:10 155 KSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVP 234 (471) T ss_pred CCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCcee Confidence 888889999999997654 455677899999999999887654321 12344566789999999 Q ss_pred eEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc-- Q lcl|NC_021297. 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-- 283 (480) Q Consensus 206 vv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-- 283 (480) ||+|+|+. .|.|+++ .|++|||+||.++|++++.+++|++|+++++|...+...+.. ..+...+++.+ T Consensus 235 vv~~~n~~-----~~~sd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~ 304 (471) T protein:vir:10 235 FIPFKNNE-----IETNDLK-PIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFL----EDLKRYKMIKMDN 304 (471) T ss_pred EEEeccCC-----CCCCchH-HHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhH----HHhhcCCeEEecC Confidence 99999975 4788886 599999999999999999999999999999997654332211 11222222222 Q ss_pred ----CCCCceEEEecc--ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 ----ASEAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 284 ----~g~~~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 357 (480) .+.++++.+++. ...+.++++|+..|+.++..+++.+..+ + ++||+||++++.+|.+||+++++.|+++ T Consensus 305 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~----g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~ 379 (471) T protein:vir:10 305 DGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL----G-NSSGVALKFLYSLLELKAGNMETQFRSG 379 (471) T ss_pred CCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc----c-CccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233567766543 3445555555555555555554443333 2 3699999999999999999999999999 Q ss_pred HHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHH Q lcl|NC_021297. 358 WERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQET 437 (480) Q Consensus 358 l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~ 437 (480) |++++++++++.+.. +..++++.|+++.|.|.++.+++++|+. |++|.||+++++|+++|+.++++++++|++ T Consensus 380 l~~~~~li~~~~~~~---d~~~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~~p~v~D~~~E~eri~~E~~ 452 (471) T protein:vir:10 380 YATLVKMILKHLGLS---DKLKIKQTWTRNSINNDTEMAQVVSTLA----TITSRENVAKSNPIVEDWQDELRLQKAEQE 452 (471) T ss_pred HHHHHHHHHHHhccC---CCceeEEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 999999999988743 4567999999999999999999999873 469999999999999988888877776665 Q ss_pred HHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 438 EDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +.. +...... +..++++. + T Consensus 453 ~~~-~~~~~~~----~~~~~~e~-----~ 471 (471) T protein:vir:10 453 GRS-EKLYDME----EVEHESEV-----E 471 (471) T ss_pred HHH-hcccccC----CCCCcccc-----C Confidence 431 1111111 11111111 1 No 55 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=2.1e-76 Score=435.47 Aligned_cols=432 Identities=12% Similarity=0.105 Sum_probs=326.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc------chhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++++++|..+.+|+.++.+||+|+|+|....... .+...++|+++||+++||++.++||+++++++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~ 103 (474) T protein:vir:96 24 YETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTF 103 (474) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcccCcee Confidence 666788999999999999999999999999999987654322 22345789999999999999999999999875 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+++..+.+++++++++|+||++||. +++|+++++.++|++++|+||+...+.+.++ T Consensus 104 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 176 (474) T protein:vir:96 104 SSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYI------DENGEFKTFRVPAEQAIPIWTNKERDTLKAF 176 (474) T ss_pred ecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 56677788888886 68999999999999999999999996 4578999999999999999999888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccc----------cccccccccccccccceEEeecccccCccCCccc Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ----------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|.... ..++++|+++++++|...++.... ........+|+||+||||+|+|++ +|.|+ T Consensus 177 vr~~~~~~----~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~-----~g~sd 247 (474) T protein:vir:96 177 IRYYRLDG----AERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNP-----QEMSD 247 (474) T ss_pred EEEEeecC----ceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCC-----CCCCc Confidence 99996543 346799999999988765433211 112234578999999999999974 68999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCC--CceEEEeccccHHHH Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASE--AAKISEFKAAELRNF 301 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~~~~~~~ 301 (480) ++. +++|||+||.++|++++.++++++|+++++|..++.... ...++...+++.++++ ++++.+++. +.+.+ T Consensus 248 ~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~----~~~~~~~~~~i~~~~~~~~~~~l~~~~-~~~~~ 321 (474) T protein:vir:96 248 LFM-YKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDE----FMRNLKYYKAINVDGDGSGVDTIQIEV-PVQSS 321 (474) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccc----hhhhhhcCceEEecCCCCceeEEeecC-ChHHH Confidence 975 999999999999999999999999999999987653322 1223445566666544 455655432 33444 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) ...++.+...|+..+++|+.+++..+ +++||+||+++++++++||.++++.|+++|++++++++.+.|.. .+..+++ T Consensus 322 ~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~--~~~~~i~ 398 (474) T protein:vir:96 322 KEYLDMLRDYVIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLN--IKVQDVE 398 (474) T ss_pred HHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceee Confidence 44444444444445555544444322 23699999999999999999999999999999999999998744 4566899 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) ++|+++.|.|+++.++.+.+ + |++|+||++.++|+++|+.++++++++|+.+.. +.... ...+. .+... T Consensus 399 i~f~~~~p~~~~e~~~~~~~---a--g~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~-~~~~~---~~~~~--~~~~~ 467 (474) T protein:vir:96 399 ITFNFNVMVNELEQSQIGVQ---S--QYLSKETVVTNHPWVDDPVAELERIEQDNIDFN-KQLPP---LEGDA--NGRAQ 467 (474) T ss_pred EEeccCCCcCHHHHHHHHHh---c--CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hcccc---ccccc--ccccC Confidence 99999999999999986543 3 479999999999999988888877776664432 22111 11111 11111 Q ss_pred CCCcccc Q lcl|NC_021297. 462 DTKTETQ 468 (480) Q Consensus 462 d~~~~~~ 468 (480) |.+.+++ T Consensus 468 d~~~e~~ 474 (474) T protein:vir:96 468 DNESETN 474 (474) T ss_pred CCcccCC Confidence 1111111 No 56 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=1.2e-75 Score=431.34 Aligned_cols=432 Identities=13% Similarity=0.089 Sum_probs=324.0 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcc--------cchhhhhhccccchHHHHHHHHHhhhccCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIG--------APPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~--------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) +.+..++++.+...... +.++++++++||+|+|+|++.... .+....++|+++||+++||++.++|+++++ T Consensus 18 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p 97 (479) T protein:vir:79 18 KESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNP 97 (479) T ss_pred cCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcCC Confidence 55555544433333222 568899999999999999764332 222345779999999999999999999999 Q ss_pred cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 72 FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 72 ~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +++..+++..+.+.+.|..|+|+..+.++++.++++|++|++||. +++|++++++++|.+++|+||+...+++. T Consensus 98 ~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~ 171 (479) T protein:vir:79 98 IVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYI------NRKGEFKYVIIPAEEAIPIWDSKRQRELV 171 (479) T ss_pred ceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEe------CCCCceEEEEEccceeEEEEeCCCCCceE Confidence 987766666666777777899999999999999999999999985 45788999999999999999988888899 Q ss_pred EEEEEEEEec-CCceEEEEEEEeCCcEEEEEecCCcccc---------------cccccccccccccccceEEeeccccc Q lcl|NC_021297. 152 RAVRLYTTRD-DVAVPDRATLYLPDETVPLRRNGGLNDQ---------------WVVDGDVIKHGLGVVPVVPLTNDPRL 215 (480) Q Consensus 152 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~iPvv~~~n~~~~ 215 (480) +++++|...+ +++..+++++|+++.+++|...++.... .....+..+|+||.||||+|.|++ T Consensus 172 ~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-- 249 (479) T protein:vir:79 172 AFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNE-- 249 (479) T ss_pred EEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCC-- Confidence 9999987664 4556778999999999998776543211 112345678999999999999975 Q ss_pred CccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEec Q lcl|NC_021297. 216 GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFK 294 (480) Q Consensus 216 ~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~ 294 (480) +|+|+++ .|++|||+||.++|++++.++++++|+++++|...+...+. .. ....+.++. .+++++++.+++ T Consensus 250 ---~g~sd~~-~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~--~~--~~~~~~~i~~~~~~~~~~l~~~ 321 (479) T protein:vir:79 250 ---KCVSDLT-FYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEF--ID--NIRYYKSIKVDGGGGVDKLEIN 321 (479) T ss_pred ---CCCcchh-hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccc--hh--hhhhccceecCCCCcceEEecc Confidence 5889996 59999999999999999999999999999999765433221 11 122233333 345678887655 Q ss_pred cc--cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Q lcl|NC_021297. 295 AA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR- 371 (480) Q Consensus 295 ~~--~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~- 371 (480) .. ..+++++.|+..++.++ ++|...++. . +++||+||++++++|.+||.++++.|+++|++++++++++.+. T Consensus 322 ~~~~~~~~~~~~l~~~i~~~s---~~p~~~~~~-~-gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 396 (479) T protein:vir:79 322 IPVEAKKELLDRLEKNIIIFG---QGVNPESQN-T-GDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKIS 396 (479) T ss_pred CCHHHHHHHHHHHHHHHHHHh---Ccccccccc-c-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 32 34455555555555555 455444432 2 3379999999999999999999999999999999999887653 Q ss_pred -CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_021297. 372 -EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) Q Consensus 372 -~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (480) ....+..++++.|+++.|.|+++.+++++|+. |++|.||+++++|+++|+.++++++++|+.++...... .++ T Consensus 397 ~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~----g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~--~~~ 470 (479) T protein:vir:79 397 GNKSYDYKTVQITFNHSMIINEAEKIDMAAKST----GIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDL--IPN 470 (479) T ss_pred CCCccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhc--cCc Confidence 34456678999999999999999999999974 46899999999999998888887777766543321111 110 Q ss_pred CcccccCCCCCCCCc Q lcl|NC_021297. 451 QADATPKPTVTDTKT 465 (480) Q Consensus 451 ~~~~~~~~~~~d~~~ 465 (480) ..++..++ + T Consensus 471 -----~~~~~~~e-~ 479 (479) T protein:vir:79 471 -----NQDGVIDE-T 479 (479) T ss_pred -----ccCCCcCc-C Confidence 11111111 1 No 57 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=1e-75 Score=431.70 Aligned_cols=443 Identities=13% Similarity=0.069 Sum_probs=323.1 Q ss_pred CCCHHH----HHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHE----HVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~----~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |+++++ .|.+|+.+|.. +++|++++.+||+|+|++.... ........++|+++||+++||++.++||++++++ T Consensus 16 ~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~ 95 (506) T protein:vir:94 16 QESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPIN 95 (506) T ss_pred ccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCce Confidence 444332 24557777654 6889999999999999764222 1122334678999999999999999999999987 Q ss_pred c-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 74 I-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) + +++++.++.|+++|+.|+|+.++.+++++++++|+||++||. +++|+++++++||.+++|+||+...+++.+ T Consensus 96 ~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~------ded~~~~i~~~~p~~~~~v~dd~~~~~~~~ 169 (506) T protein:vir:94 96 VKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYR------GEDNEEHLAKLDPLDTFVIYSTDVDPKPIM 169 (506) T ss_pred eecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEEEEEcccceEEEecCCCCCceEE Confidence 5 445667889999999999999999999999999999999996 457899999999999999999888788999 Q ss_pred EEEEEEEecCCc-----eEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 153 AVRLYTTRDDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 153 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) ++++|...+..+ ..++.++|+++.+++|....... ...+..+|+||.||||+|+|++ .|.|+++ . T Consensus 170 ~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~----~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e-~ 239 (506) T protein:vir:94 170 AVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMG----KMQVDTTKPITTFPVVEFKNSN-----FRLGDFE-N 239 (506) T ss_pred EEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCcc----ceeccccccCCccceEEecCCC-----CCCCchh-h Confidence 999986554322 33567889999988886554321 2344568999999999999986 4677886 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc-----------cc---------chhhhhhhhhcccC--- Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG-----------EN---------TTLDIYYGRILTLA--- 284 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~-----------~~---------~~~~~~~~~~~~~~--- 284 (480) +++|||+||+++|++++.++++++|+++++|.......... .. .......++++.++ T Consensus 240 ~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (506) T protein:vir:94 240 VLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGM 319 (506) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccc Confidence 99999999999999999999999999999996543221100 00 00011122222222 Q ss_pred -------CCCceEEEec--cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 285 -------SEAAKISEFK--AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 285 -------g~~~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +.++++..++ ....++++++|...|+.++.++++++..|+ +++||+||++++.++.+||.++++.|+ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~----~n~Sg~Aik~~~~~l~~k~~~k~~~~~ 395 (506) T protein:vir:94 320 TVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFA----SNSSGVAMQYKVLGTVELASTKRRMFE 395 (506) T ss_pred cccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 2245554433 345566666666666666666655554444 237999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhC---CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHH Q lcl|NC_021297. 356 GAWERAMRIAMQIMG---REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW 432 (480) Q Consensus 356 ~~l~~~~~~~~~~~~---~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~ 432 (480) ++|++++++++++.+ .....+..++++.|+++.|.|.++.|++++|++ |++|+||+++++|+++|+.++++++ T Consensus 396 ~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~lp~v~d~~~E~~ri 471 (506) T protein:vir:94 396 RGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG----ATLPQKYLYQQLPGVTNPQDIVDMM 471 (506) T ss_pred HHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHH Confidence 999999999988754 334556778999999999999999999999983 4799999999999999988888877 Q ss_pred HHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 433 DKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) ++|+.+..... ......+...+.....+ +++.+=. T Consensus 472 ~~E~~~~~~~~--~~~~~~~~~~~~~~~~~-~~~~e~~ 506 (506) T protein:vir:94 472 KEQSANGDYSF--DQNGVISNDGQTNTTAT-QTDEEVR 506 (506) T ss_pred HHHHHHHhhcc--hhhcCCCcccCcccccc-ccccCCC Confidence 77765432221 11111111111122211 1111111 No 58 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=3.2e-75 Score=429.02 Aligned_cols=423 Identities=12% Similarity=0.097 Sum_probs=323.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc------chhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-.-.++|.+|+..|..+.+|+.++.+||+|+|+|....... ++..+++|+++||+++||++.++|++++++++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~ 103 (468) T protein:vir:96 24 YETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTY 103 (468) T ss_pred ccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCcee Confidence 222356888999999999999999999999999986543322 22335789999999999999999999999875 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) + ++++..+.++++|+ |+++..+.++++.++++|++|++||. +++|.+++..++|.+++|+||+...+++.++ T Consensus 104 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ 176 (468) T protein:vir:96 104 GTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYV------DEQGEFKTFRVPAEQAIPIWTNKERDELKAF 176 (468) T ss_pred ccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEE------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 56667888999986 78999999999999999999999996 4578899999999999999998878889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc----------cccccccccccccccccceEEeecccccCccCCccc Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN----------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|.... ..++++|+++++++|...++.. ..........+|+||+||||+|.|++ +|.|+ T Consensus 177 ir~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~-----~g~sd 247 (468) T protein:vir:96 177 IRLYELDG----GERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNP-----QEVSD 247 (468) T ss_pred EEEEEecC----ceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCC-----CCCCc Confidence 99986543 2467999999998887764321 11123345678999999999999974 58999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC---CCCceEEEecc--ccH Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA---SEAAKISEFKA--AEL 298 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~---g~~~~~~~~~~--~~~ 298 (480) +++ |++|||+||.++|++++.++++++|+++++|+..++... ....+..++++.++ ++++++.+++. ... T Consensus 248 ~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~----~~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~ 322 (468) T protein:vir:96 248 LFM-YKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEE----FMYNLKYYKAINVDGDGSGGVDTIQIDVPVQSA 322 (468) T ss_pred hHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccch----hhhhhhcCceEEecCCCCCcceEEeecCChHHH Confidence 975 999999999999999999999999999999987643222 12233344444443 33566655443 345 Q ss_pred HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) +++++.|+..++.++.++++.+..|+ +++||+||+++++++.+||+++++.|+++|++++++++++.|.. .+.. T Consensus 323 ~~~~~~l~~~I~~~s~~p~~~~~~~~----~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~~ 396 (468) T protein:vir:96 323 KEYLDMLRDYVIEFGQGVDFQQDKFG----NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLS--IKVQ 396 (468) T ss_pred HHHHHHHHHHHHHHhCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccc Confidence 55666666666666555555444433 24799999999999999999999999999999999999998754 4556 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) +++++|.++.|.|.++.|+++++ + |++|+||+++++|+++|+.++++++++|+++..... ....+..+..|. T Consensus 397 ~i~i~f~~~~p~d~~e~a~~~~~---~--g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~ 468 (468) T protein:vir:96 397 DVEITFNFNVMVNELEQSQIGVN---S--QYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIE--EGLNGKENNEPT 468 (468) T ss_pred eeeEEecCCCCcCHHHHHHHHHh---c--CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh--hccCCCCCCCCC Confidence 89999999999999999987654 2 578999999999999988888877777665432211 111111111111 No 59 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=1e-74 Score=426.24 Aligned_cols=430 Identities=12% Similarity=0.020 Sum_probs=319.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch---hcC-------------cccchhhhhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK---TIG-------------IGAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~---~~~-------------~~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) ..|+ ++|+++|..|..+++|+.++.+||+|.+... ... ....+..+++|+++||+++||++.+ T Consensus 14 ~~~~-e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 92 (474) T protein:vir:10 14 GILP-KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRV 92 (474) T ss_pred CCCH-HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHh Confidence 2332 5799999999999999999999999975421 110 0111223467999999999999999 Q ss_pred hhhccCccccCC------CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEcccee Q lcl|NC_021297. 65 DRLDIEGFRISE------DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 65 ~~l~~~~~~~~~------d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~ 138 (480) +|++++|+++.. +++..+.|+++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.++ T Consensus 93 ~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~~~~i~p~~~ 166 (474) T protein:vir:10 93 GYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI------DTNGDIRIKNIDPYNV 166 (474) T ss_pred hheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCeeEEEEEcccce Confidence 999999977532 3455678899999999999999999999999999999985 4678999999999999 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecC--CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccC Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDD--VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) +|+||+.. .+.+++++|...++ +...+++++|++..++.|...+... ....+..+|+||.||||+|+|++ T Consensus 167 ~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~--- 238 (474) T protein:vir:10 167 IFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDA---LQEVGRYEHLFDYNPLFGVPNNK--- 238 (474) T ss_pred EEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCc---ccccccccCCCCccceEEecCCC--- Confidence 99998643 46788888876653 3345678999999998887654332 34566789999999999999975 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEecc Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFKA 295 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~~ 295 (480) +|.|++++ |++|||+||.++|++++.++++++|+++++|+..++.. ...++. .+.++. .+++++++.+++. T Consensus 239 --~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~----~~~~~~-~~~i~~~~~~~~~~~l~~~~ 310 (474) T protein:vir:10 239 --EMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEM----IQETQK-SGAFELFDKDMDVKYLTKDV 310 (474) T ss_pred --CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchh----hhhhhh-cceeEecCCCCceeEEeccC Confidence 58899975 99999999999999999999999999999998654321 111121 122222 3355666665543 Q ss_pred --ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_021297. 296 --AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-- 371 (480) Q Consensus 296 --~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-- 371 (480) ...++++++|+..|+. .+++|+.+++..++| +||+||++++++|++||+++++.|+++|++++++++++.+. T Consensus 311 ~~~~~~~~~~~l~~~I~~---~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:10 311 NDTMIENHLDRIEKNIMR---FAKSVNFNSDEFNGN-VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred CHHHHHHHHHHHHHHHHH---HhCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 3344555555555554 555554444432333 69999999999999999999999999999999999887642 Q ss_pred --CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_021297. 372 --EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 372 --~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ....++.++++.|+++.|.|.++.|++++|+. |++|+||+++++|+++|+.++++++++|+++. .+....... T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~-~~~~~~~~~ 461 (474) T protein:vir:10 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK----GQVSERTRLGQSQLVDDVDYELDEMEKESLEF-NDKLPDIDE 461 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-HhhcccccC Confidence 23345678999999999999999999999974 47899999999999999888888777666443 232222111 Q ss_pred cCcccccCCCCCCCCcc Q lcl|NC_021297. 450 AQADATPKPTVTDTKTE 466 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~ 466 (480) +..+..++... ++ T Consensus 462 ~~~~~~~~~~~----s~ 474 (474) T protein:vir:10 462 GDANDKSQNNQ----SE 474 (474) T ss_pred CCcCCCCcccc----CC Confidence 11111111111 11 No 60 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=1e-74 Score=426.24 Aligned_cols=430 Identities=12% Similarity=0.020 Sum_probs=319.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch---hcC-------------cccchhhhhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK---TIG-------------IGAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~---~~~-------------~~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) ..|+ ++|+++|..|..+++|+.++.+||+|.+... ... ....+..+++|+++||+++||++.+ T Consensus 14 ~~~~-e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 92 (474) T protein:vir:94 14 GILP-KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRV 92 (474) T ss_pred CCCH-HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHh Confidence 2332 5799999999999999999999999975421 110 0111223467999999999999999 Q ss_pred hhhccCccccCC------CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEcccee Q lcl|NC_021297. 65 DRLDIEGFRISE------DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 65 ~~l~~~~~~~~~------d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~ 138 (480) +|++++|+++.. +++..+.|+++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.++ T Consensus 93 ~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~~~~i~p~~~ 166 (474) T protein:vir:94 93 GYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI------DTNGDIRIKNIDPYNV 166 (474) T ss_pred hheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCeeEEEEEcccce Confidence 999999977532 3455678899999999999999999999999999999985 4678999999999999 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecC--CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccC Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDD--VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) +|+||+.. .+.+++++|...++ +...+++++|++..++.|...+... ....+..+|+||.||||+|+|++ T Consensus 167 ~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~--- 238 (474) T protein:vir:94 167 IFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDA---LQEVGRYEHLFDYNPLFGVPNNK--- 238 (474) T ss_pred EEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCc---ccccccccCCCCccceEEecCCC--- Confidence 99998643 46788888876653 3345678999999998887654332 34566789999999999999975 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc-cCCCCceEEEecc Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFKA 295 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~~ 295 (480) +|.|++++ |++|||+||.++|++++.++++++|+++++|+..++.. ...++. .+.++. .+++++++.+++. T Consensus 239 --~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~----~~~~~~-~~~i~~~~~~~~~~~l~~~~ 310 (474) T protein:vir:94 239 --EMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEM----IQETQK-SGAFELFDKDMDVKYLTKDV 310 (474) T ss_pred --CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchh----hhhhhh-cceeEecCCCCceeEEeccC Confidence 58899975 99999999999999999999999999999998654321 111121 122222 3355666665543 Q ss_pred --ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_021297. 296 --AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-- 371 (480) Q Consensus 296 --~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-- 371 (480) ...++++++|+..|+. .+++|+.+++..++| +||+||++++++|++||+++++.|+++|++++++++++.+. T Consensus 311 ~~~~~~~~~~~l~~~I~~---~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:94 311 NDTMIENHLDRIEKNIMR---FAKSVNFNSDEFNGN-VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred CHHHHHHHHHHHHHHHHH---HhCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 3344555555555554 555554444432333 69999999999999999999999999999999999887642 Q ss_pred --CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_021297. 372 --EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 372 --~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ....++.++++.|+++.|.|.++.|++++|+. |++|+||+++++|+++|+.++++++++|+++. .+....... T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~-~~~~~~~~~ 461 (474) T protein:vir:94 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK----GQVSERTRLGQSQLVDDVDYELDEMEKESLEF-NDKLPDIDE 461 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-HhhcccccC Confidence 23345678999999999999999999999974 47899999999999999888888777666443 232222111 Q ss_pred cCcccccCCCCCCCCcc Q lcl|NC_021297. 450 AQADATPKPTVTDTKTE 466 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~ 466 (480) +..+..++... ++ T Consensus 462 ~~~~~~~~~~~----s~ 474 (474) T protein:vir:94 462 GDANDKSQNNQ----SE 474 (474) T ss_pred CCcCCCCcccc----CC Confidence 11111111111 11 No 61 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=3.7e-75 Score=428.64 Aligned_cols=451 Identities=10% Similarity=0.040 Sum_probs=318.9 Q ss_pred CC---------CHHHHHHHHHHHH--HHHHHHHHHHHHHhcccCcchhcCcc---------cchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MT---------TYHEHVERLQGLL--ARDLPNLLEAEAYRNGTRRLKTIGIG---------APPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~---------~~~~~i~~l~~~~--~~~~~r~~~~~~YY~g~~~i~~~~~~---------~~~~~~~~~~~~n~~~~iV 60 (480) || .+++++...+..| ..+++++.++++||+|+|+|...... .....+++|+++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 33 3345565555544 35679999999999999999654321 1223457899999999999 Q ss_pred HHHHhhhccCccccCCCch----hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccc Q lcl|NC_021297. 61 RTLSDRLDIEGFRISEDSE----GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPL 136 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~~----~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~ 136 (480) ++.++||+++++++..+++ ..+.+.+++ .|+|+..+.++++++++||+||+++|. +++|.++++.++|. T Consensus 81 d~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~------de~~~~~~~~i~p~ 153 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKDEDNTQLDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFART------TSEGKLKFQTVDGL 153 (537) T ss_pred HHHhhhhcccCceeecCcchhHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeee------cCCCceEEEEEccc Confidence 9999999999987654332 333444444 589999999999999999999999996 46788999999999 Q ss_pred eeEEEEeCCCCCeeEEEEEEEEEec------CCceEEEEEEEeCCcEEEEEecCCccccc-------------------- Q lcl|NC_021297. 137 YMYAELDPRNTRRVTRAVRLYTTRD------DVAVPDRATLYLPDETVPLRRNGGLNDQW-------------------- 190 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------- 190 (480) ++||+||+.. ++.+++++|.... +...++++++||++.+++|...++..... T Consensus 154 ~~~pv~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 231 (537) T protein:vir:78 154 TLIPVFDDYG--VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEE 231 (537) T ss_pred eeEEEEcCCC--CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccc Confidence 9999999753 5666777664322 33456789999999999998765432111 Q ss_pred --------ccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc Q lcl|NC_021297. 191 --------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTD 262 (480) Q Consensus 191 --------~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 262 (480) .......+|+||.||||+|.||. +|.|+++ +|++|||+||.++|++++.+++|++|+++++|...+ T Consensus 232 ~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~-----~~~sd~e-~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~ 305 (537) T protein:vir:78 232 STDADFEDTDGYQVLGRSYSKFPFQLLYNNK-----DGMSDVK-RVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGD 305 (537) T ss_pred cccccccccccccccccCCcceeEEEeccCc-----cCCCchh-hhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCc Confidence 12234567999999999999985 5788886 599999999999999999999999999999998664 Q ss_pred ccccccccchhhhhhhhhccc--CCCCceEEEe--ccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDIYYGRILTL--ASEAAKISEF--KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~--~g~~~~~~~~--~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~ 338 (480) ...+. .. .+...+++.+ +++++++.++ +....+.++++|+..|+.++..+++++..+| ++||+||++ T Consensus 306 ~~~~~--~~--~l~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~g-----n~SGvAlk~ 376 (537) T protein:vir:78 306 STDKL--RQ--NIKAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDG-----NVTNVVIKS 376 (537) T ss_pred cchhH--HH--HHhhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccccc-----CCcHHHHHH Confidence 32221 11 1222333333 3455666544 4556788999999999998888886554322 269999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHH Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQAR 416 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~ 416 (480) ++++|++||+++++.|+++|++++++|+.+.+. ....++.+++++|++++|.|+.+.++.++++.++| ++|++|++ T Consensus 377 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~g--iiS~eT~l 454 (537) T protein:vir:78 377 RYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETE--ALKIGNIM 454 (537) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcC--cchHHHHH Confidence 999999999999999999999999999887642 23456788999999999999999999999887654 78999999 Q ss_pred HhCCCChhHHHHHHHHHHHHH----HHHHHHHhcccccC---------------cccccCCCCCCCCc--ccccCCC-CC Q lcl|NC_021297. 417 IDLGYTATQREQMRDWDKQET----EDMIDTLYSTTKAQ---------------ADATPKPTVTDTKT--ETQTSPS-GF 474 (480) Q Consensus 417 ~~lg~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~---------------~~~~~~~~~~d~~~--~~~~~p~-~~ 474 (480) +++|+++++ +++ ++++++. +...+......... .+..+.|..++.++ ...++|+ .. T Consensus 455 ~~~p~vdd~-e~e-k~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:78 455 TVAPRIGDD-ETL-KLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPN 532 (537) T ss_pred HhCCCCCCH-HHH-HHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCc Confidence 999999885 222 2222221 11111111111000 00111111111111 1112222 23 Q ss_pred CCCCC Q lcl|NC_021297. 475 NRTKT 479 (480) Q Consensus 475 ~~~~~ 479 (480) ++++| T Consensus 533 ~~~~~ 537 (537) T protein:vir:78 533 AVPQT 537 (537) T ss_pred cCCCC Confidence 34555 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=7.7e-56 Score=322.72 Aligned_cols=440 Identities=10% Similarity=0.049 Sum_probs=291.5 Q ss_pred CCCHHHHHHHH-HHHHHHHHHHHHHHHHHhcccCcchhcCc--ccchhhhhhccccchHHHHHHHHHhhhccCccccC-C Q lcl|NC_021297. 1 MTTYHEHVERL-QGLLARDLPNLLEAEAYRNGTRRLKTIGI--GAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-E 76 (480) Q Consensus 1 ~~~~~~~i~~l-~~~~~~~~~r~~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~ 76 (480) ..++++++... +..+..+..++.++.+||+|+|++..... ...+...++++++|||++||+.+++|++++++++. + T Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i~~~ 98 (496) T protein:vir:38 19 LKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKINID 98 (496) T ss_pred chhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCCcceEeeC Confidence 11222222111 11233455788999999999998743211 12233346678899999999999999999987654 5 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 77 d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) ++..++.|+++++.|+|...+.+++..++++|.+|+++|. |.+|.+++.+++|.+++|+|++.. .+..++++ T Consensus 99 d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~------D~~~~~~i~~v~~~~~~P~~~~~~--~~~~~~f~ 170 (496) T protein:vir:38 99 DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYH------DGNKNVKVSFATADCMYPLSNDSE--NVDECVIA 170 (496) T ss_pred ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEE------cCCCcEEEEEEcccceEEEEecCC--cEEEEEEE Confidence 6677889999999999999999999999999999999996 457889999999999999998653 34333333 Q ss_pred EEEecCCceEEEEEEEeCCc----EEE--EEecCCccccc-c--------cccccccccccccceEEee----cccccCc Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDE----TVP--LRRNGGLNDQW-V--------VDGDVIKHGLGVVPVVPLT----NDPRLGN 217 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~----~~~--~~~~~~~~~~~-~--------~~~~~~~~~~~~iPvv~~~----n~~~~~~ 217 (480) .....++...+.++.|+... +.+ |+...+...+. + .......++++++|++.|+ |+....+ T Consensus 171 ~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~ 250 (496) T protein:vir:38 171 NSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTS 250 (496) T ss_pred EEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCC Confidence 33333444556666665321 111 22222111110 0 0112233577888888774 5667888 Q ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCCCccccccccccchhhhhhhhhc----ccCCCCce Q lcl|NC_021297. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTDELTNDGENTTLDIYYGRIL----TLASEAAK 289 (480) Q Consensus 218 ~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~----i~g~~~~~~~~~~~~~~~~~~~~~~~----~~~g~~~~ 289 (480) ++|.|+++ ++++|+|+||+++|++++.++.....+.+ +... .. ........+........ ...++... T Consensus 251 p~G~Sd~~-~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~-~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (496) T protein:vir:38 251 PLGISVYA-NALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTA-VN--LDGSTTQYFDSTDEAFFLYQGDQDDNGKA 326 (496) T ss_pred cCCCchHh-hHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhcc-CC--CCCccccCCCCccceEEEeecCCCccccc Confidence 99999997 58999999999999999998753322222 1110 00 00111111111111111 11122234 Q ss_pred EEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 290 ~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +.+++. ...++|++.++.+++.|+..++++++.||.+..+.+||.+++++++.|.+++..+++.|+.+|++++++++.+ T Consensus 327 i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~ 406 (496) T protein:vir:38 327 IKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEV 406 (496) T ss_pred ceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555543 3568999999999999999999999999988777789999999999999999999999999999999988754 Q ss_pred hC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-ChhHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 369 MG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TATQREQMRDWDKQETEDMID 442 (480) Q Consensus 369 ~~-----~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~-~~~~~~~~~~~~~~~~~~~~~ 442 (480) .. .+...+...+++.|.++.+.|..+.++++++++++| ++|+++++..+++ ++++++++.++.++|.+.. T Consensus 407 ~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--iiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~-- 482 (496) T protein:vir:38 407 GKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQG--MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE-- 482 (496) T ss_pred HHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcC--CCCHHHHHHhcCCCChHHHHHHHHHHHHhhhcc-- Confidence 21 222334467899999999999999999999999876 7899999988754 4454443333333222110 Q ss_pred HHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 443 TLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) .|.++..+..++.+ T Consensus 483 ------------~~~~d~~~~~~~~e 496 (496) T protein:vir:38 483 ------------MPNNDMNGIFGEEE 496 (496) T ss_pred ------------CccccccCCCCCCC Confidence 00111111111111 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=8.5e-52 Score=300.57 Aligned_cols=443 Identities=12% Similarity=0.046 Sum_probs=292.1 Q ss_pred CCC-HHHHHHHHHHH------------------HHHHHHHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHH Q lcl|NC_021297. 1 MTT-YHEHVERLQGL------------------LARDLPNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~-~~~~i~~l~~~------------------~~~~~~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~i 59 (480) |-+ +..+|+.++++ +.....++.++.+||.|+|+..... ........++++++|+++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 443 33344333332 3345578899999999998753321 11223344678899999999 Q ss_pred HHHHHhhhccCccccC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEcccee Q lcl|NC_021297. 60 LRTLSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 60 Vd~~~~~l~~~~~~~~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~ 138 (480) |+++++|+++++.++. +++..++.|+++++.|+|...+.+++..|+.+|.+|+++|. |.+|++++.+++|.++ T Consensus 81 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~------D~~~~~~i~~v~a~~~ 154 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYH------DGNKNVKVSFATADCM 154 (499) T ss_pred HHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEE------CCCCcEEEEEEcCCce Confidence 9999999999876543 56677889999999999999999999999999999999996 4568999999999999 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEe--CCcEEE-------EEecCCccccc-cc--------cccccccc Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYL--PDETVP-------LRRNGGLNDQW-VV--------DGDVIKHG 200 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-------~~~~~~~~~~~-~~--------~~~~~~~~ 200 (480) +|+|.+.. .+..+++++....++...+.++.|+ .+.... |........+. +. ......++ T Consensus 155 ~Pi~~d~~--~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~ 232 (499) T protein:vir:80 155 YPLSNDSE--NVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPS 232 (499) T ss_pred EEEEecCC--CeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecC Confidence 99876542 3333333332233344455555543 222111 22111111110 00 11112246 Q ss_pred ccccceEEee----cccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-cchhhh Q lcl|NC_021297. 201 LGVVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-NTTLDI 275 (480) Q Consensus 201 ~~~iPvv~~~----n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~ 275 (480) ++++|++.|+ |+....+|+|+|+++. +++|+|+||+.+|++++.++....++.+-..+-.......+. ...+.. T Consensus 233 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 311 (499) T protein:vir:80 233 LTRPTFIYIKPNIANNKNLTSPLGISVYAN-ALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDS 311 (499) T ss_pred CCccceEeecCCccccccCCCccCCchHhh-HHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCCCc Confidence 8899998885 5567788999999975 899999999999999999986544443310000000000111 111111 Q ss_pred hhhhhc----ccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 276 YYGRIL----TLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 276 ~~~~~~----~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ...... ..++....+.+++. -..++|++.++.++++|...+|++++.||....+..||.+++++++.+.+++..+ T Consensus 312 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~ 391 (499) T protein:vir:80 312 TDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSH 391 (499) T ss_pred ccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHH Confidence 111111 11222234555543 3568899999999999999999999999987777778999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChh Q lcl|NC_021297. 351 GRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTAT 424 (480) Q Consensus 351 ~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg-~~~~ 424 (480) ++.|+.+|++++++++.+.. .+...+...+.+.|.+..+.|..+.++.+.+++++| ++|+++++..++ .+++ T Consensus 392 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l~~~~~~~d~ 469 (499) T protein:vir:80 392 SQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQG--MIPLKIALQRAWNITEA 469 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHhhcCCCChH Confidence 99999999999998875421 122334568999999999999999999999999876 789999988774 5555 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +++++.+++++|... .+ +.++..+..++.+ T Consensus 470 ea~~el~~i~~E~~~---~~---------~~~d~~g~~ge~e 499 (499) T protein:vir:80 470 EADEWAEMLAKEKQA---EI---------PNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHHHHHHHhhc---CC---------CCCCccccCCCCC Confidence 554443333333210 00 0111111111111 No 64 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=1.9e-51 Score=298.66 Aligned_cols=458 Identities=11% Similarity=0.106 Sum_probs=335.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d 77 (480) |... .-..-..|+.+|+.|.+||.+++.-...-....+......+.++..+++|...-. +.+.|.. -..+ T Consensus 21 ~p~~------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~-~~~~g~~~~~~~~~ 93 (527) T protein:vir:10 21 FPNA------VTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMR-FLGQGLKWEFSKKD 93 (527) T ss_pred Cccc------CCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCcce-eeccCccccccchh Confidence 3221 1223346788999999999997532211111111112356778888999986544 3455543 2346 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE-- Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV-- 154 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~-- 154 (480) ++....+..++++|++..++.+..+++.+.|++ |.++|-+.+ .+.++++++.+||.+.||+.|+...+.+..+- T Consensus 94 e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k---~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~ 170 (527) T protein:vir:10 94 AKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEK---DEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV 170 (527) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCC---CcCCCceEeecCcceeeeeecCCCCCceeeEEEe Confidence 677888899999999999999999999999996 666664322 23468999999999999999987766666552 Q ss_pred EEEEEecCCceEE---------------------EEEEEeC-----CcEEEEEec---CC--cccccccccccccccccc Q lcl|NC_021297. 155 RLYTTRDDVAVPD---------------------RATLYLP-----DETVPLRRN---GG--LNDQWVVDGDVIKHGLGV 203 (480) Q Consensus 155 ~~~~~~~~~~~~~---------------------~~~~~~~-----~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~ 203 (480) .-|...++.+... -.++||. +.+...... .. .......+.+..|+++++ T Consensus 171 ~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~f 250 (527) T protein:vir:10 171 DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITT 250 (527) T ss_pred eeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCCCc Confidence 2244333322111 1122222 111100000 00 000112235567899999 Q ss_pred cceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc Q lcl|NC_021297. 204 VPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL 283 (480) Q Consensus 204 iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (480) ||||+|+|.+..++.||+|+|+ ++++++|++|+++|+.+.++++.+.|+.++.|..+-+ ..+....+.++++.+|.+ T Consensus 251 iPvV~~~t~p~~~~~WG~S~La-~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd--~~G~~~~~~VgPG~iweL 327 (527) T protein:vir:10 251 LPVFHFRGHPIMNAMFGRSGLA-GLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD--SRGNMVPWTISPLGMVEH 327 (527) T ss_pred cceEeecCCCccccccChhhHh-HHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc--ccCCcCccccCCceeEec Confidence 9999999999999999999998 5999999999999999999999999999999987653 234446688999999988 Q ss_pred CCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 ASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 284 ~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) + +++++..++. .+++.|.+++..+...|+.++++|..+||. +..++.||.||+..+++|.+++.+++..++...++. T Consensus 328 ~-e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~ 406 (527) T protein:vir:10 328 G-QNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQF 406 (527) T ss_pred C-CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5 5578877775 578899999999999999999999999993 456778999999999999999999999888888765 Q ss_pred HH-HHH----HHhCCCC--cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHH Q lcl|NC_021297. 362 MR-IAM----QIMGREV--TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRD 431 (480) Q Consensus 362 ~~-~~~----~~~~~~~--~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~~ 431 (480) .+ .+. ++.+... ......+.|.|.+++|.|.++.++.+++++++| ++|.++++++| |+..++-.++++ T Consensus 407 ~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG--i~S~~tAv~~L~~~~g~eD~E~E~~~ 484 (527) T protein:vir:10 407 FYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAG--LIPAKKLTEELSKIMGFELTEEDFKQ 484 (527) T ss_pred hhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC--chhHHHHHHHHHhccCCCChHHHHHH Confidence 43 221 2222211 122356799999999999999999999999987 78999998887 667777777777 Q ss_pred HHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCC Q lcl|NC_021297. 432 WDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGF 474 (480) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~ 474 (480) +.+++...+++...+..+-++.++...+.++++.++++.|... T Consensus 485 I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 485 ATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 7777777777777777777777788888889888998887766 No 65 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=2.2e-51 Score=298.32 Aligned_cols=458 Identities=11% Similarity=0.107 Sum_probs=335.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d 77 (480) |... .-..-..|+.+|+.|.+||.+++.-...-....+......+.++..+++|...-. +.+.|.. -..+ T Consensus 21 ~p~~------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~-~~~~g~~~~~~~~~ 93 (527) T protein:vir:10 21 FPNA------VTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMR-FLGQGLKWEFSKKD 93 (527) T ss_pred Cccc------CCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCcce-eeccCccccccchh Confidence 3221 1223346788999999999997532211111111112356778888999986544 3455543 2346 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE-- Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV-- 154 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~-- 154 (480) ++....+..++++|++..++.+..+++.+.|++ |.++|-+.+ .+.++++++.+||.+.||+.|+...+.+..+- T Consensus 94 e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k---~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~ 170 (527) T protein:vir:10 94 AKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEK---DEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV 170 (527) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCC---CcCCCceEeecCcceeeeeecCCCCCceeeEEEe Confidence 677888899999999999999999999999996 666664322 23468999999999999999987766666552 Q ss_pred EEEEEecCCceEE---------------------EEEEEeC-----CcEEEEEec---CC--cccccccccccccccccc Q lcl|NC_021297. 155 RLYTTRDDVAVPD---------------------RATLYLP-----DETVPLRRN---GG--LNDQWVVDGDVIKHGLGV 203 (480) Q Consensus 155 ~~~~~~~~~~~~~---------------------~~~~~~~-----~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~ 203 (480) .-|...++.+... -.++||. +.+...... .. .......+.+..|+++++ T Consensus 171 ~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~f 250 (527) T protein:vir:10 171 DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITT 250 (527) T ss_pred eeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCCCc Confidence 2244333322111 1122222 111100000 00 000112234567899999 Q ss_pred cceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc Q lcl|NC_021297. 204 VPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL 283 (480) Q Consensus 204 iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (480) ||||+|+|.+..++.||+|+|+ ++++++|++|+++|+.+.++++.+.|+.++.|..+-+ ..+....+.++++.+|.+ T Consensus 251 iPvV~~~t~p~~~~~WG~S~La-~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd--~~G~~~~~~VgPG~iweL 327 (527) T protein:vir:10 251 LPVFHFRGHPIMNAMFGRSGLA-GLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD--SRGNMVPWTISPLGMVEH 327 (527) T ss_pred cceEeecCCCccccccChhhHh-HHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc--ccCCcCccccCCceeEec Confidence 9999999999999999999998 5999999999999999999999999999999987653 234446688999999988 Q ss_pred CCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 ASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 284 ~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) + +++++..++. .+++.|.++++.+...|+.++++|..+||. +..++.||.||+..+++|.+++.+++..++...++. T Consensus 328 ~-e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~ 406 (527) T protein:vir:10 328 G-QNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQF 406 (527) T ss_pred C-CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5 5578877775 578899999999999999999999999993 456778999999999999999999999888888765 Q ss_pred HH-HHH----HHhCCCC--cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHH Q lcl|NC_021297. 362 MR-IAM----QIMGREV--TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRD 431 (480) Q Consensus 362 ~~-~~~----~~~~~~~--~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~~ 431 (480) .+ .+. ++.+... ......+.|.|.+++|.|.++.++.+++++++| ++|.++++++| |+..++-.++++ T Consensus 407 ~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG--iiS~etAv~~L~~~~g~eD~E~E~~~ 484 (527) T protein:vir:10 407 FYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAG--LIPAKKLTEELSKIMGFELTEEDFRQ 484 (527) T ss_pred hhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC--chhHHHHHHHHHhccCCCchHHHHHH Confidence 43 221 2222211 122356799999999999999999999999987 78999998887 667777777777 Q ss_pred HHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCC Q lcl|NC_021297. 432 WDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGF 474 (480) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~ 474 (480) +.+++...+++...+..+-++.++...+.++++.++++.|... T Consensus 485 I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 485 ATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 7777777777777777777777788888889888998887766 No 66 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=9.7e-49 Score=283.80 Aligned_cols=442 Identities=11% Similarity=0.056 Sum_probs=295.9 Q ss_pred CCCHHHHHHHHHHHH------------------HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLL------------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~------------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~ 62 (480) ++.++.++.+.+... ...+.|+.++.+||+|+++............+..+.++|+|+.||+. T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i~~~ 83 (508) T protein:vir:15 4 IQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTAARR 83 (508) T ss_pred HHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHHHHH Confidence 444555555533331 12457899999999999876432222223334556788999999999 Q ss_pred HHhhhccCc--cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEE Q lcl|NC_021297. 63 LSDRLDIEG--FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA 140 (480) Q Consensus 63 ~~~~l~~~~--~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~ 140 (480) +|+++++++ +++++++..++.|+++++.|+|...+.+++..++..|.+|+.+|.. ++.++|.+++|++++| T Consensus 84 ~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v~ad~~~P 156 (508) T protein:vir:15 84 IASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYID-------GNHIKIAWVRADQFYP 156 (508) T ss_pred HHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEe-------CCeeEEEEEcCCeeEE Confidence 999998875 5566666677889999999999999999999999999999999863 3578999999999999 Q ss_pred EEeCCCCCeeEEEE-EEEEEec--CCceEEEEEEEe-----CCcEEE--EEecCCccccccc-----------ccccccc Q lcl|NC_021297. 141 ELDPRNTRRVTRAV-RLYTTRD--DVAVPDRATLYL-----PDETVP--LRRNGGLNDQWVV-----------DGDVIKH 199 (480) Q Consensus 141 v~d~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~-----~~~~~~--~~~~~~~~~~~~~-----------~~~~~~~ 199 (480) +..+. ++...+++ +.+...+ +...++.++.|+ ...|.+ |+..+....+... ..+...+ T Consensus 157 ~~~d~-~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~ 235 (508) T protein:vir:15 157 LQSNT-NDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTIS 235 (508) T ss_pred EEEcC-CCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEec Confidence 74333 33333333 2222222 233455666665 223322 3322211101100 0112235 Q ss_pred cccccceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021297. 200 GLGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 200 ~~~~iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~ 275 (480) ++.++|++.| +|+....+|+|.|+++. +++++|+||..+|++++.++ .+.+..++.-.-.. .+..+...+.. T Consensus 236 g~~~p~f~y~~~~~~N~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~-~~~~~i~v~~~~l~--~d~~~~~~~~~ 311 (508) T protein:vir:15 236 GLQRPLFAYFKTPGANNINIESPLGLGVVDN-AKHVLDDINDTHDQFIWEIR-LGQKHIAVQPGMLR--FDDEHKPTFDT 311 (508) T ss_pred CCCcceeEEecCCccccccCCCCcCCchHhh-hHHHHHHHHHHHHHHHHHHH-hcccceeechHHhc--CCCCCccccCC Confidence 7778888877 57778889999999975 88999999999999999996 45444444211111 11122223332 Q ss_pred hhhhhccc---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 276 YYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 276 ~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) ....+... ++....+.+++.. ..++|.+.++.+++.|...+|+++..||....+..||.+++..++.+.+++..++ T Consensus 312 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~ 391 (508) T protein:vir:15 312 EQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYL 391 (508) T ss_pred CCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHH Confidence 22212211 2223346555543 5788999999999999999999999999887777799999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhC------CC-------CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh Q lcl|NC_021297. 352 RIFGGAWERAMRIAMQIMG------RE-------VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID 418 (480) Q Consensus 352 ~~~~~~l~~~~~~~~~~~~------~~-------~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~ 418 (480) +.|+.+|++++++++.+.. .+ ......+++|.|.+.+++|..++++..++++++| ++|+++++.. T Consensus 392 ~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aG--i~s~e~~i~~ 469 (508) T protein:vir:15 392 TMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIG--ALSKQTFLQR 469 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcC--CCCHHHHHHh Confidence 9999999999998876532 11 1223456899999999999999999999999886 6899998766 Q ss_pred C-CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 419 L-GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 419 l-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) + |+++++++++.+++++|... ....+++.+ ..++++++ T Consensus 470 ~~g~~deea~~el~ri~~E~~~-------~~~~~~~~~-~~~g~~ge 508 (508) T protein:vir:15 470 NYGMTDEQAAEELAKIQSEAPT-------DTFEGGRSA-ILNGGDGE 508 (508) T ss_pred cCCCChHHHHHHHHHHHHhccc-------cCccccccc-cCCCCCCC Confidence 5 77777766555444444211 111111111 11111222 No 67 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=1.7e-48 Score=282.51 Aligned_cols=439 Identities=12% Similarity=0.032 Sum_probs=292.6 Q ss_pred CCCHHHHHHHHHHHH------------------HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLL------------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~------------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~ 62 (480) .+-++.++.++.... ...+.++.++.+||.|+++.............+.+.++|+++.||+. T Consensus 4 ~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~ 83 (505) T protein:vir:79 4 WDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAK 83 (505) T ss_pred HHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHHHHH Confidence 455566666644321 12456778889999999875432222222333556788999999999 Q ss_pred HHhhhccCccccC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEE Q lcl|NC_021297. 63 LSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 63 ~~~~l~~~~~~~~-~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v 141 (480) +|+++++++.++. +|+..++.|+++++.|+|...+.+++..++..|.+++.+|.. +++++|.+++|++++|+ T Consensus 84 ~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-------~~~~~i~~v~ad~~~P~ 156 (505) T protein:vir:79 84 LASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-------SGKIKLAWATADQVYPL 156 (505) T ss_pred HHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-------CCceEEEEEcCCeeEEE Confidence 9999998875543 467788899999999999999999999999999999998862 46789999999999998 Q ss_pred EeCCCCCeeEEEEEEEEEecCC--ceEEEEEEEeC----CcEEE--EEecCCcccc----------cccc-ccccccccc Q lcl|NC_021297. 142 LDPRNTRRVTRAVRLYTTRDDV--AVPDRATLYLP----DETVP--LRRNGGLNDQ----------WVVD-GDVIKHGLG 202 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~----~~~~~--~~~~~~~~~~----------~~~~-~~~~~~~~~ 202 (480) +.+..+....+++..|...++. ..++.++.|+. ..|.+ |........+ +... .+...++++ T Consensus 157 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~ 236 (505) T protein:vir:79 157 QADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLK 236 (505) T ss_pred EEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCC Confidence 6444333233333333333322 24556777763 22221 2222211111 1000 112225677 Q ss_pred ccceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCCCccc--cccccccch Q lcl|NC_021297. 203 VVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTDE--LTNDGENTT 272 (480) Q Consensus 203 ~iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~----i~g~~~~~--~~~~~~~~~ 272 (480) ++|++.| +|+....+|+|.|+++. +++++|+||..+|++++.++....++.+ +. ..... ......... T Consensus 237 ~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~-~~~~~~~~~~~~~~~~ 314 (505) T protein:vir:79 237 HPLFAFYRNKGANNKNFTSPMGMSLIDN-SYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLK-TGSSYGGQASETHPPM 314 (505) T ss_pred cceEEEecCCcccccccCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhcccceeechHHhc-ccCCCCcccccccccC Confidence 7777766 56788889999999975 8899999999999999999863333222 11 11110 001111112 Q ss_pred hhhhhhhh--cccCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 273 LDIYYGRI--LTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAER 349 (480) Q Consensus 273 ~~~~~~~~--~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~ 349 (480) +....... +..+++++.+.+++.. ..++|++.++.++++|...+|+++..||....+..+|.+++..++.+.++++. T Consensus 315 fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~ 394 (505) T protein:vir:79 315 FDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSS 394 (505) T ss_pred CCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHH Confidence 22211111 1223444556666543 56889999999999999999999999998777777899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhCC-----------CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh Q lcl|NC_021297. 350 KGRIFGGAWERAMRIAMQIMGR-----------EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID 418 (480) Q Consensus 350 ~~~~~~~~l~~~~~~~~~~~~~-----------~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~ 418 (480) +++.|+.+|+++++.++.+... ........+.|.|.+.++.|..++++...+++++| ++|.++++.. T Consensus 395 ~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~G--i~s~e~~l~~ 472 (505) T protein:vir:79 395 YITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQ--VMPKKQFLMR 472 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHh Confidence 9999999999999988765321 11223357899999999999999999999999886 6899998766 Q ss_pred C-CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 419 L-GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 419 l-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) + |+++++++++.+++++|... . .+....-++| T Consensus 473 ~~~~~eeea~~el~ri~~E~~~----------~--~p~~~~~gg~ 505 (505) T protein:vir:79 473 NYGLDEEEADEWLAQIDAENST----------A--EPEFNQFGGD 505 (505) T ss_pred cCCCChHHHHHHHHHHHHhccc----------c--CCCchhccCC Confidence 6 67777665554444443211 0 0001111111 No 68 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=5.2e-47 Score=274.35 Aligned_cols=441 Identities=9% Similarity=0.012 Sum_probs=292.9 Q ss_pred CCCHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-----------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+-++.|+++.+... ..+..|+.++.+||+|+++........++...+.+.++|+|+.||+.+ T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:98 4 IQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHH Confidence 445555655533221 123478899999999996543222223334456678889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEE Q lcl|NC_021297. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~ 142 (480) ++++++++.++ .+|+..++.++++++.|+|...+.+++..++..|.+|+.+|.. .++++|.+++|++++|+. T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~ 156 (500) T protein:vir:98 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQ 156 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEE Confidence 99999877542 3466788899999999999999999999999999999998863 356899999999999986 Q ss_pred eCCCCCeeEEEEEEEEE--ecCC-ceEEEEEEEe--CCc---EEE--EEecCCcccc------ccc---ccccccccccc Q lcl|NC_021297. 143 DPRNTRRVTRAVRLYTT--RDDV-AVPDRATLYL--PDE---TVP--LRRNGGLNDQ------WVV---DGDVIKHGLGV 203 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~--~~~---~~~--~~~~~~~~~~------~~~---~~~~~~~~~~~ 203 (480) .+.. +...+++.++.. .++. ..++.++.|+ ++. |.+ |+.......+ .++ ..+...+++++ T Consensus 157 ~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 235 (500) T protein:vir:98 157 SNTQ-DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTR 235 (500) T ss_pred EcCC-CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCC Confidence 5543 345555544322 2222 2344566654 333 221 3322111100 011 11222356677 Q ss_pred cceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CC---Cccc-cccccccchhh Q lcl|NC_021297. 204 VPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GV---TTDE-LTNDGENTTLD 274 (480) Q Consensus 204 iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~---~~~~-~~~~~~~~~~~ 274 (480) +|++.| +|+....+|+|.|+++. +++++|++|..+|++++.++. +.+..++. .+ +... ..+......+. T Consensus 236 p~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d 313 (500) T protein:vir:98 236 PIFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFE 313 (500) T ss_pred ccEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccC Confidence 777665 57888899999999975 899999999999999999986 33322331 11 1100 00111111222 Q ss_pred hhhhhhccc---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 275 IYYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 275 ~~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ......... +++...+.+++.. ..+.|.+.++.++++|...+++++..||....+..+|.+++++++.+.+++..+ T Consensus 314 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~ 393 (500) T protein:vir:98 314 SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSI 393 (500) T ss_pred CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHH Confidence 211111111 2333446555433 468899999999999999999999999987777778999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHH-hCCCChh Q lcl|NC_021297. 351 GRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARI-DLGYTAT 424 (480) Q Consensus 351 ~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~-~lg~~~~ 424 (480) ++.|+.+|++++++++.+.. ........++.|.|.++.++|..++++.+++++++| ++|.++++. .+|++++ T Consensus 394 ~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG--i~s~~~~i~~~~g~~ee 471 (500) T protein:vir:98 394 VALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAG--FGTREMAIQKVLNVTEE 471 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHhcCCCCHH Confidence 99999999999998875531 112223457899999999999999999999999986 689999774 4589988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) +++++.++.+++.. ...+..++ .++--|. T Consensus 472 ea~~~l~~i~~E~~--------~~~~~~~~---------~~~~~g~ 500 (500) T protein:vir:98 472 KAQEIAAEINTGIV--------DEINQQRT---------DTHLYGE 500 (500) T ss_pred HHHHHHHHHHHhcc--------ccCCCCCc---------cccccCC Confidence 77766555444321 00000000 0111111 No 69 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=5.2e-47 Score=274.35 Aligned_cols=441 Identities=9% Similarity=0.012 Sum_probs=292.9 Q ss_pred CCCHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-----------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+-++.|+++.+... ..+..|+.++.+||+|+++........++...+.+.++|+|+.||+.+ T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:30 4 IQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHH Confidence 445555655533221 123478899999999996543222223334456678889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEE Q lcl|NC_021297. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~ 142 (480) ++++++++.++ .+|+..++.++++++.|+|...+.+++..++..|.+|+.+|.. .++++|.+++|++++|+. T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~ 156 (500) T protein:vir:30 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQ 156 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEE Confidence 99999877542 3466788899999999999999999999999999999998863 356899999999999986 Q ss_pred eCCCCCeeEEEEEEEEE--ecCC-ceEEEEEEEe--CCc---EEE--EEecCCcccc------ccc---ccccccccccc Q lcl|NC_021297. 143 DPRNTRRVTRAVRLYTT--RDDV-AVPDRATLYL--PDE---TVP--LRRNGGLNDQ------WVV---DGDVIKHGLGV 203 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~--~~~---~~~--~~~~~~~~~~------~~~---~~~~~~~~~~~ 203 (480) .+.. +...+++.++.. .++. ..++.++.|+ ++. |.+ |+.......+ .++ ..+...+++++ T Consensus 157 ~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 235 (500) T protein:vir:30 157 SNTQ-DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTR 235 (500) T ss_pred EcCC-CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCC Confidence 5543 345555544322 2222 2344566654 333 221 3322111100 011 11222356677 Q ss_pred cceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CC---Cccc-cccccccchhh Q lcl|NC_021297. 204 VPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GV---TTDE-LTNDGENTTLD 274 (480) Q Consensus 204 iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~---~~~~-~~~~~~~~~~~ 274 (480) +|++.| +|+....+|+|.|+++. +++++|++|..+|++++.++. +.+..++. .+ +... ..+......+. T Consensus 236 p~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d 313 (500) T protein:vir:30 236 PIFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFE 313 (500) T ss_pred ccEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccC Confidence 777665 57888899999999975 899999999999999999986 33322331 11 1100 00111111222 Q ss_pred hhhhhhccc---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 275 IYYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 275 ~~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ......... +++...+.+++.. ..+.|.+.++.++++|...+++++..||....+..+|.+++++++.+.+++..+ T Consensus 314 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~ 393 (500) T protein:vir:30 314 SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSI 393 (500) T ss_pred CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHH Confidence 211111111 2333446555433 468899999999999999999999999987777778999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHH-hCCCChh Q lcl|NC_021297. 351 GRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARI-DLGYTAT 424 (480) Q Consensus 351 ~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~-~lg~~~~ 424 (480) ++.|+.+|++++++++.+.. ........++.|.|.++.++|..++++.+++++++| ++|.++++. .+|++++ T Consensus 394 ~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG--i~s~~~~i~~~~g~~ee 471 (500) T protein:vir:30 394 VALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAG--FGTREMAIQKVLNVTEE 471 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHhcCCCCHH Confidence 99999999999998875531 112223457899999999999999999999999986 689999774 4589988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) +++++.++.+++.. ...+..++ .++--|. T Consensus 472 ea~~~l~~i~~E~~--------~~~~~~~~---------~~~~~g~ 500 (500) T protein:vir:30 472 KAQEIAAEINTGIV--------DEINQQRT---------DTHLYGE 500 (500) T ss_pred HHHHHHHHHHHhcc--------ccCCCCCc---------cccccCC Confidence 77766555444321 00000000 0111111 No 70 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=2.7e-44 Score=259.46 Aligned_cols=451 Identities=9% Similarity=-0.005 Sum_probs=294.7 Q ss_pred CCCHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGL-----------------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~-----------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+.++.+++++... +..+..++.+..+||+|+++.........+..++.+.++|+++.||+.+ T Consensus 4 ~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (522) T protein:vir:47 4 FQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKI 83 (522) T ss_pred HHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHHHHH Confidence 56677777776643 3345578889999999986543221112233446678889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEE Q lcl|NC_021297. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~ 142 (480) |++++.++.++ .+|+..++.++++++.|+|...+.+++..++..|.+++.+|.. ++.++|.+++|.+++|+. T Consensus 84 A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~i~~v~ad~~~P~~ 156 (522) T protein:vir:47 84 ASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-------GDKVRVAFIQAPVFFPLE 156 (522) T ss_pred hhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-------CCceEEEEEcCCceEEEE Confidence 99999887653 3466788899999999999999999999999999988888863 467999999999999984 Q ss_pred eCCCCCeeEEEEEEEEEe--cCCc-eEEEEEEEe--C-------------CcEEE---EEecCCcccc----------cc Q lcl|NC_021297. 143 DPRNTRRVTRAVRLYTTR--DDVA-VPDRATLYL--P-------------DETVP---LRRNGGLNDQ----------WV 191 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~--~-------------~~~~~---~~~~~~~~~~----------~~ 191 (480) .+. .....+++...... ++.+ ..+.++.|+ . ..++. |+.......+ |. T Consensus 157 ~~~-~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~ 235 (522) T protein:vir:47 157 SNT-QDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYK 235 (522) T ss_pred EcC-CceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccccccccc Confidence 333 33444544443322 2222 223344432 1 11111 2222111111 10 Q ss_pred c-ccccccccccccceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCCCcc Q lcl|NC_021297. 192 V-DGDVIKHGLGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTD 262 (480) Q Consensus 192 ~-~~~~~~~~~~~iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~----i~g~~~~ 262 (480) . .....-+++.+++++.| +|+....+|+|.|++.. .++++|++|..+|++++.++....++.+ +.-.... T Consensus 236 ~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~ 314 (522) T protein:vir:47 236 NLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQR 314 (522) T ss_pred CCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhh-hHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCC Confidence 0 11122246677777776 57778889999999975 7899999999999999999864443322 1100000 Q ss_pred ccccccccchhhhhhhhhccc---CCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDIYYGRILTL---ASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~ 338 (480) ..........+......+... .++..++..++. -..+.|.+.++.+++.|...+++++..||.......+|.+++. T Consensus 315 ~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s 394 (522) T protein:vir:47 315 PDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVS 394 (522) T ss_pred CCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHH Confidence 000001111122111111111 122334555543 3577899999999999999999999999987777778999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHH Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKE 413 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e 413 (480) ..+.+.++++++++.++.+|+++++.++.+.. .........+.|.|.+.+++|..+++....+++++| ++|.+ T Consensus 395 ~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG--~~s~e 472 (522) T protein:vir:47 395 ENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAG--FSTKK 472 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcC--CCCHH Confidence 99999999999999999999999999886542 122234467999999999999999999999999987 68999 Q ss_pred HHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 414 QARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 414 t~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +++.. .|+++++++++.++.++|... ..+.. .+.... .+++++++|-.| T Consensus 473 ~~i~~~~g~~eeea~~el~ri~~E~~~-------~~~~~------~~~~~~-~~~~~~~~d~~~ 522 (522) T protein:vir:47 473 RAIGKTLNISGVEAEKELNAINSELLP-------MNDAE------LAIYGM-HDQNEEKADDKG 522 (522) T ss_pred HHHHhcCCCChHHHHHHHHHHHHhhcc-------CCCCC------CCCCCC-CCcccccCCCCC Confidence 97655 588888766554444443210 01100 111111 123333333333 No 71 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=9.5e-45 Score=261.93 Aligned_cols=465 Identities=12% Similarity=0.091 Sum_probs=307.9 Q ss_pred CCCH-------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTY-------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~-------~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |..- ..|+. .....|+.+|+.|.+||.|++.-...-. +.-...-+..++.+++|++...+| +.|+. T Consensus 9 ~p~~~~fp~~~a~wV~---~~D~~RlaaY~ly~d~y~n~~~el~~il---~G~dr~~~~~ps~r~~V~~~~~~L-g~~~~ 81 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVD---ENDKNRVRAYDLYENIYLNSAETLKLVL---RGDDSVPILMPSGRKIVEAVHRFL-GVGFD 81 (563) T ss_pred CCCcccccccccccCC---HHHHHHHHHHHHHHHhhcCchhhhhhhc---CCCceeeeccchHHHHHHHHHHhc-CCCcE Confidence 2221 11222 2345688999999999999875422100 110122233456789999976555 88765 Q ss_pred c--CC---Cc----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEEEEccceeEEEEe Q lcl|NC_021297. 74 I--SE---DS----EGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVESPLYMYAELD 143 (480) Q Consensus 74 ~--~~---d~----~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d 143 (480) + +. |+ ..+..|.++++++++..++.+..+++.+.|++ |.++|-+++ ...++++++.++|.+.||+-| T Consensus 82 ~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K---~~g~R~rv~~vDP~~~fp~~d 158 (563) T protein:vir:74 82 YLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNK---KAGERISVDEVDPRQIFLIED 158 (563) T ss_pred EecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecccc---ccCCCceEeecCCceeeeccC Confidence 4 32 22 12456788999999999999999999999996 666664322 245789999999999999666 Q ss_pred CCCCCeeEEEEE---EEEEecC-----------------CceE-----EEEEEEeCCcEE-------EEEecCCc--ccc Q lcl|NC_021297. 144 PRNTRRVTRAVR---LYTTRDD-----------------VAVP-----DRATLYLPDETV-------PLRRNGGL--NDQ 189 (480) Q Consensus 144 ~~~~~~~~~~~~---~~~~~~~-----------------~~~~-----~~~~~~~~~~~~-------~~~~~~~~--~~~ 189 (480) ++....+ +.++ -|...++ .+.. +-+|.|+.+.+- .+...... ... T Consensus 159 pd~v~g~-~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~ 237 (563) T protein:vir:74 159 GSTVVGF-HMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQ 237 (563) T ss_pred CCCcccc-eeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhh Confidence 6543221 1111 1221211 1111 111222211100 00000000 011 Q ss_pred cccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc Q lcl|NC_021297. 190 WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE 269 (480) Q Consensus 190 ~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~ 269 (480) ...+.+..|++++.||+|+|+|.+..+++||+|.|++ ++++++++|+++++.+.++.+.++|+.++.|..+ .....+. T Consensus 238 ~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~-ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p-~d~~~g~ 315 (563) T protein:vir:74 238 HDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEG-METLAYALNQSLTDEDATIVFQGLGMYVTNASAP-VDPNTGE 315 (563) T ss_pred hhchhhhccccccCccEEEcCCCCCcccccchhhHHH-HHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccc-ccccccc Confidence 1123455688999999999999999999999999975 9999999999999999999999999988877553 3334455 Q ss_pred cchhhhhhhhhcccCCC--CceEEEecc-ccHHHHHHHHHHHHH-HHHhccCCChHHhcc-ccccchHHHHHHHHHHHHH Q lcl|NC_021297. 270 NTTLDIYYGRILTLASE--AAKISEFKA-AELRNFAEEMEVFRK-EAASITGLPPQYLSS-SSENPASAEAIIATDSRIV 344 (480) Q Consensus 270 ~~~~~~~~~~~~~~~g~--~~~~~~~~~-~~~~~~~~~l~~~~~-~i~~~~~~p~~~~g~-~~~n~~Sg~Al~~~~~~l~ 344 (480) ...|.++++.+|.+++. ...+-.++. .+++.+..+++.+.+ -++.++++|..+||. +.++..||.||+..+.+|+ T Consensus 316 ~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~ 395 (563) T protein:vir:74 316 LTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLL 395 (563) T ss_pred ccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHH Confidence 56788999999988654 345666664 467888888888877 568889999999994 3455679999999999999 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHH-HHhC-----C-------CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Q lcl|NC_021297. 345 KMAERKGRIFGGAWER----AMRIAM-QIMG-----R-------EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ 407 (480) Q Consensus 345 ~k~~~~~~~~~~~l~~----~~~~~~-~~~~-----~-------~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~ 407 (480) ++++++++.+...+++ .+++.+ .+.+ . .....-..+.|+|.+++|.|.++....++.++++| T Consensus 396 a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aG- 474 (563) T protein:vir:74 396 AANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAH- 474 (563) T ss_pred HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcC- Confidence 9999999988888776 333333 1221 1 11112234788999999999999999999999987 Q ss_pred cCCCHHHHHHhC---CCChhH-HHHHHHHHHHHHHHHHH-HHhcccccCcccccCCCCCCCCcccccCCCCC-CCCC--- Q lcl|NC_021297. 408 GPIPKEQARIDL---GYTATQ-REQMRDWDKQETEDMID-TLYSTTKAQADATPKPTVTDTKTETQTSPSGF-NRTK--- 478 (480) Q Consensus 408 ~~~s~et~~~~l---g~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~-~~~~--- 478 (480) ++|.+|+.++| ||...+ ..++..++..+..+++. +..+....+.++..+++.++++.+++++|-+- +++- T Consensus 475 -iiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~ 553 (563) T protein:vir:74 475 -LILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIP 553 (563) T ss_pred -chhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCchhHcCCcccCC Confidence 78999998777 876533 23333444444433222 22333455567777888888888888887532 2221 Q ss_pred ---CC Q lcl|NC_021297. 479 ---TR 480 (480) Q Consensus 479 ---~~ 480 (480) |. T Consensus 554 ~~~~~ 558 (563) T protein:vir:74 554 PDVTQ 558 (563) T ss_pred ccccc Confidence 11 No 72 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=5.7e-41 Score=241.22 Aligned_cols=445 Identities=9% Similarity=-0.013 Sum_probs=289.4 Q ss_pred CCCHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-----------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+.++.|++++...+ .....|+.++.+||.|+++-.+.........+..+.++|+++.|+..+ T Consensus 4 ~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~ 83 (517) T protein:vir:98 4 IQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVL 83 (517) T ss_pred HHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHHh Confidence 556666666654331 123568888999999998643221111223345577889999999999 Q ss_pred HhhhccCc--cccCC----------CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 64 SDRLDIEG--FRISE----------DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 64 ~~~l~~~~--~~~~~----------d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) +++++.+. +++++ ....++.|+++++.|+|...+.+++..++..|.+++.+|.. ++.++|. T Consensus 84 A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~I~ 156 (517) T protein:vir:98 84 SGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-------NGEIEFS 156 (517) T ss_pred hhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-------CCeeEEE Confidence 99998775 33332 12356789999999999999999999999999999998863 4678999 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEE--ec-CCceEEEEEEEeCCcE--------EE---EEecCCccccccc----- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTT--RD-DVAVPDRATLYLPDET--------VP---LRRNGGLNDQWVV----- 192 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~--------~~---~~~~~~~~~~~~~----- 192 (480) +++|++++|+-.+ .++...+++.+... .. ....++.++.|+.+.+ +. |........+..+ T Consensus 157 ~v~ad~~~Pl~~~-~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~ 235 (517) T protein:vir:98 157 WALANAFYPLRSN-SNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEEL 235 (517) T ss_pred EEcCCeeEEEEec-CCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccccc Confidence 9999999995433 34556565533322 22 2224455677766542 11 2222111111100 Q ss_pred ----ccccccccccccceEEe----ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccc Q lcl|NC_021297. 193 ----DGDVIKHGLGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDE 263 (480) Q Consensus 193 ----~~~~~~~~~~~iPvv~~----~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~~~~~ 263 (480) .....-+++.+++++.| +|+....+++|.|+++. +++++|++|..+|++++.++....++ ++. .+-... T Consensus 236 ~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~-a~~~~d~lD~~~s~~~~e~~~g~~~i-~vp~~~l~~~ 313 (517) T protein:vir:98 236 YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDN-SVSTLKKINDTYDQFWWEIKMGQRTV-FVSDVMLRTV 313 (517) T ss_pred ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhh-hHHHHHHHHHHHHHHHHHHHhCCcce-ecChhhhccc Confidence 11122245566555554 57778889999999975 78999999999999999988644332 221 110000 Q ss_pred cccc--cccchhhhhhhhh--cccCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH Q lcl|NC_021297. 264 LTND--GENTTLDIYYGRI--LTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 264 ~~~~--~~~~~~~~~~~~~--~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~ 338 (480) .... ..+..|......+ +..+.+...+.+++.. ..+.|.+.++.+++.|...+|+++..||....+..+|.+++. T Consensus 314 ~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s 393 (517) T protein:vir:98 314 PDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVS 393 (517) T ss_pred cCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHH Confidence 0000 0111122111111 1222233445555432 457899999999999999999999999988777778999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIM------GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~------~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) ..+.+.+++.++++.++.+|++++++++.+. ++.. ....++.|.|.+..++|..+++....+++++| ++|. T Consensus 394 ~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~-~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG--~ms~ 470 (517) T protein:vir:98 394 ENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEI-PSAEHIGVDFDDGVFQDRSALLRFYGQAKTFG--FIPT 470 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCCcceEEEcCCCCCCCHHHHHHHHHHHHhcC--CCCH Confidence 9999999999999999999999999887543 2232 23357899999999999999999999999987 5899 Q ss_pred HHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 413 EQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 413 et~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) ++++.+ +|+++++++++.++.+++... ..+.. ..+ .+++...|.++ T Consensus 471 ~~~i~~~~g~~eeeA~~e~~~i~~E~~~-------~~~~~----------~~~-~~~~~~~gd~e 517 (517) T protein:vir:98 471 VEAIQRIFKVPKKTAEQWLEEIRKDQIE-------LDPVT----------ISQ-RAQKRMFGDEE 517 (517) T ss_pred HHHHHHhCCCChHHHHHHHHHHHHhccc-------cCCCC----------ccc-cccCCCCCCCC Confidence 887654 599988876665444444210 00000 000 11111112222 No 73 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=1.8e-40 Score=238.53 Aligned_cols=440 Identities=11% Similarity=0.071 Sum_probs=276.0 Q ss_pred CCCHHHHHHHHHHHH--HHHHHHHHHHHHHhcccCc----c----hhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLL--ARDLPNLLEAEAYRNGTRR----L----KTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~--~~~~~r~~~~~~YY~g~~~----i----~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) .++.++.|+...+-- ..+..++..+.++|.+... + .+++...++...+.++++|+++.||+.+|++++.+ T Consensus 4 ~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e 83 (518) T protein:vir:78 4 WSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYISGK 83 (518) T ss_pred hhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhhcCC Confidence 556566665544321 1233444555555544321 1 12233334455677889999999999999999988 Q ss_pred ccc--cC-----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEe Q lcl|NC_021297. 71 GFR--IS-----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELD 143 (480) Q Consensus 71 ~~~--~~-----~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d 143 (480) +.+ ++ +++..++.++++++.|+|...+.+++..++..|.+++.++.. +|+++|.+++|.+++|+|+ T Consensus 84 ~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v~ad~~~P~~~ 156 (518) T protein:vir:78 84 PLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-------NGRPSISVHSSSQFWIDFK 156 (518) T ss_pred CceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-------CCeeEEEEEcCCeeEEEee Confidence 654 32 355667899999999999999999999999999999888752 4789999999999999997 Q ss_pred CCCCCeeEEEEEEEEEec-CC-ceEEEEEEEeCC------------cEEE--EEecCCcc--cc------c---cc---- Q lcl|NC_021297. 144 PRNTRRVTRAVRLYTTRD-DV-AVPDRATLYLPD------------ETVP--LRRNGGLN--DQ------W---VV---- 192 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~------------~~~~--~~~~~~~~--~~------~---~~---- 192 (480) +. ++..+++...... +. ...+.++.|..+ .|.+ |+...+.. .. . .. T Consensus 157 ~g---~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~ 233 (518) T protein:vir:78 157 NN---EPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTND 233 (518) T ss_pred cC---cEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccccc Confidence 53 2444443332222 22 223334444321 2211 22221110 00 0 00 Q ss_pred --ccccccccccccceEEee-----cccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc Q lcl|NC_021297. 193 --DGDVIKHGLGVVPVVPLT-----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) Q Consensus 193 --~~~~~~~~~~~iPvv~~~-----n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 265 (480) +...++++. ..|++.|. |+...++|+|.|+++. +++++|+||..+|++++.++. +.+..++.-.-..... T Consensus 234 ~~e~~~~~tg~-~~~~~~~~~n~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~ 310 (518) T protein:vir:78 234 IQLNHSVSIGL-KSMGAYLINNSPSNTRYPHLNLGESDLSQ-CTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKV 310 (518) T ss_pred CccceeeccCC-ccceEEeeccccccccccCCCcCcchHhh-hhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCC Confidence 011122333 34555542 5666788999999975 889999999999999999986 5554454311111011 Q ss_pred cc---cccchhhhhhhhhcccC-----CCCc--eEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHH Q lcl|NC_021297. 266 ND---GENTTLDIYYGRILTLA-----SEAA--KISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAE 334 (480) Q Consensus 266 ~~---~~~~~~~~~~~~~~~~~-----g~~~--~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~ 334 (480) +. .....+......+..+. |.++ .+.+++. -..+.|++.++.+++.|...+|+++..||..+. ..||. T Consensus 311 ~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~-~~TAT 389 (518) T protein:vir:78 311 NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR-EVKAT 389 (518) T ss_pred CCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc-cccHH Confidence 11 11112332222222221 1111 2444432 356889999999999999999999999987544 46899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-------CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Q lcl|NC_021297. 335 AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-------EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ 407 (480) Q Consensus 335 Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-------~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~ 407 (480) +++...+.+.+++.+++..++.+|+++++.++.+... ....+...+.|.|.+.+++|..++++...+++++| T Consensus 390 ei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aG- 468 (518) T protein:vir:78 390 EIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSAL- 468 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcC- Confidence 9999999999999999999999999999988765432 11233467999999999999999999999999887 Q ss_pred cCCCHHHHHHhC--CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 408 GPIPKEQARIDL--GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 408 ~~~s~et~~~~l--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) ++|++++++++ ++++++++++.+++++|... .. . +.|.+-+ +.+ +..| T Consensus 469 -imS~e~~i~~~~~~~~deea~~e~~ri~~E~~~----~~---~----~~p~~~~-----g~~-~~~g 518 (518) T protein:vir:78 469 -AMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAI----GE---V----PDPEAIG-----GME-TKGG 518 (518) T ss_pred -CCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcc----cC---C----CCCcccc-----CCC-CCCC Confidence 68999988753 67776665544333333210 00 0 0000000 000 0011 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.96 E-value=1.8e-28 Score=172.67 Aligned_cols=459 Identities=14% Similarity=0.114 Sum_probs=274.4 Q ss_pred CCCHH-HHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-----cCcc---cchhhhhh---ccccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTYH-EHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIG---APPELAYL---DVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~-~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-----~~~~---~~~~~~~~---~~~~n~~~~iVd~~~~~l~ 68 (480) |+|-. +-+..--..|.+..++.+.+++-|.|...++. +++. ....|+.. -+..|+++.+|+.++++++ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 77753 33555555677788888899999988755543 2222 22233332 2457999999999999998 Q ss_pred cCccccCCCchhHHHHHH-HHH-----hcCHHHHHHHHHHHHhhcCeEEEEEecCccccc------------CCCceeEE Q lcl|NC_021297. 69 IEGFRISEDSEGLEELWN-WWQ-----ANDLDEESVLGHDDSLTFGRAYITVSHPDVESG------------DPAGIPLI 130 (480) Q Consensus 69 ~~~~~~~~d~~~~~~l~~-i~~-----~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~------------d~~~~~~i 130 (480) -+..+...+ ....+.+ +++ -++++.+++.++..++.+|+++++|..+..... ...-+|.+ T Consensus 81 ~k~p~~~~~--~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~ 158 (513) T protein:vir:97 81 SEPIKLNED--VPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYW 158 (513) T ss_pred hcCcccCcC--chHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceE Confidence 877655422 2233333 222 258999999999999999999999987654321 11225889 Q ss_pred EEEccceeEEEE-eCCCCCeeEEEEEE---EEEecC--CceEEEEEEEeCCcEEEEEecCCcccc--ccccccccccccc Q lcl|NC_021297. 131 RVESPLYMYAEL-DPRNTRRVTRAVRL---YTTRDD--VAVPDRATLYLPDETVPLRRNGGLNDQ--WVVDGDVIKHGLG 202 (480) Q Consensus 131 ~~~~p~~~~~v~-d~~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 202 (480) ..++|.+++--- +...++..+-.+++ +.+.|+ ......+.+|+++.+..|+...+.... .....+...|.++ T Consensus 159 ~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~ 238 (513) T protein:vir:97 159 VMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLN 238 (513) T ss_pred EEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCC Confidence 999999876432 22222222222222 222332 223344567899887777665433221 1123344568999 Q ss_pred ccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc Q lcl|NC_021297. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) .||||+|.... ....-|.+.|. +|-.|..+.-+..|+...++.+.++|++++.|.+... ...+.++.+.+|. T Consensus 239 ~IP~v~~~~~~-~~~~~~~pPLl-~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~------~~~i~iG~~~~~~ 310 (513) T protein:vir:97 239 YVPLVTFYADR-QGFMMGKPPLL-DLAHLNVAHWQSASDQRHILTVSRFPILACSGASGED------SDPVVVGPNKVLY 310 (513) T ss_pred ceeEEEEecCC-CCCCCCccchH-HHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCC------CCceEeecccccc Confidence 99999987543 23344666665 4666666777889999999999999999999975432 1224577778887 Q ss_pred cC--CCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LA--SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 283 ~~--g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) ++ +.++.+.+++...+....+.++.+..+|....-. -+..++.+ .||++.+.......+....+...+..+|.+ T Consensus 311 lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~---ll~~~~~~-~Ta~a~~~~~~~~~S~L~~~a~~le~al~~ 386 (513) T protein:vir:97 311 NPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAE---FLKRKTGG-QTATARALDSAEATSDLSAMTGLFEDALAQ 386 (513) T ss_pred CCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHH---hhccCCcc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76 4567788988888888888888887777554321 12323333 689998888888888888889999999999 Q ss_pred HHHHHHHHhCCCCcccceeeeEEecCCCCCCH-HHHHHHHHHHHhccccCCCHHHHHHhC---CCChhH--HHHHHHHHH Q lcl|NC_021297. 361 AMRIAMQIMGREVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYANGQGPIPKEQARIDL---GYTATQ--REQMRDWDK 434 (480) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~-~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~--~~~~~~~~~ 434 (480) +++++..++|.........+.-.| ....+ ++.++++.++.++| .+|.+|+++.| |+...+ .++..+.++ T Consensus 387 ~l~~~a~wlg~~~~~~~v~in~dF---~~~~~~~~~~~al~~a~~~G--~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~ 461 (513) T protein:vir:97 387 ALDITADWLRLGPNGGTVELVKDY---DLEEMDAPGLQALQVAREKR--DISRKTYLNGLRLRGVLPEDFDEDEDWEELM 461 (513) T ss_pred HHHHHHHHhCCCCCccEEEecccc---CcccCCHHHHHHHHHHHhCC--CCCHHHHHHHHHhccCCCccCCHHHHHHHHH Confidence 999999999854322122333344 33333 46778888988876 68999988765 453211 122222222 Q ss_pred HHHH-HHHHHH---hcccccCcccccCCCCCCCCcccccCCCCCC----CCCC Q lcl|NC_021297. 435 QETE-DMIDTL---YSTTKAQADATPKPTVTDTKTETQTSPSGFN----RTKT 479 (480) Q Consensus 435 ~~~~-~~~~~~---~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~----~~~~ 479 (480) ++.+ +..+.. ....+..++.++..+.++.++.+.+ |.|-+ ++-| T Consensus 462 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 513 (513) T protein:vir:97 462 EEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGG-EGGEGGGNPGGES 513 (513) T ss_pred HhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCC-CccccCCCCCCCC Confidence 2211 111100 0000000001111111111111111 11111 1112 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.94 E-value=3e-26 Score=160.48 Aligned_cols=423 Identities=14% Similarity=0.117 Sum_probs=247.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-----cCccc---chhhhhh---ccccchHHHHHHHHHhhhcc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIGA---PPELAYL---DVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-----~~~~~---~~~~~~~---~~~~n~~~~iVd~~~~~l~~ 69 (480) |. |..--..|.+..++.+.+++-|.|...++. +++.. ...|+.. -...|+++.+|+.++++++. T Consensus 1 m~-----V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~ 75 (452) T protein:vir:94 1 MP-----IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLD 75 (452) T ss_pred CC-----CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhc Confidence 43 222233455566666777787877755443 22221 1222222 23469999999999999988 Q ss_pred CccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 70 EGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~ 149 (480) +.......+ ....+..=-.-++++.++..++..++.+|+++++|..+. ..++|.+..++|.+++- |+...... T Consensus 76 k~p~~~~p~-~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~-----~g~rPy~~~~~~~~Ii~-W~~~~~g~ 148 (452) T protein:vir:94 76 QPPVITHPD-AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL-----TGGDPYISVYTTENILN-WEEDEDGR 148 (452) T ss_pred CCceecccH-HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeecc-----CCCceEEEEechhhhcC-ccccccCC Confidence 876653222 222222112347899999999999999999999997643 34689999999999874 43322233 Q ss_pred eEEEE-EEEE-EecC-----CceEEEEEEEe--CCcEEE--EEecCCcccc--cccccccccccccccceEEeecccccC Q lcl|NC_021297. 150 VTRAV-RLYT-TRDD-----VAVPDRATLYL--PDETVP--LRRNGGLNDQ--WVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 150 ~~~~~-~~~~-~~~~-----~~~~~~~~~~~--~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) +...+ +... ..+. .+....+.+|+ ++.+.. |+..++.... .....+...|+++.||+|.+..... . T Consensus 149 l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~-~ 227 (452) T protein:vir:94 149 LLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGL-S 227 (452) T ss_pred eeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCC-C Confidence 33333 3222 1221 12333445554 555443 3322222111 1122344568899999998865432 2 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC--CCCceEEEec Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA--SEAAKISEFK 294 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~~~~ 294 (480) ...|.+.+. ++-.|.-+..+..|+...++.+.++|++++.|.+... .+.++.+..|.++ |.++.+.+++ T Consensus 228 ~~~~~pPLl-~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~--------~i~iG~~~~~~lpe~~~~~~yie~~ 298 (452) T protein:vir:94 228 MTPAKPPMI-DIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS--------TMHIGSTKAWVIPEVAAKVGFLEFT 298 (452) T ss_pred CCCCccchH-HHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC--------ceEecccccccCCCCCCcceEEccC Confidence 334666665 4777777778899999999999999999999864311 2456777778776 4467788988 Q ss_pred cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021297. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 374 (480) ...++.+++.|+.+..++..... ..+-..+.+..|+.|.......-.+....+...+..++.+++++++.+.|.+.. T Consensus 299 g~~i~~~~~~l~~le~~m~~~Ga---~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~ 375 (452) T protein:vir:94 299 GQGLQSLEKALSEKQAQLASLSA---RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGT 375 (452) T ss_pred chhHHHHHHHHHHHHHHHHHHHH---HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCc Confidence 88888888888888777654432 112222333457777655444444444555556677888888998999885321 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhcccccC Q lcl|NC_021297. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) Q Consensus 375 ~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (480) ..+++.-.-....-.++.++++.++.++| .+|.+|+++.| |+.+.+.+. +..+ .+ ..+ ++ T Consensus 376 ---~~v~~n~dF~~~~~~~~~~~al~~~~~~G--~is~~t~~~~L~~~gvl~~~~e~-~~i~-~E-------~~~--~~- 438 (452) T protein:vir:94 376 ---LNIKLNSAFLDSKLTAAELKAWVEAYLSG--GISKEIYIHALKVGKVLPPPGES-MGVI-PD-------PPA--PE- 438 (452) T ss_pred ---eEEEeccccccccCCHHHHHHHHHHHhcC--CCcHHHHHHHHHhCCCCCCccCH-HHHH-HH-------hhc--cC- Confidence 23333211122232367788888988776 68999998876 665432111 1111 11 111 00 Q ss_pred cccccCCCCCCCCc Q lcl|NC_021297. 452 ADATPKPTVTDTKT 465 (480) Q Consensus 452 ~~~~~~~~~~d~~~ 465 (480) +.+..+|..+.++. T Consensus 439 ~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 439 PSPSNTPPNPSSKA 452 (452) T ss_pred cccCCCCCCCccCC Confidence 00111111111111 No 76 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.93 E-value=3.8e-25 Score=154.46 Aligned_cols=453 Identities=14% Similarity=0.089 Sum_probs=242.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-----cCccc--------chhhhhh---ccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIGA--------PPELAYL---DVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-----~~~~~--------~~~~~~~---~~~~n~~~~iVd~~~ 64 (480) |.+ |..--..|....++.+.+++-+.|...++. +++-. ...|+.. -+..|+++.+|+.++ T Consensus 32 m~d----V~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~ 107 (535) T protein:vir:80 32 LPN----VGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM 107 (535) T ss_pred CCC----CCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh Confidence 654 222333456666777888888888755543 22211 0113322 245799999999999 Q ss_pred hhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeEEEEEecCccccc-------CCCceeEEEE Q lcl|NC_021297. 65 DRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG-------DPAGIPLIRV 132 (480) Q Consensus 65 ~~l~~~~~~~~~d~~~~~~l~~i~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~-------d~~~~~~i~~ 132 (480) ++++.+..... ....+..++.+ ++++.+++.++..++.+|+++++|..+..+.. ....+|.+.. T Consensus 108 G~vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~ 183 (535) T protein:vir:80 108 GQVFSRDPIRQ----LPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITL 183 (535) T ss_pred chhhcCCccee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEE Confidence 99887765442 22344455443 58999999999999999999999975543211 1134689999 Q ss_pred EccceeEEEEeCCC-CC-eeEEEEEE--EEEecC---CceEEEEEEEeCC---c--EEEEEecCCccc--cc--cccccc Q lcl|NC_021297. 133 ESPLYMYAELDPRN-TR-RVTRAVRL--YTTRDD---VAVPDRATLYLPD---E--TVPLRRNGGLND--QW--VVDGDV 196 (480) Q Consensus 133 ~~p~~~~~v~d~~~-~~-~~~~~~~~--~~~~~~---~~~~~~~~~~~~~---~--~~~~~~~~~~~~--~~--~~~~~~ 196 (480) ++|.+++---.... ++ .+..++.. +.+.++ .+....+.+++.+ . +..|+..+.... .+ ....+. T Consensus 184 y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~ 263 (535) T protein:vir:80 184 VHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDG 263 (535) T ss_pred echhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccC Confidence 99998764322211 22 23333221 222222 1233345555553 1 222333322111 11 122334 Q ss_pred ccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhh Q lcl|NC_021297. 197 IKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIY 276 (480) Q Consensus 197 ~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~ 276 (480) ..|.++.||||.|... ......|.+.|. +|..|.-+.-+.-|+..+++.+.++|++++.|.+.....+......+.++ T Consensus 264 g~~~l~~IPfv~~~~~-~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG 341 (535) T protein:vir:80 264 NGNPFKEIPFQFIGPL-DNNADIDHPPLL-DLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLG 341 (535) T ss_pred CCcccCeeEEEEeecC-CCCCCCCccchH-HHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEec Confidence 5689999999987533 223334555554 35555555556778899999999999999999875443333333345667 Q ss_pred hhhhcccC-CCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 277 YGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 277 ~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) ....|.++ +.+.++.++....+. .+.++.+..++.....- .+...+.+.+++.| +.......+....+...+. T Consensus 342 ~~~~~~lP~~~~~~~~e~~~~~~a--~~~l~~~e~qM~~lGa~---ll~~~~~~~Ta~~a-~~~~~~~~S~L~~~a~~le 415 (535) T protein:vir:80 342 SRAIIPLPQGATAGILQITPNSVP--FEAMTHKESQMIAMGAN---LLVKSGGNRTFGEA-QQEEASEQSILSACTKNVS 415 (535) T ss_pred CcccccCCCCCCcceeeeccchhH--HHHHHHHHHHHHHHHHH---hhccCcccccHHHH-HHHHHHHhHHHHHHHHHHH Confidence 77777665 556778887765543 34566665555443211 12222233333322 3333333444555566777 Q ss_pred HHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHH--HHH Q lcl|NC_021297. 356 GAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQR--EQM 429 (480) Q Consensus 356 ~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~--~~~ 429 (480) .++.+++++++.+.|.....+...+++. ++..... .++.++++.++.++| .||.+|+++.| ++...+. +++ T Consensus 416 ~al~~aL~~~A~w~G~~~~~~~~~i~~n-~dF~~~~ld~~~~~all~~~~~G--~Is~et~~~~L~r~gvl~~~~~~eee 492 (535) T protein:vir:80 416 MAFRKALRWANQFQTGIVNDETVEYNLN-TDFPAARLTPNERAELILEWQQG--AITFKEMRAGLRRAGVASEDDAKAET 492 (535) T ss_pred HHHHHHHHHHHHHcCCccCCCceEEEec-cccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCcccchHHH Confidence 7889999998889885443332223321 2222333 356788888888875 68999998776 6654332 222 Q ss_pred HHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 430 RDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +...+.|..+ ....++.......+.++..+-+ +-++..+..++ T Consensus 493 ~~ri~~E~~~--~~~~~g~~~d~~~~g~~~~~~~--~~~~~~~~~~~ 535 (535) T protein:vir:80 493 EGKATVEFIA--KTAAAGKVGDAASGGTNKAKLN--NGNGGGNQAGN 535 (535) T ss_pred HHHHHhhhhh--ccccCCCCCCCCCCCCCcCccc--CCccccccCCC Confidence 2222222111 0111111111011111111100 00111111111 No 77 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.91 E-value=1.9e-23 Score=145.17 Aligned_cols=441 Identities=15% Similarity=0.134 Sum_probs=239.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch-----hcCccc----c----hhhhhh---ccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-----TIGIGA----P----PELAYL---DVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~-----~~~~~~----~----~~~~~~---~~~~n~~~~iVd~~~ 64 (480) |.+. ..--..|....++.+.+++-+.|...++ ++++-. + ..|+.. -+..|+++.+|+.++ T Consensus 1 m~~V----~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~ 76 (501) T protein:vir:95 1 MPNV----SFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLV 76 (501) T ss_pred CCCC----CCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHh Confidence 7752 2222345566667778888888876553 332210 1 123322 235799999999999 Q ss_pred hhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeEEEEEecCccccc---------CCCceeEE Q lcl|NC_021297. 65 DRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG---------DPAGIPLI 130 (480) Q Consensus 65 ~~l~~~~~~~~~d~~~~~~l~~i~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~---------d~~~~~~i 130 (480) ++++.+...+. ....+..++.+ ++++.+++.++..++.+|+++++|..+..... ...-+|.+ T Consensus 77 G~vf~k~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~ 152 (501) T protein:vir:95 77 GQVFMRDPVVK----VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTL 152 (501) T ss_pred hhhhcCCccee----CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEE Confidence 99987776553 22344444433 58999999999999999999999986543210 01225889 Q ss_pred EEEccceeEEEEeCCC-CC-eeEEEEEE--EEEecCC---ceEEEEEEEeCC--c---EEEEEecCCccc--------cc Q lcl|NC_021297. 131 RVESPLYMYAELDPRN-TR-RVTRAVRL--YTTRDDV---AVPDRATLYLPD--E---TVPLRRNGGLND--------QW 190 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~-~~-~~~~~~~~--~~~~~~~---~~~~~~~~~~~~--~---~~~~~~~~~~~~--------~~ 190 (480) ..++|.+++---.... ++ .+..++.. +.+.++. .....+.+++.+ . +..|+....... .+ T Consensus 153 ~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~ 232 (501) T protein:vir:95 153 YVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNY 232 (501) T ss_pred EEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcc Confidence 9999998764221112 22 23333222 2222221 122233344432 1 222433322110 00 Q ss_pred -----ccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc Q lcl|NC_021297. 191 -----VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) Q Consensus 191 -----~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 265 (480) ....+...|.++.||+|.+... .....-+.+.+. ++-.|.-+.=+.-|+...++.+.++|+++++|.+.+... T Consensus 233 ~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl-~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~ 310 (501) T protein:vir:95 233 QQYVVYKPTDAQGKRLTEIPFMFIGSE-NNDSNPDNPNFY-DLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVT 310 (501) T ss_pred cccceeeeeccCCCcCCeeeEEEEecC-CCCCCCCccchH-HHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccc Confidence 0112223489999999976432 222222333332 233333333345678888999999999999998765332 Q ss_pred cccccchhhhhhhhhcccC-CCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHH Q lcl|NC_021297. 266 NDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIV 344 (480) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~ 344 (480) .. ....+.++....|.++ |.++++.+.+...+. .+.++.+..++....- .-+..++.+ -||++.+....... T Consensus 311 ~~-~~~~i~~G~~~~~~lP~~~~~~~ie~~~~~i~--~~~l~~l~~~m~~~Ga---~ll~~~~~~-~Ta~~~~~~~~~~~ 383 (501) T protein:vir:95 311 NV-LKGSVNFGSRGGIPLPVGADAKLLQASENTML--KEAMDTKERQMVALGA---KLVEQKEVQ-RTATEAELEAASEG 383 (501) T ss_pred cC-CCCceeecccccccCCCCCceeEEecChhhHH--HHHHHHHHHHHHHHHH---hhccCCccc-hhHHHHHHHHHHHh Confidence 22 2233566666666664 557778887765553 4556666665544421 112222223 57777777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC---C Q lcl|NC_021297. 345 KMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---G 420 (480) Q Consensus 345 ~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~et~~~~l---g 420 (480) +....+...+..++.+++++++.+.|... +...+++. ++..... .++.++++.++..+| .+|.+|+++.| + T Consensus 384 S~L~~~a~~le~al~~~l~~~a~w~g~~~--~~~~v~i~-~df~~~~~~~~~~~al~~~~~~G--~is~~t~~~~L~~~~ 458 (501) T protein:vir:95 384 STLSSATKNVSAAFEWALKWAARWVGQAD--SGVKFELN-TDFDIARMTPDERRSLVEEWQKG--AITFEEMRTGLRKAG 458 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCC--CceEEEEe-cccccccCCHHHHHHHHHHHhCC--CCcHHHHHHHHHhCC Confidence 77777778888899999999999988431 11233332 3333333 356788888988765 68999997655 6 Q ss_pred CChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +.+...+.++++++.+... ....... ++......+++. -|.++ T Consensus 459 v~~~~~~~e~e~i~~~~~~---~~~~~~~--~~~~~~~~gg~~--------~~~~~ 501 (501) T protein:vir:95 459 VATEDDSKAKEKIAKDTAE---AMALATP--ANVPGDGSGGDN--------VGNSE 501 (501) T ss_pred CCChhHHHHHHHHHhhhcC---ccccccc--CCCCCCCccccc--------ccCCC Confidence 6654433333332222110 0000000 000001111111 11111 No 78 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.90 E-value=3.9e-21 Score=132.49 Aligned_cols=410 Identities=12% Similarity=0.036 Sum_probs=140.4 Q ss_pred CHHHHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCcccchhhhhh--cccc---chHHHHHHHHHhhhc--cCcccc Q lcl|NC_021297. 3 TYHEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIGAPPELAYL--DVQP---GWVATYLRTLSDRLD--IEGFRI 74 (480) Q Consensus 3 ~~~~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~~~~---n~~~~iVd~~~~~l~--~~~~~~ 74 (480) =+.+.++++++.+..+. .++.++.++++--+.-..+-.......... .... .-..+|+..++.++. ..+|.. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 55778888888887653 344444443332222111101101110000 0000 112366777777664 346766 Q ss_pred CCCch---hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE-EEccc--eeEEEEeCCCCC Q lcl|NC_021297. 75 SEDSE---GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR-VESPL--YMYAELDPRNTR 148 (480) Q Consensus 75 ~~d~~---~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~-~~~p~--~~~~v~d~~~~~ 148 (480) +.... ..+...+.++. -+...+.............| |...+. .++.. -.+.+.+|. T Consensus 81 G~p~~~~~~d~~~~~~l~~-~~~~~~~~~~~~l~~~~~~~--------------G~a~~~~y~d~~~~~~~~~~~p~--- 142 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIID-VLGDDRALTLNGLLVDSSNA--------------GRAWLHYWIDEDGNFRYGIIQPD--- 142 (470) T ss_pred ccceeeecCchHHHHHHHH-HHhhhHHHHHHHHHHHHhhc--------------CeeEEEEEecCCCceEEEEEccc--- Confidence 65532 22333333322 12222222222222211111 222222 22211 112223321 Q ss_pred eeEEEEEEEEEecCCceEEEEEEEeC----C-cEEE-EEecCCcc-cccccccc--cccccccccceEEee------ccc Q lcl|NC_021297. 149 RVTRAVRLYTTRDDVAVPDRATLYLP----D-ETVP-LRRNGGLN-DQWVVDGD--VIKHGLGVVPVVPLT------NDP 213 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~-~~~~-~~~~~~~~-~~~~~~~~--~~~~~~~~iPvv~~~------n~~ 213 (480) .++-.|.....+.....+.+|.. + ..+. +..-+... ..+..... ........++..... ... T Consensus 143 ---~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (470) T protein:vir:10 143 ---QITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSN 219 (470) T ss_pred ---ceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccccccccc Confidence 11223332222333334444431 1 1111 11111111 11100000 000011111111000 001 Q ss_pred ccCccCCcccch---------hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC Q lcl|NC_021297. 214 RLGNRYGRSEIS---------PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA 284 (480) Q Consensus 214 ~~~~~~G~s~i~---------~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (480) .....||.-.+. .++..+++-+|..-.-+++ +++........-.-..... .... ....... T Consensus 220 ~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~----~~~~~~~~~~~~lvl~g~~--~~~~---~~~~~~~- 289 (470) T protein:vir:10 220 TLKHNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNG----FINDLDDVQTVILVLTNYG--GADL---HQFMNDL- 289 (470) T ss_pred ccccCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHH----HHHHHHHhcCcceeeecCC--cccc---chhhhhh- Confidence 111233332211 2344444444444333333 3333322221100000000 0000 0000000 Q ss_pred CCCceEEEecc------cc-------H--HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 285 SEAAKISEFKA------AE-------L--RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAER 349 (480) Q Consensus 285 g~~~~~~~~~~------~~-------~--~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~ 349 (480) .......++. +. . +++...++.+...|+..+++|+. +... +|.+=...+..+...... T Consensus 290 -~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~-----~~~~-~gn~Sg~Alk~~~~~l~~ 362 (470) T protein:vir:10 290 -RKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDP-----ANFE-SSNASGVAIKMLYSHLEL 362 (470) T ss_pred -hhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCC-----Cccc-cccchHHHHHHHHHHHHH Confidence 1112222221 11 1 45788999999999999999852 2233 343334445666777777 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecC---CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH- Q lcl|NC_021297. 350 KGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRD---PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ- 425 (480) Q Consensus 350 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~---~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~- 425 (480) +.......|++.++-++++.-..... ....|.+ ...+++.....+.+. ++....|....+ T Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~l~~----~~~d~~~i~i~f~~~~p~d~~e~~~------------~~~~~~g~iS~et 426 (470) T protein:vir:10 363 KAAKTQTYFEHAINELVRAIMRYLNF----SDADKRHISQHWTRTKVEDSLTKAQ------------IVSTVANYSSKEA 426 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc----cCcccceeeEEeccCCCCCHHHHHH------------HHHHHhccCcHHH Confidence 77777778888887766664322211 1222322 233444443222221 111223543322 Q ss_pred HHHHHHH---HHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 426 REQMRDW---DKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 426 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) .-++.-. -+++.+.....-.... ...+.....++ .|.|.-+ T Consensus 427 ~l~~~p~v~D~~~E~eri~~E~~e~~----~~~~~~~~~~~--------~~~dde~ 470 (470) T protein:vir:10 427 VAKANPIVDDWQQELKDLAKDKEEND----PYSNQADELNG--------KGVNDEQ 470 (470) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHH----HhhccccccCC--------CCCCCCC Confidence 1111000 1122222222211111 11111111111 1222222 No 79 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.88 E-value=5.7e-22 Score=137.04 Aligned_cols=438 Identities=15% Similarity=0.079 Sum_probs=238.4 Q ss_pred CCCHHHH---HHHHHHHHHHHHHHHHHHHHHhcccCc--c-----hhcCccc-chhhhhh---ccccchHHHHHHHHHhh Q lcl|NC_021297. 1 MTTYHEH---VERLQGLLARDLPNLLEAEAYRNGTRR--L-----KTIGIGA-PPELAYL---DVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~~~~---i~~l~~~~~~~~~r~~~~~~YY~g~~~--i-----~~~~~~~-~~~~~~~---~~~~n~~~~iVd~~~~~ 66 (480) |-|..-- |..--..|....++.+.+++-|.|... . ++.++.. ...|+.. -+..|+++.+|++++++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhch Confidence 3333211 111223344555666777788888542 1 1211111 1123322 23579999999999999 Q ss_pred hccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeEEEEEecCccccc------CCCceeEEEEEcc Q lcl|NC_021297. 67 LDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG------DPAGIPLIRVESP 135 (480) Q Consensus 67 l~~~~~~~~~d~~~~~~l~~i~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~------d~~~~~~i~~~~p 135 (480) ++.+...+. ....+..++++ ++++.+++.++..++.+|+++++|..+..... ....+|.+..++| T Consensus 81 vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~ 156 (489) T protein:vir:78 81 VMRKEPEIN----IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTT 156 (489) T ss_pred hhcCCccee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEech Confidence 988766552 22334444443 68999999999999999999999987643211 1123699999999 Q ss_pred ceeEEEE-eCCCCC-eeEEEEEEEE--Eec-----CCceEEEEEEEeCCc-----EEEEEecCCccccc-cc--cccccc Q lcl|NC_021297. 136 LYMYAEL-DPRNTR-RVTRAVRLYT--TRD-----DVAVPDRATLYLPDE-----TVPLRRNGGLNDQW-VV--DGDVIK 198 (480) Q Consensus 136 ~~~~~v~-d~~~~~-~~~~~~~~~~--~~~-----~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-~~--~~~~~~ 198 (480) .+++--- +...++ .+..++.... ..+ .......+.+++.+. ...|+....+.... .+ ..+... T Consensus 157 ~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~ 236 (489) T protein:vir:78 157 ENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGE 236 (489) T ss_pred hhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCC Confidence 9976532 211222 3333332221 112 122344566777652 23344443322211 11 123345 Q ss_pred ccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc--cccccchhhhh Q lcl|NC_021297. 199 HGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT--NDGENTTLDIY 276 (480) Q Consensus 199 ~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~--~~~~~~~~~~~ 276 (480) |.++.||+|.+.... ....-+.+.+. +|-.|.-+.=+.-|+...++.+.++|++++.|.+..... .......+.++ T Consensus 237 ~~l~~IPfv~~~~~~-~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g 314 (489) T protein:vir:78 237 SLRGVIPFTFIGATN-NDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFG 314 (489) T ss_pred CccCeeeEEEEecCC-CCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeC Confidence 889999999886432 22223444443 344444444467788889999999999999986532211 11122234455 Q ss_pred hhhhcccC-CCCceEEEeccccHHHHHHHHHHHHHHHHhc-cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 277 YGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASI-TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 277 ~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~-~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ....+.++ +++.++.+.+..++ -.+.++.+..++... ..+. . .+.+ -||.+.+.....-.+....+...+ T Consensus 315 ~~~~~~lp~~~~~~~ie~~~~~~--~r~~l~~le~qm~~lGa~l~----~-~~~~-~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (489) T protein:vir:78 315 SRRGHNLGYGGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLI----T-PTQQ-ITAQSARIQRGADTSVMATIARNV 386 (489) T ss_pred CcccccCCCCCCcceeccCcchH--HHHHHHHHHHHHHHHhhhhc----c-CCcc-hhHHHHHHHHHHhhHHHHHHHHHH Confidence 55556554 45667778776544 244455555554433 1221 1 2222 466666666666666667777888 Q ss_pred HHHHHHHHHHHHHHhCCCCccc-ceeeeEEecCCCCCCH-HHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHH Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEE-YTRLETVWRDPSTPTV-AAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQM 429 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~-~~~i~v~f~~~~~~~~-~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~ 429 (480) ..++.+++++++.+.|.....+ ...+...|. ...+ ++.++++.++.++| .+|.+|+++.| |+.+.+.+++ T Consensus 387 e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~---~~~~d~~~~~al~~~~~~G--~is~~t~~~~L~~~gv~d~~~e~~ 461 (489) T protein:vir:78 387 SQAYTDALRWVAVMLGKPEDTEVEFRLNMDFF---LEPMTAQDRAAWMADINAG--LLPATAYYAALRKAGVTDWTDADI 461 (489) T ss_pred HHHHHHHHHHHHHHcCCCCCCceEEEeecccC---cccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCccHHHH Confidence 8899999999999988543221 122333442 2222 56678888888866 68999988765 5554433333 Q ss_pred HHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 430 RDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) +++.+++ ..+. ...-.++.+++.. ++++ T Consensus 462 ~~ei~~~----------~~~~--~~~~~g~~~~~~q-~~~~ 489 (489) T protein:vir:78 462 KDAVADQ----------PLPV--ATEVQGEIPQSAQ-QQEK 489 (489) T ss_pred HHHHhhc----------CCCc--ccCCcccCCCCcc-cccC Confidence 2222211 0000 0111122222111 1111 No 80 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.86 E-value=2.1e-20 Score=128.47 Aligned_cols=441 Identities=15% Similarity=0.066 Sum_probs=232.8 Q ss_pred CCCHHHH---HHHHHHHHHHHHHHHHHHHHHhcccCc-------chhcCccc-chhhhhh---ccccchHHHHHHHHHhh Q lcl|NC_021297. 1 MTTYHEH---VERLQGLLARDLPNLLEAEAYRNGTRR-------LKTIGIGA-PPELAYL---DVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~~~~---i~~l~~~~~~~~~r~~~~~~YY~g~~~-------i~~~~~~~-~~~~~~~---~~~~n~~~~iVd~~~~~ 66 (480) |-|..-- |..--..|....++.+.+++-|.|... +++.++.. ...|+.. -...|+++.+|+.++++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhch Confidence 3222100 111122344555666777777877431 12211111 1123322 24569999999999999 Q ss_pred hccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeEEEEEecCccccc------CCCceeEEEEEcc Q lcl|NC_021297. 67 LDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG------DPAGIPLIRVESP 135 (480) Q Consensus 67 l~~~~~~~~~d~~~~~~l~~i~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~------d~~~~~~i~~~~p 135 (480) ++.+...+. ....+..++++ ++++.+++.++..++.+|+++++|..+..... ....+|.+..++| T Consensus 81 vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~ 156 (491) T protein:vir:95 81 VMRKEPEIN----IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTT 156 (491) T ss_pred hhcCCceee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEech Confidence 988776552 12334444433 68999999999999999999999987653211 0133699999999 Q ss_pred ceeEEEE-eCCCCC-eeEEEEEEEE--Eec-----CCceEEEEEEEeC---Cc--EEEEEecCCccccccc---cccccc Q lcl|NC_021297. 136 LYMYAEL-DPRNTR-RVTRAVRLYT--TRD-----DVAVPDRATLYLP---DE--TVPLRRNGGLNDQWVV---DGDVIK 198 (480) Q Consensus 136 ~~~~~v~-d~~~~~-~~~~~~~~~~--~~~-----~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~---~~~~~~ 198 (480) .+++--- +...++ .+..++.... ..+ .....+.+.+++. +. +..|+....+...... ...... T Consensus 157 ~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~ 236 (491) T protein:vir:95 157 ENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGE 236 (491) T ss_pred hhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCC Confidence 9876432 111222 3333332222 111 1223344455543 32 2234443222211111 112335 Q ss_pred ccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc--cccccchhhhh Q lcl|NC_021297. 199 HGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT--NDGENTTLDIY 276 (480) Q Consensus 199 ~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~--~~~~~~~~~~~ 276 (480) |.++.||||.+.... ....-+.+.+. +|-.|.-+.=+.-|+...++.+.++|++++.|.+..... .......+.++ T Consensus 237 ~~l~~IPfv~~~~~~-~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g 314 (491) T protein:vir:95 237 SLRGVIPFTFIGATN-NDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFG 314 (491) T ss_pred cccCeeEEEEEecCC-CCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEec Confidence 789999999886432 12223444443 244444444456788888899999999999986542211 11122234455 Q ss_pred hhhhcccC-CCCceEEEeccccHHHHHHHHHHHHHHHHhc-cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 277 YGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASI-TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 277 ~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~-~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ....+.++ +.++.+.+.+..++ ....|+.+..++... ..+ ...+.+ -||++.+.....-.+....+...+ T Consensus 315 ~~~~~~lP~~~~~~~ie~~~~~~--~~~~l~~~e~qm~~~Ga~l-----~~~~~~-~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (491) T protein:vir:95 315 SRCGHNLGYGGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQL-----ITPSQQ-ITAESARIQRGADTSVMATIARNV 386 (491) T ss_pred CcCCcCCCCCCccceeecCcchH--HHHHHHHHHHHHHHHHHHh-----ccCCcc-hhHHHHHHHHHHhhHHHHHHHHHH Confidence 55555553 45677777765544 233344443333222 111 112222 467777776666677777777888 Q ss_pred HHHHHHHHHHHHHHhCCCCccc-ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHH Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEE-YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMR 430 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~-~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~ 430 (480) ..++.+++++++.+.|.....+ ...+...|... .-.++.++++.++.++| .+|.+|+++.| ++.+...+++. T Consensus 387 e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~--~~~~~~~~all~~~~~G--~is~~t~~~~L~~~~vl~~~~e~~~ 462 (491) T protein:vir:95 387 SQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQ--PMTAQDRAAWMADINAG--LLPATAYYAALRKAGVTDWTDEDIL 462 (491) T ss_pred HHHHHHHHHHHHHHcCCCCCCceEEEeecccccc--cCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCccHHHHH Confidence 8899999999999988543322 12233344221 22256788888888876 68999988765 55543333332 Q ss_pred HHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) ++++++. ....+.....++.++....+++ T Consensus 463 ~~ie~~~----------~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 463 NAIEDAP----------LPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHhcC----------CCCCccccccccchhhhhhccC Confidence 2222111 0000111111111111111111 No 81 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.81 E-value=2.2e-18 Score=117.43 Aligned_cols=424 Identities=10% Similarity=-0.008 Sum_probs=216.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc----------------cchhhhhh---c-cccchHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----------------APPELAYL---D-VQPGWVATYL 60 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~----------------~~~~~~~~---~-~~~n~~~~iV 60 (480) .++.......+...|..-+--...--+ ..|+.-++..+.. ....+.+. + +-.|+++..+ T Consensus 16 V~~~hp~y~a~~~~W~~~~d~g~~~~k-~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl 94 (488) T protein:vir:96 16 TPIYHPDYLVNAPQWLRNLDCVMDNIK-RKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPTM 94 (488) T ss_pred ccccCHHHHHHhhhhhHhhhhhhHHHH-HhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHHH Confidence 345555555555555321100000000 1122112211100 01111111 1 2459999999 Q ss_pred HHHHhhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeEEEEEecCccccc-----CCCceeEE Q lcl|NC_021297. 61 RTLSDRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG-----DPAGIPLI 130 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~~~~~~l~~i~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~-----d~~~~~~i 130 (480) ++++++++-+......++ ...+..++++ ++++.+++.++..++.+|+++++|..+..... -...+|.+ T Consensus 95 ~~l~G~vfrk~p~~~~~~--~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~ 172 (488) T protein:vir:96 95 NAITGAVMRREPEFDTMD--NPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTA 172 (488) T ss_pred HHhcchhhccCceeccCC--cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEE Confidence 999999987776543321 1224444432 68999999999999999999999987632110 02346899 Q ss_pred EEEccceeEEEE-eCCCCCe-eEEEE-EE-EEEecCC--c--eEEEEEEEeCCcEEEEEecCCccccccccccccccccc Q lcl|NC_021297. 131 RVESPLYMYAEL-DPRNTRR-VTRAV-RL-YTTRDDV--A--VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 131 ~~~~p~~~~~v~-d~~~~~~-~~~~~-~~-~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (480) ..++|.+++--- +...++. +..++ +- +.+.|.. . ....+..++++.+..++...+...........-.|.++ T Consensus 173 ~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~ 252 (488) T protein:vir:96 173 AFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSD 252 (488) T ss_pred EEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecCCCcccC Confidence 999999977532 2222222 33222 21 2233321 1 12233346676554444433322211222233457899 Q ss_pred ccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhh--hhh Q lcl|NC_021297. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYY--GRI 280 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~--~~~ 280 (480) .||||+|.... ....-+.+.+. +|-.|.-+.=+..|+...++....+|++++.+..... .........-+.. ... T Consensus 253 ~IP~v~~~~~~-~~~~~~~pPLl-dLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~-~~~~~~~~~g~~~~~~~~ 329 (488) T protein:vir:96 253 TIPFFLASSQS-NEWCIDSTPLT-SLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNK-TMASEMNPLGFTLAGRMP 329 (488) T ss_pred eeEEEEEecCC-CCCCCCCCchH-HHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCc-ccccccccceeeeccccc Confidence 99999885432 22233444443 3444444444567777777777778877754322211 1111111111111 112 Q ss_pred cccCCCCceEEEeccccHHHHHHHHHHHHHHHHhcc-CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT-GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 281 ~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~-~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) ...+.++.++.+.+.+.+ ..+.|+.+..++.... .+. . .+.+ -||++.+.....-.+........+..++. T Consensus 330 ~~~~~g~~~~~e~~~~~l--~~~~l~~l~~qm~~~Ga~l~----~-~~~~-~Ta~~~~~~~~~~~S~L~~~a~~le~al~ 401 (488) T protein:vir:96 330 YYVKNGDVKVIQAQFSPE--TENKVEKLFEQAVKVGASLF----T-QQSN-ETATGAAIRSGSSTASMATLGNNVEDTVR 401 (488) T ss_pred ccccCCceeecCCchhHH--HHHHHHHHHHHHHHHhHhhc----c-CCCc-chHHHHHHHHHHhhHHHHHHHHHHHHHHH Confidence 222333445555443332 2455666666653322 121 1 2222 36777666666666666777788888999 Q ss_pred HHHHHHHHHhCCCCcc-cceeeeEEe-cCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC---CCChh--HHHHHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTE-EYTRLETVW-RDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTAT--QREQMRD 431 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~-~~~~i~v~f-~~~~~~~-~~~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~--~~~~~~~ 431 (480) +++++++.+.|..... +....++.- ++..... .++.++++.++..+| .||.+|+++.| |+..+ ..+++++ T Consensus 402 ~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G--~Is~~t~~~~L~~~gvl~~d~~~e~~~~ 479 (488) T protein:vir:96 402 NMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEG--NLPQVSWFELLKRARVVRGDMSKEEFDE 479 (488) T ss_pred HHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCcCCccCCHHHHHH Confidence 9999999998854321 111222221 2333333 356788899998776 68999997765 55432 2233222 Q ss_pred HHHHHHHHHHHHHhcccccC Q lcl|NC_021297. 432 WDKQETEDMIDTLYSTTKAQ 451 (480) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~ 451 (480) +.+++. .+- T Consensus 480 ~ie~~g-----------~~~ 488 (488) T protein:vir:96 480 HIAELG-----------FGM 488 (488) T ss_pred HHhhcC-----------CCC Confidence 222211 111 No 82 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.80 E-value=6.7e-20 Score=125.71 Aligned_cols=466 Identities=12% Similarity=0.069 Sum_probs=217.2 Q ss_pred CCCHH--HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCc Q lcl|NC_021297. 1 MTTYH--EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) +++++ ++..+|...+... +....+-.+||+|+|--... .........-.+++|.++.+|+...++..-+. T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~nr 116 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDE-IDELKERGQAPTVYNVISQSVNWIIGSEKRGR 116 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHH-HHHHHhcCCceEEecchHHHHHHHHHHHHhCC Confidence 55543 3444544433332 22234556899998743221 01111122335789999999999888654332 Q ss_pred c--c----cCCCchhHH----HHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEE Q lcl|NC_021297. 72 F--R----ISEDSEGLE----ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 72 ~--~----~~~d~~~~~----~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v 141 (480) . . -.+|.+..+ .+..++..|+++.....+..+++++|.||+-|+.... .+ .+.+++.+++|.+++ T Consensus 117 ~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~--~~-~~~~~~~~~~p~~i~-- 191 (776) T protein:vir:93 117 SDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDE--ND-GEPIYAGAESWRNIL-- 191 (776) T ss_pred cceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeecc--CC-CCceEeeccChhhee-- Confidence 1 1 123444333 3445667789999999999999999999999876432 11 223455678888755 Q ss_pred EeCCCCC----eeEEEEE-EEEEe------------------------------------------------------cC Q lcl|NC_021297. 142 LDPRNTR----RVTRAVR-LYTTR------------------------------------------------------DD 162 (480) Q Consensus 142 ~d~~~~~----~~~~~~~-~~~~~------------------------------------------------------~~ 162 (480) ||+.... ...+.++ .|.+. .. T Consensus 192 ~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (776) T protein:vir:93 192 WDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYA 271 (776) T ss_pred eccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccccccccccC Confidence 4443211 0111110 00000 00 Q ss_pred CceEEEEEEEeCCcEE--------------EEEecC---------Ccc----------------cccccccccccccccc Q lcl|NC_021297. 163 VAVPDRATLYLPDETV--------------PLRRNG---------GLN----------------DQWVVDGDVIKHGLGV 203 (480) Q Consensus 163 ~~~~~~~~~~~~~~~~--------------~~~~~~---------~~~----------------~~~~~~~~~~~~~~~~ 203 (480) ...+..+++|+..... .|.... +.. ...+......|.+.++ T Consensus 272 ~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~ 351 (776) T protein:vir:93 272 RKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNR 351 (776) T ss_pred CCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccCCCCCCCc Confidence 1223345555432111 000000 000 0000011223445578 Q ss_pred cceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhccc Q lcl|NC_021297. 204 VPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL 283 (480) Q Consensus 204 iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (480) +|+|+|+......+.+|.|.+ +.+++.++.+|..+|.+.+.+. +.+..+-.|.-.+. ++-... + ..++.++.. T Consensus 352 ~Pfv~~~~~~~~~~~~~~G~v-~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~~~--d~~~~~-~-~rp~~vi~~ 424 (776) T protein:vir:93 352 YPFTPIWGFRRARDGMPYGVI-RFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVDDI--DEFRRE-A-ARPDAVMTV 424 (776) T ss_pred cceEEecCceecccccccchH-HhhhHHHHHHHHHHHHHHHhhc--CCceeeccccccch--HHHHHh-c-ccCCceeee Confidence 999999988777777888877 5699999999999999988763 44433333432111 110000 0 123333332 Q ss_pred -CCCCceEE-EeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 -ASEAAKIS-EFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 284 -~g~~~~~~-~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) +|..+.+. +....-...+...+......|-.++|+++..+|..+ |..||+|+..+...-........+.|..+++++ T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~ 503 (776) T protein:vir:93 425 KNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTT-NAVSGVAIQARQEQGSVATNKLFDNLRLAFQQH 503 (776) T ss_pred CCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33322221 112223456777788888888889999999999654 557999998887777777777777777777777 Q ss_pred HHHHHHHh----CC---------CCcccce----------------eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021297. 362 MRIAMQIM----GR---------EVTEEYT----------------RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 362 ~~~~~~~~----~~---------~~~~~~~----------------~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) +++++.+. +. ....++. ++.|.=.+..+.-..+....++++.+....-+.. T Consensus 504 ~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~ 583 (776) T protein:vir:93 504 GEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIAL 583 (776) T ss_pred HHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHH Confidence 77655432 21 1011111 1212111111111333445555554322111111 Q ss_pred ---HHHHHhCCCC--hhHHHHHHHHHH----------------HHHH----HHHHHHhccc--ccCcccccC---CCCCC Q lcl|NC_021297. 413 ---EQARIDLGYT--ATQREQMRDWDK----------------QETE----DMIDTLYSTT--KAQADATPK---PTVTD 462 (480) Q Consensus 413 ---et~~~~lg~~--~~~~~~~~~~~~----------------~~~~----~~~~~~~~~~--~~~~~~~~~---~~~~d 462 (480) ..+++..++. ++..+++.+... ++.+ .......... ...++.... ..... T Consensus 584 ~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~ 663 (776) T protein:vir:93 584 TMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAE 663 (776) T ss_pred HHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHh Confidence 1122333221 222222221100 0000 0000000000 000000000 00000 Q ss_pred CCccc--ccCCC-C----CCCCCCC Q lcl|NC_021297. 463 TKTET--QTSPS-G----FNRTKTR 480 (480) Q Consensus 463 ~~~~~--~~~p~-~----~~~~~~~ 480 (480) .+... ...+. + -...++. T Consensus 664 aqa~~~~~~a~~~~~~a~q~a~qa~ 688 (776) T protein:vir:93 664 AKAKHISRMAIREGVGAVKDATDAA 688 (776) T ss_pred hhhhhhhhcchhhhhhhhhhhhhhh Confidence 00000 00000 0 0000111 No 83 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.75 E-value=7.9e-18 Score=114.34 Aligned_cols=468 Identities=11% Similarity=0.039 Sum_probs=214.8 Q ss_pred CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-------DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) =.+.++++.++...+.. .+....+-.+||.|.|--.... ..-+....-.+++|.++.+|+..+++..-+... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~ 103 (711) T protein:vir:10 25 NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR-TERELEQRPCLVNNVLPTFVDQVLGDQRQNRPA 103 (711) T ss_pred cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHH-HHHHhcCCCcEEEcchHHHHHHHhhhHhhCCcc Confidence 22223455555544322 2333445579999987432210 011111233578899999999988876533221 Q ss_pred ---cC-------------------------CCchhHHHHH----HHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc Q lcl|NC_021297. 74 ---IS-------------------------EDSEGLEELW----NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES 121 (480) Q Consensus 74 ---~~-------------------------~d~~~~~~l~----~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~ 121 (480) .+ +|.+..+.++ .++..|+.+.....+..+++++|.||+-|+.....+ T Consensus 104 ~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~ 183 (711) T protein:vir:10 104 IKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD 183 (711) T ss_pred eEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEecccCC Confidence 11 2333344444 345668889999999999999999998776543333 Q ss_pred cCCCceeEEEEE-ccceeEEEEeCCCCC----eeEEE-EEEEEEecC--------------------------CceEEEE Q lcl|NC_021297. 122 GDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRDD--------------------------VAVPDRA 169 (480) Q Consensus 122 ~d~~~~~~i~~~-~p~~~~~v~d~~~~~----~~~~~-~~~~~~~~~--------------------------~~~~~~~ 169 (480) ...+|.++|..+ +|.++ +||+.... ...++ .+.|.+.++ ....... T Consensus 184 d~~~~e~~i~~v~~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~ 261 (711) T protein:vir:10 184 DSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVS 261 (711) T ss_pred CCCCCCeEEeeecChhhe--eeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcccCcceeeEE Confidence 334688888777 68875 56653211 11111 122211110 1122333 Q ss_pred EEEeCCcEE------------EEEec---------CCc-----------------ccccccccccccccccccceEEeec Q lcl|NC_021297. 170 TLYLPDETV------------PLRRN---------GGL-----------------NDQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 170 ~~~~~~~~~------------~~~~~---------~~~-----------------~~~~~~~~~~~~~~~~~iPvv~~~n 211 (480) ++|...... .|... .+. ..+.....++.|.+.+++|+|||.. T Consensus 262 E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g 341 (711) T protein:vir:10 262 EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWG 341 (711) T ss_pred EEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEEEEee Confidence 444332111 11100 000 0000111223344456788888753 Q ss_pred c----cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhhhhhhhcc-cCC Q lcl|NC_021297. 212 D----PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDIYYGRILT-LAS 285 (480) Q Consensus 212 ~----~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~~~~~~~~~~~~~~~~~~~~~~~~-~~g 285 (480) . ...+.+||. + +.+++.++.+|..+|.+...+...+.+..++. |.-. ... +...... ..++.++. -+| T Consensus 342 ~r~~~d~~~~~~G~--v-r~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~-~~~-~~~~e~~-~~~~~vi~~~~~ 415 (711) T protein:vir:10 342 KSLIIKKKEIFRSI--I-RHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE-GRE-DEWEQAN-TKNFSLLTYIPQ 415 (711) T ss_pred eeeccccccccchh--h-hhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccC-ChH-HHHHhcc-ccCCCeeEeccc Confidence 2 334446663 3 56899999999999999999987777655442 3221 111 1000000 12222222 222 Q ss_pred C--CceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 E--AAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 286 ~--~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) . +..+...+. .-...+...++.....|-.++|+++..+|..+ |..||+|+......-..........|..+.+++. T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g 494 (711) T protein:vir:10 416 YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVG 494 (711) T ss_pred ccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 123332222 22445667777777777788999999999764 4579999998877766666666667777777766 Q ss_pred HHHHHH----h---------CCCCcccce-------------------------eeeEEecCCCCCCHHHHHHHHHHHHh Q lcl|NC_021297. 363 RIAMQI----M---------GREVTEEYT-------------------------RLETVWRDPSTPTVAAKADAVSKLYA 404 (480) Q Consensus 363 ~~~~~~----~---------~~~~~~~~~-------------------------~i~v~f~~~~~~~~~~~ad~~~kl~~ 404 (480) ++++.+ . |.....++. ++.+.=.+..+....+.+..++++.+ T Consensus 495 ~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~ 574 (711) T protein:vir:10 495 KILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQ 574 (711) T ss_pred HHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHh Confidence 655543 2 111111111 11111122222222344555666654 Q ss_pred ccccCC--CHHHHHHhCCCCh-hH-HHHHHHH----------------HHHHHHHHHHHHhcc-cccCcccccCCCCCCC Q lcl|NC_021297. 405 NGQGPI--PKEQARIDLGYTA-TQ-REQMRDW----------------DKQETEDMIDTLYST-TKAQADATPKPTVTDT 463 (480) Q Consensus 405 ~~~~~~--s~et~~~~lg~~~-~~-~~~~~~~----------------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~d~ 463 (480) ..+... -...+++.+++.. ++ .+++... ..++.+......... ...+....... ... T Consensus 575 ~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~--ae~ 652 (711) T protein:vir:10 575 AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAE--ADT 652 (711) T ss_pred hcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHH Confidence 321111 1122334443321 11 1111100 000000000000000 00000000000 000 Q ss_pred CcccccC-CCCCCCCCC--C Q lcl|NC_021297. 464 KTETQTS-PSGFNRTKT--R 480 (480) Q Consensus 464 ~~~~~~~-p~~~~~~~~--~ 480 (480) ...+... -......+. + T Consensus 653 ~~Aqae~~qa~~e~~~~q~q 672 (711) T protein:vir:10 653 AQAQADMLKAQLETEEAQKQ 672 (711) T ss_pred HHHHHHHHHHHHHHHHHHHH Confidence 0000000 000000000 0 No 84 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.72 E-value=2.9e-17 Score=111.24 Aligned_cols=439 Identities=9% Similarity=0.039 Sum_probs=217.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CcchhcC-----cccc-hhhhhhccccchHHHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGT-RRLKTIG-----IGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~-~~i~~~~-----~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |.+..+--+.........+..+ +..+=.|. ++....+ ..+. ..+......+.+++++||..++.+.-+|+. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~--~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~ 78 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDF--MVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWS 78 (461) T ss_pred CccchhhhhhhhhhhhhhhhHH--HhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCee Confidence 8888776665555443322211 11111121 1111111 1111 122344556788999999999999999987 Q ss_pred cCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEcccee--EEEEeCCCCCee Q lcl|NC_021297. 74 ISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM--YAELDPRNTRRV 150 (480) Q Consensus 74 ~~~-d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~--~~v~d~~~~~~~ 150 (480) +.. +++..+.+.+.|++-++...+.++.+.+..||.|++++...+... ........+.|... +...++.....+ T Consensus 79 i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~---~~~~~~~pl~~~~~~~~~~l~~~~~~~i 155 (461) T protein:vir:80 79 LKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR---EQADLSTAIDPKTIKSIPYINTFNTQKV 155 (461) T ss_pred eecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc---cccCccCCcccccccceeEEEecccccc Confidence 654 445667788889888889999999999999999998875422110 00011111222110 110110000000 Q ss_pred EEEEEEEE--EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhH Q lcl|NC_021297. 151 TRAVRLYT--TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 151 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) . ...... ...+.+.+.++.+........+...+.. ....-.+..-++++|.+.+.....+|+|.+.+ + T Consensus 156 ~-~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~--------~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~-~ 225 (461) T protein:vir:80 156 T-QLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTT--------ASTSEQIHRSRIIHEQGLRFEGETKGRSIFES-L 225 (461) T ss_pred c-hhhhcccCcCcccccceEEEEecccccccccccccc--------CccceEEccccEEEecCCCCCccccCcchHHH-H Confidence 0 000000 0112223322222222111111111000 00011233357888988888788899999975 7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-cchhhhh--hhhhcccCCCCceEEEeccccHHHHHHHH Q lcl|NC_021297. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-NTTLDIY--YGRILTLASEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 229 ~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~l 305 (480) .+.+.++++++......+..+..+...+.|...-....... ...++.. ...++.++ .+.++-+++. ++....+.+ T Consensus 226 ~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d-~~e~~e~~~~-~lsgl~~~l 303 (461) T protein:vir:80 226 YDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIK-GDEQLTKEST-NVSGMKDLL 303 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEc-CCcceEEEec-CcCCHHHHH Confidence 88888898888777766655555443333321111011100 1112211 11122222 2234444432 344555667 Q ss_pred HHHHHHHHhccCCChHHh-ccccccchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCC---CCcccceee Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYL-SSSSENPASAEAIIATDSRIVKMAERKG-RIFGGAWERAMRIAMQIMGR---EVTEEYTRL 380 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~-g~~~~n~~Sg~Al~~~~~~l~~k~~~~~-~~~~~~l~~~~~~~~~~~~~---~~~~~~~~i 380 (480) +.....|+..+++|..-| |...+..+||..= .......++.++ ..++..|++++++++...++ ...++..++ T Consensus 304 ~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D---~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~ 380 (461) T protein:vir:80 304 DYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYD---VMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEW 380 (461) T ss_pred HHHHHHHhhhhcCCeeeeecccCCccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccce Confidence 777889999999998766 4334444666542 233444444444 56788999999988765443 344455689 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhc-ccccCcccccCC Q lcl|NC_021297. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYS-TTKAQADATPKP 458 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 458 (480) ++.|.+-..++..+.|+...+.+++. .+++. .| ++++++.+.. ..+ ...... ...+......+- T Consensus 381 ~i~f~~L~~~s~kekAe~~~~~a~a~------~~~~~-~g~is~~e~r~~l---~~~----~~~~~~~~~~~~~~~~~~~ 446 (461) T protein:vir:80 381 AIEFNPLWNLDSKTDAEVRKLTAEAD------QIYIV-NGVLDPDEVKETR---FGR----FGLENSSKFSGDSAEIDKL 446 (461) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHH------HHHHh-cCCCCHHHHHHHH---HHh----cCCCCCccCCCCCchhhhh Confidence 99999999999999998777665532 22222 34 3344322110 000 000000 000000000000 Q ss_pred CCCCCCcccccCCCC Q lcl|NC_021297. 459 TVTDTKTETQTSPSG 473 (480) Q Consensus 459 ~~~d~~~~~~~~p~~ 473 (480) ...+.+..+++.+.| T Consensus 447 ~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 447 AKLVYDAYAKKNADG 461 (461) T ss_pred hhhccccccccCCCC Confidence 001122233333333 No 85 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.68 E-value=1.1e-14 Score=97.02 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=211.6 Q ss_pred CCCHH-------------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MTTYH-------------EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~~~~-------------~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iV 60 (480) |.|++ ++-+++...+... +....+-.+||+|.|--... ...-...+.-.+++|.++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEV-LQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHH Confidence 22221 2223333333222 23334566899998743221 011111223457899999999 Q ss_pred HHHHhhhccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 61 RTLSDRLDIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 61 d~~~~~l~~~~~~---~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +..+++..-+... .| +++ +..+. +..++..|+++.....+..+++++|.||+-++... ...++.+ T Consensus 80 ~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i 156 (714) T protein:vir:27 80 DGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEF 156 (714) T ss_pred HHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCe Confidence 9998876543221 22 222 23333 44456678999999999999999999999887653 2345678 Q ss_pred EEEEEccceeEEEEeCCCCC----eeEEEE-EEEEEec------------------------------------------ Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------ 161 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------ 161 (480) ++..++|.+++ ||+.... ...+.+ +.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:27 157 KVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 99999999865 5553211 111111 1111100 Q ss_pred --------------CCceEEEEEEEeCC------------cEEEEEecCCcc--------------------ccc----- Q lcl|NC_021297. 162 --------------DVAVPDRATLYLPD------------ETVPLRRNGGLN--------------------DQW----- 190 (480) Q Consensus 162 --------------~~~~~~~~~~~~~~------------~~~~~~~~~~~~--------------------~~~----- 190 (480) +......+++|... ++++|....... ..+ T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~ 314 (714) T protein:vir:27 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPH 314 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCc Confidence 01122334455422 222222110000 000 Q ss_pred ccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc Q lcl|NC_021297. 191 VVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG 268 (480) Q Consensus 191 ~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 268 (480) +....+.|-+.+++|+|||.-. ...+.++|. .+.+++.++.+|...|.....+ .++...+..|...+. +.. T Consensus 315 ~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~ 387 (714) T protein:vir:27 315 FIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DND 387 (714) T ss_pred ccccCCCCCCCCceeEEEEeeeeeeccCceeeh---hhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHH Confidence 0001122222345666665432 122446663 3568999999999999988765 344443333332221 100 Q ss_pred ccchhhhhhhhhccc-CC----C--CceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 269 ENTTLDIYYGRILTL-AS----E--AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 269 ~~~~~~~~~~~~~~~-~g----~--~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) -...+ ..++.++.. ++ . ...+...+ ..-...+...++.....|-.++|+++..+|.. .|..||+|+..+- T Consensus 388 ~~e~~-arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAi~~rq 465 (714) T protein:vir:27 388 LMEQI-ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLV 465 (714) T ss_pred HHHhc-cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC-ccchhHHHHHHHH Confidence 00001 111222211 11 1 12232222 22355677778888888888999999999965 4667999998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------CCccc---c--------------------eeeeEEe Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQ----IMGR---------EVTEE---Y--------------------TRLETVW 384 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~----~~~~---------~~~~~---~--------------------~~i~v~f 384 (480) ..-..........+..+.+++.++++. +.+. ....+ . +++.+.= T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:27 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 665555555556666666666665443 3221 00000 0 1122221 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..+....+.+..++++.+..... +...++++.+.+. . ++++.+..++. ....+ .+++..+ T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~--------~~~~~-----~~~~~~~ 610 (714) T protein:vir:27 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTP 610 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHH--------cCCCC-----Cccccch Confidence 2222222355667777777643211 2234555666652 1 23343333221 01000 0011111 Q ss_pred CCCccc-ccCC--CCCCCCCCC Q lcl|NC_021297. 462 DTKTET-QTSP--SGFNRTKTR 480 (480) Q Consensus 462 d~~~~~-~~~p--~~~~~~~~~ 480 (480) ..+..+ +.+. ..-...+.| T Consensus 611 e~q~~~~~~q~~~~~q~~lq~~ 632 (714) T protein:vir:27 611 EEQEVAAQQQALQQQQAELQMR 632 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000 000000001 No 86 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.68 E-value=1.1e-14 Score=97.02 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=211.6 Q ss_pred CCCHH-------------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MTTYH-------------EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~~~~-------------~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iV 60 (480) |.|++ ++-+++...+... +....+-.+||+|.|--... ...-...+.-.+++|.++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEV-LQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHH Confidence 22221 2223333333222 23334566899998743221 011111223457899999999 Q ss_pred HHHHhhhccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 61 RTLSDRLDIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 61 d~~~~~l~~~~~~---~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +..+++..-+... .| +++ +..+. +..++..|+++.....+..+++++|.||+-++... ...++.+ T Consensus 80 ~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i 156 (714) T protein:vir:10 80 DGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEF 156 (714) T ss_pred HHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCe Confidence 9998876543221 22 222 23333 44456678999999999999999999999887653 2345678 Q ss_pred EEEEEccceeEEEEeCCCCC----eeEEEE-EEEEEec------------------------------------------ Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------ 161 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------ 161 (480) ++..++|.+++ ||+.... ...+.+ +.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:10 157 KVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 99999999865 5553211 111111 1111100 Q ss_pred --------------CCceEEEEEEEeCC------------cEEEEEecCCcc--------------------ccc----- Q lcl|NC_021297. 162 --------------DVAVPDRATLYLPD------------ETVPLRRNGGLN--------------------DQW----- 190 (480) Q Consensus 162 --------------~~~~~~~~~~~~~~------------~~~~~~~~~~~~--------------------~~~----- 190 (480) +......+++|... ++++|....... ..+ T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~ 314 (714) T protein:vir:10 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPH 314 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCc Confidence 01122334455422 222222110000 000 Q ss_pred ccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc Q lcl|NC_021297. 191 VVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG 268 (480) Q Consensus 191 ~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 268 (480) +....+.|-+.+++|+|||.-. ...+.++|. .+.+++.++.+|...|.....+ .++...+..|...+. +.. T Consensus 315 ~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~ 387 (714) T protein:vir:10 315 FIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DND 387 (714) T ss_pred ccccCCCCCCCCceeEEEEeeeeeeccCceeeh---hhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHH Confidence 0001122222345666665432 122446663 3568999999999999988765 344443333332221 100 Q ss_pred ccchhhhhhhhhccc-CC----C--CceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 269 ENTTLDIYYGRILTL-AS----E--AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 269 ~~~~~~~~~~~~~~~-~g----~--~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) -...+ ..++.++.. ++ . ...+...+ ..-...+...++.....|-.++|+++..+|.. .|..||+|+..+- T Consensus 388 ~~e~~-arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAi~~rq 465 (714) T protein:vir:10 388 LMEQI-ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLV 465 (714) T ss_pred HHHhc-cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC-ccchhHHHHHHHH Confidence 00001 111222211 11 1 12232222 22355677778888888888999999999965 4667999998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------CCccc---c--------------------eeeeEEe Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQ----IMGR---------EVTEE---Y--------------------TRLETVW 384 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~----~~~~---------~~~~~---~--------------------~~i~v~f 384 (480) ..-..........+..+.+++.++++. +.+. ....+ . +++.+.= T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:10 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 665555555556666666666665443 3221 00000 0 1122221 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..+....+.+..++++.+..... +...++++.+.+. . ++++.+..++. ....+ .+++..+ T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~--------~~~~~-----~~~~~~~ 610 (714) T protein:vir:10 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTP 610 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHH--------cCCCC-----Cccccch Confidence 2222222355667777777643211 2234555666652 1 23343333221 01000 0011111 Q ss_pred CCCccc-ccCC--CCCCCCCCC Q lcl|NC_021297. 462 DTKTET-QTSP--SGFNRTKTR 480 (480) Q Consensus 462 d~~~~~-~~~p--~~~~~~~~~ 480 (480) ..+..+ +.+. ..-...+.| T Consensus 611 e~q~~~~~~q~~~~~q~~lq~~ 632 (714) T protein:vir:10 611 EEQEVAAQQQALQQQQAELQMR 632 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000 000000001 No 87 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.68 E-value=1.1e-14 Score=97.02 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=211.6 Q ss_pred CCCHH-------------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MTTYH-------------EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~~~~-------------~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iV 60 (480) |.|++ ++-+++...+... +....+-.+||+|.|--... ...-...+.-.+++|.++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEV-LQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHH Confidence 22221 2223333333222 23334566899998743221 011111223457899999999 Q ss_pred HHHHhhhccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 61 RTLSDRLDIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 61 d~~~~~l~~~~~~---~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +..+++..-+... .| +++ +..+. +..++..|+++.....+..+++++|.||+-++... ...++.+ T Consensus 80 ~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i 156 (714) T protein:vir:99 80 DGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEF 156 (714) T ss_pred HHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCe Confidence 9998876543221 22 222 23333 44456678999999999999999999999887653 2345678 Q ss_pred EEEEEccceeEEEEeCCCCC----eeEEEE-EEEEEec------------------------------------------ Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------ 161 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------ 161 (480) ++..++|.+++ ||+.... ...+.+ +.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:99 157 KVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 99999999865 5553211 111111 1111100 Q ss_pred --------------CCceEEEEEEEeCC------------cEEEEEecCCcc--------------------ccc----- Q lcl|NC_021297. 162 --------------DVAVPDRATLYLPD------------ETVPLRRNGGLN--------------------DQW----- 190 (480) Q Consensus 162 --------------~~~~~~~~~~~~~~------------~~~~~~~~~~~~--------------------~~~----- 190 (480) +......+++|... ++++|....... ..+ T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~ 314 (714) T protein:vir:99 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPH 314 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCc Confidence 01122334455422 222222110000 000 Q ss_pred ccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc Q lcl|NC_021297. 191 VVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG 268 (480) Q Consensus 191 ~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 268 (480) +....+.|-+.+++|+|||.-. ...+.++|. .+.+++.++.+|...|.....+ .++...+..|...+. +.. T Consensus 315 ~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~ 387 (714) T protein:vir:99 315 FIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DND 387 (714) T ss_pred ccccCCCCCCCCceeEEEEeeeeeeccCceeeh---hhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHH Confidence 0001122222345666665432 122446663 3568999999999999988765 344443333332221 100 Q ss_pred ccchhhhhhhhhccc-CC----C--CceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 269 ENTTLDIYYGRILTL-AS----E--AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 269 ~~~~~~~~~~~~~~~-~g----~--~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) -...+ ..++.++.. ++ . ...+...+ ..-...+...++.....|-.++|+++..+|.. .|..||+|+..+- T Consensus 388 ~~e~~-arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAi~~rq 465 (714) T protein:vir:99 388 LMEQI-ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLV 465 (714) T ss_pred HHHhc-cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC-ccchhHHHHHHHH Confidence 00001 111222211 11 1 12232222 22355677778888888888999999999965 4667999998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------CCccc---c--------------------eeeeEEe Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQ----IMGR---------EVTEE---Y--------------------TRLETVW 384 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~----~~~~---------~~~~~---~--------------------~~i~v~f 384 (480) ..-..........+..+.+++.++++. +.+. ....+ . +++.+.= T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:99 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 665555555556666666666665443 3221 00000 0 1122221 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..+....+.+..++++.+..... +...++++.+.+. . ++++.+..++. ....+ .+++..+ T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~--------~~~~~-----~~~~~~~ 610 (714) T protein:vir:99 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTP 610 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHH--------cCCCC-----Cccccch Confidence 2222222355667777777643211 2234555666652 1 23343333221 01000 0011111 Q ss_pred CCCccc-ccCC--CCCCCCCCC Q lcl|NC_021297. 462 DTKTET-QTSP--SGFNRTKTR 480 (480) Q Consensus 462 d~~~~~-~~~p--~~~~~~~~~ 480 (480) ..+..+ +.+. ..-...+.| T Consensus 611 e~q~~~~~~q~~~~~q~~lq~~ 632 (714) T protein:vir:99 611 EEQEVAAQQQALQQQQAELQMR 632 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000 000000001 No 88 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.68 E-value=1.1e-14 Score=97.02 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=211.6 Q ss_pred CCCHH-------------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MTTYH-------------EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~~~~-------------~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iV 60 (480) |.|++ ++-+++...+... +....+-.+||+|.|--... ...-...+.-.+++|.++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEV-LQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHH Confidence 22221 2223333333222 23334566899998743221 011111223457899999999 Q ss_pred HHHHhhhccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 61 RTLSDRLDIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 61 d~~~~~l~~~~~~---~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +..+++..-+... .| +++ +..+. +..++..|+++.....+..+++++|.||+-++... ...++.+ T Consensus 80 ~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i 156 (714) T protein:vir:32 80 DGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEF 156 (714) T ss_pred HHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCe Confidence 9998876543221 22 222 23333 44456678999999999999999999999887653 2345678 Q ss_pred EEEEEccceeEEEEeCCCCC----eeEEEE-EEEEEec------------------------------------------ Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------ 161 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------ 161 (480) ++..++|.+++ ||+.... ...+.+ +.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:32 157 KVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 99999999865 5553211 111111 1111100 Q ss_pred --------------CCceEEEEEEEeCC------------cEEEEEecCCcc--------------------ccc----- Q lcl|NC_021297. 162 --------------DVAVPDRATLYLPD------------ETVPLRRNGGLN--------------------DQW----- 190 (480) Q Consensus 162 --------------~~~~~~~~~~~~~~------------~~~~~~~~~~~~--------------------~~~----- 190 (480) +......+++|... ++++|....... ..+ T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~ 314 (714) T protein:vir:32 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPH 314 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCc Confidence 01122334455422 222222110000 000 Q ss_pred ccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc Q lcl|NC_021297. 191 VVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG 268 (480) Q Consensus 191 ~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 268 (480) +....+.|-+.+++|+|||.-. ...+.++|. .+.+++.++.+|...|.....+ .++...+..|...+. +.. T Consensus 315 ~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~ 387 (714) T protein:vir:32 315 FIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DND 387 (714) T ss_pred ccccCCCCCCCCceeEEEEeeeeeeccCceeeh---hhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHH Confidence 0001122222345666665432 122446663 3568999999999999988765 344443333332221 100 Q ss_pred ccchhhhhhhhhccc-CC----C--CceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 269 ENTTLDIYYGRILTL-AS----E--AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 269 ~~~~~~~~~~~~~~~-~g----~--~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) -...+ ..++.++.. ++ . ...+...+ ..-...+...++.....|-.++|+++..+|.. .|..||+|+..+- T Consensus 388 ~~e~~-arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAi~~rq 465 (714) T protein:vir:32 388 LMEQI-ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLV 465 (714) T ss_pred HHHhc-cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC-ccchhHHHHHHHH Confidence 00001 111222211 11 1 12232222 22355677778888888888999999999965 4667999998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------CCccc---c--------------------eeeeEEe Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQ----IMGR---------EVTEE---Y--------------------TRLETVW 384 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~----~~~~---------~~~~~---~--------------------~~i~v~f 384 (480) ..-..........+..+.+++.++++. +.+. ....+ . +++.+.= T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:32 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 665555555556666666666665443 3221 00000 0 1122221 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..+....+.+..++++.+..... +...++++.+.+. . ++++.+..++. ....+ .+++..+ T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~--------~~~~~-----~~~~~~~ 610 (714) T protein:vir:32 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTP 610 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHH--------cCCCC-----Cccccch Confidence 2222222355667777777643211 2234555666652 1 23343333221 01000 0011111 Q ss_pred CCCccc-ccCC--CCCCCCCCC Q lcl|NC_021297. 462 DTKTET-QTSP--SGFNRTKTR 480 (480) Q Consensus 462 d~~~~~-~~~p--~~~~~~~~~ 480 (480) ..+..+ +.+. ..-...+.| T Consensus 611 e~q~~~~~~q~~~~~q~~lq~~ 632 (714) T protein:vir:32 611 EEQEVAAQQQALQQQQAELQMR 632 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000 000000001 No 89 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.68 E-value=1.1e-14 Score=97.02 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=211.6 Q ss_pred CCCHH-------------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHH Q lcl|NC_021297. 1 MTTYH-------------EHVERLQGLLARD-------LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~~~~-------------~~i~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iV 60 (480) |.|++ ++-+++...+... +....+-.+||+|.|--... ...-...+.-.+++|.++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEV-LQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHH Confidence 22221 2223333333222 23334566899998743221 011111223457899999999 Q ss_pred HHHHhhhccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 61 RTLSDRLDIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 61 d~~~~~l~~~~~~---~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +..+++..-+... .| +++ +..+. +..++..|+++.....+..+++++|.||+-++... ...++.+ T Consensus 80 ~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i 156 (714) T protein:vir:81 80 DGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEF 156 (714) T ss_pred HHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCe Confidence 9998876543221 22 222 23333 44456678999999999999999999999887653 2345678 Q ss_pred EEEEEccceeEEEEeCCCCC----eeEEEE-EEEEEec------------------------------------------ Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------ 161 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------ 161 (480) ++..++|.+++ ||+.... ...+.+ +.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:81 157 KVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 99999999865 5553211 111111 1111100 Q ss_pred --------------CCceEEEEEEEeCC------------cEEEEEecCCcc--------------------ccc----- Q lcl|NC_021297. 162 --------------DVAVPDRATLYLPD------------ETVPLRRNGGLN--------------------DQW----- 190 (480) Q Consensus 162 --------------~~~~~~~~~~~~~~------------~~~~~~~~~~~~--------------------~~~----- 190 (480) +......+++|... ++++|....... ..+ T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~ 314 (714) T protein:vir:81 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPH 314 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCc Confidence 01122334455422 222222110000 000 Q ss_pred ccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc Q lcl|NC_021297. 191 VVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG 268 (480) Q Consensus 191 ~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 268 (480) +....+.|-+.+++|+|||.-. ...+.++|. .+.+++.++.+|...|.....+ .++...+..|...+. +.. T Consensus 315 ~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~ 387 (714) T protein:vir:81 315 FIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DND 387 (714) T ss_pred ccccCCCCCCCCceeEEEEeeeeeeccCceeeh---hhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHH Confidence 0001122222345666665432 122446663 3568999999999999988765 344443333332221 100 Q ss_pred ccchhhhhhhhhccc-CC----C--CceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 269 ENTTLDIYYGRILTL-AS----E--AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 269 ~~~~~~~~~~~~~~~-~g----~--~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) -...+ ..++.++.. ++ . ...+...+ ..-...+...++.....|-.++|+++..+|.. .|..||+|+..+- T Consensus 388 ~~e~~-arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAi~~rq 465 (714) T protein:vir:81 388 LMEQI-ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLV 465 (714) T ss_pred HHHhc-cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC-ccchhHHHHHHHH Confidence 00001 111222211 11 1 12232222 22355677778888888888999999999965 4667999998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------CCccc---c--------------------eeeeEEe Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQ----IMGR---------EVTEE---Y--------------------TRLETVW 384 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~----~~~~---------~~~~~---~--------------------~~i~v~f 384 (480) ..-..........+..+.+++.++++. +.+. ....+ . +++.+.= T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:81 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 665555555556666666666665443 3221 00000 0 1122221 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..+....+.+..++++.+..... +...++++.+.+. . ++++.+..++. ....+ .+++..+ T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~--------~~~~~-----~~~~~~~ 610 (714) T protein:vir:81 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTP 610 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHH--------cCCCC-----Cccccch Confidence 2222222355667777777643211 2234555666652 1 23343333221 01000 0011111 Q ss_pred CCCccc-ccCC--CCCCCCCCC Q lcl|NC_021297. 462 DTKTET-QTSP--SGFNRTKTR 480 (480) Q Consensus 462 d~~~~~-~~~p--~~~~~~~~~ 480 (480) ..+..+ +.+. ..-...+.| T Consensus 611 e~q~~~~~~q~~~~~q~~lq~~ 632 (714) T protein:vir:81 611 EEQEVAAQQQALQQQQAELQMR 632 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000 000000001 No 90 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.67 E-value=6.6e-14 Score=92.83 Aligned_cols=443 Identities=14% Similarity=0.067 Sum_probs=230.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc-----cc----hh---h----hhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-----AP----PE---L----AYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~-----~~----~~---~----~~~~~~~n~~~~iVd~~~ 64 (480) |.=....|..+..+...++.+.....+-|+|-..-+..+.. .. .. + +++-....|++-+|+..+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 76666666666666666555554444557664322211100 00 00 0 122234568899999999 Q ss_pred hhhccC-ccccC-----C----CchhHHHHHHHHH----------hcCHHHHHHHHHHHHhhcCeEEEEEecCccccc-C Q lcl|NC_021297. 65 DRLDIE-GFRIS-----E----DSEGLEELWNWWQ----------ANDLDEESVLGHDDSLTFGRAYITVSHPDVESG-D 123 (480) Q Consensus 65 ~~l~~~-~~~~~-----~----d~~~~~~l~~i~~----------~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~-d 123 (480) ...++. |+.+. . +++..+.+...|+ ..+|...+..+++.....|.+|+.+........ + T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 999987 55321 1 2334455555554 236888888899999999999998754322111 1 Q ss_pred CCce-eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccc Q lcl|NC_021297. 124 PAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 124 ~~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (480) ..+. .++..++|+.+-.-++ ....+..+|.+ +..|.+..+.++..+- +. .....+. T Consensus 161 g~~~~l~lq~iepd~l~~~~~--~~~~i~~GVe~----d~~Gr~~aY~i~~~hP--------gd---------~~~~~~~ 217 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSD--ESNRLNQGVFV----DDWGRPEKYLVYKSRP--------VS---------GRQMETK 217 (502) T ss_pred CcccceEEEEecchhcCCCCC--CCCeeEeeeEE----CCCCceEEEEEeecCC--------CC---------Cccccee Confidence 1111 4789999988742222 23344444432 4444444333332211 00 0112234 Q ss_pred ccc---eEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc-----ccccccchhh Q lcl|NC_021297. 203 VVP---VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL-----TNDGENTTLD 274 (480) Q Consensus 203 ~iP---vv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~-----~~~~~~~~~~ 274 (480) +|| |+|+....+.+..-|.|.+.+ ++..+..++....-........+.--.+|+....+.. .+........ T Consensus 218 rvpA~~vlH~f~~~r~gQ~RGis~lap-vl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~ 296 (502) T protein:vir:79 218 EVDAERMLHLKFVRRLHQMRGTSLLSG-VLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELT 296 (502) T ss_pred EechhheEEeecccCCccccCCchHHH-HHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCcccccc Confidence 566 788888888888999999987 4444444544332222222222221223332221111 1111222334 Q ss_pred hhhhhhc-c-cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 275 IYYGRIL-T-LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGR 352 (480) Q Consensus 275 ~~~~~~~-~-~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~ 352 (480) +.++.++ . .+|.+.++.+.+. ...+|...++..++.|+...|+|-+.|.++. +. |-.|+++.+......+...+. T Consensus 297 l~pG~i~~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~-s~-nySs~R~~~~e~~r~~~~~q~ 373 (502) T protein:vir:79 297 IQPGIIYDDLKPGEEIGMVKSDR-PNPNLETFRNGQLRAVAAGSRLSFSSTARNY-NG-TYSAQRQELVESTDGYLILQD 373 (502) T ss_pred ccCCccccccCCCceeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-cc-hHHHHHHHHHHHHHHHHHHHH Confidence 5566543 2 3566666654332 2345666678889999999999999998774 32 556778888888777777777 Q ss_pred HHHHHHHH-HHHHH--HHHhCCCCc-----ccceeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCC Q lcl|NC_021297. 353 IFGGAWER-AMRIA--MQIMGREVT-----EEYTRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYT 422 (480) Q Consensus 353 ~~~~~l~~-~~~~~--~~~~~~~~~-----~~~~~i~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~ 422 (480) .|...+.+ +++.. .++..+.+. ....-+.+.|..|.. .|....+.+..+.+.+| +.|.+.++...|.. T Consensus 374 ~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~G--l~t~~~~~a~~G~D 451 (502) T protein:vir:79 374 WFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGG--AATESDWVRAGGRN 451 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHcCCC Confidence 77766554 33322 233333222 112235677855443 46677788888888876 66777788888987 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC-CCcccccCCCCCCCCC Q lcl|NC_021297. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD-TKTETQTSPSGFNRTK 478 (480) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~~~~~~p~~~~~~~ 478 (480) .++.-+. ++++.+.. +.. + ..-..++...+..+. ..+.++.++++.+.-. T Consensus 452 ~~~v~~q---~a~e~~~~-~~~-G-l~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 452 PDDVKRR---RKAEIDEN-RKL-D-LVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHH---HHHHHHHH-HHc-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 6653321 22221111 111 1 010111111111111 1111122222222222 No 91 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.66 E-value=9.5e-16 Score=102.95 Aligned_cols=472 Identities=10% Similarity=0.009 Sum_probs=205.2 Q ss_pred CCCHHHHHHHHHHHHHHHH-------HHHHHHHHHhcccCcchhcCcccchhhh-hhccccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDL-------PNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-------~r~~~~~~YY~g~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |.|-+..+.++...+.... ....+-.+||+|.|--... ...++ .-+.++|.++.+|+..+++-.-+-. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~----~~~l~~q~rp~~N~i~~~v~~v~g~e~~nr~ 76 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWL----SQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH----HHHHHhcCCCcccchHHHHHHHHhhHHhCCc Confidence 9998888877776554322 2334556899998753321 11111 2345789999999998887532211 Q ss_pred ---cc---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEE----cccee Q lcl|NC_021297. 73 ---RI---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE----SPLYM 138 (480) Q Consensus 73 ---~~---~~d~~~~~~l~~----i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~----~p~~~ 138 (480) .. .+|.+..+.++. +...|+.+.....+..+++++|.||+-|..+-......++..+|+.+ ++.++ T Consensus 77 d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v 156 (725) T protein:vir:10 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV 156 (725) T ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHc Confidence 11 234444444444 44567889999999999999999998774322111122333444433 34444 Q ss_pred EEEEeCCCCCe----eEEE-EEEEEEec------------------------------CCceEEEEEEEeCC-------- Q lcl|NC_021297. 139 YAELDPRNTRR----VTRA-VRLYTTRD------------------------------DVAVPDRATLYLPD-------- 175 (480) Q Consensus 139 ~~v~d~~~~~~----~~~~-~~~~~~~~------------------------------~~~~~~~~~~~~~~-------- 175 (480) + |||..... ..++ ++.|...+ .......+++|... T Consensus 157 ~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~ 234 (725) T protein:vir:10 157 I--WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) T ss_pred c--cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEEEE Confidence 3 66533111 1111 11111100 01112223333321 Q ss_pred -------cEEEEEecC----------Ccc------------------cccccccccccccccccceEEeeccc--ccCcc Q lcl|NC_021297. 176 -------ETVPLRRNG----------GLN------------------DQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNR 218 (480) Q Consensus 176 -------~~~~~~~~~----------~~~------------------~~~~~~~~~~~~~~~~iPvv~~~n~~--~~~~~ 218 (480) +++.|.... .+. .+..+..++.+.+-+.+|+|||.-.. ..+.+ T Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~ 314 (725) T protein:vir:10 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) T ss_pred eccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcc Confidence 111111100 000 00011112223333456776664321 22334 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhhhhhcccCCC--CceEEEec Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASE--AAKISEFK 294 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~~~~~~~~g~--~~~~~~~~ 294 (480) ++.|-| +.+++.+|.+|..+|.+...+..........-....+..... ......-+..+.+...+|. ...+...+ T Consensus 315 ~~~G~v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~ 393 (725) T protein:vir:10 315 VYEGVV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYE 393 (725) T ss_pred eeeeee-ccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCcccC Confidence 433334 568999999999999998887543322111110000000000 0000000000000001111 01111112 Q ss_pred -cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----h Q lcl|NC_021297. 295 -AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI----M 369 (480) Q Consensus 295 -~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~----~ 369 (480) ..-...+...++.....|-.++|+++..+|..+ |..||+|+.................+..+.+++.++++.+ . T Consensus 394 ~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~ 472 (725) T protein:vir:10 394 NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIY 472 (725) T ss_pred CCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 223446667788888888899999999998654 5589999999877777666667777777777776655543 2 Q ss_pred CC---------CCcccc------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH--HH Q lcl|NC_021297. 370 GR---------EVTEEY------------------------TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK--EQ 414 (480) Q Consensus 370 ~~---------~~~~~~------------------------~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~--et 414 (480) +. .....+ +++.|.=.+..+.-..+.+..++++.++.....+. .+ T Consensus 473 ~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~ 552 (725) T protein:vir:10 473 DVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLL 552 (725) T ss_pred CCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHH Confidence 21 000000 12222222222222345666777887765443332 22 Q ss_pred HHHhCCCChhH-HHHHHHHHHHHHHH--------------HHHHHhcc-cccCcccccC-----CCCCCC-CcccccCCC Q lcl|NC_021297. 415 ARIDLGYTATQ-REQMRDWDKQETED--------------MIDTLYST-TKAQADATPK-----PTVTDT-KTETQTSPS 472 (480) Q Consensus 415 ~~~~lg~~~~~-~~~~~~~~~~~~~~--------------~~~~~~~~-~~~~~~~~~~-----~~~~d~-~~~~~~~p~ 472 (480) ++..+...+-+ ++++.+..+.+... ........ .+..+..... ....+- ....+..-. T Consensus 553 l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~ 632 (725) T protein:vir:10 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) T ss_pred HHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33323322111 22222222211000 00000000 0000000000 000000 000000000 Q ss_pred CCCCCCCC Q lcl|NC_021297. 473 GFNRTKTR 480 (480) Q Consensus 473 ~~~~~~~~ 480 (480) ..+-.++. T Consensus 633 ~~~a~~~~ 640 (725) T protein:vir:10 633 QIDAAKVE 640 (725) T ss_pred HHHHHHHH Confidence 00000000 No 92 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.65 E-value=2.5e-16 Score=106.08 Aligned_cols=453 Identities=11% Similarity=0.035 Sum_probs=196.1 Q ss_pred CC-------CHHHHHHHHHHHHH---HHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MT-------TYHEHVERLQGLLA---RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~-------~~~~~i~~l~~~~~---~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) .+ +.+.+. ++...+. .-+..-.+-.+||+|.|--... ...-+..+.-.+++|.++.+|+..+++..-+ T Consensus 14 ~~~~~~~~~~~~~~~-~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~-~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~n 91 (772) T protein:vir:10 14 LPPAGDTPLTVDEYA-DINYEIEDQPAWRAVADKEMDYADGNQLDTEL-LRRQQALGIPPAVEDLIGPALLSLQGYEAVT 91 (772) T ss_pred CCcccccccCHHHHH-HHHHHHhccHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEEcchHHHHHHHHHHHHhc Confidence 11 112222 2222221 2223334567899998743221 0011112233578999999999998876544 Q ss_pred ccc---cC----CCchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeE Q lcl|NC_021297. 71 GFR---IS----EDSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMY 139 (480) Q Consensus 71 ~~~---~~----~d~~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~ 139 (480) ... .+ +|.+..+. +..++..|+++.....+..+++++|.+|+-++.... ..++.++|+.++|.++ T Consensus 92 r~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d---~~~~~i~i~~v~p~~v- 167 (772) T protein:vir:10 92 RTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESD---PFKFPYRCRPIRRDEI- 167 (772) T ss_pred CcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccC---CCCCCeEEEeeCcccc- Confidence 321 22 22233333 344566789999999999999999999998876432 2345678999999884 Q ss_pred EEEeCCCCCeeEE---EE-EEEEEe------------------------------------------------------- Q lcl|NC_021297. 140 AELDPRNTRRVTR---AV-RLYTTR------------------------------------------------------- 160 (480) Q Consensus 140 ~v~d~~~~~~~~~---~~-~~~~~~------------------------------------------------------- 160 (480) +||+.......- .+ ..|.+. T Consensus 168 -~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (772) T protein:vir:10 168 -HWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQE 246 (772) T ss_pred -eecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhcccccc Confidence 566644221111 10 000000 Q ss_pred -----cCCceEEEEEEEeCCc------------EEEEEecC---------Cc---------------ccc-ccccccccc Q lcl|NC_021297. 161 -----DDVAVPDRATLYLPDE------------TVPLRRNG---------GL---------------NDQ-WVVDGDVIK 198 (480) Q Consensus 161 -----~~~~~~~~~~~~~~~~------------~~~~~~~~---------~~---------------~~~-~~~~~~~~~ 198 (480) .+.+..+.+++|.... ++.|.... +. ..+ .+......| T Consensus 247 ~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p 326 (772) T protein:vir:10 247 DHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTP 326 (772) T ss_pred ccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCC Confidence 0012233455554321 12221100 00 000 001112233 Q ss_pred ccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhh Q lcl|NC_021297. 199 HGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIY 276 (480) Q Consensus 199 ~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~ 276 (480) -+.+.+|+|+|.-. ...+.+||. .+.+++.++.+|+..|.+...+.- .+...=.|.-.+. +......+. . T Consensus 327 ~~~~~fP~vP~~g~r~~~~g~~~G~---vr~~kd~Qr~~N~~~S~~~~~l~~--~~~~~~~gav~~~--d~~~~e~~a-r 398 (772) T protein:vir:10 327 YTHRHFPYVPFFGFREDATGIPYGY---VRGMKYAQDSLNSGVSKLRWGMSV--ARVERTKGAVAMT--DAQFRRQIA-R 398 (772) T ss_pred CCCCccceEEEeeeEeccCCcccch---hhhhhhHHHHHHHHHHHHHHHHhc--ccccccCCCccch--hHHHHHhcc-C Confidence 34456777777533 223346653 367899999999999999887743 3322222222111 000000111 1 Q ss_pred hhhhccc-CC----CCceEEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 277 YGRILTL-AS----EAAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 277 ~~~~~~~-~g----~~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ++.++.. ++ ....|...+ ..-...+...++.....|-.++|+.+..+|. ..|..||+|+..+-..-....... T Consensus 399 p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~-~~na~SGvAi~~rq~qg~~~l~~~ 477 (772) T protein:vir:10 399 PDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGR-KGTATSGIQEQQQIEQSNQSIGRI 477 (772) T ss_pred CCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCC-CcchhhHHHHHHHHHHHHHHHHHH Confidence 1222221 11 112332222 2234577788888888899999999999995 456689999987766655555666 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC------CCHHHHHHHHHHHHhccccCCCH-------HHHHH Q lcl|NC_021297. 351 GRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST------PTVAAKADAVSKLYANGQGPIPK-------EQARI 417 (480) Q Consensus 351 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~------~~~~~~ad~~~kl~~~~~~~~s~-------et~~~ 417 (480) ...+..+.+++.++++.+.-..... ...+.|.-.+... -|.... +. ..|.+.+.. ...+. T Consensus 478 ~Dnl~~~~~~~g~~lL~li~~~y~~-er~~RI~~~d~~~~~~~v~in~~~~-d~-----~tg~~~~~NDi~~g~yDv~i~ 550 (772) T protein:vir:10 478 MDNFRAGRTLVGELLLAMIVEDIGQ-ERTEVVIEGDAVTADRVVVLNEPQR-DP-----QTGAAYLSNDLLRTRIKVALE 550 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCC-CcEEEEecCCCCCCCceEEecccee-cc-----cccccceeccceeeeEEEEee Confidence 6666777776666655443211110 0112222111100 000000 00 000000000 00111 Q ss_pred hCCCChhHHH----HHHHHHHH---H-HHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 418 DLGYTATQRE----QMRDWDKQ---E-TEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 418 ~lg~~~~~~~----~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .-|.+....+ .|.++... + ....++.+....+.. + ..+...--....++.+-....+.. T Consensus 551 ~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p---~-~~ei~~~ir~~~~~~~peq~~~~~ 617 (772) T protein:vir:10 551 DVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVP---F-KRDVVEAIRAVDQQQTPEQIQQQI 617 (772) T ss_pred ccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCC---C-hHHHHHHHHHHhccCChHHHHHHH Confidence 1121111111 11111000 0 000011100000000 0 000000000000110001111111 No 93 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.63 E-value=3.3e-15 Score=99.98 Aligned_cols=457 Identities=10% Similarity=-0.012 Sum_probs=201.1 Q ss_pred CCCHHHHHHHHHHHHHHHH-------HHHHHHHHHhcccCcchhcCcccchhh-hhhccccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDL-------PNLLEAEAYRNGTRRLKTIGIGAPPEL-AYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-------~r~~~~~~YY~g~~~i~~~~~~~~~~~-~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |.|-+..+.++...+.... ....+-.+||+|.|--... ...+ ..-+.++|.++.+|+...++-.-+-. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~----~~~l~~q~rp~~N~i~~~i~~v~g~e~~nr~ 76 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWL----SQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHH----HHHHHhcCCCcccchHHHHHHHHhhHHhCCc Confidence 9998888877776554322 2334556899998753221 1111 13355789999999998886432211 Q ss_pred ---cc---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEc---cceeE Q lcl|NC_021297. 73 ---RI---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVES---PLYMY 139 (480) Q Consensus 73 ---~~---~~d~~~~~~l~~----i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~---p~~~~ 139 (480) .. .+|.+..+.++. +...|+.+.....+..+++++|.||+-|..........++.++|+..+ |.. - T Consensus 77 d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~-~ 155 (725) T protein:vir:92 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS-H 155 (725) T ss_pred ceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChh-h Confidence 11 234444444444 445688899999999999999999987753321112223444444332 221 1 Q ss_pred EEEeCCCCCe------eEEEEEEEEEec------------------------------CCceEEEEEEEeCC-------- Q lcl|NC_021297. 140 AELDPRNTRR------VTRAVRLYTTRD------------------------------DVAVPDRATLYLPD-------- 175 (480) Q Consensus 140 ~v~d~~~~~~------~~~~~~~~~~~~------------------------------~~~~~~~~~~~~~~-------- 175 (480) ++|||..... ..+ ++.|.+.+ .......+++|... T Consensus 156 V~~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~ 234 (725) T protein:vir:92 156 VIWDSNSKLMDKSDSRHCT-VIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) T ss_pred cccCchhhccChhhHHHHH-HHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEe Confidence 2355533210 000 11111100 01122233333321 Q ss_pred -------cEEEEEecC----------Ccc------------------cccccccccccccccccceEEeeccc--ccCcc Q lcl|NC_021297. 176 -------ETVPLRRNG----------GLN------------------DQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNR 218 (480) Q Consensus 176 -------~~~~~~~~~----------~~~------------------~~~~~~~~~~~~~~~~iPvv~~~n~~--~~~~~ 218 (480) +++.|.... .+. .+..+..++.+.+-+.+|+|||.-.. ..+.+ T Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~ 314 (725) T protein:vir:92 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) T ss_pred ecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCcc Confidence 111111100 000 00011112223333456777664321 23334 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhh-------hcccCCC--Cce Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGR-------ILTLASE--AAK 289 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~-------~~~~~g~--~~~ 289 (480) ++.|-| +.+++.++.+|..+|.+...+.........+ .. ...... ...+...... +...+|. ... T Consensus 315 ~~~G~v-r~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~--~~-~~i~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:92 315 VYEGVV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFF--WP-EQIAGF--EHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) T ss_pred ccccee-ccchhHHHHHHHHHHHHHHHHHhccCccccc--ch-hhhhHH--HHHHhccCccceeeccccccccccccccC Confidence 433434 5689999999999999887775433321111 11 011000 0001100000 0001111 011 Q ss_pred EEEec-cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021297. 290 ISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ- 367 (480) Q Consensus 290 ~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~- 367 (480) +...+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-..........|..+.+++.++++. T Consensus 389 i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:92 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21112 223456667788888888889999999998654 558999999887666666666666666676666665543 Q ss_pred ---HhCCC---------Ccccc------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC Q lcl|NC_021297. 368 ---IMGRE---------VTEEY------------------------TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP 411 (480) Q Consensus 368 ---~~~~~---------~~~~~------------------------~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s 411 (480) +.+.. ....+ +++.|.=.+..+.-..+.+..++++.++.....+ T Consensus 468 I~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~ 547 (725) T protein:vir:92 468 VNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) T ss_pred HHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchh Confidence 32210 00000 1222221222111133556667777766543332 Q ss_pred H--HHHHHhCCCChhH-HHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCC--------- Q lcl|NC_021297. 412 K--EQARIDLGYTATQ-REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKT--------- 479 (480) Q Consensus 412 ~--et~~~~lg~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~--------- 479 (480) . -++...+...+-+ +.++.+..+.+.... .... +...+..... ...+.....--.... T Consensus 548 ~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~-------~~~~--~~~~e~~q~~-~~~qqa~~~q~~~e~~~~qa~~~~ 617 (725) T protein:vir:92 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-------GVKK--PETPEEQQWL-VEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) T ss_pred HHHHHHHHHhhcccchHHHHHHHHHHhhhchh-------ccCC--ccchhhhHHH-HHHHHHHHhhhHHHHHHHHHHHHH Confidence 2 1222222222111 222222221110000 0000 0000000000 000000000000000 Q ss_pred ------C Q lcl|NC_021297. 480 ------R 480 (480) Q Consensus 480 ------~ 480 (480) | T Consensus 618 ~qae~~k 624 (725) T protein:vir:92 618 GQAELAK 624 (725) T ss_pred HHHHHHH Confidence 0 No 94 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.62 E-value=4.5e-15 Score=99.22 Aligned_cols=449 Identities=12% Similarity=0.075 Sum_probs=206.7 Q ss_pred CCCHH---HHHHHHHHHHH---HHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc- Q lcl|NC_021297. 1 MTTYH---EHVERLQGLLA---RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR- 73 (480) Q Consensus 1 ~~~~~---~~i~~l~~~~~---~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~- 73 (480) ..+.+ +.+.++...+. .-+....+-.+||+|.|--... ...-...+.-.+++|.++.+|+..+++..-+... T Consensus 15 ~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~ 93 (714) T protein:vir:10 15 GSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEV-IQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDL 93 (714) T ss_pred hhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHHHHHHHHHHhCCcce Confidence 11111 11122111111 1123334667899998743211 0011111233578999999999998876544321 Q ss_pred --cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEE Q lcl|NC_021297. 74 --IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 74 --~~---~d~--~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~ 142 (480) .+ +++ +..+. +..++..|+.......+..+++++|.+|+-++... ...++.+++..++|.+++ | T Consensus 94 ~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~---d~~~~~i~i~~v~p~~v~--~ 168 (714) T protein:vir:10 94 IVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---EPFGPEFKVSTVSRNEVF--W 168 (714) T ss_pred EEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeecc---CCCCCCeEEEecChhhee--e Confidence 12 222 22333 44456678999999999999999999999887653 234678999999998865 5 Q ss_pred eCCCCCe----eEEEE-EEEEE--------------------------------------------------e------c Q lcl|NC_021297. 143 DPRNTRR----VTRAV-RLYTT--------------------------------------------------R------D 161 (480) Q Consensus 143 d~~~~~~----~~~~~-~~~~~--------------------------------------------------~------~ 161 (480) |+..... ..+++ +.|.+ . . T Consensus 169 Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (714) T protein:vir:10 169 DWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQR 248 (714) T ss_pred ccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhccccccccccccc Confidence 5432110 11110 00000 0 0 Q ss_pred CCceEEEEEEEeCCc------------EEEEEecCC--------cc------------ccc-----cccccccccccccc Q lcl|NC_021297. 162 DVAVPDRATLYLPDE------------TVPLRRNGG--------LN------------DQW-----VVDGDVIKHGLGVV 204 (480) Q Consensus 162 ~~~~~~~~~~~~~~~------------~~~~~~~~~--------~~------------~~~-----~~~~~~~~~~~~~i 204 (480) +...+..+++|.... ++.|..... +. ..+ +....+.|-+.+++ T Consensus 249 ~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~f 328 (714) T protein:vir:10 249 ERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMF 328 (714) T ss_pred CcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCce Confidence 112344556664422 222221100 00 000 00112223344457 Q ss_pred ceEEeeccc--ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcc Q lcl|NC_021297. 205 PVVPLTNDP--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 205 Pvv~~~n~~--~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) |+|||+-.. ..+.+||. .+.+++.++.+|...|.....+. ++...+..|...+. +......+. .++.++. T Consensus 329 p~vP~~g~~~~~~g~~~G~---vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~~--d~~~~e~~~-rp~~vi~ 400 (714) T protein:vir:10 329 PLVPFWGYRKDKTGEPYGL---ISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLS--DNDLMEQLE-RPDGIIK 400 (714) T ss_pred eeEEecceeeeccCcccee---hhhhhhHHHHHHHHHHHHHHHHh--CCceeecccccccc--HHHHHHhcc-CCCCeEE Confidence 777665332 23345653 35689999999999999887663 44333333332211 100000011 1111221 Q ss_pred c-C----CC--CceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 L-A----SE--AAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 283 ~-~----g~--~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) . + |. ...+...+.. -...+...++.....|-.++|+++..+|.. .|..||+|+..+...-..........| T Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~na~SGvAI~~r~~qg~~~l~~~~dnl 479 (714) T protein:vir:10 401 LNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SGATSGVAISNLVEQGATTLAEINDNY 479 (714) T ss_pred ecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCC-cchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1 11 1223322322 345677778888888888999999999965 456799999887666555555566666 Q ss_pred HHHHHHHHHHHHHH----hCCC---------Cccc-ceeeeEEec---------------------CCCCCC-HHHHHHH Q lcl|NC_021297. 355 GGAWERAMRIAMQI----MGRE---------VTEE-YTRLETVWR---------------------DPSTPT-VAAKADA 398 (480) Q Consensus 355 ~~~l~~~~~~~~~~----~~~~---------~~~~-~~~i~v~f~---------------------~~~~~~-~~~~ad~ 398 (480) ..+.+++.++++.+ .+.. .... ...+.+.+. -+..++ ..+.+.. T Consensus 480 ~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~ 559 (714) T protein:vir:10 480 QFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQR 559 (714) T ss_pred HHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHH Confidence 66666666655433 2211 0000 000111110 012223 3445666 Q ss_pred HHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCC Q lcl|NC_021297. 399 VSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFN 475 (480) Q Consensus 399 ~~kl~~~~~~~---~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~ 475 (480) ++++.++.... +....+++.+.+. . ++++.+..++. ..... .+++..+..+. .+....... T Consensus 560 l~ql~~~~~p~~~~~~~~~~le~~d~p-~-~~ei~~~ir~~--------~~~~~-----~~~~~~~e~q~-~q~~~~~~~ 623 (714) T protein:vir:10 560 MSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA--------LGTPK-----SPDEMTPEEQE-VAAQQQALQ 623 (714) T ss_pred HHHHHhhcCchhhhhHHHHHHHhcCCc-C-HHHHHHHHHHH--------cCCCC-----CccccCcchhH-HHHHHHHHH Confidence 77776543211 1123344555442 1 12232222211 00000 00000000000 000000000 Q ss_pred CCCCC Q lcl|NC_021297. 476 RTKTR 480 (480) Q Consensus 476 ~~~~~ 480 (480) ..+.+ T Consensus 624 ~~q~~ 628 (714) T protein:vir:10 624 QQQAE 628 (714) T ss_pred HHHHH Confidence 00011 No 95 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.62 E-value=3.2e-15 Score=100.05 Aligned_cols=467 Identities=10% Similarity=-0.009 Sum_probs=207.6 Q ss_pred CCCHHHHHHHHHHHHHHHH-------HHHHHHHHHhcccCcchhcCcccchhh-hhhccccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDL-------PNLLEAEAYRNGTRRLKTIGIGAPPEL-AYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-------~r~~~~~~YY~g~~~i~~~~~~~~~~~-~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |.|-+..+.++...+.... ....+-.+||+|.|-.... ...+ ..-+.++|.++.+|+...++-.-+-. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~----~~~l~~q~rp~~N~i~~~i~~v~g~~~~nr~ 76 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWL----SQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHH----HHHHHhcCCCccccHHHHHHHHHhhHHhCCc Confidence 9998888887776554322 2334456899998754321 1111 13355789999999998876532211 Q ss_pred ---cc---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEE----cccee Q lcl|NC_021297. 73 ---RI---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE----SPLYM 138 (480) Q Consensus 73 ---~~---~~d~~~~~~l~~----i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~----~p~~~ 138 (480) .. .+|.+..+.++. +...++.+.....+..+++++|.||+-|..+-......++..+|+.+ +|.++ T Consensus 77 d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v 156 (725) T protein:vir:77 77 DVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV 156 (725) T ss_pred ceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhc Confidence 11 234444444444 44567889999999999999999998775332111222334444433 33333 Q ss_pred EEEEeCCCCCe-e---E-EEEEEEEEec------------------------------CCceEEEEEEEeCCcE------ Q lcl|NC_021297. 139 YAELDPRNTRR-V---T-RAVRLYTTRD------------------------------DVAVPDRATLYLPDET------ 177 (480) Q Consensus 139 ~~v~d~~~~~~-~---~-~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~------ 177 (480) +||+..... + . ++++.|...+ .......+++|....+ T Consensus 157 --~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~~~~ 234 (725) T protein:vir:77 157 --IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) T ss_pred --eeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeEEEE Confidence 455533210 0 0 0011111110 1122233444442211 Q ss_pred ---------EEEEecC----------Ccc------------------cccccccccccccccccceEEeecc--cccCcc Q lcl|NC_021297. 178 ---------VPLRRNG----------GLN------------------DQWVVDGDVIKHGLGVVPVVPLTND--PRLGNR 218 (480) Q Consensus 178 ---------~~~~~~~----------~~~------------------~~~~~~~~~~~~~~~~iPvv~~~n~--~~~~~~ 218 (480) ..|.... .+. .+..+..++.+.+-+++|+|||.-. ...+.+ T Consensus 235 ~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~ 314 (725) T protein:vir:77 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) T ss_pred ecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCcc Confidence 1111000 000 0001112222333345677665432 122334 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhh------hhhhccc-CCC--Cce Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIY------YGRILTL-ASE--AAK 289 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~------~~~~~~~-~g~--~~~ 289 (480) ++.|-| +.+++.++.+|..+|.+...+....... ..+.... .... ...+.-. ....... +|. ... T Consensus 315 ~~~G~v-r~~kd~Q~~~N~~~S~~~~~~~~~~~~~--~~~~~~~-i~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:77 315 VYEGVV-RLTKDGQRLRNMIMSFNADIVARTPKKK--PFFWPEQ-IAGF--EHMYDGNDDYPYYLLNRTDENSGDLPTQP 388 (725) T ss_pred cccchh-hhhhhHHHHHHHHHHHHHHHHHhccccc--cccchhh-hhHH--HHHHHhccCCceecccccccCCCcccccC Confidence 433434 5789999999999999888775433221 1111110 0000 0001100 0001111 121 122 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +..++... ...+...++.....|-.++|+.+..+|..+ |..||+|+...-.............|..+.+++.++++.+ T Consensus 389 i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:77 389 LAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNG-GQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred ccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32333322 345667778888888889999999999664 4579999999887777777777777888887776665543 Q ss_pred ----h---------CCCCcccc------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC Q lcl|NC_021297. 369 ----M---------GREVTEEY------------------------TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP 411 (480) Q Consensus 369 ----~---------~~~~~~~~------------------------~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s 411 (480) . +.....++ .++.|.=.+..+.-..+.+..++++.++.....+ T Consensus 468 I~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~ 547 (725) T protein:vir:77 468 VNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) T ss_pred HHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccch Confidence 2 11111001 1222222222111134566777777776544333 Q ss_pred H--HHHHHhCCCChhH-HHHHHHHHHHHHHHH--------------HHHHhcc-cccCcccccCCC-CCCCCccccc--- Q lcl|NC_021297. 412 K--EQARIDLGYTATQ-REQMRDWDKQETEDM--------------IDTLYST-TKAQADATPKPT-VTDTKTETQT--- 469 (480) Q Consensus 412 ~--et~~~~lg~~~~~-~~~~~~~~~~~~~~~--------------~~~~~~~-~~~~~~~~~~~~-~~d~~~~~~~--- 469 (480) . .++...+...+-+ ++++.+..+.+.... ....... .+..+....... ......+.+. T Consensus 548 ~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~ 627 (725) T protein:vir:77 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQN 627 (725) T ss_pred hHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 1222222222211 222222211110000 0000000 000000000000 0000000000 Q ss_pred CC--CCCC---------CCCCC Q lcl|NC_021297. 470 SP--SGFN---------RTKTR 480 (480) Q Consensus 470 ~p--~~~~---------~~~~~ 480 (480) .- .... ..+.+ T Consensus 628 e~~k~q~~a~~~~~~a~~~aa~ 649 (725) T protein:vir:77 628 QTLSLQIDAAKVEAQNQLNAAR 649 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 00000 No 96 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.57 E-value=1.1e-13 Score=91.71 Aligned_cols=453 Identities=13% Similarity=0.065 Sum_probs=190.1 Q ss_pred CCCHHHHHHHHHHHH-------HHHHH-HHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc---c Q lcl|NC_021297. 1 MTTYHEHVERLQGLL-------ARDLP-NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD---I 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-------~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~---~ 69 (480) |++. +++..+...+ ..++. ...+..+||.|+..-. .. ....+++.+.+...|+.....|. + T Consensus 10 ~~~~-~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~----~~---~~~s~~~~~~v~~~v~~~~~~l~~~~~ 81 (705) T protein:vir:88 10 MDDE-QVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN----ER---PGKSGIVSRDVQETVDWIMPSLMKVFT 81 (705) T ss_pred CCHH-HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc----cc---CCCCccccHHHHHHHHHHHHHHHHhhc Confidence 5553 3444433332 22222 2345568999984321 11 12345666777777777666552 1 Q ss_pred Cc-----cc--cCCCchhHHHHHHH-----HHhcCHHHHHHHHHHHHhhcCeEEEEEecCccccc--------------- Q lcl|NC_021297. 70 EG-----FR--ISEDSEGLEELWNW-----WQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--------------- 122 (480) Q Consensus 70 ~~-----~~--~~~d~~~~~~l~~i-----~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~--------------- 122 (480) .+ |. ..+|.+..+.++++ .+.|+....+..++++++++|.|++.||....... T Consensus 82 ~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~ 161 (705) T protein:vir:88 82 SGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVAD 161 (705) T ss_pred CCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhh Confidence 11 11 13455544444443 44566667788899999999999998876321000 Q ss_pred ---C------------------------CCceeEEEEEccceeEEEEeCCCCCeeEEEE-EEEEEecC------------ Q lcl|NC_021297. 123 ---D------------------------PAGIPLIRVESPLYMYAELDPRNTRRVTRAV-RLYTTRDD------------ 162 (480) Q Consensus 123 ---d------------------------~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~-~~~~~~~~------------ 162 (480) + ..|.+++..|+|.++++-.+...-....+.+ +++.+..+ T Consensus 162 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~ 241 (705) T protein:vir:88 162 ILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIE 241 (705) T ss_pred hhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhh Confidence 0 1167888999998876432211111122221 11111000 Q ss_pred ------------------------Cc--------------eEEEEEEEeCCcEEEEEecCCcccccccc---ccc--ccc Q lcl|NC_021297. 163 ------------------------VA--------------VPDRATLYLPDETVPLRRNGGLNDQWVVD---GDV--IKH 199 (480) Q Consensus 163 ------------------------~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~--~~~ 199 (480) .+ .++++++|. .+...+.+...+... .+. ..- T Consensus 242 ~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~-----~~d~~~d~~~~~~~~~~~g~~il~~~ 316 (705) T protein:vir:88 242 ELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYT-----LLDVDGDGISELRRILYVGDYIISNE 316 (705) T ss_pred hhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeee-----EecccCCcceeeEEEEEeCccccccc Confidence 00 000111111 011111111111000 000 012 Q ss_pred cccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhhhhh Q lcl|NC_021297. 200 GLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDIYYG 278 (480) Q Consensus 200 ~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~~~~~~~~~~~~~~~~~~~~ 278 (480) +++.+|++.++..+...+.||.|.+. .++++++.+|..++.+.+++...++|...+. |.-.. .+ .+...++ T Consensus 317 ~~~~~PF~~~~~~p~~~~~~G~g~~~-~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~--~d-----~~~~~pg 388 (705) T protein:vir:88 317 PWDCRPFADLNAYRIAHKFHGMSVYD-KIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNL--ED-----LLTNEAA 388 (705) T ss_pred cCCCCCEEEecceeecCccccCChHH-HHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCc--cc-----ccccCCC Confidence 56889999998888889999999885 5899999999999999999988888865553 21110 01 1222333 Q ss_pred hhcccCC-CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc---cchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 279 RILTLAS-EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE---NPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 279 ~~~~~~g-~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~---n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) .++...+ +...+..++.. ...+...+..+...+-..+|+++...|.+.. +.-++.++..+............+.| T Consensus 389 ~vv~~~~~~~i~~~~~~~~-~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~ 467 (705) T protein:vir:88 389 GIVRVKSMNSITPLETPQL-SGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMF 467 (705) T ss_pred eeEEecCCCccccccCCcC-cHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 3333222 22223322221 1223333444455556678888887774421 12355566666666666666666666 Q ss_pred HH-HHHHHHHHHH----HHhCCCC---------cc------cceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHH Q lcl|NC_021297. 355 GG-AWERAMRIAM----QIMGREV---------TE------EYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQ 414 (480) Q Consensus 355 ~~-~l~~~~~~~~----~~~~~~~---------~~------~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et 414 (480) .. ++++++++++ .+.+... .. ...++.+.-. +...+..+....+..+.+....+... T Consensus 468 a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~-~~~~~~eq~~a~l~~ll~~~q~l~~~-- 544 (705) T protein:vir:88 468 AETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVG-IGNMNKDQQMLHLMRIWEMAQAVVGG-- 544 (705) T ss_pred HHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeec-cccchHHHHHHHHHHHHHHHHHhhcc-- Confidence 53 4555554443 3322110 00 0111222211 11112222222222221110000000 Q ss_pred HHHhCCCCh-hHHHHH-HHHHHHHHHHHHHHHhcccccCcccccCCCCC---CCC---------cccccCCCCCCCCCCC Q lcl|NC_021297. 415 ARIDLGYTA-TQREQM-RDWDKQETEDMIDTLYSTTKAQADATPKPTVT---DTK---------TETQTSPSGFNRTKTR 480 (480) Q Consensus 415 ~~~~lg~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---d~~---------~~~~~~p~~~~~~~~~ 480 (480) ....++.. ....++ .++.+............... ...+...+... ..+ .+.+..-...--.+.+ T Consensus 545 -~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~-~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e 622 (705) T protein:vir:88 545 -GGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPN-SPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAE 622 (705) T ss_pred -cchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhh-hHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00001111 111011 00000000000000000000 00000000000 000 0000000000000000 No 97 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.57 E-value=9.5e-14 Score=91.97 Aligned_cols=471 Identities=12% Similarity=0.053 Sum_probs=200.2 Q ss_pred CCCH-HHHHHHHHHHHHHH-------HHHH--HHHHHHhcccCcchhcCcccc-hh--hhhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARD-------LPNL--LEAEAYRNGTRRLKTIGIGAP-PE--LAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~-------~~r~--~~~~~YY~g~~~i~~~~~~~~-~~--~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++++++...+..- +.+. ++--.||+|.|--...-.... .+ .+.-.+++|.++.+|+..+++- T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 8887 56666666654331 1111 121268999874322111100 00 0122468899999999988865 Q ss_pred ccCcc---ccCC----CchhHHHHH----HHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc---cCCCceeEEEEE Q lcl|NC_021297. 68 DIEGF---RISE----DSEGLEELW----NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES---GDPAGIPLIRVE 133 (480) Q Consensus 68 ~~~~~---~~~~----d~~~~~~l~----~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~---~d~~~~~~i~~~ 133 (480) .-+-. ..+. |.+..+.++ .+...|+.+.....+..+++++|.||+-|..+-..+ .+.+..+.|..+ T Consensus 81 ~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:17 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred hhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEee Confidence 43321 1222 333334444 445568899999999999999999998764322111 112233334433 Q ss_pred --ccceeEEEEeCCCCCe----eEEE-EEEEEEec---------------------------CCceEEEEEEEeC----- Q lcl|NC_021297. 134 --SPLYMYAELDPRNTRR----VTRA-VRLYTTRD---------------------------DVAVPDRATLYLP----- 174 (480) Q Consensus 134 --~p~~~~~v~d~~~~~~----~~~~-~~~~~~~~---------------------------~~~~~~~~~~~~~----- 174 (480) ++.+++ ||+..... ..+. ++.|.+.+ +.+.+...++|.. T Consensus 161 ~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~ 238 (708) T protein:vir:17 161 YDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRKESV 238 (708) T ss_pred ccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEeeeee Confidence 334544 66643211 0000 01110000 0122223333321 Q ss_pred ----------CcEEEEEecCC----------c------------------ccccccccccccccccccceEEeecc---- Q lcl|NC_021297. 175 ----------DETVPLRRNGG----------L------------------NDQWVVDGDVIKHGLGVVPVVPLTND---- 212 (480) Q Consensus 175 ----------~~~~~~~~~~~----------~------------------~~~~~~~~~~~~~~~~~iPvv~~~n~---- 212 (480) +.++.|..... + ..+..+...+.|-+.+.+|+|||... T Consensus 239 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~ 318 (708) T protein:vir:17 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) T ss_pred EEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccc Confidence 12222221100 0 00001122334445567888887532 Q ss_pred cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhh-----hcCCCcccccccccc---chhhhhhhhh-ccc Q lcl|NC_021297. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV-----ISGVTTDELTNDGEN---TTLDIYYGRI-LTL 283 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~-----i~g~~~~~~~~~~~~---~~~~~~~~~~-~~~ 283 (480) .....+||.- +.+++.++.||..+|.+...+......+.+ +.|............ ..+....+.+ +.. T Consensus 319 d~~~~~yG~v---r~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 395 (708) T protein:vir:17 319 DDIERVEGHI---AKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNII 395 (708) T ss_pred cCCCcccchh---hhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccc Confidence 2233346643 568999999999999998877554433221 122211100000000 0011101111 111 Q ss_pred CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 284 ~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) ++..+....-+....+.+...+......|-.++|+++..+|. ..| .||+|+...-..-..........+..+.+++.+ T Consensus 396 ~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~-~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 473 (708) T protein:vir:17 396 AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) T ss_pred cccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccC-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222211111122234566777888888888899999998885 335 799999988766666666666666666666655 Q ss_pred HHHH----HhCC---------CCcccc-------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhc Q lcl|NC_021297. 364 IAMQ----IMGR---------EVTEEY-------------------------TRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) Q Consensus 364 ~~~~----~~~~---------~~~~~~-------------------------~~i~v~f~~~~~~~~~~~ad~~~kl~~~ 405 (480) +++. +.+. ....++ +++.|.=.+..+.-..+..+.++++.++ T Consensus 474 ~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~ 553 (708) T protein:vir:17 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHh Confidence 5443 3221 000000 0111111111112123456667777765 Q ss_pred cccCCCH-H----HHHHhCCCChhHHHHHHHHHHHHHHH-------------HHHHHhcccccCcccccCCCCCC---CC Q lcl|NC_021297. 406 GQGPIPK-E----QARIDLGYTATQREQMRDWDKQETED-------------MIDTLYSTTKAQADATPKPTVTD---TK 464 (480) Q Consensus 406 ~~~~~s~-e----t~~~~lg~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~d---~~ 464 (480) +....+. . .+++.+.+.. ++++.+..+..... .........++..+....+.... .+ T Consensus 554 ~~~~~~~~~~~~~l~l~~~D~p~--~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~q 631 (708) T protein:vir:17 554 MLPADPMRPAIQGIILDNIDGEG--LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) T ss_pred cCCccchhHHHHHHHHHhcCCCC--hHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5322111 1 1233333321 11222221111000 00000000000000000000000 00 Q ss_pred cccccCCC-----CCCCCCCC Q lcl|NC_021297. 465 TETQTSPS-----GFNRTKTR 480 (480) Q Consensus 465 ~~~~~~p~-----~~~~~~~~ 480 (480) ++.+..-. .....+.+ T Consensus 632 Ae~~ka~aea~~~q~~a~q~~ 652 (708) T protein:vir:17 632 AEAQKATNETAQTQIKAFTAQ 652 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHH Confidence 00000000 00000000 No 98 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.57 E-value=4.7e-14 Score=93.67 Aligned_cols=456 Identities=10% Similarity=0.022 Sum_probs=195.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHH----H-------------HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPN----L-------------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r----~-------------~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) |.+.+.+...+...|...... . .++.+||.|...-... .+..-...+++.+.+...|+.. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~---~~~~~~rs~~~~~~v~~~ve~~ 91 (651) T protein:vir:80 15 YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVG---DVNADWRHKITTGKAFEAIETI 91 (651) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccC---CCCCCCCccccChhHHHHHHHH Confidence 777777655555444332111 1 1334566664321111 1111235678899999999977 Q ss_pred HhhhccC---c---cc--cCCCchhH----HHHHHHH----HhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc------ Q lcl|NC_021297. 64 SDRLDIE---G---FR--ISEDSEGL----EELWNWW----QANDLDEESVLGHDDSLTFGRAYITVSHPDVES------ 121 (480) Q Consensus 64 ~~~l~~~---~---~~--~~~d~~~~----~~l~~i~----~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~------ 121 (480) ...|... + |. ...+++.. +.+..++ .+++|......+..+++++|.|++.|+...... T Consensus 92 ~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~ 171 (651) T protein:vir:80 92 HAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKV 171 (651) T ss_pred HHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehhe Confidence 7665321 1 11 12233322 2344454 366888889999999999999999886532110 Q ss_pred -------------------cCCCceeEEEEEccceeEEEEeCCCCC--eeEEEEEEEEEecC------C----------- Q lcl|NC_021297. 122 -------------------GDPAGIPLIRVESPLYMYAELDPRNTR--RVTRAVRLYTTRDD------V----------- 163 (480) Q Consensus 122 -------------------~d~~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~~~~~~~~~~------~----------- 163 (480) ....|.|+++.++|.++++ |+..+. ...+..+.+....+ . T Consensus 172 ~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~ 249 (651) T protein:vir:80 172 QVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDV 249 (651) T ss_pred eccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHH Confidence 0013678999999998874 554332 12222222211000 0 Q ss_pred ------------------------------ceEEEEEEEeCCcEEEEEecCCccccccc------cccccccc-ccccce Q lcl|NC_021297. 164 ------------------------------AVPDRATLYLPDETVPLRRNGGLNDQWVV------DGDVIKHG-LGVVPV 206 (480) Q Consensus 164 ------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~-~~~iPv 206 (480) .....+++|.. +...+.......+ ......++ +..+|+ T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~-----~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf 324 (651) T protein:vir:80 250 VEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD-----IHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPF 324 (651) T ss_pred HhhhccccccCCccccccccCCCccccccccceEEEEEEEE-----eeccCCceEEEEEEEcCcEEecccccCCCCCCCe Confidence 00112223321 1111111000000 01111233 345799 Q ss_pred EEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC-C Q lcl|NC_021297. 207 VPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA-S 285 (480) Q Consensus 207 v~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~-g 285 (480) +++...+..++.||+|.+. .+.+.+..+|.+...+...+...++|...+..-..... ..+...++.++... . T Consensus 325 ~~~~~~~~~~~~yG~g~~~-~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~------~~l~~~pg~vi~~~~~ 397 (651) T protein:vir:80 325 VIGTYIPTARQPYAMGALQ-PNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQP------EDVYTEPGKVFLVSDH 397 (651) T ss_pred eeecceecCccccCCChHH-HHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccH------HHhhcCCCceEEecCC Confidence 9999988999999999886 48899999999999999999999999866631111110 01222344443322 2 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHH-H----H Q lcl|NC_021297. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE--NPASAEAIIATDSRIVKMAERKGRIFGG-A----W 358 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~--n~~Sg~Al~~~~~~l~~k~~~~~~~~~~-~----l 358 (480) ++....+....++......+..+...+...+++++...|.+.. ...++.++......+.......-+.|.. . + T Consensus 398 ~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~ 477 (651) T protein:vir:80 398 GDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLL 477 (651) T ss_pred CCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2233333333344444444555555556667776655553221 1112334444444444444444444443 3 3 Q ss_pred HHHHHHHHHHhCCC---------Cc-cc-----ceeeeEEec--CCCCCCHHHHHHHHH---HHHhcc--ccCCCH---- Q lcl|NC_021297. 359 ERAMRIAMQIMGRE---------VT-EE-----YTRLETVWR--DPSTPTVAAKADAVS---KLYANG--QGPIPK---- 412 (480) Q Consensus 359 ~~~~~~~~~~~~~~---------~~-~~-----~~~i~v~f~--~~~~~~~~~~ad~~~---kl~~~~--~~~~s~---- 412 (480) +++++++....... .. .. -.++++.+. ..-+....+....+. .+.+.+ .+.+.. T Consensus 478 ~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~ 557 (651) T protein:vir:80 478 EKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDY 557 (651) T ss_pred HHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhH Confidence 44555544332110 00 00 012222221 111111222222222 222211 111111 Q ss_pred -HH---HHHhCCCChhHH-------HHHH---H--HHHHHH---HHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 413 -EQ---ARIDLGYTATQR-------EQMR---D--WDKQET---EDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 413 -et---~~~~lg~~~~~~-------~~~~---~--~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) .. +++.+|+-+... .... + +.+.+. +.....+....+ ... +..-..+....+.- T Consensus 558 ~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~----~~~---~~~~~~~~~~~~~~ 630 (651) T protein:vir:80 558 KRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQ----ADG---GTQMMSEMYGTPNA 630 (651) T ss_pred HHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHH----HHH---HHHHHHHHHHHHHH Confidence 11 233345421110 0000 0 000000 000000000000 000 00000000000000 Q ss_pred CCCCCCC Q lcl|NC_021297. 474 FNRTKTR 480 (480) Q Consensus 474 ~~~~~~~ 480 (480) ....+.. T Consensus 631 ~~~~~~~ 637 (651) T protein:vir:80 631 DQMQQEL 637 (651) T ss_pred HHHHHHH Confidence 0000001 No 99 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.56 E-value=1.1e-13 Score=91.53 Aligned_cols=436 Identities=14% Similarity=0.107 Sum_probs=208.7 Q ss_pred CC----------CHHHHHHHHHHHHHH----HHHHH---HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MT----------TYHEHVERLQGLLAR----DLPNL---LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~----------~~~~~i~~l~~~~~~----~~~r~---~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) |+ +..++-..++..|.. |.+.. .++.+||.+... ...+....+ ..+++.+|.+.-+++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~-~~~~~~~~~--~r~~~~~~k~~~~~~~i 77 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDT-TTTSNQGLP--WKNSTTLPKLCQIRDNL 77 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhh-hhhhhcccc--cccccchhHHHHHHHHH Confidence 33 222222333444322 22222 577888887543 222111111 14577888888888877 Q ss_pred Hhhhc----c-------CccccCCCchh--HHHHHHHH----HhcCHHHHHHHHHHHHhhcCeEEEEEecCccc------ Q lcl|NC_021297. 64 SDRLD----I-------EGFRISEDSEG--LEELWNWW----QANDLDEESVLGHDDSLTFGRAYITVSHPDVE------ 120 (480) Q Consensus 64 ~~~l~----~-------~~~~~~~d~~~--~~~l~~i~----~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~------ 120 (480) +.++. + .|+. ++|.+. .+.+.... ...+|...+.++..++.++|.|++.+...... T Consensus 78 ~~~l~~~~Fp~~~w~~~v~~~-~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~ 156 (584) T protein:vir:95 78 HSNYFSSLFPNDDWLRWVGYG-KGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDG 156 (584) T ss_pred HHHHHHhhcCccceeeeecCC-CchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecc Confidence 77663 1 1222 222222 34444444 55689999999999999999999998643321 Q ss_pred -ccCCCceeEEEEEccceeEEEEeCCCCC--eeEEEEEEEEEec-----------------------------------C Q lcl|NC_021297. 121 -SGDPAGIPLIRVESPLYMYAELDPRNTR--RVTRAVRLYTTRD-----------------------------------D 162 (480) Q Consensus 121 -~~d~~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~~~~~~~~~-----------------------------------~ 162 (480) ...+-..|++..+||.++| ||+.... ...+.++.+.+.. + T Consensus 157 ~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~ 234 (584) T protein:vir:95 157 TLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVED 234 (584) T ss_pred ccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCcccc Confidence 1122346899999999887 6765422 2222222211000 0 Q ss_pred CceE---------EEEEEEeCCcEEEEEe-------cCCccc----------ccccccccccccccccceEEeecccccC Q lcl|NC_021297. 163 VAVP---------DRATLYLPDETVPLRR-------NGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 163 ~~~~---------~~~~~~~~~~~~~~~~-------~~~~~~----------~~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) .... ...+.|..+.+.-+.. ..+... ......+..|.+.+.+|++.+...+... T Consensus 235 ~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~ 314 (584) T protein:vir:95 235 FDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPD 314 (584) T ss_pred cccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeec Confidence 0000 0111122221111110 000000 0011122345667999999999999999 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCC-CCceEEEecc Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS-EAAKISEFKA 295 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~ 295 (480) +.||+|...- +.++|+.+|.+.-.+.+.+..+.+|.+...+... . +..+++..+.... ++..+.+.+. T Consensus 315 s~yG~gi~~l-l~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~--~--------~~~~pg~~~~~~~~~~~q~~~p~a 383 (584) T protein:vir:95 315 NLWAMGPLDN-LVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVE--E--------FVWGPGAEIHLDQGGDVQEIAKNV 383 (584) T ss_pred cccCCCchhh-hhhHHHHHhHHHHHHHHHHHHhcCcceeeccccc--h--------hcccCCceeecCCCCCcceecCch Confidence 9999999864 8999999999999999999999999544433211 1 1223443333321 2222333332 Q ss_pred ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCC- Q lcl|NC_021297. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAW-ERAMRIAMQIMGREV- 373 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~~~~~~~~~~~- 373 (480) ..+-+....|..+-..+...+|+|...-|..+.....+..+......+..-...+.+.|..++ ++++.++.++.-... T Consensus 384 ~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd 463 (584) T protein:vir:95 384 NYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMD 463 (584) T ss_pred hhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 222222244555556677789999988886544333344456666677777777888888876 787877776521100 Q ss_pred cccce------------------eeeEEec--CCCCCCHHHHHH---HHHHHHh--ccc---cCCCHHHHHH----h--C Q lcl|NC_021297. 374 TEEYT------------------RLETVWR--DPSTPTVAAKAD---AVSKLYA--NGQ---GPIPKEQARI----D--L 419 (480) Q Consensus 374 ~~~~~------------------~i~v~f~--~~~~~~~~~~ad---~~~kl~~--~~~---~~~s~et~~~----~--l 419 (480) ..+.. +++-.|. ..-..-+..+++ .+..+.+ .|. +..+...+.. . + T Consensus 464 ~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~ 543 (584) T protein:vir:95 464 GSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGL 543 (584) T ss_pred ccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCC Confidence 01111 1111110 000111122222 2222322 111 1122222211 1 1 Q ss_pred C---CC-h--hHHHHHHHHH-HHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 420 G---YT-A--TQREQMRDWD-KQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 420 g---~~-~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) | +. + ...++++... .++.+ ..+....++.+ .+.. T Consensus 544 p~~~~~~~~~~~~~Q~~~q~~~~~~q---~~~~~~~~~~~----~~~~ 584 (584) T protein:vir:95 544 QGYEIFRPNVAVAEQAETQSLVAQAQ---EDLQLQAQMPA----EGAI 584 (584) T ss_pred CcccccCCCcccchhHHHHhhhHHHH---HHHHHHHhhhh----ccCC Confidence 1 11 1 1111111000 00111 11111111100 0000 No 100 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.54 E-value=1.6e-13 Score=90.79 Aligned_cols=469 Identities=14% Similarity=0.089 Sum_probs=204.1 Q ss_pred CCCH-HHHHHHHHHHHHHHH-------HHHHHHHHHh--cccCcchhcCccc-chhh--hhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDL-------PNLLEAEAYR--NGTRRLKTIGIGA-PPEL--AYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~-------~r~~~~~~YY--~g~~~i~~~~~~~-~~~~--~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. ++++.++...+..-. .....-.+|| .|+|--....... .+.. +.-.+++|.++.+|+..+++- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 8887 667677666554321 1122223455 5776432210000 0100 122467899999999988876 Q ss_pred ccCccc---cCC----CchhHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc----cCCCceeEEEE Q lcl|NC_021297. 68 DIEGFR---ISE----DSEGLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGIPLIRV 132 (480) Q Consensus 68 ~~~~~~---~~~----d~~~~~~l----~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~----~d~~~~~~i~~ 132 (480) .-+... .+. |.+..+.+ ..++..|+.+.....+..+++++|.||+-|......+ .+..+.+.... T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:10 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEe Confidence 443221 222 33333333 3455678999999999999999999998775432111 12223333334 Q ss_pred EccceeEEEEeCCCCCe----eEEE-EEEEEEec---------------------------CCceEEEEEEEeC------ Q lcl|NC_021297. 133 ESPLYMYAELDPRNTRR----VTRA-VRLYTTRD---------------------------DVAVPDRATLYLP------ 174 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~----~~~~-~~~~~~~~---------------------------~~~~~~~~~~~~~------ 174 (480) .+|.. -++||+..... ..++ ++.|.+.+ ........++|.. T Consensus 161 ~~p~~-~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~ 239 (708) T protein:vir:10 161 YDPSR-SVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVD 239 (708) T ss_pred ecchh-hcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEEEE Confidence 44421 12355432110 0111 11111110 0011222222222 Q ss_pred ---------CcEEEEEecCC----------c------------------ccccccccccccccccccceEEeecc----c Q lcl|NC_021297. 175 ---------DETVPLRRNGG----------L------------------NDQWVVDGDVIKHGLGVVPVVPLTND----P 213 (480) Q Consensus 175 ---------~~~~~~~~~~~----------~------------------~~~~~~~~~~~~~~~~~iPvv~~~n~----~ 213 (480) +.++.|..... + ..+..+..+..+-+++++|+|||.-. . T Consensus 240 ~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d 319 (708) T protein:vir:10 240 VISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFID 319 (708) T ss_pred EEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccC Confidence 11111211100 0 00001112334456677888887532 2 Q ss_pred ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccc----c-ccccch---hh---hhhhhhc Q lcl|NC_021297. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT-TDELT----N-DGENTT---LD---IYYGRIL 281 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~----~-~~~~~~---~~---~~~~~~~ 281 (480) ....+||. | +.+++.++.||...|.+...+....... .+.+.. ..... . ...+.. +. ...+.+. T Consensus 320 ~~~~~yG~--v-r~~kd~Q~~~N~~~S~~~~~~a~~~~~~-~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~ 395 (708) T protein:vir:10 320 DIERVEGH--I-AKAMDPQRLYNLQVSMLADTAAQDPGQI-PIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNII 395 (708) T ss_pred CCccccee--e-cccchhHHHHHHHHHHHHHHHHhcCCcc-cccChhhhhhHHHHHhhccccchhhhccccccccccccc Confidence 22334554 3 5689999999999999988876443322 222111 00000 0 000000 00 0001111 Q ss_pred ccCCCCc-eEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 282 TLASEAA-KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 282 ~~~g~~~-~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) ++..+ ...+ +......+...++.....|-.++|+++..+|. ..| .||+|+..+-..-..........+..+.++ T Consensus 396 --~~~~~~~~~q-~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn-~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~ 470 (708) T protein:vir:10 396 --AGATPAGYTQ-PAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSN-IAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) T ss_pred --cccCCccccC-CccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 1112 12234557777888888888899999998884 445 799999988777666667777777777777 Q ss_pred HHHHHHHH----hCC---------CCccc-------------------------ceeeeEEecCCCCCCHHHHHHHHHHH Q lcl|NC_021297. 361 AMRIAMQI----MGR---------EVTEE-------------------------YTRLETVWRDPSTPTVAAKADAVSKL 402 (480) Q Consensus 361 ~~~~~~~~----~~~---------~~~~~-------------------------~~~i~v~f~~~~~~~~~~~ad~~~kl 402 (480) +.++++.+ .+. ....+ -++|.|.=.+..+.-..+.+..++++ T Consensus 471 ~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~ql 550 (708) T protein:vir:10 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHH Confidence 76665543 221 10000 00122221222223334667778888 Q ss_pred HhccccCCCHH-----HHHHhCCCC--hhHHHHHHHHH----------HHHHHHHHHHHhcccccCcccccCCCCCC--- Q lcl|NC_021297. 403 YANGQGPIPKE-----QARIDLGYT--ATQREQMRDWD----------KQETEDMIDTLYSTTKAQADATPKPTVTD--- 462 (480) Q Consensus 403 ~~~~~~~~s~e-----t~~~~lg~~--~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~d--- 462 (480) .+++....+.. .+++.+.+. ++-.+++.+.. .++.+...... ....+..+....+.... T Consensus 551 l~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q-~~~q~q~~~~~~e~qa~~~~ 629 (708) T protein:vir:10 551 LSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQ-MAAQSQPNPEMVLAQAQMVA 629 (708) T ss_pred HHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 77653321111 123333332 22222332110 00000000000 00000000000000000 Q ss_pred CCcccccCCC-----CCCCCCCC Q lcl|NC_021297. 463 TKTETQTSPS-----GFNRTKTR 480 (480) Q Consensus 463 ~~~~~~~~p~-----~~~~~~~~ 480 (480) .+++.+..-. .....+.+ T Consensus 630 ~qAe~~ka~a~a~~~~~~a~q~~ 652 (708) T protein:vir:10 630 AQAEAQKATNETAQTQIKAFTAQ 652 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000 00000000 No 101 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.54 E-value=2e-12 Score=84.71 Aligned_cols=431 Identities=10% Similarity=0.004 Sum_probs=214.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh---c---Cccc--ch----h-------hhhhccccchHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT---I---GIGA--PP----E-------LAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~---~---~~~~--~~----~-------~~~~~~~~n~~~~iVd 61 (480) +-.+. +-...... .... +.|+|-..-+. + +... .. . -+++-....|++-+|+ T Consensus 14 ~i~~~-~~~~~~~~-~~~~-------~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~ 84 (505) T protein:vir:96 14 MVNWA-WYRYVEPQ-KNAA-------RAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQ 84 (505) T ss_pred ccchh-hhhhHHHH-HHhh-------hhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 22211 11111111 1111 23443211110 0 0000 00 0 0122334568899999 Q ss_pred HHHhhhcc-CccccC---------CCchhHHHHHHHHHh------------cCHHHHHHHHHHHHhhcCeEEEEEecCcc Q lcl|NC_021297. 62 TLSDRLDI-EGFRIS---------EDSEGLEELWNWWQA------------NDLDEESVLGHDDSLTFGRAYITVSHPDV 119 (480) Q Consensus 62 ~~~~~l~~-~~~~~~---------~d~~~~~~l~~i~~~------------n~~~~~~~~~~~~a~~~G~a~~~v~~~~~ 119 (480) ..+...++ .|+... .+++..+.+...|.. .+|...+..+++..+..|.+|+....... T Consensus 85 ~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 164 (505) T protein:vir:96 85 LLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYP 164 (505) T ss_pred HHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCC Confidence 99999998 587532 245555655555443 14677788899999999999987654321 Q ss_pred cccCCCceeEEEEEccceeEEEEeC--CCCCeeEEEEEEEEEecCCceEEEEEEEeCC--cEEEEEecCCcccccccccc Q lcl|NC_021297. 120 ESGDPAGIPLIRVESPLYMYAELDP--RNTRRVTRAVRLYTTRDDVAVPDRATLYLPD--ETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 120 ~~~d~~~~~~i~~~~p~~~~~v~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 195 (480) ..--.++..++|+.+-.-++. ..+..+..+|.+ +..|.+..+-++..+ ..+. ... T Consensus 165 ----~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~hPgd~~~-------------~~~ 223 (505) T protein:vir:96 165 ----NKWGYALQILECDRLDLNYNADLQNGNRIRMSIEL----DAWERPVAYHLLVNHPGDNSY-------------CYH 223 (505) T ss_pred ----CCcceEEEEechhhcCCCCCcccCCcCeEEeceEE----CCCCceEEEEEeecCCCcccc-------------ccc Confidence 111247889999876432221 122334444432 444544444443321 1110 000 Q ss_pred cccccccccc---eEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc---cccccccc Q lcl|NC_021297. 196 VIKHGLGVVP---VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT---DELTNDGE 269 (480) Q Consensus 196 ~~~~~~~~iP---vv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~---~~~~~~~~ 269 (480) .....+.+|| |+|+....+.+..-|.|.|.+ ++..+..++....-........+.--.+|+.... +...+... T Consensus 224 ~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lap-vl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~ 302 (505) T protein:vir:96 224 YAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHA-SMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQG 302 (505) T ss_pred cccccccccCHhHhhhhhcccCCccccCcchHHH-HHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccC Confidence 1112344566 577777777788899999987 4444444443322222222222211123442111 11112222 Q ss_pred cchhhhhhhhhccc-CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 270 NTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAE 348 (480) Q Consensus 270 ~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~ 348 (480) .....+.++.+..+ +|++.++.+.+. ...+|...++..++.|+...++|-+.+.++..+ +|-.|.++.+......+. T Consensus 303 ~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~-~nYSS~R~~~~e~~r~~~ 380 (505) T protein:vir:96 303 EIVEEVEAGTYQLLPYGIRFKEHKIDH-PHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEG-VNFSSLRSGELDERDLYK 380 (505) T ss_pred ccccccCCceeeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc-ccHHHHHHHHHHHHHHHH Confidence 33445666776665 455566654332 234566667888999999999999988766433 344467777777777777 Q ss_pred HHHHHHHHHHHH-HHHHHH--HHhCCCCccc----ceeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhC Q lcl|NC_021297. 349 RKGRIFGGAWER-AMRIAM--QIMGREVTEE----YTRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDL 419 (480) Q Consensus 349 ~~~~~~~~~l~~-~~~~~~--~~~~~~~~~~----~~~i~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~et~~~~l 419 (480) ..+..|...+.+ +++..+ ++..+.++.. ..-+.+.|..|-. .|....+.+....+.+| +.|.+.++... T Consensus 381 ~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G--~~t~~~~~a~~ 458 (505) T protein:vir:96 381 LLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNR--TRSRSSIIRAA 458 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHc Confidence 777777665544 333222 2333332211 1224677865543 46677888888888876 56777778888 Q ss_pred CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCC Q lcl|NC_021297. 420 GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFN 475 (480) Q Consensus 420 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~ 475 (480) |...++.-+. +. ++.+...+. +. ...++..+. ....++.+.++.++| T Consensus 459 G~D~~~v~~q--~a-~e~~~~~~~--Gl-~~~~~~~~~---~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 459 GDDPEDVFDE--IA-WEEQLMRDK--GV-NPTPPEQES---KDATTDEEDDSASDD 505 (505) T ss_pred CCCHHHHHHH--HH-HHHHHHHHc--CC-CCCCCCCCC---CCCCCCCCCCCCCCC Confidence 9876653222 11 111111111 11 111111111 111112222222333 No 102 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.53 E-value=5.5e-13 Score=87.79 Aligned_cols=410 Identities=10% Similarity=0.033 Sum_probs=184.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC--- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED--- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d--- 77 (480) |-+..-+..-.. -.... |-.....|+.+.. .....+......+.+++++||..+.-+.-+|+.+..+ T Consensus 1 ~~~~D~~~~~~~-~~g~~--~~~~~~~~~~~~~-------~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~ 70 (437) T protein:vir:52 1 MKFFDGIKSLAL-KLGSK--QEQTYYSPSLSLT-------DDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLN 70 (437) T ss_pred CchhhhhHhHHh-cCCCc--cccceeecCcccc-------ccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCC Confidence 222111111100 00000 0000000111100 0001223345567788999999999998999887442 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccc---ccCCCcee-EEEEEccceeEEEEeCC-CCCeeEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE---SGDPAGIP-LIRVESPLYMYAELDPR-NTRRVTR 152 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~---~~d~~~~~-~i~~~~p~~~~~v~d~~-~~~~~~~ 152 (480) ++..+.+++.|++=++...+.++.+.+-.||.|++++...... ..+..|.+ .|+++++.++.|..-.. +... T Consensus 71 ~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s--- 147 (437) T protein:vir:52 71 SKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLS--- 147 (437) T ss_pred HHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccc--- Confidence 2333568888888788999999999999999999988653211 01111222 25555655554322111 1110 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc---cccCccCCcccchhhHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND---PRLGNRYGRSEISPELR 229 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~---~~~~~~~G~s~i~~~v~ 229 (480) .+.+.+..+.+ .++... ... |+-- |++|.+. ......+|.|.+.. +. T Consensus 148 --------~~fg~p~~y~v-----------~~~~~~-~~i------H~SR---ii~~~~~~~~~~~~~~~G~s~le~-~~ 197 (437) T protein:vir:52 148 --------PNFGRYSEYSI-----------LGGSQS-ITV------HHSR---LIILNANDAPLSDNDIWGVSDLEK-II 197 (437) T ss_pred --------cccCcceEEEE-----------ecCCcc-eeE------ccce---eEEecCccCCCccccccCCchHHH-HH Confidence 11111111111 111000 000 1111 1111111 11234579998865 66 Q ss_pred HHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-----cchhhh--hhhhhcccC-CCCceEEEeccccHHHH Q lcl|NC_021297. 230 KVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-----NTTLDI--YYGRILTLA-SEAAKISEFKAAELRNF 301 (480) Q Consensus 230 ~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-----~~~~~~--~~~~~~~~~-g~~~~~~~~~~~~~~~~ 301 (480) .-+..++++.-.....+..+..+...+.|.........+. ...+.. ....++.++ +++.+....+-+.+. T Consensus 198 ~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl~-- 275 (437) T protein:vir:52 198 DVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGLK-- 275 (437) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCHH-- Confidence 6666777776666555544444433343321111000000 011111 112233333 223333333333444 Q ss_pred HHHHHHHHHHHHhccCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCccccee Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERKG-RIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~-~~n~~Sg~Al~~~~~~l~~k~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 379 (480) +.+.....+|++.+++|...|.+. ...-+||..-..-|.. .++.++ ..+...+++++.+++.-..+..+. + T Consensus 276 -~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd---~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~---~ 348 (437) T protein:vir:52 276 -DLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHE---AIRRLQETRLRPIFEIIDPLICNELFGGLPA---D 348 (437) T ss_pred -HHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC---c Confidence 556666789999999999777443 3333466544433333 333333 567888999999877655444432 5 Q ss_pred eeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhcccccC-cc--cc Q lcl|NC_021297. 380 LETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYSTTKAQ-AD--AT 455 (480) Q Consensus 380 i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~ 455 (480) +++.|.+-...+..+.|+...+.+++. .+++ ..| ++++++.+.. ++. ..++..+.. .. .. T Consensus 349 ~~~~f~pL~~~s~kekae~~~~~a~a~------~~~~-~~g~i~~~e~r~~L---~~~------g~~~~i~~~~~~~~~~ 412 (437) T protein:vir:52 349 WWFEFVPLTTVKQEQQINMLNTFATAA------NTLI-QNGVLNEYQIANEL---RES------GLFANISAEHIEELKN 412 (437) T ss_pred ceEEeCCcCCcCHHHHHHHHHHHHHHH------HHHH-hcCCCCHHHHHHHH---Hhc------CCCCCCCccccccccC Confidence 888999988899888888765554321 1112 234 2333322211 110 111111100 00 00 Q ss_pred cCCCCCCC--CcccccCCCCCCCCC Q lcl|NC_021297. 456 PKPTVTDT--KTETQTSPSGFNRTK 478 (480) Q Consensus 456 ~~~~~~d~--~~~~~~~p~~~~~~~ 478 (480) ..+..++. +.+.++.+.+...++ T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 413 ADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCCCCCccCCCCCCCCCCCCCCCCC Confidence 00001110 011111111222222 No 103 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.50 E-value=4.2e-14 Score=93.92 Aligned_cols=434 Identities=12% Similarity=0.056 Sum_probs=190.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHH-----HHH-------HHhcccCcch--hc--Ccccch-hhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLL-----EAE-------AYRNGTRRLK--TI--GIGAPP-ELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~-----~~~-------~YY~g~~~i~--~~--~~~~~~-~~~~~~~~~n~~~~iVd~~ 63 (480) =.+ .-...++..|.-.-..+. .+. .+.-|.+.-. .+ +...+. .+-.....+.+++++||.. T Consensus 27 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~ 104 (532) T protein:vir:94 27 RAT--HTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETP 104 (532) T ss_pred hhh--hhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccc Confidence 000 000113333322111100 000 0111110000 00 001110 1112333456679999999 Q ss_pred HhhhccCccccCCC------chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc---cC----------C Q lcl|NC_021297. 64 SDRLDIEGFRISED------SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES---GD----------P 124 (480) Q Consensus 64 ~~~l~~~~~~~~~d------~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~---~d----------~ 124 (480) ++-+.-+++.+..+ ++..+.+...|++-++...+.++.+.+..||.|++++....... .+ . T Consensus 105 aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~ 184 (532) T protein:vir:94 105 ADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQ 184 (532) T ss_pred hHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccc Confidence 99998888876432 22334566667766788889999999999999987764321110 00 1 Q ss_pred Ccee-EEEEEccceeEEE-EeCCCCCeeEEEEEEEEEecCCceEEEEEE-----EeCCcEEEEEecCCcccccccccccc Q lcl|NC_021297. 125 AGIP-LIRVESPLYMYAE-LDPRNTRRVTRAVRLYTTRDDVAVPDRATL-----YLPDETVPLRRNGGLNDQWVVDGDVI 197 (480) Q Consensus 125 ~~~~-~i~~~~p~~~~~v-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (480) .|.+ .|.+++|.++.|. |+..+.... +.+.+.++.+ +-+.++++|... T Consensus 185 ~g~~~~l~vld~~~v~p~~~~~~dp~sp-----------~fg~P~~y~v~~g~~iH~SRli~f~g~-------------- 239 (532) T protein:vir:94 185 RGCLIGFATIEPMWLSPNAYNATDPTLP-----------SFYKPDSWIATSGKKIHSSRIHTVVGR-------------- 239 (532) T ss_pred cceeeEEEeechheeccccccccccccc-----------ccCCceeEEEccCeeeccceEEEecCC-------------- Confidence 1222 3555666665543 211111111 1111111111 112222222100 Q ss_pred cccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-----cch Q lcl|NC_021297. 198 KHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-----NTT 272 (480) Q Consensus 198 ~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-----~~~ 272 (480) .+|-+. ......+|.|.+.+ +.+-+..++++.......+..+....+.+ +........... ... T Consensus 240 -----~~p~~~----~~~~~~~G~Svlq~-~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~a~~ls~~~~~~~~~r~~~ 308 (532) T protein:vir:94 240 -----PVGDML----KAAYSFRGVSISQL-AMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DMAQLLAPGGAQSLDARLQL 308 (532) T ss_pred -----Cchhhh----ccccccccccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCceeee-chHHhhcchhHHHHHHHHHH Confidence 011100 01123468888754 55666667776666655554444332221 211111000100 001 Q ss_pred hhhh--hhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHh-cccccc-chHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 273 LDIY--YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAE 348 (480) Q Consensus 273 ~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~ 348 (480) ++.. ...++.++++..++-+++ .++....+.+......|++.+++|..-| |..... +++|..-..-|...+. . T Consensus 309 ~~~~~~n~g~~~id~~~e~~e~~~-~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~--s 385 (532) T protein:vir:94 309 FNLYRDNRNIGALDKGTEEIQQTN-TPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIA--G 385 (532) T ss_pred HHhhcCCccceEEcCCCceeEEEe-cccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHH--H Confidence 1111 112344444334444443 2233445566777888999999998754 533222 2456544434444333 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHH-------HHhccccCCCHHHHHHhCCC Q lcl|NC_021297. 349 RKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSK-------LYANGQGPIPKEQARIDLGY 421 (480) Q Consensus 349 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~k-------l~~~~~~~~s~et~~~~lg~ 421 (480) +.+..+...+++++++++....+.... ++++.|.+-...+..+.|+...+ ++++ +.++.+.+++.++. T Consensus 386 ~Qe~~l~p~le~l~~~l~~s~~g~~~~---d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--Gvi~~~Evr~~l~~ 460 (532) T protein:vir:94 386 YQATNLTPLMEWIIDLIQLSEYGQIDP---GLAWEWSPLMELDDKELAEVRQLNASTDSTLMEL--GVIDAKMVQQRLAA 460 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCC---CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhc Confidence 334566788999998887655444332 48889998888888887765533 4443 46788777777653 Q ss_pred Chh------HH--HHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 422 TAT------QR--EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 422 ~~~------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .+. .. +.+.+....+.+...+.......+..+..+.+++.+++++.+..++..-++.-| T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 527 (532) T protein:vir:94 461 DPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQ 527 (532) T ss_pred CCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCC Confidence 221 10 111111111111111111111111111223333333222222211111111112 No 104 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.48 E-value=4.5e-14 Score=93.76 Aligned_cols=427 Identities=10% Similarity=0.022 Sum_probs=196.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC--- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED--- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d--- 77 (480) |+.-......+..... ..+-..+..||-... +. ..++-.....+.+++++||..+.-+.-+|+.+..+ T Consensus 70 ~d~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~------~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~ 140 (537) T protein:vir:10 70 MDGLDVEGGTFSAYAN--PNLSEGLVLWYAQQA-FI------GHQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGN 140 (537) T ss_pred ccccccchhhhhhhcc--ccccchhhhhccccC-Cc------cHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcc Confidence 2221111111110000 011111222222211 11 11222344556788999999999998888876432 Q ss_pred ---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc--cC--------CCce-eEEEEEccceeEEEEe Q lcl|NC_021297. 78 ---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GD--------PAGI-PLIRVESPLYMYAELD 143 (480) Q Consensus 78 ---~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~--~d--------~~~~-~~i~~~~p~~~~~v~d 143 (480) .+..+.+...|++-++...+.++.+.+..||.+++++....... .+ ..|. ..+.+++|.++.|... T Consensus 141 ~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~ 220 (537) T protein:vir:10 141 ELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLD 220 (537) T ss_pred cccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccc Confidence 13345677778888888999999999999999988775421110 00 0111 2345566655554321 Q ss_pred CCCCCeeEEEEEEEEEecCCceEEEEEE----EeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccC Q lcl|NC_021297. 144 PRNTRRVTRAVRLYTTRDDVAVPDRATL----YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~ 219 (480) ......+. ..+.+.+..+.+ +-+.++++|... . +|-+. ......+ T Consensus 221 ~~~~~dp~--------sp~fg~P~~y~v~g~~iH~SRli~f~g~------------~-------~p~~~----~~~~~~~ 269 (537) T protein:vir:10 221 AQASSNPV--------SMHFYEPTYWLINGKKYHRSHLAIYIND------------E-------VVDFL----KPSYIYG 269 (537) T ss_pred hhhhccCC--------ccccCCceeeeecCeEecceeEEEecCC------------C-------Cchhh----hcccCcc Confidence 10000000 001111111110 112222222100 0 01100 0112357 Q ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-c---chhhh--hhhhhcccCCCCceEEEe Q lcl|NC_021297. 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-N---TTLDI--YYGRILTLASEAAKISEF 293 (480) Q Consensus 220 G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~---~~~~~--~~~~~~~~~g~~~~~~~~ 293 (480) |.|.+.. +.+-+..++++....+..+..+....+.+.|...- ..+.. . ..+.. ....++.++.+.-++-++ T Consensus 270 G~Svlq~-~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l--~~~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~ 346 (537) T protein:vir:10 270 GVPLPQQ-IMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVL--ANKQQFDETMSWWTATRDNYQVRVVDKDNEDVVQI 346 (537) T ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhh--cCHHHHHHHHHHHHhhcCCcceeEecCCCceeEEE Confidence 9998865 66666777777776666665555554444332211 11111 0 01111 112334444433444443 Q ss_pred ccccHHHHHHHHHHHHHHHHhccCCChHHh-cccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021297. 294 KAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR 371 (480) Q Consensus 294 ~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~-~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~ 371 (480) + .++....+.+......|++.+++|..-| |... +-++||..-..-|...+ +.+|..+...++++.++++....+ T Consensus 347 ~-~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I---~~~Qe~l~p~l~~l~~ll~~~~~~ 422 (537) T protein:vir:10 347 D-TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEEC---ESTQDDMRPLIDRHHQLVCRSHLR 422 (537) T ss_pred e-ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhcCC Confidence 3 2334444556666778999999998754 5332 22356665444444433 333445788999999988765543 Q ss_pred CCcccceeeeEEecCCCCCCHHHHHHHH-------HHHHhccccCCCHHHHHHhCCCChh-HHHHHH-HHHHHHHHHHHH Q lcl|NC_021297. 372 EVTEEYTRLETVWRDPSTPTVAAKADAV-------SKLYANGQGPIPKEQARIDLGYTAT-QREQMR-DWDKQETEDMID 442 (480) Q Consensus 372 ~~~~~~~~i~v~f~~~~~~~~~~~ad~~-------~kl~~~~~~~~s~et~~~~lg~~~~-~~~~~~-~~~~~~~~~~~~ 442 (480) . ..++++.|.+-...+..+.|+.. .+++++| .++...+++.|+-.++ ....+- ....+..+.... T Consensus 423 ~----~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G--~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~ 496 (537) T protein:vir:10 423 K----RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMG--AVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDV 496 (537) T ss_pred C----CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHHhccCccccccccCCCChhhhhcccC Confidence 3 23688999998889998887754 4444443 5677666666532211 000010 000000110000 Q ss_pred HHhcccccCc--ccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 443 TLYSTTKAQA--DATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 443 ~~~~~~~~~~--~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ...+...... ...+.+..+..+.+....+...++++++ T Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 536 (537) T protein:vir:10 497 DDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSGAAFE 536 (537) T ss_pred CccCCcCCCCCCCCCccccCCCCccccccCCCccCccccC Confidence 0000000000 1111111222222222233356666666 No 105 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.48 E-value=3.9e-12 Score=83.14 Aligned_cols=443 Identities=12% Similarity=0.064 Sum_probs=215.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC----cchhc-Cc--ccc----hhh-------hhhccccchHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR----RLKTI-GI--GAP----PEL-------AYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~----~i~~~-~~--~~~----~~~-------~~~~~~~n~~~~iVd~ 62 (480) |.++.--.... . .+.........||.|.. .+..+ +. ... ... +++-....|++-+|+. T Consensus 1 ~~~p~~~~~~~---~-~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~ 76 (533) T protein:vir:34 1 MKTPTIPTLLG---P-DGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQL 76 (533) T ss_pred CCCchhhhhhc---c-cccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 77763221111 1 11122245567776531 12111 00 000 011 1222345689999999 Q ss_pred HHhhhccCccccCC-------------CchhHHHHHHHHHh--------------cCHHHHHHHHHHHHhhcCeEEEEEe Q lcl|NC_021297. 63 LSDRLDIEGFRISE-------------DSEGLEELWNWWQA--------------NDLDEESVLGHDDSLTFGRAYITVS 115 (480) Q Consensus 63 ~~~~l~~~~~~~~~-------------d~~~~~~l~~i~~~--------------n~~~~~~~~~~~~a~~~G~a~~~v~ 115 (480) .+.++++.|++... +++..+.+...|.. .+|...+..+++..+..|.+|+... T Consensus 77 ~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~ 156 (533) T protein:vir:34 77 HQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQAT 156 (533) T ss_pred HHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEee Confidence 99999999987522 12223444444421 1577778889999999999999875 Q ss_pred cCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccc Q lcl|NC_021297. 116 HPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 116 ~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (480) ..... +..--.++..++|+.+-.-++......+..+|.+ +..|.+.-+-++..+.- +.....+.. . T Consensus 157 ~~~~~--g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~~~~------~~~~~~~~~--~ 222 (533) T protein:vir:34 157 WDTSS--SRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI----NDSGAALGYYVSEDGYP------GWMPQKWTW--I 222 (533) T ss_pred eccCC--CCccceEEEEechhhcCCCCCCCCCCceEeeeEE----CCCCCeEEEEEeecCCC------Cccccccce--e Confidence 43321 1111357889999876543333333445555533 34444444444332110 000000000 0 Q ss_pred cccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH---HHHHHHHHHHhcchhhhhcCCCcccc-------- Q lcl|NC_021297. 196 VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT---LMNLQSASQILGTPLRVISGVTTDEL-------- 264 (480) Q Consensus 196 ~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~---~s~~~~~~~~~~~~~~~i~g~~~~~~-------- 264 (480) .....++.-=|+|+....+.+..-|.|.+.+-+ ..+..++.. ....+.....++ .+|+....... T Consensus 223 ~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl-~~l~~l~~y~dael~~a~i~A~~a---~fi~~~~~~~~~~~~~~~~ 298 (533) T protein:vir:34 223 PRELPGGRASFIHVFEPVEDGQTRGANVFYSVM-EQMKMLDTLQNTQLQSAIVKAMYA---ATIESELDTQSAMDFILGA 298 (533) T ss_pred eeeeccChhHeeeeccccCCCcccCCchHHHHH-HHHHHHHHHHHHHHHHHHHhhhhe---eeeecCCCcccccccccCC Confidence 000112222378888777788889999998744 333333332 222222222222 12221111000 Q ss_pred -----ccc-----------cccchhhhhhhhhccc-CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc Q lcl|NC_021297. 265 -----TND-----------GENTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS 327 (480) Q Consensus 265 -----~~~-----------~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~ 327 (480) .+. .....+.+.++.+..+ +|++.++.+.+. ...+|...++.+++.|+...|+|-+.|.++. T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~ 377 (533) T protein:vir:34 299 NSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQD-TDNGYSVFEQSLLRYIAAGLGVSYEQLSRNY 377 (533) T ss_pred CcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhhhc Confidence 000 0011123556666554 455666654332 2335666677779999999999999997764 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHhCCCCcc------cc-----eeeeEEecCCC--CCC Q lcl|NC_021297. 328 ENPASAEAIIATDSRIVKMAERKGRIFGGAWERA-MRIA--MQIMGREVTE------EY-----TRLETVWRDPS--TPT 391 (480) Q Consensus 328 ~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~-~~~~--~~~~~~~~~~------~~-----~~i~v~f~~~~--~~~ 391 (480) .+ +|-.+.+..+......+...+..|...+.+- ++.. .++..+.++. ++ .-+.+.|..|- ..| T Consensus 378 s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iD 456 (533) T protein:vir:34 378 AQ-MSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAID 456 (533) T ss_pred cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccC Confidence 33 2444667777777777777776666655432 2211 2333332221 11 12457775443 356 Q ss_pred HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHH-HhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT-LYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) ....+.+....+.+| +.|.+.++...|...++..+ ++ +++.+.+.+. +.. .. ++......+. ..++ .. T Consensus 457 P~Ke~~a~~~~i~~G--~~s~~~~~a~~G~D~~ev~~--q~-a~e~~~~~~~gl~~--~~--~~~~~~~s~~-~~~~-~~ 525 (533) T protein:vir:34 457 GLKEVQEAVMLIEAG--LSTYEKECAKRGDDYQEIFA--QQ-VRETMERRAAGLKP--PA--WAAAAFESGL-RQST-EE 525 (533) T ss_pred hHHHHHHHHHHHHcC--CCCHHHHHHHcCCCHHHHHH--HH-HHHHHHHHhcCCCC--CC--CCCcCccCCC-CCCC-CC Confidence 677888888888876 66777788888987654322 21 1221111111 111 11 1111111000 0111 11 Q ss_pred CCCCCCCC Q lcl|NC_021297. 471 PSGFNRTK 478 (480) Q Consensus 471 p~~~~~~~ 478 (480) |...+.++ T Consensus 526 ~~~~~~~~ 533 (533) T protein:vir:34 526 EKSDSRAA 533 (533) T ss_pred CcccCCCC Confidence 11111122 No 106 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.48 E-value=7.4e-12 Score=81.60 Aligned_cols=439 Identities=12% Similarity=0.055 Sum_probs=217.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC----cchhc-Ccc--cc----hhh-------hhhccccchHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR----RLKTI-GIG--AP----PEL-------AYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~----~i~~~-~~~--~~----~~~-------~~~~~~~n~~~~iVd~ 62 (480) |-++.-+- .. .++-......||.|.- .+..+ +.. .. ... +++-....|++-+|+. T Consensus 1 ~~~~~~~~------~~-~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKIPSLVG------PD-GKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred Cccceeec------Cc-cccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 44432111 00 1222344566775431 11110 000 00 000 1223345699999999 Q ss_pred HHhhhccCccccCC-------------CchhHHHHHHHHHh--------------cCHHHHHHHHHHHHhhcCeEEEEEe Q lcl|NC_021297. 63 LSDRLDIEGFRISE-------------DSEGLEELWNWWQA--------------NDLDEESVLGHDDSLTFGRAYITVS 115 (480) Q Consensus 63 ~~~~l~~~~~~~~~-------------d~~~~~~l~~i~~~--------------n~~~~~~~~~~~~a~~~G~a~~~v~ 115 (480) .+..+++.|+.... +++..+.+.+.|+. .+|...+..+++..+..|.+|+... T Consensus 74 ~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 153 (530) T protein:vir:38 74 HQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQAT 153 (530) T ss_pred HHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEee Confidence 99999999986421 22233445555532 2567788889999999999999875 Q ss_pred cCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccc Q lcl|NC_021297. 116 HPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 116 ~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (480) ..... ...--.++..++|+.+---++......+..+|.+ +..|.+.-+.++..+.- +.... ... T Consensus 154 ~~~~~--g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~~~~------~~~~~----~~~ 217 (530) T protein:vir:38 154 WDSDS--TRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI----NDSGAALGYYVSDDGYP------GWMAQ----NWT 217 (530) T ss_pred eccCC--CCccceEEEEechhhcCCCCCCCCCCeeEeeeEE----CCCCceEEEEEeeccCC------Ccccc----ccc Confidence 43211 0111257889999876533332234445555532 34444443333322110 00000 011 Q ss_pred ccc--ccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH---HHHHHHHHHhcchhhhhcCCCccc------- Q lcl|NC_021297. 196 VIK--HGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL---MNLQSASQILGTPLRVISGVTTDE------- 263 (480) Q Consensus 196 ~~~--~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~---s~~~~~~~~~~~~~~~i~g~~~~~------- 263 (480) .++ ..++.--|+|+....+.+..-|.|.+.+.+ ..+..++... ...+.....++ .+|+...... T Consensus 218 ~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl-~~l~~l~~y~dael~~a~i~A~~a---~fi~~~~~~~~~~~~~~ 293 (530) T protein:vir:38 218 YIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVM-EQMKMLDTLQNTQLQSAIVKAMYA---ATIESELDTQSAMDFIL 293 (530) T ss_pred eeeeeeccChhHeEeeccccCCCcccCCchHHHHH-HHHHHHhHHHHHHHHHHHHhhhhe---eeeeccCCccccccccc Confidence 111 234444588888888888899999998744 3333343322 22222222222 1232111100 Q ss_pred ------cccc-----------cccchhhhhhhhhccc-CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcc Q lcl|NC_021297. 264 ------LTND-----------GENTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSS 325 (480) Q Consensus 264 ------~~~~-----------~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~ 325 (480) .... .....+.+.++.+..+ +|++.++.+.+. ...+|...++..++.|+...++|-+.|.+ T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~ye~lt~ 372 (530) T protein:vir:38 294 GADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQD-TDNGYSTFEQSLLRYIAAGLGVSYEQLSR 372 (530) T ss_pred cCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhc Confidence 0000 0011233556666554 455566655432 22356666788899999999999999976 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhCCCCcc------cc-----eeeeEEecCCCC-- Q lcl|NC_021297. 326 SSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIA--MQIMGREVTE------EY-----TRLETVWRDPST-- 389 (480) Q Consensus 326 ~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~~~--~~~~~~~~~~------~~-----~~i~v~f~~~~~-- 389 (480) +..+ +|-.|.+..+......+...+..|...+.+ +++.. .++..+.+.. ++ .-+.+.|..|-. T Consensus 373 D~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~ 451 (530) T protein:vir:38 373 NYSQ-MSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMA 451 (530) T ss_pred cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccc Confidence 6433 244466777777777777777766665433 22221 2233332221 11 124567754433 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) .|....+.+....+.+| +.|.+.++...|...++..+ + ++++.+.. +..--..+..+...+. ......+.. T Consensus 452 iDP~Ke~~a~~~~i~~G--~~s~~~~~a~~G~D~~~v~~--q-~a~e~~~~-~~~Gl~~~~~~~~~~~---~~~~~~~~~ 522 (530) T protein:vir:38 452 IDGLKEVQEAVMLIEAG--LSTYEKECAKRGDDYQEIFA--Q-QVRESMER-RAAGLNPPAWAAAAFE---AGVKKSNEE 522 (530) T ss_pred cChHHHHHHHHHHHHcC--CCCHHHHHHHcCCCHHHHHH--H-HHHHHHHH-HHcCCCCCCCcccccC---CCCCCCCCC Confidence 46667788888888876 66777778888987664322 1 11221111 1110011111111111 111223333 Q ss_pred CCCCCCCC Q lcl|NC_021297. 470 SPSGFNRT 477 (480) Q Consensus 470 ~p~~~~~~ 477 (480) ++.|.+++ T Consensus 523 ~~d~~~~a 530 (530) T protein:vir:38 523 EQDGARAA 530 (530) T ss_pred CCCCCCCC Confidence 34455555 No 107 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.47 E-value=8.7e-12 Score=81.23 Aligned_cols=454 Identities=15% Similarity=0.062 Sum_probs=219.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh---c--Ccccchh-------h----hhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT---I--GIGAPPE-------L----AYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~---~--~~~~~~~-------~----~~~~~~~n~~~~iVd~~~ 64 (480) |.=+..+|..+......++.+-....+-|+|-..-+. . +...... + +++-..+.|++-+|+.++ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 5544444444444444444333344455665422111 0 0000000 0 122234568899999999 Q ss_pred hhhccC-cccc-----CCCch----hHHHHHHH---HHh-------cCHHHHHHHHHHHHhhcCeEEEEEecCcccccC- Q lcl|NC_021297. 65 DRLDIE-GFRI-----SEDSE----GLEELWNW---WQA-------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGD- 123 (480) Q Consensus 65 ~~l~~~-~~~~-----~~d~~----~~~~l~~i---~~~-------n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d- 123 (480) ..+++. |+.+ ..|++ ..+.+... |.. .+|...+..+++..+..|.+|+........... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 888873 3321 22322 22333333 332 357888888999999999999876543221100 Q ss_pred CCc-eeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccc Q lcl|NC_021297. 124 PAG-IPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 124 ~~~-~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (480) ... -.+|..++|+.+-.-++.. ...+...|. .+..|.+.-+-++..+---.+.... ... ... ++ T Consensus 161 g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE----~D~~Grp~aY~i~~~hPgd~~~~~~--~~~----~~r----vp 225 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNL-SKGIVQGIE----RDTWRRKRAYHLLKDHPGNLQTLGG--SLA----VKR----VE 225 (548) T ss_pred CcccceEEEEechhhcCCCCCCC-CCceeeeeE----ECCCCceEEEEEeecCCCccccccc--ccc----eee----ec Confidence 001 1478899998764323222 233444443 2444545444444332110000000 000 010 11 Q ss_pred ccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc----ccccccchhhhhhh Q lcl|NC_021297. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL----TNDGENTTLDIYYG 278 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~----~~~~~~~~~~~~~~ 278 (480) .-=|+|+....+.+..-|.|.+.+-+.. +..++....-........+.--.+|+...++.. .+........+.++ T Consensus 226 A~~VlHif~~~r~gQ~RGvs~lapvl~~-l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG 304 (548) T protein:vir:95 226 AERIIHIAYRKRIGQNRGVPMLHAVLIR-LADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPG 304 (548) T ss_pred hhHheecccccCCccccCcchHHHHHHH-HHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcccccccccccCC Confidence 1125677767777788899999874433 333433322222222211211123332222111 11122233455666 Q ss_pred hhc-c-cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 279 RIL-T-LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) Q Consensus 279 ~~~-~-~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 356 (480) .++ . .+|++.++.+.+. ...+|...++.+++.|+...++|-+.|.++. + .|-.|.++.+...-......+..|.. T Consensus 305 ~iv~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~IAaglGipYe~ltgD~-s-~nYSS~R~~l~e~~r~~~~~q~~~i~ 381 (548) T protein:vir:95 305 MVFDDLEPGEDVGMIESNR-PNPFLEGFRNGQLRMIGAGTRSTYSSVSRAY-D-GTYSAQRQELVEGWLGYDLLQHEFID 381 (548) T ss_pred ccccccCCCceeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c-hhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654 2 3577777755332 2345666677889999999999999998775 3 35667788777777777777777766 Q ss_pred HHHH-HHHHHH--HHhCCCCcc-----cceeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH Q lcl|NC_021297. 357 AWER-AMRIAM--QIMGREVTE-----EYTRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR 426 (480) Q Consensus 357 ~l~~-~~~~~~--~~~~~~~~~-----~~~~i~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~ 426 (480) .+.+ +++..+ ++..+.+.. ...-+.+.|..|-. .|....+.+....+.+| +.|.+.++...|...++. T Consensus 382 ~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~G--l~T~~~~~a~~G~D~~ev 459 (548) T protein:vir:95 382 YWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAG--FADEAEVARARGRDPREL 459 (548) T ss_pred HHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHhCCCHHHH Confidence 6554 333322 333332211 11235778855443 46677888888888876 667777888889876653 Q ss_pred H-HHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC----------------------------------- Q lcl|NC_021297. 427 E-QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS----------------------------------- 470 (480) Q Consensus 427 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~----------------------------------- 470 (480) . ++.++.+...+ . .+.. +..+...+...+.+..+++++. T Consensus 460 ~~q~a~E~~~~~~--~-GL~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (548) T protein:vir:95 460 KKSRETEIKANRA--A-GLVF--SSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNES 534 (548) T ss_pred HHHHHHHHHHHHH--c-CCCC--CCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCccc Confidence 2 22222111111 0 1100 0000000001111111111111 Q ss_pred -CCCCCCCCCC Q lcl|NC_021297. 471 -PSGFNRTKTR 480 (480) Q Consensus 471 -p~~~~~~~~~ 480 (480) ..|.++.-|- T Consensus 535 ~~~~~~~~~~~ 545 (548) T protein:vir:95 535 NNGGADGQPSN 545 (548) T ss_pred ccCCCCCCCCC Confidence 1122222222 No 108 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.47 E-value=6.6e-12 Score=81.87 Aligned_cols=463 Identities=13% Similarity=0.046 Sum_probs=203.4 Q ss_pred CCC-HHHHHHHHHHHHHHH-------HHHHHHHHHHh--cccCcchhcCcccc-hhh--hhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTT-YHEHVERLQGLLARD-------LPNLLEAEAYR--NGTRRLKTIGIGAP-PEL--AYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~-------~~r~~~~~~YY--~g~~~i~~~~~~~~-~~~--~~~~~~~n~~~~iVd~~~~~l 67 (480) |++ -.+++.++...+... +.+...-.+|| .|.|-......... ... ..-.+++|.++.+|+..+++. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 988 466777766665432 22223334666 57764432111110 000 122578899999999988876 Q ss_pred ccCc--cc-cC----CCchhHHHHH----HHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccccc---CCCceeEEEEE Q lcl|NC_021297. 68 DIEG--FR-IS----EDSEGLEELW----NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG---DPAGIPLIRVE 133 (480) Q Consensus 68 ~~~~--~~-~~----~d~~~~~~l~----~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~---d~~~~~~i~~~ 133 (480) .-+- +. .+ +|.+..+.++ .+...|+.+.....+..+++++|.||+-|......+. ...+.+.+..+ T Consensus 81 ~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v 160 (706) T protein:vir:10 81 RNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPI 160 (706) T ss_pred HhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeee Confidence 4332 11 12 2333344444 4456689999999999999999999988754322211 12334444433 Q ss_pred -ccceeEEEEeCCCCC----eeEEEEE-EEEEec--------------------------CCceEEEEEEEeCCc----E Q lcl|NC_021297. 134 -SPLYMYAELDPRNTR----RVTRAVR-LYTTRD--------------------------DVAVPDRATLYLPDE----T 177 (480) Q Consensus 134 -~p~~~~~v~d~~~~~----~~~~~~~-~~~~~~--------------------------~~~~~~~~~~~~~~~----~ 177 (480) +|.+ .++||+.... ...++++ .|.+.+ ........++|.... + T Consensus 161 ~~p~~-~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~ 239 (706) T protein:vir:10 161 YDPAR-SVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRKESVDV 239 (706) T ss_pred ccchh-ceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccceeEEE Confidence 4543 2346653211 1111111 111111 001111222232211 1 Q ss_pred EEEEecCCc---------------------c------------------cccccccccccccccccceEEeecc----cc Q lcl|NC_021297. 178 VPLRRNGGL---------------------N------------------DQWVVDGDVIKHGLGVVPVVPLTND----PR 214 (480) Q Consensus 178 ~~~~~~~~~---------------------~------------------~~~~~~~~~~~~~~~~iPvv~~~n~----~~ 214 (480) ++|...... . .+.....++.|-+.+++|+|||.-. .. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~ 319 (706) T protein:vir:10 240 ISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 319 (706) T ss_pred EEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccc Confidence 111111000 0 0000111222333377888877543 23 Q ss_pred cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc-ccccchhhhhhhhh-c----ccCCCC- Q lcl|NC_021297. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTN-DGENTTLDIYYGRI-L----TLASEA- 287 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~-~~~~~~~~~~~~~~-~----~~~g~~- 287 (480) ...+||.- +.+++.++.+|..+|.+.+.+.... ...-.|.......- ..+..........+ + ..+|.- T Consensus 320 ~~~~~G~v---r~~~d~Q~~~N~~~s~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~ 394 (706) T protein:vir:10 320 VERVEGHI---AKAMDPQRLYNLQVSMLADAAAQDP--GQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVV 394 (706) T ss_pred cCccccee---ccchhhHHHHHHHHHHHHHHHHhcC--CcccccchhHHHHHHHHhhhcccccccchhcccccCCCCccc Confidence 34456643 5689999999999999988874322 22222211100000 00000000000000 0 001110 Q ss_pred ---ceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 288 ---AKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 288 ---~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) ..+..++. .-.+.+...++.....|-.++|+++..+|.. .| .||+|+...-..-..........|..+.+++.+ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~-sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 472 (706) T protein:vir:10 395 APANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMP-SN-VARETVNSLLNRSDMASFIYLDNMAKSLKRAGE 472 (706) T ss_pred ccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111121 1234566677777788888999999998854 34 699999998777777777777888888887766 Q ss_pred HHHHH----hCC---------CCcccc-------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhc Q lcl|NC_021297. 364 IAMQI----MGR---------EVTEEY-------------------------TRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) Q Consensus 364 ~~~~~----~~~---------~~~~~~-------------------------~~i~v~f~~~~~~~~~~~ad~~~kl~~~ 405 (480) +++.+ .+. ....++ +++.|.=.+..+.-..+..+.++++.++ T Consensus 473 ~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~ 552 (706) T protein:vir:10 473 IWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQG 552 (706) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHh Confidence 65543 211 110000 1111111222223345667778888776 Q ss_pred cccCCCHH-----HHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc-cccCCCCCCCCcccccCC-------- Q lcl|NC_021297. 406 GQGPIPKE-----QARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD-ATPKPTVTDTKTETQTSP-------- 471 (480) Q Consensus 406 ~~~~~s~e-----t~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~~~p-------- 471 (480) +....+.. .+++.+.+.- ++++.+..+.+.. ......+. +.+.+........++.++ T Consensus 553 ~~p~~~~~~~l~~~~~~~~d~p~--~~e~~e~irk~~~-------~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~ 623 (706) T protein:vir:10 553 MLPQDPMRPALMGIIIDNMEGEG--LDDFKAFNRRQLL-------TQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQ 623 (706) T ss_pred cCCcchhhHHHHHHHHhhcCccc--hHHHHHHHHHhhc-------ccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 53221111 1233333321 1222222111100 00000000 000000000000000000 Q ss_pred -----CCCCCCCCC Q lcl|NC_021297. 472 -----SGFNRTKTR 480 (480) Q Consensus 472 -----~~~~~~~~~ 480 (480) .+..-...+ T Consensus 624 aq~~~~qA~~~k~~ 637 (706) T protein:vir:10 624 AQMVVAQAEAQKSQ 637 (706) T ss_pred HHHHHHHHHHHHHH Confidence 000000000 No 109 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.45 E-value=1.2e-11 Score=80.51 Aligned_cols=455 Identities=13% Similarity=0.063 Sum_probs=213.3 Q ss_pred CCC-HHHHHHHHHHHHHHHHHHHHHHHHHhccc--Cc--chhc-Ccc------cchhh-------hhhccccchHHHHHH Q lcl|NC_021297. 1 MTT-YHEHVERLQGLLARDLPNLLEAEAYRNGT--RR--LKTI-GIG------APPEL-------AYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~~~r~~~~~~YY~g~--~~--i~~~-~~~------~~~~~-------~~~~~~~n~~~~iVd 61 (480) |.+ ....+.......... +.......|+|- +. ...+ +.. +.... +++-....|++-+|+ T Consensus 1 m~~~~~r~~~~~a~~~~~~--~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQ--SASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred Ccchhhhhhcccccccchh--hhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 443 233333222222211 111112345442 11 1110 000 00011 122234568999999 Q ss_pred HHHhhhccCccccCC----------Cc----hhHHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCeEEEE Q lcl|NC_021297. 62 TLSDRLDIEGFRISE----------DS----EGLEELWNWWQ---A-----------NDLDEESVLGHDDSLTFGRAYIT 113 (480) Q Consensus 62 ~~~~~l~~~~~~~~~----------d~----~~~~~l~~i~~---~-----------n~~~~~~~~~~~~a~~~G~a~~~ 113 (480) .++..+++.|++... ++ +..+.+.+.|+ + .+|...+..+++.....|.+|+. T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 999999999987421 12 22333433332 2 14777788899999999999987 Q ss_pred EecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccc Q lcl|NC_021297. 114 VSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVD 193 (480) Q Consensus 114 v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (480) ....... +..--+++..++|+.+-.-++......+..+|.+ +..|.+.-+-++..+---.|...... ..|.-. T Consensus 159 ~~~~~~~--~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~----d~~Gr~vaY~i~~~hPgd~~~~~~~~-~~~~r~ 231 (553) T protein:vir:63 159 AEWDRAA--NRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQY----DKRGRPQGYWIQVAHPGDLYQMAPDM-YKWKFV 231 (553) T ss_pred eeeccCC--CCcccceEEEechhhcCCCCCCCCCCeeEeeeEE----CCCCceEEEEeeccCCCccccccccc-cceeee Confidence 6543211 1112357889999877554443334445555533 44455554444443211111111110 001000 Q ss_pred cccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH---HHHHHHHHHHhcchhhhhcCCCcccc------ Q lcl|NC_021297. 194 GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT---LMNLQSASQILGTPLRVISGVTTDEL------ 264 (480) Q Consensus 194 ~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~---~s~~~~~~~~~~~~~~~i~g~~~~~~------ 264 (480) .. ...++.-=|+|+....+.+..-|.|.+.+-+ ..+..++.. ....+.....++ .+|+...++.. T Consensus 232 ~~--~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl-~~l~~l~~y~daeL~~a~i~A~~a---~fi~~~~~~~~~~~~~~ 305 (553) T protein:vir:63 232 QQ--SKPWGRRQVIHILEPREPDQSRGIADIVSGL-KDMRMAKRFKEMSLQNAVINASYA---AAIESELPPEFIHSQMS 305 (553) T ss_pred cc--ccccChhHheecccccCCCcccCCchHHHHH-HHHHHHhHHHHHHHHHHHHhhhhe---eeeecCCChhhhhhhcc Confidence 00 1122333456777667777888999998744 333334332 222222222222 12321111000 Q ss_pred -------c-------------cccccchhhhhhhhhccc-CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHh Q lcl|NC_021297. 265 -------T-------------NDGENTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL 323 (480) Q Consensus 265 -------~-------------~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~ 323 (480) . .........+.++.+..+ +|++.++.+.+. ...+|...++..++.|+...|+|-+.+ T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~-p~~~~~~F~~~~lr~iaaglGi~Ye~l 384 (553) T protein:vir:63 306 GGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGT-PGGVGSEFEASLNRHLASAFGMSYEEF 384 (553) T ss_pred cccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 0 001112234556666555 455666654432 233566667788999999999999988 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhCCCCccc--------------ceeeeEEecC Q lcl|NC_021297. 324 SSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIA--MQIMGREVTEE--------------YTRLETVWRD 386 (480) Q Consensus 324 g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~~~--~~~~~~~~~~~--------------~~~i~v~f~~ 386 (480) .++..+ +|-.|.++.+..........+..|...+.+ +++.. .++..+.+... ..-+.+.|.. T Consensus 385 t~D~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~ 463 (553) T protein:vir:63 385 TRDFSK-ANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIG 463 (553) T ss_pred hhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeec Confidence 766433 234456666666666666666666665544 33322 23333322110 1124577866 Q ss_pred CCCC--CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHH-HhcccccCcccccCCCCCCC Q lcl|NC_021297. 387 PSTP--TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT-LYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 387 ~~~~--~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~ 463 (480) |-.. |....+.+....+.+| +.|.+..+...|...++..+. ++++.+.+.+. +....... .+...+.+. T Consensus 464 p~~~~iDP~Ke~~A~~~~i~~G--~~t~~~~~a~~G~D~~~v~~q---~a~e~~~~~~~Gl~~~~~~~---~~~~~~~~~ 535 (553) T protein:vir:63 464 ASQGQIDQLKETQAAVMRIDAG--LSTYEREIARLGGDFRKSFAQ---RAREDALLKKYGLTFNLSAK---RSLGDGRDA 535 (553) T ss_pred CCccccChHHHHHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHHH---HHHHHHHHHHcCCCCCCCCc---cccCCCccc Confidence 5543 6677788888888776 567777788889876643221 11121111111 11100000 010111111 Q ss_pred CcccccCCCCCCCCCCC Q lcl|NC_021297. 464 KTETQTSPSGFNRTKTR 480 (480) Q Consensus 464 ~~~~~~~p~~~~~~~~~ 480 (480) ++.....|...++.++- T Consensus 536 ~~~~~~~~~~~~~~~~~ 552 (553) T protein:vir:63 536 ATGIAEDPAAAQTSQQG 552 (553) T ss_pred CCCCCCCCCCCCccccc Confidence 11111111111111111 No 110 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.45 E-value=5.8e-13 Score=87.65 Aligned_cols=407 Identities=13% Similarity=0.115 Sum_probs=189.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHh---cccCcch----hcCccc-chhhhhhccccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYR---NGTRRLK----TIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY---~g~~~i~----~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |+. +..+..+-+.|. .+.-... +.+... -..+......+.+++++||..++-+.-+|+ T Consensus 5 m~~--------------~~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~ 70 (435) T protein:vir:79 5 MSD--------------KVKAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGF 70 (435) T ss_pred ccc--------------ccccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCc Confidence 332 122223333332 2211110 011111 122334455677889999999999999999 Q ss_pred ccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc----cCCCcee-EEEEEccceeEEEEeCCCC Q lcl|NC_021297. 73 RISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGIP-LIRVESPLYMYAELDPRNT 147 (480) Q Consensus 73 ~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~----~d~~~~~-~i~~~~p~~~~~v~d~~~~ 147 (480) .+..++ ..+.+...|++-++...+.++.+.+..||.|++++...+... .+.+|.+ .|.+++|.++.|-.-..+. T Consensus 71 ~i~g~~-~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp 149 (435) T protein:vir:79 71 KVDGVK-NEKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNA 149 (435) T ss_pred eecCCC-hHHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCC Confidence 887654 345677777777788899999999999999988876422111 1222332 3445555444431100000 Q ss_pred CeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 148 RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) ... +.+.+..+.+ . ..+...+......+. ..|..-|+-... ......||.|.|.+. T Consensus 150 ~sp-----------~fg~P~~y~v---------~-~~~~~~~~~iH~SRl-i~~~g~~~p~~~--~~~~~~~G~S~l~e~ 205 (435) T protein:vir:79 150 RSV-----------RYGEPKLYKI---------S-PGGDIPEFFVHYSRI-CIIDGERVSNEK--RRQNDGWGASILNKR 205 (435) T ss_pred ccc-----------ccCcceEEEE---------e-cCCCCCceEEcceeE-EEecCCcchhhh--ccccCcccchHHHHH Confidence 000 1111111111 0 000000000000000 001111110111 122446788877555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch------hh--hhhhhhcccCCCCceEEEeccccHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT------LD--IYYGRILTLASEAAKISEFKAAELR 299 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~------~~--~~~~~~~~~~g~~~~~~~~~~~~~~ 299 (480) +.+-+..++++.......+..+....+.+.|.............. +. ......+.+.+++.++-+++ .++. T Consensus 206 ~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~-~~ls 284 (435) T protein:vir:79 206 LIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLN-SDVS 284 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEe-cccC Confidence 666566677776666666554444444444322111111111000 11 11112223334433443333 2334 Q ss_pred HHHHHHHHHHHHHHhccCCChHHh-cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021297. 300 NFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 377 (480) ...+.+......|++.+++|..-| |..... ++||..-...|...+.. ..+..+...|++++++++.- T Consensus 285 gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~--~Qe~~l~p~l~~l~~li~~s--------- 353 (435) T protein:vir:79 285 GVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDR--KRVEDYKPILEFLLPFMISE--------- 353 (435) T ss_pred CHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC--------- Confidence 445556777889999999998665 433332 35666544444444442 33566788888888876521 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-ChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) .++.+.|.+-..++..+.|+...+.+++- .++.. .|. +++++.+.. +.... .-.+.. .....-+ T Consensus 354 ~d~~~~f~pL~~~sekEkAei~~~~a~a~------~~~~~-~g~i~~~e~r~~L---~~~~~--~~~~~~---~~~~~~~ 418 (435) T protein:vir:79 354 TEWSIEFEPLSVPSDKDKAEIMAKNVESV------VKLKA-EQAINLKETRDTL---RSICP--DLKIMD---NDNIELP 418 (435) T ss_pred CCCeEEeCCCCCCCHHHHHHHHHHHHHHH------HHHHh-cCCCCHHHHHHHH---HHhcc--ccCCCC---cccccCC Confidence 25788999999999999998776665432 22222 343 333332211 10000 000000 0000011 Q ss_pred CCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 457 KPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 457 ~~~~~d~~~~~~~~p~~~~~ 476 (480) +..+ -...+..++|.|. T Consensus 419 --~~~d-~~~~~~~e~g~~~ 435 (435) T protein:vir:79 419 --EPED-LDPEPGQEGGLNK 435 (435) T ss_pred --cccc-CCCCCCCCCCCCC Confidence 1111 1122233345555 No 111 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.43 E-value=6.1e-13 Score=87.54 Aligned_cols=469 Identities=12% Similarity=0.053 Sum_probs=197.8 Q ss_pred CCCH-HHHHHHHHHHHHHHH-------HHHHHHHHHh--cccCcchhcCc---ccchhhhhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDL-------PNLLEAEAYR--NGTRRLKTIGI---GAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~-------~r~~~~~~YY--~g~~~i~~~~~---~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++. ++++.++...+.... .....-.+|| .|.+--...-. ...+.-..-.+++|.++.+|+..+++. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 9887 677777777654432 1122234666 47653221100 000000112367799999999988876 Q ss_pred ccCc--c-ccCC----CchhHHHHH----HHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC---CceeEEEEE Q lcl|NC_021297. 68 DIEG--F-RISE----DSEGLEELW----NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP---AGIPLIRVE 133 (480) Q Consensus 68 ~~~~--~-~~~~----d~~~~~~l~----~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~---~~~~~i~~~ 133 (480) .-+- + ..|. |.+..+.++ .+...|+.+.....+..+++++|.||+-|..+...+.+. ...+++..+ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v 160 (720) T protein:vir:35 81 RHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPI 160 (720) T ss_pred HhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecc Confidence 4322 1 1222 333334444 345668899999999999999999999886532222221 233444432 Q ss_pred -cc-ceeEEEEeCCCCCe----eEEE-EEEEEEec--------------------------CCceEEEEEEEeCC----- Q lcl|NC_021297. 134 -SP-LYMYAELDPRNTRR----VTRA-VRLYTTRD--------------------------DVAVPDRATLYLPD----- 175 (480) Q Consensus 134 -~p-~~~~~v~d~~~~~~----~~~~-~~~~~~~~--------------------------~~~~~~~~~~~~~~----- 175 (480) +| .+ +.|||..... ..++ +..|.+.+ +......+++|.-. T Consensus 161 ~~~~~~--v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~~~~ 238 (720) T protein:vir:35 161 YDPARS--VWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKESVD 238 (720) T ss_pred cCchhh--eeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEEEEEE Confidence 22 23 3455433210 1111 11111110 01112223333221 Q ss_pred ----------cEEEEEecCC--------------------ccc--------ccccccccccccccccceEEeecc----c Q lcl|NC_021297. 176 ----------ETVPLRRNGG--------------------LND--------QWVVDGDVIKHGLGVVPVVPLTND----P 213 (480) Q Consensus 176 ----------~~~~~~~~~~--------------------~~~--------~~~~~~~~~~~~~~~iPvv~~~n~----~ 213 (480) .++.|..... ... +..+..++.+-+++++|+|||... . T Consensus 239 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d 318 (720) T protein:vir:35 239 VVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFID 318 (720) T ss_pred EEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccC Confidence 1122211100 000 001112223445567888887632 2 Q ss_pred ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-cchhhhh-----hhhhcccCCC- Q lcl|NC_021297. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-NTTLDIY-----YGRILTLASE- 286 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~~-----~~~~~~~~g~- 286 (480) ....+||. | +.+++.+|.||..+|.+...+. +.+...-.|........++. ...-... .+.+....|. T Consensus 319 ~~~~~~G~--v-r~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~ 393 (720) T protein:vir:35 319 DIERVEGH--I-AKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNI 393 (720) T ss_pred CCccccee--e-ecchhHHHHHHHHHHHHHHHHH--cCCccccccCcchHHHHHHHhhccccccccccccccccccCccc Confidence 22223554 3 5689999999999999999985 34433333321111000000 0000000 0000001111 Q ss_pred ---CceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 287 ---AAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 287 ---~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) ...+...+. .-.+.+...+......|-.++|+++..+|.. +| .||+|+..+-..-..........+..+.+++. T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~-sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g 471 (720) T protein:vir:35 394 IAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMP-SN-IAKETVNHLMHRSDMSSFIYLDNMAKSLKRAG 471 (720) T ss_pred ccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112222222 2344566777777888888999999999964 45 69999988765555555556666666666666 Q ss_pred HHHHHH----hCC---------CCcccc-------------------------eeeeEEecCCCCCC-HHHHHHHHHHHH Q lcl|NC_021297. 363 RIAMQI----MGR---------EVTEEY-------------------------TRLETVWRDPSTPT-VAAKADAVSKLY 403 (480) Q Consensus 363 ~~~~~~----~~~---------~~~~~~-------------------------~~i~v~f~~~~~~~-~~~~ad~~~kl~ 403 (480) ++++.+ .+. ...... +++.+.=. |...+ ..+....++++. T Consensus 472 ~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~-p~~~s~req~~~~m~qll 550 (720) T protein:vir:35 472 EVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVG-PSYTARRDATVSVLTNLL 550 (720) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecc-cCcccHHHHHHHHHHHHH Confidence 655433 221 110000 11112111 22233 334566677766 Q ss_pred hccccCCC-----HHHHHHhCCCCh--hHHHHHHHHH----------HH----------HHHHH---HHHHhc---cccc Q lcl|NC_021297. 404 ANGQGPIP-----KEQARIDLGYTA--TQREQMRDWD----------KQ----------ETEDM---IDTLYS---TTKA 450 (480) Q Consensus 404 ~~~~~~~s-----~et~~~~lg~~~--~~~~~~~~~~----------~~----------~~~~~---~~~~~~---~~~~ 450 (480) .+...-.+ ...+++.+.+.. +-.+++.+.. .+ +.+.. .....+ ..+. T Consensus 551 ~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqa 630 (720) T protein:vir:35 551 AGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQA 630 (720) T ss_pred HhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHH Confidence 54321111 111233333321 1111221100 00 00000 000000 0000 Q ss_pred CcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 451 QADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 451 ~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .......... .........-+.....+.| T Consensus 631 e~~kaqa~~~-~~qa~a~~aqa~a~~~~a~ 659 (720) T protein:vir:35 631 EVQKAKNEEL-AIQVKAFQAQTEARVAEAK 659 (720) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 0000000000 0000000000000000001 No 112 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.41 E-value=1.2e-12 Score=85.85 Aligned_cols=402 Identities=12% Similarity=0.108 Sum_probs=180.0 Q ss_pred HHHHHHHHHHHHHHHHHhcccCcchhcC---cccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHH Q lcl|NC_021297. 12 QGLLARDLPNLLEAEAYRNGTRRLKTIG---IGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWW 88 (480) Q Consensus 12 ~~~~~~~~~r~~~~~~YY~g~~~i~~~~---~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~i~ 88 (480) ++.+.. +-+....-|.++-...+ ....-.+-.....+.+++++||..++-+..+|+.+..+++ .+.+...| T Consensus 1 ~~~~~~-----d~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~-~~~~~~~~ 74 (427) T protein:vir:10 1 MKIVKH-----DGYNDIFNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKD-EKEFKSLW 74 (427) T ss_pred CCcccc-----chHHHHhhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccH-HHHHHHHH Confidence 111111 11112222211110010 0111122344556778899999999999999998876653 35677778 Q ss_pred HhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc----cCCCce-eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCC Q lcl|NC_021297. 89 QANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDV 163 (480) Q Consensus 89 ~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~----~d~~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~ 163 (480) ++-++...+.++.+.+..||.|++++...+... ....|. ..|.++++.++.|-.-..+.. ..+. T Consensus 75 ~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~-----------s~~f 143 (427) T protein:vir:10 75 DSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNAR-----------SPRY 143 (427) T ss_pred HHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCcc-----------cccc Confidence 777888999999999999999998875432211 111122 234444444433311100000 0111 Q ss_pred ceEEEEEEEeCCcEEEEEecCCcc-cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHH Q lcl|NC_021297. 164 AVPDRATLYLPDETVPLRRNGGLN-DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNL 242 (480) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~ 242 (480) +.+..+.+ .++.. ........+. ..|..-|+-.+. ......||.|.|.+.+.+-+..++++.... T Consensus 144 g~P~~y~v-----------~~~~~~~~~~iH~SRl-i~~~g~~~p~~~--~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~ 209 (427) T protein:vir:10 144 GEPEIYKV-----------SPGDNMQPYLIHHSRV-FIADGERVAQQA--RKQNQGWGASVLNKSLIDAICDYDYCESLA 209 (427) T ss_pred CcceEEEE-----------ecCCCCcceEEccccE-EEecCCCchhhh--cccCCcccchhhhHHHHHHHHHHHHHHHHH Confidence 12211111 11100 0000000000 001111110010 122446799887654555555666666655 Q ss_pred HHHHHHhcchhhhhcCCCccccccccccch------hhh--hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHh Q lcl|NC_021297. 243 QSASQILGTPLRVISGVTTDELTNDGENTT------LDI--YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAAS 314 (480) Q Consensus 243 ~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~------~~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~ 314 (480) ...+..+....+.+.|.............. +.. .....+.+.+++.++-+++ .++....+.+.....+|++ T Consensus 210 ~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~-~~lsgl~~~~~~~~~~iaa 288 (427) T protein:vir:10 210 TQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLN-SDISGVPEFLSSKMDRIVS 288 (427) T ss_pred HHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEe-cccCChHHHHHHHHHHHHh Confidence 555544444444444422111011111000 010 0111222333333443332 2233344556667788999 Q ss_pred ccCCChHHh-cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCH Q lcl|NC_021297. 315 ITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 315 ~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~ 392 (480) .+++|..-| |..... +++|..=..-|...+. ...+..+...|++++.+++.- .++++.|.+-...+. T Consensus 289 a~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~--~~Qe~~l~p~l~~l~~~i~~s---------~~~~~~f~pL~~~s~ 357 (427) T protein:vir:10 289 LSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVDE---------EEWSIEFEPLSVPSK 357 (427) T ss_pred hhCCCeeeeccCCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcC---------CCcEEEeCCCCCCCH Confidence 999998766 433322 3556654444444443 233456788889888876521 258899999999999 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhcccccCc-----ccccCCCCCCCCcc Q lcl|NC_021297. 393 AAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYSTTKAQA-----DATPKPTVTDTKTE 466 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~et~~~~lg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~d~~~~ 466 (480) .+.|+...+.+++. .++. ..| ++++++.+.. +... ....+........ +..++|..++++ T Consensus 358 kEkaei~~~~a~a~------~~~~-~~gvi~~~e~r~~L---~~~~--~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~-- 423 (427) T protein:vir:10 358 KEESEITKNNVESV------TKAI-TEQIIDLEEARDTL---RSIA--PEFKLKDGNNINIREPEETTEPEPGLGEKL-- 423 (427) T ss_pred HHHHHHHHHHHHHH------HHHH-hcCCCCHHHHHHHH---Hhhh--ccccCCCCccccccccchhcCCCCCCCCCC-- Confidence 99988766554431 2222 234 3334322111 1110 0111111000000 000111111111 Q ss_pred cccCCCCCC Q lcl|NC_021297. 467 TQTSPSGFN 475 (480) Q Consensus 467 ~~~~p~~~~ 475 (480) ++.| T Consensus 424 -----~d~~ 427 (427) T protein:vir:10 424 -----EDEN 427 (427) T ss_pred -----CCCC Confidence 1111 No 113 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.41 E-value=7e-12 Score=81.73 Aligned_cols=437 Identities=13% Similarity=0.077 Sum_probs=208.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc---hhcCcccch-h----h-------hhhccccchHHHHHHHHHh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL---KTIGIGAPP-E----L-------AYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i---~~~~~~~~~-~----~-------~~~~~~~n~~~~iVd~~~~ 65 (480) |++- -++ .+... .+-....+-|+|-..- ...+...+. . . +++-...+|++-+|+..+. T Consensus 3 ~~~~-~~~-a~~~~-----~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 3 MTPS-GYQ-SLASG-----LLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred cccc-ccc-ccchh-----hhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 3331 111 11111 1111223345553211 111111110 0 0 1223345689999999999 Q ss_pred hhccCcccc---CCCchhHHHHHHHHH---h-------cCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEE Q lcl|NC_021297. 66 RLDIEGFRI---SEDSEGLEELWNWWQ---A-------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 66 ~l~~~~~~~---~~d~~~~~~l~~i~~---~-------n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~ 132 (480) .+++.||+. +++++..+.+...|+ + .+|...+..+++.....|.+|+.+....... ....-.++.. T Consensus 76 ~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~-g~~~~~~lql 154 (495) T protein:vir:10 76 AAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSE-GLSVPLQLQI 154 (495) T ss_pred hhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCC-CCccceEEEE Confidence 999999874 345555555555543 2 3678888889999999999998764322110 0112258899 Q ss_pred EccceeE-EEEeC--CCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc---e Q lcl|NC_021297. 133 ESPLYMY-AELDP--RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP---V 206 (480) Q Consensus 133 ~~p~~~~-~v~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---v 206 (480) ++|+.+- |.-+. .....+..+|.+ +..|.+..+-++..+---.+ ... ....+.+|| | T Consensus 155 iepd~l~~~~~~~~~~~g~~i~~GIe~----d~~Gr~vaY~i~~~hpgd~~-----------~~~--~~~~~~rvpA~~v 217 (495) T protein:vir:10 155 IEPDMLASDIPDETLPSGGYVKGGIRF----SNGGKRKAYCFYRNHPAESS-----------LIG--DPVDTVWIKAEHV 217 (495) T ss_pred echhhcCCCCCCCCCCCCCEEEeceEE----CCCCceEEEEEeecCCCccc-----------ccc--cccceeeechhhe Confidence 9999874 33222 123345555543 33444433333332110000 000 011223455 5 Q ss_pred EEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc---------ccccccchhhhhh Q lcl|NC_021297. 207 VPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL---------TNDGENTTLDIYY 277 (480) Q Consensus 207 v~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~---------~~~~~~~~~~~~~ 277 (480) +|+. ..+.+..-|.|.+.. ++.|- .++....-........+.--.+|+...++.. .+........+.+ T Consensus 218 lH~f-~~r~gQ~RGis~la~-i~~l~-~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p 294 (495) T protein:vir:10 218 LHVT-VLTVRSDAGAPWFQL-LLRLN-ELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNP 294 (495) T ss_pred Eecc-ccCCCcccCcchhHH-HHHHH-HhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCC Confidence 6664 356677889998764 55543 2333222211222211211123332111111 1111122334566 Q ss_pred hhhccc-CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_021297. 278 GRILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGR-IFG 355 (480) Q Consensus 278 ~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~-~~~ 355 (480) +.+..+ +|.+.++.+.+. ...+|.+.++..++.|+...|+|-+.+.++..+ +|-.+++..+......+...+. .+. T Consensus 295 G~i~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~ 372 (495) T protein:vir:10 295 GTLQYLQPGQEVKFSNPAD-VGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRG-VNYSSIRAGLLEFRRLCQQVQHHMII 372 (495) T ss_pred ceeeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc-ccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666654 455666654332 234566667888999999999999988766433 2344567776666666665544 344 Q ss_pred HHHHH-HHHHHH--HHhCCCCc-ccc-----eeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChh Q lcl|NC_021297. 356 GAWER-AMRIAM--QIMGREVT-EEY-----TRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT 424 (480) Q Consensus 356 ~~l~~-~~~~~~--~~~~~~~~-~~~-----~~i~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~ 424 (480) ..+.+ +++..+ ++..+.+. .++ .-+.+.|..|.. .|....+.+....+.+| +.|.+..+...|...+ T Consensus 373 ~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G--~~s~~~~~a~~G~D~~ 450 (495) T protein:vir:10 373 HQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAG--FAPISDKQAERGYDME 450 (495) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHcCCCHH Confidence 44432 333222 23333222 111 124567755443 46677888888888876 5677777888898766 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +..+. ++++.+.+ +.. + ..-..++......+ ...+....++..|+ T Consensus 451 ~v~~q---~a~e~~~~-~~~-G-l~~~~~p~~~~~~~-~~~~~~~~~~~~~e 495 (495) T protein:vir:10 451 ELFDM---ISDANQLI-DEY-D-LRLDSDPRYVNGSG-AEQKSVMEAALNNE 495 (495) T ss_pred HHHHH---HHHHHHHH-HHc-C-CCCCCCCCcCCCcc-CCCCCCCCCCCCCC Confidence 53322 11121111 111 1 10001111100000 00111112222222 No 114 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.34 E-value=1.3e-11 Score=80.25 Aligned_cols=402 Identities=13% Similarity=0.120 Sum_probs=182.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc-hhcC---cccchhhhhhccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL-KTIG---IGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i-~~~~---~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+-..- +...+.|-+.- ...+ ......+......+.+++++||..+.-+.-+|+.+.. T Consensus 1 ~~~~D~------------------~~n~~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~ 62 (422) T protein:vir:10 1 MVKTDS------------------YANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (422) T ss_pred Cccchh------------------hHHHHcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccC Confidence 332222 22222232111 0010 0111222344556778899999999999999998876 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc----cCCCcee-EEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 77 d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~----~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +++ ...+..-|++-++...+.++.+.+..||.|++++...+... .+..|.+ .+.+++|.++.|..-..+.... T Consensus 63 ~~~-~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~- 140 (422) T protein:vir:10 63 IDD-EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNA- 140 (422) T ss_pred CCH-HHHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCcccc- Confidence 653 34566667777888899999999999999988876432211 1122322 3445555554432100011111 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) +.+.+..+.+- ..++... ..+...+. ..|..-|+-.+. ......||.|.+.+.+.+- T Consensus 141 ----------~fg~P~~y~v~---------~~~~~~~-~~iH~SRl-i~~~g~~~p~~~--~~~~~~~G~S~l~~~~~~~ 197 (422) T protein:vir:10 141 ----------RFGEPLTYRIT---------TNESDMF-YDVHYSRI-HIIDGERIPNVM--RRQNDGWGRSVLSSDILDS 197 (422) T ss_pred ----------ccCcceEEEEe---------cCCCCcc-eeecccee-EEeCCCCchhhh--cccCCcccchhHHHHHHHH Confidence 11111111110 0000000 00000000 000001110000 1224467999886545555 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc------hhhhh--hhhhcccCCCCceEEEeccccHHHHHH Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT------TLDIY--YGRILTLASEAAKISEFKAAELRNFAE 303 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~------~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~ 303 (480) +..++++.......+..+......+.|............. .+... ....+.+.+.+.++-+++ .++....+ T Consensus 198 i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~-~~lsgl~~ 276 (422) T protein:vir:10 198 IKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN-SDIGGIDA 276 (422) T ss_pred HHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEe-cccCChHH Confidence 6666676666655554444444444332111000111000 01111 111122333333343332 22334445 Q ss_pred HHHHHHHHHHhccCCChHHhccc-ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSS-SEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~-~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) .+......|++.+++|..-|.+. ... ++||..-..-|...+. ...+..+...+++++++++.- .+++ T Consensus 277 ~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~--~~Qe~~l~p~l~~l~~~i~~s---------~~~~ 345 (422) T protein:vir:10 277 FLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVD--RKRNAELLPILEFLIPFIVNA---------EEWS 345 (422) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhccc---------CCcE Confidence 56667889999999998766433 222 2455544434444333 233456788899888876631 2588 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) +.|.+-..++..+.|+...+.+++- .++.. .| ++++++.+.. +.... ...+.+ .....+..+.+.. T Consensus 346 ~~f~pL~~~sekekaei~~~~a~a~------~~~~~-~g~i~~~e~r~~L---~~~~~--~~~~~~-~~~~~~~~~~~~~ 412 (422) T protein:vir:10 346 VEFNPLAQESSKDKAEILEKNVNSI------AALIA-AGAMDIDEARDTL---RTIAP--EVKIND-GSVETEVTISETS 412 (422) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHH------HHHHh-cCCCCHHHHHHHh---hhhcc--cccCCC-CCCccccchhhcC Confidence 8999999999999988766655432 22323 34 3344332211 11100 000000 0000001111111 Q ss_pred CCCCcccccCCCCC Q lcl|NC_021297. 461 TDTKTETQTSPSGF 474 (480) Q Consensus 461 ~d~~~~~~~~p~~~ 474 (480) .+....|+++ T Consensus 413 ----~~~~~~~~~d 422 (422) T protein:vir:10 413 ----NDPLEVPTDD 422 (422) T ss_pred ----CCCCCCCCCC Confidence 1112223333 No 115 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.33 E-value=1.1e-12 Score=86.11 Aligned_cols=424 Identities=13% Similarity=0.037 Sum_probs=179.9 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCC--- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE--- 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~--- 76 (480) |+.+..++..+-..+... ......+..+|... ....-++-.+...+.++++|||..++-+.-+|+.+.. T Consensus 100 ~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~-------~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d 172 (862) T protein:vir:99 100 MDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQ-------GFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGE 172 (862) T ss_pred hhcchhhhhhccccccccccccchhcccccccc-------CcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCc Confidence 122222222221111100 00000001111000 0001112234455678899999999999888887653 Q ss_pred ----CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcc--cccC--------CCcee-EEEEEccceeEEE Q lcl|NC_021297. 77 ----DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV--ESGD--------PAGIP-LIRVESPLYMYAE 141 (480) Q Consensus 77 ----d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~--~~~d--------~~~~~-~i~~~~p~~~~~v 141 (480) +++..+.+.+.|++-++...+.++.+.+-.||.+++++..... ...+ ..|.+ -|.+++|..+.|. T Consensus 173 ~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~ 252 (862) T protein:vir:99 173 GEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPM 252 (862) T ss_pred ccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccc Confidence 2234566777888878888999999999999988766532110 0000 11121 2445555544432 Q ss_pred Ee---CCCCCeeEEEEEEEEEecCCceEEEEEE----EeCCcEEEEEecCCcccccccccccccccccccceEEeecccc Q lcl|NC_021297. 142 LD---PRNTRRVTRAVRLYTTRDDVAVPDRATL----YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 142 ~d---~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~ 214 (480) -. ..+... .+.+.+..+.+ +-+.++++|. .-|+..+. .. T Consensus 253 ~v~~~~~Dp~s-----------p~yGkP~~y~I~g~~IH~SRliif~---------------------g~~vpd~l--k~ 298 (862) T protein:vir:99 253 LTAESTADPSS-----------QFFYEPEFWIISGQKYHRSHLIIAR---------------------GPQPADIL--KP 298 (862) T ss_pred ccccccccccc-----------cccCCceeeeecCeeeccceeEEec---------------------CCCchhhh--hc Confidence 10 000000 01111111111 0011222110 00110000 01 Q ss_pred cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc---cchhhhhhh--hhcccC-CCCc Q lcl|NC_021297. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---NTTLDIYYG--RILTLA-SEAA 288 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~---~~~~~~~~~--~~~~~~-g~~~ 288 (480) ....+|.|.+.. +.+.+..++++.......+..+....+.+.+...-.. ++.- ...++...+ .++.++ +++. T Consensus 299 ay~f~G~SvLe~-iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~-ed~l~~r~~~~~~~rdN~Gi~liD~eEe~ 376 (862) T protein:vir:99 299 TYIFGGIPLVQR-IYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIAN-EDKFIQRLMFWVRYRDNHAVKVLGTDETM 376 (862) T ss_pred cCCccCccHHHH-HHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhcc-HHHHHHHHHHHHhccCcceeEEecCCCce Confidence 123579998864 6666667777766665555544444433333321110 1100 011122111 123332 2333 Q ss_pred eEEEeccccHHHHHHHHHHHHHHHHhccCCChHHh-ccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 289 KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSS-SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM 366 (480) Q Consensus 289 ~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~-~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~ 366 (480) .+.+.+-+.+. +.+......|++.+++|..-| |.. .+-++||..=..-|...+.- ..+..+...|++++.+++ T Consensus 377 e~ls~slSGL~---dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s--~QE~~L~P~LerL~~li~ 451 (862) T protein:vir:99 377 EQFDTSLADFD---AVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELES--IQEHVYMPFLQRHYLISR 451 (862) T ss_pred eEEecccCChH---HHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Confidence 34444444444 456666778999999998854 533 23346676433334443332 234567888888877664 Q ss_pred HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH-------HHHhccccCCCHHHHHHhC------CCChhHHHHHHHHH Q lcl|NC_021297. 367 QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS-------KLYANGQGPIPKEQARIDL------GYTATQREQMRDWD 433 (480) Q Consensus 367 ~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~-------kl~~~~~~~~s~et~~~~l------g~~~~~~~~~~~~~ 433 (480) .-.+ .+ .++++.|.+-...+..+.|+... +++++ +.++.+.++..| |+..-+.+.+++.. T Consensus 452 ~~lg--~~---~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~s--GvispdEvR~~L~~~~~~g~~~l~ded~E~d~ 524 (862) T protein:vir:99 452 LSLG--IQ---HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDG--GVISPDEERNRIRDDKRSGYNRLTKEDAEETP 524 (862) T ss_pred HhcC--CC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCcCCCCCCcccccccC Confidence 3333 22 35889999988899888887644 34443 356666665543 33211111111000 Q ss_pred HHHHHHHHHHHhcccc-----------cCcc-------cccCCCCCCCCcccccCCCCCCCCC-------------CC Q lcl|NC_021297. 434 KQETEDMIDTLYSTTK-----------AQAD-------ATPKPTVTDTKTETQTSPSGFNRTK-------------TR 480 (480) Q Consensus 434 ~~~~~~~~~~~~~~~~-----------~~~~-------~~~~~~~~d~~~~~~~~p~~~~~~~-------------~~ 480 (480) -...+. .+....... +.++ +.+.+.++....++...+++...+. |+ T Consensus 525 ~~~~e~-~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~~~~~~~~ 601 (862) T protein:vir:99 525 GASPEN-LAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDAPVAGVVAKL 601 (862) T ss_pred CCCccc-ccccccCCcccccccccccccccCCccccCCcccccccCCCCCCCccccccccccCCCccccccCcccccc Confidence 000000 000000000 0000 0000000000000000000000000 11 No 116 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.28 E-value=5.2e-11 Score=76.98 Aligned_cols=447 Identities=10% Similarity=0.015 Sum_probs=208.7 Q ss_pred CCCHHHHHHHHHHHHHH----HHHHH---HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhcc---- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR----DLPNL---LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI---- 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~----~~~r~---~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~---- 69 (480) |++..+++.+++..|.. |.... +++.+|-.. ++.+..+...-+ -.+++.+|-.--+++.+..+++. T Consensus 15 ~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~-~~tr~t~~~~~~--w~~s~t~~k~~~~~~~l~a~~~~~~fp 91 (599) T protein:vir:31 15 RDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDA-TDTRKTSNSKLP--FKNSTTINKLAHLHLMITTSYMEHLLP 91 (599) T ss_pred cCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhh-hcccccccCCCC--cccccchHHHHHHHHHHHHHHHhhhcC Confidence 88999997777776643 22222 455566433 223322221111 13455566666667776665532 Q ss_pred -------CccccCCCc-hhHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc-------cCCCceeEE Q lcl|NC_021297. 70 -------EGFRISEDS-EGLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES-------GDPAGIPLI 130 (480) Q Consensus 70 -------~~~~~~~d~-~~~~~l----~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~-------~d~~~~~~i 130 (480) .|+...++. +..+.+ ++=+.+.+|...+..+..+.+.+|.||..+....... ...--.|++ T Consensus 92 ~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~P~~ 171 (599) T protein:vir:31 92 NRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVT 171 (599) T ss_pred CccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeecccccccccccceE Confidence 122222111 112222 3334556889999999999999999998875222111 011234889 Q ss_pred EEEccceeEEEEeCCCCCeeEEEEEEEEEecCC------ce---------------EE---------------------- Q lcl|NC_021297. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDV------AV---------------PD---------------------- 167 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~------~~---------------~~---------------------- 167 (480) ..++|.++|+=-+-.......+.+|.+.++.+- +. +. T Consensus 172 ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d 251 (599) T protein:vir:31 172 ERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDSLHKK 251 (599) T ss_pred EeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccccccc Confidence 999998876533222233344444443321100 00 00 Q ss_pred ----EEEEEeCCcE--E-----EEEecCCccc----------ccccccccccccccccceEEeecccccCccCCcccchh Q lcl|NC_021297. 168 ----RATLYLPDET--V-----PLRRNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 168 ----~~~~~~~~~~--~-----~~~~~~~~~~----------~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ..++|.++-+ . .|...++... +.+...+..|.+.|..|++.....+...+.||.+.+.. T Consensus 252 ~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~ 331 (599) T protein:vir:31 252 GYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHR 331 (599) T ss_pred cccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCCchh Confidence 0001111100 0 1111111110 00111233456677899999999999999999998865 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccC-CCCceEEE--eccccHHHHHH Q lcl|NC_021297. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA-SEAAKISE--FKAAELRNFAE 303 (480) Q Consensus 227 ~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~--~~~~~~~~~~~ 303 (480) +..+++.+|.+.-.+.+.+..+..|++...|.-.+. + +...++.+|-.. .++..+.. .+..+++..+. T Consensus 332 -~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~e--D------~~~~P~~v~~~~d~~~vq~~~p~s~~~~a~~~is 402 (599) T protein:vir:31 332 -LTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREK--G------MRGGPNHVFEVEETGDVQYMTPPAEVLQPDNQLS 402 (599) T ss_pred -cchHHHHHHHHHHHhhhhhhhhhccccccccccccc--C------ccCCCCcceeecCCCccccccCchhhhhHHHHHH Confidence 889999999998888888888888877766542111 1 112244444332 22222211 11223444445 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH----hCCC------ Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRIAMQI----MGRE------ 372 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~----~~~~------ 372 (480) .++.. +-..+|+|....|..+.....+..+..+....-....++.+.|...+- .+++.++.+ .+.. T Consensus 403 ~~e~~---mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~ 479 (599) T protein:vir:31 403 ITLQL---MEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTF 479 (599) T ss_pred HHHHH---HHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeee Confidence 54444 445778999888865543334555666777777777777777777654 366544322 2210 Q ss_pred ----Ccccceeee-------EEecCCCCCCHHHHHHHHHHHHhc-----cccCCC---HHHHHHh---------CCCChh Q lcl|NC_021297. 373 ----VTEEYTRLE-------TVWRDPSTPTVAAKADAVSKLYAN-----GQGPIP---KEQARID---------LGYTAT 424 (480) Q Consensus 373 ----~~~~~~~i~-------v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s---~et~~~~---------lg~~~~ 424 (480) ..+.+.+|. +.+.+.-..-+.++++.+.++.+- |..+.+ ++-+... .++-+. T Consensus 480 ~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~ 559 (599) T protein:vir:31 480 NSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTF 559 (599) T ss_pred cccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCC Confidence 001122221 122222223344555555444332 222222 2111111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccc--cCcccccCCCCCCCCcccccC Q lcl|NC_021297. 425 QREQMRDWDKQETEDMIDTLYSTTK--AQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~~~~~~~~ 470 (480) .+ .++++|.+. ..++...+ +..+-. ++..+. ++.++++ T Consensus 560 ~v----a~~eqq~~~--~m~Q~~lq~~~~~~~~-~~~~~~-~~~~~~~ 599 (599) T protein:vir:31 560 GI----GVQEDQQLA--RMAQKSTQQTEETALT-QEEVGG-PTTDTGQ 599 (599) T ss_pred ch----hHHHHHHHH--HHHHHHHHHhHhhhhh-hhhcCC-CCcccCC Confidence 00 001111111 11111000 000000 111111 1111222 No 117 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.23 E-value=1.5e-11 Score=79.98 Aligned_cols=451 Identities=12% Similarity=0.022 Sum_probs=175.8 Q ss_pred CCCHHHH---HHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc-----cCcc Q lcl|NC_021297. 1 MTTYHEH---VERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD-----IEGF 72 (480) Q Consensus 1 ~~~~~~~---i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~-----~~~~ 72 (480) -+.+.+| +....+.|...+.+...+.+||.|.-+.. +. ....+-+++..-....|+.....|. ++-+ T Consensus 26 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---~~~grs~vv~~~v~~~ve~~~~~l~~~f~~~~~~ 100 (763) T protein:vir:95 26 ELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK--PP---KVKGRSQVQPKLVRRQAEWRYSALTEPFLGSNKL 100 (763) T ss_pred hHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc--cc---ccCCCccccCHHHHHHHHHHHHHHHHhhcCCCcE Confidence 1222223 33333334444555555656544432221 11 1112334555656666665444331 1111 Q ss_pred -c----cCCCchhHHHHHH-----HHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccc---------------------- Q lcl|NC_021297. 73 -R----ISEDSEGLEELWN-----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE---------------------- 120 (480) Q Consensus 73 -~----~~~d~~~~~~l~~-----i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~---------------------- 120 (480) . ..+|.+..+..+. ++..|+-......++++++++|.+++.||..... T Consensus 101 ~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (763) T protein:vir:95 101 FKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQAD 180 (763) T ss_pred EEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHH Confidence 1 2345544444443 3445666667789999999999999998753100 Q ss_pred ----------------------c-------cC---------------------CCceeEEEEEccceeEEEEeCCCCC-- Q lcl|NC_021297. 121 ----------------------S-------GD---------------------PAGIPLIRVESPLYMYAELDPRNTR-- 148 (480) Q Consensus 121 ----------------------~-------~d---------------------~~~~~~i~~~~p~~~~~v~d~~~~~-- 148 (480) . .+ ..++|+|..++|.++++ |+.... T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~i--Dp~a~sD~ 258 (763) T protein:vir:95 181 ALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIII--DPSCQGDI 258 (763) T ss_pred HHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHhee--cCCCCCch Confidence 0 00 12567888899999874 443221 Q ss_pred -eeEEEE-EEEEEecC----C-------------------------------------ceEEEEEEEeCCcEEEEEecCC Q lcl|NC_021297. 149 -RVTRAV-RLYTTRDD----V-------------------------------------AVPDRATLYLPDETVPLRRNGG 185 (480) Q Consensus 149 -~~~~~~-~~~~~~~~----~-------------------------------------~~~~~~~~~~~~~~~~~~~~~~ 185 (480) ...+++ +++.+..+ + ..+...++|.. +...+. T Consensus 259 ~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~-----~d~~gd 333 (763) T protein:vir:95 259 NKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGF-----WDIEGN 333 (763) T ss_pred hhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeee-----eccCCc Confidence 112221 22211100 0 00011122211 001111 Q ss_pred ccccc---------ccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh Q lcl|NC_021297. 186 LNDQW---------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI 256 (480) Q Consensus 186 ~~~~~---------~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i 256 (480) +...+ +...+..|...+.+|+++|+..+...++||.|.+. .++++++.+|..++.+.+++...++|...+ T Consensus 334 g~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~-~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v 412 (763) T protein:vir:95 334 GVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAE-LLGDNQAVLGAVMRGMIDLLGRSANGQRGM 412 (763) T ss_pred ceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhhcCCcEEe Confidence 11010 11122334455789999999888889999999885 599999999999999999998888875433 Q ss_pred c-CCCccccccccccchhhhhhhhhcc-cCCCCce----EEEecc--ccHHHHHHHHHHHHHHHHhccCCChHHhccccc Q lcl|NC_021297. 257 S-GVTTDELTNDGENTTLDIYYGRILT-LASEAAK----ISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSE 328 (480) Q Consensus 257 ~-g~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~----~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~ 328 (480) - |.- . ..+. +...++.++. .+|..+. ...++. ......+.+++..+. ..+|++....|.+.. T Consensus 413 ~~gav-~-~~d~-----~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e---~~TGv~~~~~G~~~~ 482 (763) T protein:vir:95 413 PKGML-D-ALNS-----RRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAE---SLTGVKAFAGGVTGE 482 (763) T ss_pred ecccc-c-chhh-----hcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHH---HhhCcchhhcCcCcc Confidence 2 221 1 1110 1122222222 1222211 122222 123344455544444 455555555443221 Q ss_pred --c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCCCCc-----cc-----------ceeeeEEec Q lcl|NC_021297. 329 --N-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM----QIMGREVT-----EE-----------YTRLETVWR 385 (480) Q Consensus 329 --n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~----~~~~~~~~-----~~-----------~~~i~v~f~ 385 (480) | .++|++.. ...-........+.|..+++.+++.++ .+.+.... .+ ..++.+.-. T Consensus 483 ~~~~tat~v~~l--~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~ 560 (763) T protein:vir:95 483 SYGDVAAGIRGV--LDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS 560 (763) T ss_pred cccchhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc Confidence 1 23444332 222222333344555556655555444 33322100 00 112222221 Q ss_pred CCCCCCHH-HHHHHHHHHHhccccCCCHHH----HHHhCCCC------h---------hHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021297. 386 DPSTPTVA-AKADAVSKLYANGQGPIPKEQ----ARIDLGYT------A---------TQREQ-MRDWDKQETEDMIDTL 444 (480) Q Consensus 386 ~~~~~~~~-~~ad~~~kl~~~~~~~~s~et----~~~~lg~~------~---------~~~~~-~~~~~~~~~~~~~~~~ 444 (480) +.+.. +.+..+..+.+.....++... +.....+. . ++++. +.+....+.+...... T Consensus 561 ---~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~ 637 (763) T protein:vir:95 561 ---TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEEL 637 (763) T ss_pred ---cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHH Confidence 22221 223333333332111112111 11110110 0 00000 0000000000000000 Q ss_pred hcccc-cCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 445 YSTTK-AQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 445 ~~~~~-~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ....+ ..+.+.......+ ....+..-.-..-.+.+ T Consensus 638 ~akaq~~qaqa~~~~aq~e-~~~~d~~~~e~~~Q~~~ 673 (763) T protein:vir:95 638 RSKIRLNDAQAQKAMAERD-NKNLDYLEQESGTKHAR 673 (763) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 00000 0000000000000 00000000000000000 No 118 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.23 E-value=3.9e-11 Score=77.63 Aligned_cols=426 Identities=15% Similarity=0.114 Sum_probs=185.1 Q ss_pred CCCHH--HHHHHHHHHHHH---HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC Q lcl|NC_021297. 1 MTTYH--EHVERLQGLLAR---DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS 75 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~---~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~ 75 (480) |+... -....+...... ...+.. +..||....-+- .++-.....+.++++|||..+.-+.-+|+.+. T Consensus 71 ~ds~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~f~g-------yql~alY~~~~l~rkiVd~pAeDa~R~g~~I~ 142 (765) T protein:vir:96 71 MDSAYGDGPTPAAKAAAGGQNPYVVPTM-LQDWYNSQGFIG-------YQACAIISQHWLVDKACSMSGEDAARNGWELK 142 (765) T ss_pred ccccccccccchHHHhhhccCccchhhH-HHhhhcccCCcc-------HHHHHHHHhCchhhhhhhcchHHhhcCCceee Confidence 65542 123333322211 111212 233433221111 12334455677889999999998888888764 Q ss_pred CCc-----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccc--ccC--------CCcee-EEEEEccceeE Q lcl|NC_021297. 76 EDS-----EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE--SGD--------PAGIP-LIRVESPLYMY 139 (480) Q Consensus 76 ~d~-----~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~--~~d--------~~~~~-~i~~~~p~~~~ 139 (480) .++ +..+.+...|++-++...+.++.+.+-.||.+|+++...... ..+ ..|.+ .|.+++|..+. T Consensus 143 ~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~ 222 (765) T protein:vir:96 143 SDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAM 222 (765) T ss_pred cCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcc Confidence 432 223567777777788899999999999999998876432110 000 01111 23344444333 Q ss_pred EEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc------c Q lcl|NC_021297. 140 AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND------P 213 (480) Q Consensus 140 ~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~------~ 213 (480) |.-.. .+. .+ +..-.+|.+..+. + .+. .+ |+ .+ |+.|.+. . T Consensus 223 ~~~v~-----------e~~-~D----p~sp~fg~P~~y~-i--~g~----------~I-H~-SR--li~~~g~~lpd~lk 269 (765) T protein:vir:96 223 PQLTA-----------EST-AD----PSAEHFYEPDFWI-I--SGK----------KY-HR-SH--LVVVRGPQPPDILK 269 (765) T ss_pred cccch-----------hcc-cc----ccccccCcceeee-e--cCc----------ee-cc-ce--EEEecCCCchhhhc Confidence 31100 000 00 0000111111110 0 000 00 21 11 1112111 1 Q ss_pred ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhh--hhhcccCCCCce Q lcl|NC_021297. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYY--GRILTLASEAAK 289 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~--~~~~~~~~~~--~~~~~~~g~~~~ 289 (480) .....+|.|.+.. +..-+..++++.......+..+....+.+.+...-...+. .....+.... ..++.++. +.+ T Consensus 270 ~~~~~~G~Svlq~-~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~-ee~ 347 (765) T protein:vir:96 270 PTYIFGGIPLTQR-IYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGI-DET 347 (765) T ss_pred cccCccCccHHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecC-Ccc Confidence 2233579998865 5666666777665555555444444333322211000000 0001111111 11222322 233 Q ss_pred EEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) Q Consensus 290 ~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~ 367 (480) +-+++. ++....+.+......|+..+++|..-|-+.+ +-++||..-..-|...+. ...+..+...|++++.+++. T Consensus 348 ~e~~s~-~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~--s~Qe~~l~p~le~L~~li~~ 424 (765) T protein:vir:96 348 MEQFDT-NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELE--SIQEHIFDPLLERHYLLLAK 424 (765) T ss_pred eeEEec-ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH Confidence 433332 2334445566678899999999985553332 223567643333444333 23346678899999998775 Q ss_pred HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHH-------HHhccccCCCHHHHHHhC------CCC---hhHHHHHHH Q lcl|NC_021297. 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSK-------LYANGQGPIPKEQARIDL------GYT---ATQREQMRD 431 (480) Q Consensus 368 ~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~k-------l~~~~~~~~s~et~~~~l------g~~---~~~~~~~~~ 431 (480) ..+ .+ .++++.|.+-...+..+.|+...+ ++++ +.++...+++.| |+. +++++.... T Consensus 425 s~~--i~---~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~--Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~ 497 (765) T protein:vir:96 425 SES--ID---VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINS--GVVSPDEVRERLRDDPRSGYNRLTDDQAETEPG 497 (765) T ss_pred hcC--CC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhccccCCCCCCCccccccccC Confidence 422 22 258899999988888888775443 3433 356776666654 222 111111000 Q ss_pred HHHHHHHHHHHHHhc--------ccccCcccccCCCCCCCCc--ccc------cCCCCCCCCC---CC Q lcl|NC_021297. 432 WDKQETEDMIDTLYS--------TTKAQADATPKPTVTDTKT--ETQ------TSPSGFNRTK---TR 480 (480) Q Consensus 432 ~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~d~~~--~~~------~~p~~~~~~~---~~ 480 (480) +..+..++ .+.... .....++++..++.++..+ ... ....|++.++ +| T Consensus 498 ~~pe~~~~-~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~ 564 (765) T protein:vir:96 498 MSPENLAE-LEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSR 564 (765) T ss_pred CCcccccc-ccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCcccc Confidence 00000000 000000 0000000000000000000 000 0000111111 01 No 119 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.06 E-value=3.5e-09 Score=66.93 Aligned_cols=381 Identities=13% Similarity=0.123 Sum_probs=160.8 Q ss_pred hhhhccccchHHHHHHHHHhhhccCccccC-----C-Cc---hhHHHHHHHHHh---c-----------CHHHHHHHHHH Q lcl|NC_021297. 46 LAYLDVQPGWVATYLRTLSDRLDIEGFRIS-----E-DS---EGLEELWNWWQA---N-----------DLDEESVLGHD 102 (480) Q Consensus 46 ~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-----~-d~---~~~~~l~~i~~~---n-----------~~~~~~~~~~~ 102 (480) ++++--...+...+|+..++.+.+-|+.+- . .. ...+.+.+++.. | .+..+...+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 333333346777788877777766665431 0 11 112233333321 1 23456677888 Q ss_pred HHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCc----- Q lcl|NC_021297. 103 DSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDE----- 176 (480) Q Consensus 103 ~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 176 (480) +...+|.||+.+-.+ ..|.+ .+..++|..+.+..|... +.... +.. ..++.+|.+.. T Consensus 81 ~l~l~Gn~~i~~~r~------~~G~~~~l~~l~~~~v~~~~d~~~---------~~~~~-~~~-~~~~~~~~~~~~~~~~ 143 (467) T protein:vir:31 81 DYEAIGWLTIEILTQ------TDGTPTGLAYVPGHTIRKRMDERG---------FVQLL-EEK-EKYFGVAGDRYQTNGN 143 (467) T ss_pred HHHhcCCeEEEEEEC------CCCcEEEEEEeCCceeEeeeecce---------eEeec-CCc-eeeEEeccccceeecc Confidence 999999999987653 23444 578889988887765421 10000 110 11111111111 Q ss_pred --EEE-EEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_021297. 177 --TVP-LRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG--- 250 (480) Q Consensus 177 --~~~-~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~--- 250 (480) ..+ +...... .......++.==|+||++.....+.+|.|.+.- +.+++.....-......+|. T Consensus 144 ~~~~~~~~~~~~~-------~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~----~~~~i~~~~~~~~~~~~~f~ng~ 212 (467) T protein:vir:31 144 GDLDPVFVDADDG-------STGTSVSNPANELIFKRNHSPLYPHYGAPDIIP----AVKTIRGDSAAQDYNIDFFENDG 212 (467) T ss_pred cceeeeeeeeccc-------cccceeEeccccEEEecCCCCCCCcccccHHHH----HHHHHHHHHHHHHHHHHHHhccC Confidence 000 0000000 000001112223567776555567789887643 33343322222222223333 Q ss_pred chhhhh--cCCCccccccccccchhhhh-----------------hhhhcccC-CCC-----ceEEEecccc--HHHHHH Q lcl|NC_021297. 251 TPLRVI--SGVTTDELTNDGENTTLDIY-----------------YGRILTLA-SEA-----AKISEFKAAE--LRNFAE 303 (480) Q Consensus 251 ~~~~~i--~g~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~-g~~-----~~~~~~~~~~--~~~~~~ 303 (480) .|..++ +|........+.-...|+-. .+....+. |.+ .++..++... -..|++ T Consensus 213 ~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e 292 (467) T protein:vir:31 213 VPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLE 292 (467) T ss_pred CCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHH Confidence 233333 33222111110000111100 01111121 111 2222222211 235788 Q ss_pred HHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) ..+..+.+|+...++|+..+|.... +..|. +......+...+ .=..+.+...|.+ .+...........++ T Consensus 293 ~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~--~e~~~~~f~~~~l~P~~~~ie~~ln~------~l~~~~~~~~~~~i~ 364 (467) T protein:vir:31 293 FRGRNEHDILKVHDVPPVIAGVVESGAFSTD--AEEQRKEFAEETIQPKQHDFGELLYE------LVHKQGLDAPDWTIE 364 (467) T ss_pred HHHHHHHHHHHHhCCCHHHcccCCCCCcccC--HHHHHHHHHHHHHHHHHHHHHHHHHH------hhcchhhccCCceEE Confidence 8888899999999999998874322 21121 111111111111 1111111111111 111111112223567 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......+..+.++++.+++.+| +++...+++.+|+.+-.-+.+.. .........+ +..+. T Consensus 365 f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~pi~d~~~~~-----~~~~~~~~~~------~~~~~---- 427 (467) T protein:vir:31 365 FELAKPDTKLQDVEIASQRVQAMQG--LLTVNELRDEFGFEPFPEEHVYG-----GETLVAEVTG------GSGPG---- 427 (467) T ss_pred EecchhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCcccccC-----Cccccccccc------ccCCC---- Confidence 7777788899999999998888765 67788888888875421110000 0000000000 00000 Q ss_pred CCCcccccCCCCCCCC-------CCC Q lcl|NC_021297. 462 DTKTETQTSPSGFNRT-------KTR 480 (480) Q Consensus 462 d~~~~~~~~p~~~~~~-------~~~ 480 (480) ++.+.+..+.+.+.+ +++ T Consensus 428 -~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (467) T protein:vir:31 428 -GGIGDQIEQLVEDRADEIIDSYQAD 452 (467) T ss_pred -CcccCcCCCCCCCcccchHhhhhhc Confidence 000001111111111 111 No 120 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.04 E-value=4e-09 Score=66.64 Aligned_cols=413 Identities=8% Similarity=0.045 Sum_probs=148.3 Q ss_pred CCCHHHHHHHHHHHHHHH-----HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD-----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) +..+..-....-..+..+ ...+..+.+-| +++++....+.. +++. +.-++..+-.. ..+.||.+ T Consensus 54 ~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~~~~-~~npiv~~~I~~---ia~~--IA~~~~~~~~~----~~g~~~~i~ 123 (551) T protein:vir:80 54 YSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKF-GGNIILNAIINT---RSNQ--VSMYCKPARHS----EKGVGFEVR 123 (551) T ss_pred eecccccceecCcccccCccccChhHHHHHHHHh-hcCHHHHHHHHH---HHHH--Hhhhhhhhhhh----cCCCCceEE Confidence 111110000000000000 00111111111 112332111100 0000 00011000000 00111211 Q ss_pred ---------CCCchhHHHHHHHHHhc---------CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEcc Q lcl|NC_021297. 75 ---------SEDSEGLEELWNWWQAN---------DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESP 135 (480) Q Consensus 75 ---------~~d~~~~~~l~~i~~~n---------~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p 135 (480) ..+....+.+.+++.+- .+..+...+..+.+.+|.+|+.+-.+ ..|+| .+..++| T Consensus 124 ~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd------~~G~~~~L~~l~p 197 (551) T protein:vir:80 124 LKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFN------RNQSMVRFVAKDP 197 (551) T ss_pred ecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEEC------CCCcEEEEEEeCC Confidence 01111122344443321 23456667888899999998877543 34555 4788999 Q ss_pred ceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccc- Q lcl|NC_021297. 136 LYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR- 214 (480) Q Consensus 136 ~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~- 214 (480) ..+.++.++.. ......++++... +++. ...|.+++ |+||..++. T Consensus 198 ~~V~v~~~~~g-~~~~~~~~y~~~~-~g~~---~~~~~~~e-----------------------------iiH~~~n~~~ 243 (551) T protein:vir:80 198 TTIFFATTADG-KIPDNGNRFVQVI-DQKI---VATFNARE-----------------------------MAFAVRNPRS 243 (551) T ss_pred ceeEEEECCcc-ccccCceEEEEEe-CCcE---EEEEcccc-----------------------------eEEecccCCC Confidence 99988776532 1111111121111 1111 01122223 334433221 Q ss_pred --cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhh--cCCC-ccccccccccchhhh------hhhhh Q lcl|NC_021297. 215 --LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVI--SGVT-TDELTNDGENTTLDI------YYGRI 280 (480) Q Consensus 215 --~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i--~g~~-~~~~~~~~~~~~~~~------~~~~~ 280 (480) ..+.+|.|.|. .+.+++...+.-......+|. .|..+| .|.. ......+.-...|.. ..+++ T Consensus 244 ~~~~~~~G~spi~----~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~ 319 (551) T protein:vir:80 244 DIYATGYGYPELE----IALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQI 319 (551) T ss_pred CcccccccccHHH----HHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcc Confidence 22457887653 333444333333333333443 344333 2211 111100000111211 12334 Q ss_pred cccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHH--------HHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA--------EAIIATDSRIVKMAERKG 351 (480) Q Consensus 281 ~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg--------~Al~~~~~~l~~k~~~~~ 351 (480) ..+.+++.++.++.... -..|++..+..+.+|+...++|+..+|....+.+.+ ..+...... T Consensus 320 ~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~--------- 390 (551) T protein:vir:80 320 PVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQA--------- 390 (551) T ss_pred ccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHH--------- Confidence 44445667887775332 224889899999999999999999998533221111 011111111 Q ss_pred HHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChh-H-HHH Q lcl|NC_021297. 352 RIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT-Q-REQ 428 (480) Q Consensus 352 ~~~~~~l~~~~~~~~~~~~~~-~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~-~-~~~ 428 (480) .+...|.-++..+....... .......+.+.|......+..+.+. +.++..+ ++++...+++.+|+.+. + -+. T Consensus 391 -f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~-~~~~~~~--g~lT~NE~R~~~gl~P~~egGD~ 466 (551) T protein:vir:80 391 -SKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVK-ILAEKAK--VAMTVNEVRKELNLPGDVIGGDI 466 (551) T ss_pred -HHHHHHHHHHHHHHHHHHhhhccccCCceEEEeeccChhhHHHHHH-HHHHHhc--CCcCHHHHHHHhCCCCCCCCCce Confidence 11111211111111000000 0001124667787666666666654 3445544 35778888888887541 1 011 Q ss_pred H---------H-HHHHH--HHHHHHHHHhcccc-cCcccccCCCCCCCCcccccC-------CCCCCCCCCC Q lcl|NC_021297. 429 M---------R-DWDKQ--ETEDMIDTLYSTTK-AQADATPKPTVTDTKTETQTS-------PSGFNRTKTR 480 (480) Q Consensus 429 ~---------~-~~~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~d~~~~~~~~-------p~~~~~~~~~ 480 (480) + . ...++ +.+...+......+ .+.+..+++......+++++. .+..+.-+++ T Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (551) T protein:vir:80 467 PLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGK 538 (551) T ss_pred eecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCccccchhh Confidence 0 0 00000 00000001000000 000011111100000011100 0011111111 No 121 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.02 E-value=5.1e-09 Score=66.02 Aligned_cols=404 Identities=9% Similarity=0.055 Sum_probs=149.0 Q ss_pred CCCHHHHHHHHHHHHHHH-----HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh-------c Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD-----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-------D 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-------~ 68 (480) ++.+.--......-+..+ ...+..+.+-|. .+++....+. .+++..+.++ . T Consensus 50 ~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~-~npiv~~~I~----------------~~a~~ia~~~~~~~~~~~ 112 (547) T protein:vir:63 50 YSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFG-GNIILNAIIN----------------TRSNQVSMYCKPARHSEK 112 (547) T ss_pred hhchhhheeecccccccCCccCChhHHHHHHHHhh-cCHHHHHHHH----------------HHHHHHhhhhhhhhhhcc Confidence 111110000000011100 000111111121 1223221110 1111111110 0 Q ss_pred cCccc--c--------CCCchhHHHHHHHHHhc---------CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee- Q lcl|NC_021297. 69 IEGFR--I--------SEDSEGLEELWNWWQAN---------DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP- 128 (480) Q Consensus 69 ~~~~~--~--------~~d~~~~~~l~~i~~~n---------~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~- 128 (480) +.||. + +.+......+.+++.+- .+..+...+..+.+.+|.+|+.+..+ .+|++ T Consensus 113 ~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd------~~G~~~ 186 (547) T protein:vir:63 113 GVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFN------RNQSMV 186 (547) T ss_pred CCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEEC------CCCcEE Confidence 11111 0 11111122334433321 23456667888899999999887643 34554 Q ss_pred EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEE Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVP 208 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~ 208 (480) .+..++|..+.++.++.. ......++++... +++. ...|.+++ |+| T Consensus 187 ~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~-~~~~---~~~~~~~e-----------------------------iih 232 (547) T protein:vir:63 187 RFVAKDPTTIFFATTADG-KIPDNGNRFVQVI-DQKI---VATFNARE-----------------------------MAF 232 (547) T ss_pred EEEEecCceeEEEECCcc-ccccCceEEEEEc-CCcE---EEEecccc-----------------------------EEE Confidence 478899998888765432 1111111111111 1110 01122222 334 Q ss_pred eecccc---cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhh--cCCC-ccccccccccchhhh---- Q lcl|NC_021297. 209 LTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGT---PLRVI--SGVT-TDELTNDGENTTLDI---- 275 (480) Q Consensus 209 ~~n~~~---~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~---~~~~i--~g~~-~~~~~~~~~~~~~~~---- 275 (480) |+.++. ..+.+|.|.|. .+.+++...+.-......+|.+ |.-+| .|.. ......+.-...|.- T Consensus 233 ~r~n~~~~~~~~~~G~Spi~----~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G 308 (547) T protein:vir:63 233 AVRNPRSDIYATGYGYPELE----IALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSG 308 (547) T ss_pred ecccCCCCcccccccccHHH----HHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcC Confidence 433322 22457887663 3344443333333333334443 33222 2311 111000000111211 Q ss_pred --hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHH--------HHHHHHHHHHH Q lcl|NC_021297. 276 --YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA--------EAIIATDSRIV 344 (480) Q Consensus 276 --~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg--------~Al~~~~~~l~ 344 (480) ..+++..+.+++.++.++.... --.|++..+..+.+|+...++|++.+|....+.+++ ..+......+ T Consensus 309 ~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~- 387 (547) T protein:vir:63 309 INGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQAS- 387 (547) T ss_pred cccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHH- Confidence 1223444445667887776432 224888899999999999999999998543321111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCC-cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCh Q lcl|NC_021297. 345 KMAERKGRIFGGAWERAMRIAMQIMGREV-TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA 423 (480) Q Consensus 345 ~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~ 423 (480) +...|.-++..+....+... ......+.+.|......+..+.+. +.++..+| +++...+++++|+.+ T Consensus 388 ---------~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~-~~~~~~~g--~lT~NE~R~~~gl~P 455 (547) T protein:vir:63 388 ---------KNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVK-ILAEKAKV--AMTVNEVRKELNLPG 455 (547) T ss_pred ---------HHHHHHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHH-HHHHHhCC--CcCHHHHHHHhCCCC Confidence 11112211111111000000 000124667787776777666654 44555544 577777888887754 Q ss_pred h-H-HHHHH----------HHHHHH-----HHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 424 T-Q-REQMR----------DWDKQE-----TEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 424 ~-~-~~~~~----------~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) . + -+.+. ...+++ .++..+...... +..+..+.+..++ ..++++....++...++ T Consensus 456 ~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~d~~~~~~ 527 (547) T protein:vir:63 456 DVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQT-GNRVSTDVEDIPD-GKDTTGDIGKDGQRKDK 527 (547) T ss_pred CCCCCceeecccccccccccccccCCccccchhhcccccccc-CCCCCCCCCCCCC-CcccCCCcCccccccCc Confidence 2 1 00000 000000 000011111100 0001111111111 01111111111111111 No 122 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.97 E-value=9.1e-09 Score=64.66 Aligned_cols=449 Identities=12% Similarity=0.036 Sum_probs=191.5 Q ss_pred CCCH-HHHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCc----cc-chhhhhhccccchHHHHHHHHHhhhc----c Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGI----GA-PPELAYLDVQPGWVATYLRTLSDRLD----I 69 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~----~~-~~~~~~~~~~~n~~~~iVd~~~~~l~----~ 69 (480) |.+. .+.+.+..+.+...+ +....++++|+ +.+++.+. .. ....++.++..+-+...|+++++.|. + T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSD--YINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 7763 444455555554433 33444455553 22222211 11 11122345556667777887777663 2 Q ss_pred --Ccc-c--cCCCc-----hh-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEE Q lcl|NC_021297. 70 --EGF-R--ISEDS-----EG-------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 70 --~~~-~--~~~d~-----~~-------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~ 132 (480) .+| + .++++ +. .+.+.+.+..++|.....++.++..++|.|-+++.. +..+-++++. T Consensus 79 p~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~------d~~~~~r~~~ 152 (559) T protein:vir:95 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD------DDEDIIRTMP 152 (559) T ss_pred CCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec------CCCceeEEEE Confidence 122 2 22211 11 123445667789999999999999999999877653 2234578889 Q ss_pred EccceeEEEEeCCCCCeeEEEEEEEEEe---------------------cCCceEEEEEEEe----CCcE---------- Q lcl|NC_021297. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTR---------------------DDVAVPDRATLYL----PDET---------- 177 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~----~~~~---------- 177 (480) ++..++++.-|.. +++...++.+.-. +.+....+++++. .... T Consensus 153 ~~l~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 230 (559) T protein:vir:95 153 FPIGSYYLANSPR--GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) T ss_pred eecCeEEEeeCCC--CCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccc Confidence 9999988877753 3444444332110 0001011233221 1000 Q ss_pred ----EEEEecCCcccccccccccccccccccceEEeecccccCccCCccc-chhhHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021297. 178 ----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 ----~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~-i~~~v~~liD~~~~~~s~~~~~~~~~~~~ 252 (480) ++|...+.. .. . ..+.+|..+|++++..+...++.||+|- .. +..+-+..+|.+.-..+..++...+| T Consensus 231 pf~s~~~e~~~~~--~~-~---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~-~al~d~k~L~~l~~~~l~~~~~~~~p 303 (559) T protein:vir:95 231 PFKSVYYEVGGDN--DK-L---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGM-LALGPVKALQLLQKRKSQLIDKATNP 303 (559) T ss_pred eEEEEEEEecCCC--ce-e---eecCCcccCCccceeeeecCCccccccchHH-HhhHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111110 00 0 1234567789998888888899999994 53 45677888888888888888888888 Q ss_pred hhhhcCCCccccccccccchhhhhhhhhcccC--CCCceE--EEeccccHHH---HHHHHHHHHHHHHhccCCChHHhcc Q lcl|NC_021297. 253 LRVISGVTTDELTNDGENTTLDIYYGRILTLA--SEAAKI--SEFKAAELRN---FAEEMEVFRKEAASITGLPPQYLSS 325 (480) Q Consensus 253 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~--~~~~~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~ 325 (480) .+.+..- .....++..++.+.... ++...+ ......++.. -++.++.-|...+.... ...+.. T Consensus 304 p~~v~~~--------~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~--~~~l~~ 373 (559) T protein:vir:95 304 PMVAPTS--------LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDL--FMMLQN 373 (559) T ss_pred ceecccc--------ccccceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhh--HHHhhc Confidence 6554321 11112333444333221 111112 1111122322 23334444433332211 011211 Q ss_pred -ccccchHHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC--CC--cccceeeeEEecCCCCCCH--- Q lcl|NC_021297. 326 -SSENPASAEAIIATDSRIVKM----AERKGRI-FGGAWERAMRIAMQIMGR--EV--TEEYTRLETVWRDPSTPTV--- 392 (480) Q Consensus 326 -~~~n~~Sg~Al~~~~~~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~--~~--~~~~~~i~v~f~~~~~~~~--- 392 (480) +..+. ++.-++.....+... ..+.... +..-+.+.+.++.+. |. .. ......++|.|..++..-. T Consensus 374 r~~~rv-TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~l~~~~i~v~~is~La~aqk~~ 451 (559) T protein:vir:95 374 INTRSM-PVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRK-NMLPPPPDVMEGMPLKVEYISVMAQAQKSI 451 (559) T ss_pred CCCCCC-CHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCcccccCcceEEEeecHHHHHHHHH Confidence 22221 333333322222111 2222222 333345555554432 11 01 1122457777755443211 Q ss_pred -----HHHHHHHHHHHhcccc---CCCHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhc--ccccCc Q lcl|NC_021297. 393 -----AAKADAVSKLYANGQG---PIPKEQA----RIDLGYT------ATQREQMRDWDKQETEDMIDTLYS--TTKAQA 452 (480) Q Consensus 393 -----~~~ad~~~kl~~~~~~---~~s~et~----~~~lg~~------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 452 (480) ...++.+..+.+.++. .+....+ ...+|++ +++++++.+.+.++++........ +.+... T Consensus 452 ~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~ 531 (559) T protein:vir:95 452 GLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVK 531 (559) T ss_pred HHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 1111222333333221 1222222 2335653 444455443333322221111111 111100 Q ss_pred ccccCCCCCCCCcccccCC--CCCCCCCCC Q lcl|NC_021297. 453 DATPKPTVTDTKTETQTSP--SGFNRTKTR 480 (480) Q Consensus 453 ~~~~~~~~~d~~~~~~~~p--~~~~~~~~~ 480 (480) .-++ .....+...++.. .+-++++.- T Consensus 532 ~~~~--~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 532 TLSE--AKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred cccc--ccCCChhHHHHHHHhhcCccccCC Confidence 0001 1111111111111 111222222 No 123 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.96 E-value=9.3e-09 Score=64.61 Aligned_cols=462 Identities=12% Similarity=0.006 Sum_probs=175.7 Q ss_pred CCCHHHHHHHHHHHH---HHHH----HHHHHHHHHhcccCcchh-c------CcccchhhhhhccccchHHHHHHHHHhh Q lcl|NC_021297. 1 MTTYHEHVERLQGLL---ARDL----PNLLEAEAYRNGTRRLKT-I------GIGAPPELAYLDVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~~~~i~~l~~~~---~~~~----~r~~~~~~YY~g~~~i~~-~------~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 66 (480) |++. ++...|.+.+ ...+ .+.+.+.+||........ + .......-.+.++..+.+.-.|+.++.. T Consensus 20 ~~~~-~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~ 98 (641) T protein:vir:94 20 LSTD-RIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAY 98 (641) T ss_pred CCch-hHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhH Confidence 5543 3444444433 3322 234555666654332221 0 0000011123477888888888877765 Q ss_pred hccC----c--ccc----CCCchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcc------------- Q lcl|NC_021297. 67 LDIE----G--FRI----SEDSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV------------- 119 (480) Q Consensus 67 l~~~----~--~~~----~~d~~~~~~----l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~------------- 119 (480) |... . |.+ .+|.+..+. +...+.++++.....+..++++.+|.+++.++.... T Consensus 99 Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~ 178 (641) T protein:vir:94 99 FKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETG 178 (641) T ss_pred HhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccch Confidence 5321 1 211 233333333 334455678888888999999999999988863210 Q ss_pred --cc-------cCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe----------------------cCCc---- Q lcl|NC_021297. 120 --ES-------GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR----------------------DDVA---- 164 (480) Q Consensus 120 --~~-------~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~----------------------~~~~---- 164 (480) .. ......+++..++|.+++ +|+..+..-..++++.... .+.. T Consensus 179 ~~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~ 256 (641) T protein:vir:94 179 DIFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADP 256 (641) T ss_pred hhcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhccccccccc Confidence 00 011344566677777654 4443322111111111000 0000 Q ss_pred ------------eEEEEEEEe----CCc-EEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhh Q lcl|NC_021297. 165 ------------VPDRATLYL----PDE-TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 165 ------------~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) .+..+++|. ++. ...|.....+ . .......-+.|..+|++.+...+..++.||+|... . T Consensus 257 d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g--~-~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~-~ 332 (641) T protein:vir:94 257 DTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYG--K-QLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLH-P 332 (641) T ss_pred ccccccccccccccceeeeeeeeccCCCceeeEEEEEeC--C-EEeecccccccCcCCeEEecceecCCcccCCChHH-H Confidence 001111121 010 0000000000 0 01111111246678999999998999999999774 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCC-CceEEEeccccHH---HHHH Q lcl|NC_021297. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASE-AAKISEFKAAELR---NFAE 303 (480) Q Consensus 228 v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~---~~~~ 303 (480) +.+.+..+|.+.-.+.+.+....+|.+.+..-... ....+...++.++..... +.........+.. ..++ T Consensus 333 ~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~------~~~~l~~~PG~ii~~~~~~~v~pl~~~~~~~~~~~~~~~ 406 (641) T protein:vir:94 333 NLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGIL------KREDVKAKPGAVFKVAQHGSLQPIDMGRQDFVVTYQEAQ 406 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccc------ccceeeccCCcceeeCCCCcceeecCCccccchhHHHHH Confidence 88999999999999999999888887654311000 011134445544433221 1111111122222 2233 Q ss_pred HHHHHHHHHHhccCCChHHhccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHh----------- Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSS--SENPASAEAIIATDSRIVKMAERKGRIFGG-AWERAMRIAMQIM----------- 369 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~--~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~~~~~~~----------- 369 (480) .++..+.. .+++.....+.. +....++..+..+......+.....+.|.. .+..+++.++.++ T Consensus 407 ~~~~~i~~---~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R 483 (641) T protein:vir:94 407 VQESSVYR---NTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIR 483 (641) T ss_pred HHHHHHHH---hhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhh Confidence 33333333 222221111111 111112333444444444444444455543 3444444332211 Q ss_pred --C------C--CCcccceeeeEEecCCCCCCHHHHHHHHHHH---HhccccCCC--------H---HHHHHhCCCC--- Q lcl|NC_021297. 370 --G------R--EVTEEYTRLETVWRDPSTPTVAAKADAVSKL---YANGQGPIP--------K---EQARIDLGYT--- 422 (480) Q Consensus 370 --~------~--~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl---~~~~~~~~s--------~---et~~~~lg~~--- 422 (480) + + ..+.+....++.+-.-......+.+..+..+ .+.. +..+ . +.+++.+|+. T Consensus 484 ~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~-a~~P~v~d~~d~~~~~~~~~~~~g~~~p~ 562 (641) T protein:vir:94 484 MYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDIS-GRVPQIGQSLDYALILEDLLRQMRFTDPM 562 (641) T ss_pred hhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHh-hcChhhhhcCCHHHHHHHHHHHhCCCCch Confidence 1 1 1111212222222111111112223333222 2211 1111 0 2223333431 Q ss_pred -----hh-HHHHHHHHHHHHHHHHHHHHhc-----------c-cccCcccccCCCCCC-CCccccc--CCC-CCCCCCC Q lcl|NC_021297. 423 -----AT-QREQMRDWDKQETEDMIDTLYS-----------T-TKAQADATPKPTVTD-TKTETQT--SPS-GFNRTKT 479 (480) Q Consensus 423 -----~~-~~~~~~~~~~~~~~~~~~~~~~-----------~-~~~~~~~~~~~~~~d-~~~~~~~--~p~-~~~~~~~ 479 (480) ++ +.+......++.++......+. . .+.+.+.....-+.+ ..--+++ .|| .....+- T Consensus 563 ~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 563 RYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAAATQQITSGAL 641 (641) T ss_pred hhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhcccccccccCC Confidence 11 0000000000000111111000 0 000000000000000 0000000 000 0000111 No 124 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.90 E-value=1.6e-08 Score=63.31 Aligned_cols=423 Identities=15% Similarity=0.093 Sum_probs=158.5 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHhcc-cCcchh------cCcccchhhhhhccccchHHHHHHHHHhhhccCccc--- Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLE-AEAYRNG-TRRLKT------IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR--- 73 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~-~~~YY~g-~~~i~~------~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~--- 73 (480) .-++..|...... +.... -.+.|.. ...+.. .+..+.+.. -++... ...+|+..++-+-.-++. T Consensus 1 Mg~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~-al~~~~--v~~~i~~ia~~iA~lp~~~~~ 75 (457) T protein:vir:62 1 MGFWSALFGRGHS--PALDAAEGRAWEPYDPSIYNLGATASSGERVTPHD-ALQVSA--VFASVRLLSETIATLPLSTYS 75 (457) T ss_pred Cchhhhhhccccc--cccccccccccccchhhhhhccccccCCceechHH-hhccHH--HHHHHHHHHHhHhhCceEEEE Confidence 2222222111000 00000 0000000 000000 001111100 011100 111233332222222322 Q ss_pred cCCC---chhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCC Q lcl|NC_021297. 74 ISED---SEGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) Q Consensus 74 ~~~d---~~~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~ 146 (480) ..++ ......+..++.. | ....+...+..+.+.+|.||+.+-.. ...-..+..++|..+.+..+... T Consensus 76 ~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~------~g~~~~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:62 76 KRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA------GPNIAGLDVLDPTKIHVHMVMVD 149 (457) T ss_pred ecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC------CCcEEEEEEEcCcceEEEEeccC Confidence 1111 1112223334322 2 24566778888899999999887432 12223577788888877654332 Q ss_pred CCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchh Q lcl|NC_021297. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) .. .....+.|. ....+.......|.+++++++ ++....+..+|.|.+.- T Consensus 150 ~~-~~~~~~~y~-~~~~g~~~~~~~~~~~eiih~-----------------------------r~~~~~~~~~G~sp~~~ 198 (457) T protein:vir:62 150 GL-RRKVFEAYD-IDADGNEVLLGWFTPRDVLHI-----------------------------PGMMLPGDFVGCSPISY 198 (457) T ss_pred Cc-cceeEEEEE-EccCCceeEEEeeCccceEEe-----------------------------cCCCCCCceecccHHHH Confidence 21 111111121 112222222333444444443 32222233457665532 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhhh------hhhhcccCCCCceEEEeccccH- Q lcl|NC_021297. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDIY------YGRILTLASEAAKISEFKAAEL- 298 (480) Q Consensus 227 ~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~~------~~~~~~~~g~~~~~~~~~~~~~- 298 (480) +...+.....+..-....+...+.|..+|.-. .......+.-...|+-. .++++.++ ++.++.++..... T Consensus 199 -~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~-~g~~~~~l~~~~~d 276 (457) T protein:vir:62 199 -ARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAKFSKVAMSPDE 276 (457) T ss_pred -HHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChhH Confidence 22223222222111222222223344334311 11111111111112111 13344454 3467777654322 Q ss_pred HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) ..|++..+..+.+|+...++|+..+|....+..++..+......+...+ -.-+-..|++.+.. .+..... .... T Consensus 277 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~---l~P~~~~ie~~ln~--~L~~~~~-~~~~ 350 (457) T protein:vir:62 277 AQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS---LRPWLERIEAGFNR--LLFAETA-DRFR 350 (457) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH---HHHHHHHHHHHHHh--hhcCccc-cCce Confidence 2477888888999999999999999754433322322332222222221 01111122222111 1111111 1223 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH---HHH-HHHHHHHHHHHHHHHhcccccCccc Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR---EQM-RDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .+++.+......|..++++++.+++++| +++...+++.+|+.+-+- +++ .-..-..............+...++ T Consensus 351 ~i~fd~~~l~~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 428 (457) T protein:vir:62 351 FVKFNLDEIKRGAPKERMELWSLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDP 428 (457) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCC Confidence 4555566666678889999999998875 677778888887754211 111 0000000000000000000000000 Q ss_pred cc-CCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 455 TP-KPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 455 ~~-~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .+ ++..+.++.+.++.| ..+..+++ T Consensus 429 ~~~~~~~~~~~~~~~~~~-d~~~~~~~ 454 (457) T protein:vir:62 429 PAEEPADDEEPDNAEGDP-DEGETEDD 454 (457) T ss_pred CccCCCCCCCCCCCCCCC-cccccccc Confidence 00 000011111112222 22222222 No 125 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.89 E-value=1.9e-08 Score=62.86 Aligned_cols=422 Identities=10% Similarity=0.082 Sum_probs=160.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC--CCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS--EDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~--~d~ 78 (480) +++..+.-..++... -......-+ |.+++..-+... ..+.+..-...+...+|+..+..+...++.+. ++. T Consensus 51 ~~~~~d~~~~~~~r~-----g~~~~~~~~-g~~~~~epp~d~-~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~ 123 (648) T protein:vir:79 51 GSAKRDPKMSLVKRI-----GLAIMDGGG-GGRDFEEPEFDF-NEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN 123 (648) T ss_pred ccccccchhHHHHHh-----HHHHHhhcC-CccccccCCcCH-HHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCc Confidence 222222111111111 011111222 222332111111 12223322345667777777766655554332 221 Q ss_pred hhHH-HHHH-HHHhc---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-E---------------EEEEccce Q lcl|NC_021297. 79 EGLE-ELWN-WWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-L---------------IRVESPLY 137 (480) Q Consensus 79 ~~~~-~l~~-i~~~n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~---------------i~~~~p~~ 137 (480) .... .... +...| ....+...+..+.+.+|.||+.+-.+. +|.+ + +..++|.. T Consensus 124 ~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~------~G~~~~~l~~~~~~~~~~v~~l~pl~p~~ 197 (648) T protein:vir:79 124 AVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAK------DALPFQGMNVMGVGDSMPVAGYFPLNLAS 197 (648) T ss_pred cchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecC------CCccchhhhhhhhccccceeeeEeecCce Confidence 1111 1111 12222 345667788899999999999876532 2211 0 11122222 Q ss_pred eEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCc Q lcl|NC_021297. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~ 217 (480) +.+..+ +.+.+.. |.|...++.. ...+..=-|+||+......+ T Consensus 198 v~v~~d------------------~~g~~~~---------Y~y~~~g~~~----------~~~~~~~dIIHik~~~~~d~ 240 (648) T protein:vir:79 198 MKVKRD------------------KFGMIKG---------WQQEQEGQDK----------PQKFKPEDIVHIYYKREKGR 240 (648) T ss_pred eEEEEc------------------CCCceee---------eEEEecCCce----------eEEecCccEEEEccCCCCCC Confidence 222221 1111100 1111111100 00011113678876656677 Q ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc---hhhhhhhhhcccCCC-CceEEEe Q lcl|NC_021297. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT---TLDIYYGRILTLASE-AAKISEF 293 (480) Q Consensus 218 ~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~---~~~~~~~~~~~~~g~-~~~~~~~ 293 (480) .+|.|.|.- +...++..+.+.......+...+.|..++.- .......+.... .+.-....+...++. .++...+ T Consensus 241 ~~GlSpi~~-a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~-~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i 318 (648) T protein:vir:79 241 AFGTPWLLP-ALDDIRALRQVEENVLRLVYRNLHPLWHVKV-GLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTERVNI 318 (648) T ss_pred ceeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEe-CCCccchHHHHHHHHHHHHhcccccccccccccceeec Confidence 889987642 2333332222222222223333444443321 111111111111 111111112212221 2222222 Q ss_pred cc-cc--HHHHHHHHHHHHHHHHhccCCChHHhcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH-H Q lcl|NC_021297. 294 KA-AE--LRNFAEEMEVFRKEAASITGLPPQYLSSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERA-MRIAM-Q 367 (480) Q Consensus 294 ~~-~~--~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~-~~~~~-~ 367 (480) +. .+ .-.|++..+..+.+|+...++|+..+|... .+.+.+.+....+.. .+.-.+..+...+... .+.++ . T Consensus 319 ~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~---~i~~l~~~i~~~le~~~~~~ll~e 395 (648) T protein:vir:79 319 SSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKD---RIKALQKVMATFINEFMVKEILME 395 (648) T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhh Confidence 21 11 124778888889999999999999998532 222223333322221 1111222222222211 11111 0 Q ss_pred -HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH--HH--HHH-HHHHHHHHHHH Q lcl|NC_021297. 368 -IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ--RE--QMR-DWDKQETEDMI 441 (480) Q Consensus 368 -~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~--~~--~~~-~~~~~~~~~~~ 441 (480) ..+..... ...+++.|.+....+....++.+.++.++| +++...+++++|+.+-+ .. .+- .+.....+... T Consensus 396 ~~l~~~l~~-d~~ieF~~~~Llr~D~~~~a~~~~~l~~~G--ilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~ 472 (648) T protein:vir:79 396 GGFDPVLNP-DDKVEFRFNEIDMDSKIKLENQAVFLYEHN--AISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATAL 472 (648) T ss_pred hhccccccc-cceEEEeecccchhhHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCccccccccccchhcccc Confidence 00111111 234677888777788888888888888876 67888888888875432 10 110 00000000000 Q ss_pred HHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 442 DTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) +.......+........++...+++...+|....+.+++ T Consensus 473 ~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~ 511 (648) T protein:vir:79 473 AALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTS 511 (648) T ss_pred ccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCC Confidence 000000100000011111111111111112111111211 No 126 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.88 E-value=2.2e-08 Score=62.62 Aligned_cols=445 Identities=13% Similarity=0.108 Sum_probs=176.8 Q ss_pred CCCHH-----HHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhcc--- Q lcl|NC_021297. 1 MTTYH-----EHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~-----~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+++ +-++++.+.+..++ +....++++|+- .+++.. .......+..++..+-+...++++++.|.. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQY--TIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhc Confidence 98854 24444555554433 223334444421 122211 111111112244455566667776665532 Q ss_pred -C-cc-ccC-CCch--------------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCC Q lcl|NC_021297. 70 -E-GF-RIS-EDSE--------------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) Q Consensus 70 -~-~~-~~~-~d~~--------------------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~ 125 (480) . +| ++. .|.+ ..+.+...+..++|.....++.++..++|.+-+++..+ ... T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-----~~~ 153 (536) T protein:vir:10 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-----CCC Confidence 1 22 221 1110 11334556677899999999999999999988776422 223 Q ss_pred ceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-------c------C---CceEEEEEEEeC-------CcEEEEEe Q lcl|NC_021297. 126 GIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-------D------D---VAVPDRATLYLP-------DETVPLRR 182 (480) Q Consensus 126 ~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~------~---~~~~~~~~~~~~-------~~~~~~~~ 182 (480) +...++.++-.++++.-|. .+++...+|.+.-. . . ......+++|+. +.+..|.. T Consensus 154 ~~~~~~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e 231 (536) T protein:vir:10 154 NYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEE 231 (536) T ss_pred ceeeEEEEEcCeEEEeeCC--CCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEe Confidence 4445777777777766664 23444444333211 0 0 000112222221 11222211 Q ss_pred cCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc Q lcl|NC_021297. 183 NGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTD 262 (480) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 262 (480) ..+. ....+....+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........+...+. ++ T Consensus 232 ~~g~----~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~---p~ 303 (536) T protein:vir:10 232 VEGM----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN---PA 303 (536) T ss_pred ecCc----cccccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcccC---cc Confidence 1110 111222334677899999999989999999997754 556667777776666665555555443331 00 Q ss_pred ccccccccchhhhhhhhhcccCCCCceEEEec-cccHH---HHHHHHHHHHHHHHhccCCChHHhc-cccccchHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELR---NFAEEMEVFRKEAASITGLPPQYLS-SSSENPASAEAII 337 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g-~~~~n~~Sg~Al~ 337 (480) .. ..........++.+..-..++..+.++. ..+++ ..++.++.-|+..+.... +. -+... .++.-++ T Consensus 304 g~--~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~r-~TAtEV~ 375 (536) T protein:vir:10 304 GI--TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGER-VTAEEIR 375 (536) T ss_pred cc--cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-----cccCCCCC-ccHHHHH Confidence 00 0000111222333322112233344443 23333 334444444444332221 11 11112 1333333 Q ss_pred HHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCC-CHHHHHHHHHHHHh----c Q lcl|NC_021297. 338 ATDSRIVK----MAERKGRIF-GGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTP-TVAAKADAVSKLYA----N 405 (480) Q Consensus 338 ~~~~~l~~----k~~~~~~~~-~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~-~~~~~ad~~~kl~~----~ 405 (480) .....+.. -..++...| .+-+++++.++.+ .|. ..+.+. +++.+.-++.. ...+.++.+...++ . T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~ 452 (536) T protein:vir:10 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEA--VEPTISTGLEAIGRGQDLDKLERCVTAWAAL 452 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhh--ccceEEecHHHHHHHHHHHHHHHHHHHHHhh Confidence 32222222 112222222 2223333333322 111 122222 34444322221 11122222222222 1 Q ss_pred ccc----CCCHHHHH----HhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhcccc-cCcccccCCCCCCCCccccc Q lcl|NC_021297. 406 GQG----PIPKEQAR----IDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTK-AQADATPKPTVTDTKTETQT 469 (480) Q Consensus 406 ~~~----~~s~et~~----~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~d~~~~~~~ 469 (480) ++. .+....++ +.+|++ +++++++.+.+.++.+ ...+.....+ ..++....+.......+++| T Consensus 453 ~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~-~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g 531 (536) T protein:vir:10 453 APMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG-MDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) T ss_pred chhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCchhHHhhhhccc Confidence 211 12332222 345662 3444444332222211 1111111010 11111111111111111111 Q ss_pred CCCCC Q lcl|NC_021297. 470 SPSGF 474 (480) Q Consensus 470 ~p~~~ 474 (480) .--|. T Consensus 532 ~~~~~ 536 (536) T protein:vir:10 532 LQPGI 536 (536) T ss_pred cCCCC Confidence 10011 No 127 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.87 E-value=2.2e-08 Score=62.53 Aligned_cols=438 Identities=13% Similarity=0.112 Sum_probs=181.7 Q ss_pred CCCHH------HHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhc--CcccchhhhhhccccchHHHHHHHHHhhhcc-- Q lcl|NC_021297. 1 MTTYH------EHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTI--GIGAPPELAYLDVQPGWVATYLRTLSDRLDI-- 69 (480) Q Consensus 1 ~~~~~------~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~--~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~-- 69 (480) |.+.+ +.++.....+..++ +....++++|+- .+++. ...........++..+-+...++++++.|.. T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQY--TIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLAL 78 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhh Confidence 66543 23444444444432 223333444321 12221 1111111111233344456666666665532 Q ss_pred ---Ccc-cc--CC--------Cc-----------hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC Q lcl|NC_021297. 70 ---EGF-RI--SE--------DS-----------EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 70 ---~~~-~~--~~--------d~-----------~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~ 124 (480) .+| +. .+ +. +..+.+...+..++|.....++.++..++|.+-+++.. +. T Consensus 79 tP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~------~~ 152 (535) T protein:vir:15 79 FPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE------PE 152 (535) T ss_pred cCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec------CC Confidence 122 21 11 11 11133445567789999999999999999998776642 23 Q ss_pred CceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-c---------------CCceEEEEEEEeC-----C--cEEEEE Q lcl|NC_021297. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYLP-----D--ETVPLR 181 (480) Q Consensus 125 ~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~---------------~~~~~~~~~~~~~-----~--~~~~~~ 181 (480) .+..+++.++-.++++..|.. .++...++.+.-. . +.+....+++|+. + .+..|. T Consensus 153 ~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~ 230 (535) T protein:vir:15 153 GSYNPMKLYRLSSYVVQRDAY--GNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYE 230 (535) T ss_pred CCceeeEEEEcCeeEEeeCCC--CCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEE Confidence 455778888877777766643 3344444433211 0 0000111222221 1 111111 Q ss_pred ecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CC Q lcl|NC_021297. 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~ 259 (480) ...+.. ........+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+.+. |. T Consensus 231 e~~g~~----~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~ 305 (535) T protein:vir:15 231 EVEDVE----IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGI 305 (535) T ss_pred EeeCcc----ccccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc Confidence 111100 11111234577899999998888899999998754 567788888888888888888888875542 11 Q ss_pred CccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHH Q lcl|NC_021297. 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAII 337 (480) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~ 337 (480) ... ......+.+.+.....++....++.. .+++.....++.+...|....-+ + .+.. +.... ++.-++ T Consensus 306 ~~~-------~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~-~~~~~~~~r~-TAtEV~ 375 (535) T protein:vir:15 306 TQP-------RRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFML-N-SAVQRTGERV-TAEEIR 375 (535) T ss_pred ccc-------hhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh-h-hcccCCCccc-cHHHHH Confidence 100 00011122222222223344444442 34444444444433333322211 1 1211 11111 232222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHhCCCCc-ccceeeeEEecCCCCCCH-HHHHHHHHHHH Q lcl|NC_021297. 338 ATDSRIVKMAERKGRIFGGAWE------------RAMRIAMQIMGREVT-EEYTRLETVWRDPSTPTV-AAKADAVSKLY 403 (480) Q Consensus 338 ~~~~~l~~k~~~~~~~~~~~l~------------~~~~~~~~~~~~~~~-~~~~~i~v~f~~~~~~~~-~~~ad~~~kl~ 403 (480) . ++..+...++..+. +.+.++.+ .+..+ .....+++.|..++..-- .+.++.+...+ T Consensus 376 ~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~ 446 (535) T protein:vir:15 376 Y-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCI 446 (535) T ss_pred H-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHH Confidence 2 22333333333333 33333221 11111 112346777755543211 11222222222 Q ss_pred ----hcccc----CCCHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhcccc-cCcccccCCCCCCC Q lcl|NC_021297. 404 ----ANGQG----PIPKEQA----RIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTK-AQADATPKPTVTDT 463 (480) Q Consensus 404 ----~~~~~----~~s~et~----~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~d~ 463 (480) +.++. .+....+ ...+|+. +++++++.+...++++ .........+ ..+.+...|++. T Consensus 447 ~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~-~~~~a~~~g~~~~~~~~~~p~~~-- 523 (535) T protein:vir:15 447 SAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTG-IENAAATGGAGVGALATSSPEAM-- 523 (535) T ss_pred HHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHHhhccchhccChHHH-- Confidence 22211 1222222 2334543 2333333222111111 1111111111 111111112211 Q ss_pred CcccccCCCCCCCCCC Q lcl|NC_021297. 464 KTETQTSPSGFNRTKT 479 (480) Q Consensus 464 ~~~~~~~p~~~~~~~~ 479 (480) ++..+..|-.+| T Consensus 524 ----~~~~~~~g~~~~ 535 (535) T protein:vir:15 524 ----QGAAAQAGLDAT 535 (535) T ss_pred ----HHHHhccCCCCC Confidence 122222233333 No 128 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.86 E-value=2.5e-08 Score=62.26 Aligned_cols=433 Identities=14% Similarity=0.119 Sum_probs=194.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCc----ccch----hhhhhccccchHHHHHHHHHhhhc---- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGI----GAPP----ELAYLDVQPGWVATYLRTLSDRLD---- 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~----~~~~----~~~~~~~~~n~~~~iVd~~~~~l~---- 68 (480) |+ .+++++++-.-...|.+....+++||+- .+++... .... ..++.++..+-+...|+++++.|. T Consensus 1 ~~-~~~l~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 ME-NSKIVKRLDFLKTDRKNVEQIWDCIRKY--IMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CC-HHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhc Confidence 54 4667766555555555555566666532 2332211 1111 123445666777788888777663 Q ss_pred c--Cccc-c--CCC-----chh-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 69 I--EGFR-I--SED-----SEG-------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 69 ~--~~~~-~--~~d-----~~~-------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) + .+|+ . .+. .+. .+.+.+.+.+++|.....++.++..++|.|-+++..+ .+..+.++++ T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d----~~~~~~~r~~ 153 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEED----EDEEGSVVFQ 153 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccC----CCCCCceeEE Confidence 2 1222 1 111 111 2334556677889999999999999999998777532 2344668899 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEe-----------------------cCCceEEEEEEEe----CCc-------- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR-----------------------DDVAVPDRATLYL----PDE-------- 176 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~----~~~-------- 176 (480) .++..++++.-|.. +++...+|.+.-. +.+.....+++++ ... T Consensus 154 ~~pl~~~~v~~d~~--G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~ 231 (547) T protein:vir:10 154 SSPIQDSYFEEDSR--GQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAG 231 (547) T ss_pred EeecceEEEeeCCC--cCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcccc Confidence 99999888877753 3344433322110 0011011122221 100 Q ss_pred -----------EEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 177 -----------TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSA 245 (480) Q Consensus 177 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~ 245 (480) .+++...+.. .. ..+.+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-..... T Consensus 232 ~~~~~~~~p~~s~~~e~~~~~----~~---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~ 303 (547) T protein:vir:10 232 TVLAPTERPFGKKWILKEGAV----QL---GEEGGYYEMPAYAIRWRKSAGSQWGFGPSHL-ALPDVLTANRYVELVLRS 303 (547) T ss_pred ceeeccccceeEEEEEecCce----ee---eecCCcccCCeeeeeeeecCCcccccchHHH-HHHHHHHHHHHHHHHHHH Confidence 0111111100 00 1234567889999999888899999997754 567778888888888888 Q ss_pred HHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHH---HHHHHHHHHHHHHHhccCCChH Q lcl|NC_021297. 246 SQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQ 321 (480) Q Consensus 246 ~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~ 321 (480) ++....|.+.+.- ++-...++..++.+... |+...+..++. .++. ..++.++.-|...+....+. T Consensus 304 ~~~~~~pp~~v~~--------~g~~~~~~~~pgg~~~~-~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~-- 372 (547) T protein:vir:10 304 SEKVIDPAIMVTE--------RGLISDIDLGASGLTVV-RDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQ-- 372 (547) T ss_pred HHHHhcCceeccc--------ccccccceecCCeeeec-CCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhh-- Confidence 8888888765421 00111233444444332 22223332322 2333 34444444444433332211 Q ss_pred HhccccccchHHHHHHHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHhCC--CC-----cccceeeeEEecCCCC Q lcl|NC_021297. 322 YLSSSSENPASAEAIIATDSRIVK----MAERKGRIF-GGAWERAMRIAMQIMGR--EV-----TEEYTRLETVWRDPST 389 (480) Q Consensus 322 ~~g~~~~n~~Sg~Al~~~~~~l~~----k~~~~~~~~-~~~l~~~~~~~~~~~~~--~~-----~~~~~~i~v~f~~~~~ 389 (480) .. +.... ++.-++.....+.. ...+.+..| ..-+.+.+.++.+. |. .. ..+...++|++..++. T Consensus 373 -~~-~~~~~-TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~l~~~~~~~~~v~~is~La 448 (547) T protein:vir:10 373 -MK-DSPAM-TATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRA-GKLGELPSKLLESGKAAMDIVYTGPLS 448 (547) T ss_pred -cC-CCccc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhhccCcceEEEEeccHHH Confidence 11 11121 23222222221111 122222222 22334445443331 11 11 1134567788865554 Q ss_pred CCHHH-HHHH-------HHHHHhcccc---CCCHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHH-HHhcc Q lcl|NC_021297. 390 PTVAA-KADA-------VSKLYANGQG---PIPKEQA----RIDLGYT------ATQREQMRDWDKQETEDMID-TLYST 447 (480) Q Consensus 390 ~~~~~-~ad~-------~~kl~~~~~~---~~s~et~----~~~lg~~------~~~~~~~~~~~~~~~~~~~~-~~~~~ 447 (480) +.... .+.+ +..+.+.++. .+....+ ...+|++ +++++++.+.+.++++.... ++... T Consensus 449 raq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~ 528 (547) T protein:vir:10 449 RAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEA 528 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43211 1111 1222222221 1232222 2345653 44455444333222221111 11100 Q ss_pred cccCcccccCCCCCCCC-cccc Q lcl|NC_021297. 448 TKAQADATPKPTVTDTK-TETQ 468 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~-~~~~ 468 (480) . + +..+.-+.++.+ ...+ T Consensus 529 ~-g--~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 529 E-G--NAMEAQGKGQAALKENQ 547 (547) T ss_pred H-H--HHHHhhcCcccchhccC Confidence 0 0 001111111100 0001 No 129 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.86 E-value=2.5e-08 Score=62.21 Aligned_cols=446 Identities=11% Similarity=0.053 Sum_probs=193.9 Q ss_pred CCCH-HHHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCc----cc-chhhhhhccccchHHHHHHHHHhhhc----c Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGI----GA-PPELAYLDVQPGWVATYLRTLSDRLD----I 69 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~----~~-~~~~~~~~~~~n~~~~iVd~~~~~l~----~ 69 (480) |.+. .+.|.+..+.+..++ +....++++|+- .+++... .. ....+..++..+-+...|+++++.|. + T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDF--INPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITS 78 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcC Confidence 8773 444555555555433 334445555532 2222211 11 11222345566667777887777653 2 Q ss_pred --Ccc-c--cCCCc-----h-------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEE Q lcl|NC_021297. 70 --EGF-R--ISEDS-----E-------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 70 --~~~-~--~~~d~-----~-------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~ 132 (480) .+| + .++.+ + ..+.+.+.+..++|.....++.++...+|.+-+++.. +..+-++++. T Consensus 79 p~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~------~~~~~~r~~~ 152 (556) T protein:vir:73 79 PARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVME------DDQDVIRTMP 152 (556) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeee------cCCceEEEEE Confidence 122 1 12211 1 1233455677789999999999999999999887753 2345578888 Q ss_pred EccceeEEEEeCCCCCeeEEEEEEEEEe---------------------cCCceEEEEEEE----eCCcE---------- Q lcl|NC_021297. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTR---------------------DDVAVPDRATLY----LPDET---------- 177 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~----~~~~~---------- 177 (480) ++..++++--|.. +++...+|.+.-. ..+.....++++ ..... T Consensus 153 ~~l~~~~~~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~ 230 (556) T protein:vir:73 153 FPIGSYYLANSPR--GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNK 230 (556) T ss_pred eecceeEEeeCCC--CCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccc Confidence 9998888877753 3444444432111 001101122221 11000 Q ss_pred ----EEEEecCCcccccccccccccccccccceEEeecccccCccCCccc-chhhHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021297. 178 ----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 ----~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~-i~~~v~~liD~~~~~~s~~~~~~~~~~~~ 252 (480) ++|...... . .. ..+-+|..+|++++..+...++.||+|- ..+ ..+-+..+|.+.-......+....| T Consensus 231 p~~s~~~~~~~~~--~-~v---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~-~lgD~k~L~~l~~~~l~~~~~~~~p 303 (556) T protein:vir:73 231 PYRSVYFESGGDS--D-KL---LRESGFDEFPILAPRWEVNGEDVYASSCPGML-ALGQVKALQVEQKRKAQLIDKATNP 303 (556) T ss_pred eEEEEEEEecCCC--c-ee---cccCCcccCCceeeeeeecCCcccccCccHHH-hHHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111000 0 00 1123567789999998888899999994 653 5566788888888888888888888 Q ss_pred hhhhcCCCccccccccccchhhhhhhhhc--ccCCCCceEEEe--ccccHHHHHHHHHHHHHHHHhccCCCh-HHhcc-c Q lcl|NC_021297. 253 LRVISGVTTDELTNDGENTTLDIYYGRIL--TLASEAAKISEF--KAAELRNFAEEMEVFRKEAASITGLPP-QYLSS-S 326 (480) Q Consensus 253 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~--~~~g~~~~~~~~--~~~~~~~~~~~l~~~~~~i~~~~~~p~-~~~g~-~ 326 (480) .+.+..- .....++..++.+. ...++...+.-+ ...++....+.++.+-..|....-... ..++. + T Consensus 304 p~~v~~~--------~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~ 375 (556) T protein:vir:73 304 PMVAPTS--------LKNQRVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNIN 375 (556) T ss_pred ceecccc--------ccccceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCC Confidence 6554321 11122344444322 122222222222 223344444444443333332221110 01221 1 Q ss_pred cccchHHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC--CCc--ccceeeeEEecCCCCCCHH---- Q lcl|NC_021297. 327 SENPASAEAIIATDSRIVKM----AERKGRI-FGGAWERAMRIAMQIMGR--EVT--EEYTRLETVWRDPSTPTVA---- 393 (480) Q Consensus 327 ~~n~~Sg~Al~~~~~~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~--~~~--~~~~~i~v~f~~~~~~~~~---- 393 (480) ..+. ++.-++.....+... ..+.... +..-+.+.+.++.+. |. ..+ -....++|.|..++..... T Consensus 376 ~~r~-TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~ 453 (556) T protein:vir:73 376 TRSM-PVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARK-NMLPEPPDVLQGMPLRIEYISVMAQAQKSIGL 453 (556) T ss_pred CCCc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhcCceeEEEeecHHHHHHHHHHH Confidence 2221 233233222211111 2222222 333344555544431 11 111 1224577777555432211 Q ss_pred ----HHHHHHHHHHhcccc---CCCHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHH--HHhcccccCccc Q lcl|NC_021297. 394 ----AKADAVSKLYANGQG---PIPKEQA----RIDLGYT------ATQREQMRDWDKQETEDMID--TLYSTTKAQADA 454 (480) Q Consensus 394 ----~~ad~~~kl~~~~~~---~~s~et~----~~~lg~~------~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 454 (480) ..++.+..+.+.++. .+....+ ...+|+. +++++++.+.+.++++.... ......++...- T Consensus 454 ~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~ 533 (556) T protein:vir:73 454 TSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTL 533 (556) T ss_pred HHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111222233332221 1222222 2335553 44555554443333222211 111111110001 Q ss_pred c----cCCCCCCCCcccccCCCC Q lcl|NC_021297. 455 T----PKPTVTDTKTETQTSPSG 473 (480) Q Consensus 455 ~----~~~~~~d~~~~~~~~p~~ 473 (480) + +++.+-..-....++|-. T Consensus 534 ~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 534 SETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred hhccCCCHHHHHHHHHhhcCCCC Confidence 1 111111000011122212 No 130 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.80 E-value=1.6e-08 Score=63.26 Aligned_cols=399 Identities=10% Similarity=0.023 Sum_probs=158.6 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---c Q lcl|NC_021297. 5 HEHVERLQGLLARDLPN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---I 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r-------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~ 74 (480) ..+++++.......... ...+..+..+.. .+..+... .-..+.-...+|+..+.-+-.-+|. . T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~----~~~~v~~~---~al~~~~v~~~i~~ia~~ia~l~~~~~~~ 73 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP----STISVKGK---NALKVATVFACIKILSESVSKLPLKIYQE 73 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCC----Ccceechh---hhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 33444433322110000 000111111000 00001000 0001111223344333322222332 1 Q ss_pred CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCC Q lcl|NC_021297. 75 SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPR 145 (480) Q Consensus 75 ~~d---~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~ 145 (480) .++ +.....+..++.. | ....+...+..+.+.+|.||+++-.+ ..|++ .+.+++|..+.+..|.. T Consensus 74 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~ 147 (429) T protein:vir:10 74 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYIDDV 147 (429) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCc Confidence 111 1122335555532 2 33466778888999999999998653 34554 67888999888777643 Q ss_pred CCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccch Q lcl|NC_021297. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .. +......|......+.. ..|.++ -|+||++.....+.+|.|.+. T Consensus 148 ~~--~~~~~~~~~~~~~~g~~---~~~~~~-----------------------------evih~~~~~~~~~~~G~s~i~ 193 (429) T protein:vir:10 148 GL--LNSKTKMWYVVNTGGQQ---RVLKPE-----------------------------EILHFKNGITLDGLVGVPTME 193 (429) T ss_pred cc--ccccceEEEEEccCCeE---EEEccc-----------------------------cEEEecCCCCCCCcccccHHH Confidence 21 11111111111111110 011122 256665544445567877653 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEecc Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQIL---GTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.+.+.....-..-...++ +.|-.++... .......+.-...|+. ..+.++.++ ++.++.++.. T Consensus 194 ----~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~ 268 (429) T protein:vir:10 194 ----YLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISL 268 (429) T ss_pred ----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecC-CCceEEEccC Confidence 33333333222222222222 2344343321 1111111111111211 112344444 3456766654 Q ss_pred cc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021297. 296 AE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 296 ~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 374 (480) .. -..+++..+....+|+...++|+..+|....+.-|. +......+...+ -.-+-..+++.+.. .+...... T Consensus 269 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~~~~~f~~~~---l~P~~~~ie~~ln~--kl~~~~~~ 341 (429) T protein:vir:10 269 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQQQFYTDT---LQATLTMYEQEMTY--KLFLDSEL 341 (429) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcChhhc Confidence 32 224678888889999999999999997533221121 111111111111 01111112221110 11111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccc Q lcl|NC_021297. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .....+++.+..-...|..+.++++.+++.+| +++...+++.+|+.+-+- ..+...-..-..++...... T Consensus 342 ~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--gD~~~~~~n~~~~d~~~~~~------ 411 (429) T protein:vir:10 342 DKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEAG--GDRLLVNGNMLPIDMAGQAY------ 411 (429) T ss_pred CCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCeeeecccccchhhccccc------ Confidence 11123455555566678889999999998765 667777788888764321 10000000000011100000 Q ss_pred ccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~ 476 (480) . .+++.. ++..+..+-|+ T Consensus 412 -~--k~g~~~-~~~~~~~~e~~ 429 (429) T protein:vir:10 412 -L--KGGDTN-GEVSKEGNEGN 429 (429) T ss_pred -c--CCCCCC-CCCCCCCCCCC Confidence 0 001100 00111111111 No 131 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.79 E-value=4.6e-08 Score=60.78 Aligned_cols=449 Identities=12% Similarity=0.046 Sum_probs=189.2 Q ss_pred CCCHHHH--HHHHHHHHHH-HHHHHHHHHHHhcccCcchhcC----ccc-chhhhhhccccchHHHHHHHHHhhhcc--- Q lcl|NC_021297. 1 MTTYHEH--VERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+.. |.+....++++|+ +.++..+ ... ....+..++..+-+...++++++.|.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8887653 2222333333 3333444555553 2233321 111 112223445566677777777776532 Q ss_pred ---Ccc-ccC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 70 ---EGF-RIS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~-~~~-~d~~-------------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) .+| ... .|.+ ..+.+...+..++|.....++.++...+|.|-+++.. +..+-++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 122 111 1111 1233455677789999999999999999999887753 234557888 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEe---------------------cCCceEEEEEEEeC----CcE--------- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR---------------------DDVAVPDRATLYLP----DET--------- 177 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~----~~~--------- 177 (480) .++..++++.-|.. .++...+|.+.-. +.+....++++++. ... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~ 230 (555) T protein:vir:10 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRN 230 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccc Confidence 89998888877653 3444444332100 00011112332211 100 Q ss_pred -----EEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021297. 178 -----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 -----~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~ 252 (480) ++|...... . .. ..+-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-.....++...+| T Consensus 231 ~p~~s~~~~~~~d~--~-~v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~p 303 (555) T protein:vir:10 231 MAWKSVYFEPGADE--T-RT---LRESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNP 303 (555) T ss_pred cceEEEEEEeccCC--c-cc---cccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111000 0 00 1123567789999998888899999997754 5566777787766677777777777 Q ss_pred hhhhcCCCccccccccccchhhhhhhhhccc-CC--CCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCCh-HHhcc-c Q lcl|NC_021297. 253 LRVISGVTTDELTNDGENTTLDIYYGRILTL-AS--EAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPP-QYLSS-S 326 (480) Q Consensus 253 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-~g--~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~g~-~ 326 (480) .+.+.. +.....++..++.+... .| ++.-.-.++. .+++...+.|+.+...|....-.+. ..+.. + T Consensus 304 p~~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:10 304 PLQLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ceeecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 555421 11111234444433221 11 1222222222 2344444444444333332221110 11211 1 Q ss_pred cccchHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCC--CC--cccceeeeEEecCCCCCCHH-H-- Q lcl|NC_021297. 327 SENPASAEAIIATDSRIVKM----AERKG-RIFGGAWERAMRIAMQIMGR--EV--TEEYTRLETVWRDPSTPTVA-A-- 394 (480) Q Consensus 327 ~~n~~Sg~Al~~~~~~l~~k----~~~~~-~~~~~~l~~~~~~~~~~~~~--~~--~~~~~~i~v~f~~~~~~~~~-~-- 394 (480) ... .++.-++.....+... ..+.. ..+.+-+.+.+.++.+- |. .. ......++|.|-.++.+... . T Consensus 376 ~~~-~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~ 453 (555) T protein:vir:10 376 NPQ-MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT 453 (555) T ss_pred CCc-ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhcCceeEEEeccHHHHHHHHHHH Confidence 112 2333332222111111 12221 22222334444443331 11 01 11224577777555433211 1 Q ss_pred -----HHHHHHHHHhcccc---CCCHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHH-HHHhcccccCcc-c Q lcl|NC_021297. 395 -----KADAVSKLYANGQG---PIPKE----QARIDLGYT------ATQREQMRDWDKQETEDMI-DTLYSTTKAQAD-A 454 (480) Q Consensus 395 -----~ad~~~kl~~~~~~---~~s~e----t~~~~lg~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~ 454 (480) .++.+..+.+.++. .+... .+...+|++ +++++++.+.+.++++... .++.....+.+. - T Consensus 454 ~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:10 454 NSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred HHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11112222222221 12222 223445653 4455555443333222111 111111110000 0 Q ss_pred ccCCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNRTKT 479 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~~~~ 479 (480) +.... +..+...+- .+..++=| T Consensus 534 ~~~~~--~~~~~~~~~-~~~~~~~~ 555 (555) T protein:vir:10 534 GSVDT--SKQNALTDV-TRAFSGYT 555 (555) T ss_pred ccccc--CcchhHHHH-HhhhccCC Confidence 00111 111111111 23333334 No 132 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.79 E-value=4.6e-08 Score=60.78 Aligned_cols=449 Identities=12% Similarity=0.046 Sum_probs=189.2 Q ss_pred CCCHHHH--HHHHHHHHHH-HHHHHHHHHHHhcccCcchhcC----ccc-chhhhhhccccchHHHHHHHHHhhhcc--- Q lcl|NC_021297. 1 MTTYHEH--VERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+.. |.+....++++|+ +.++..+ ... ....+..++..+-+...++++++.|.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8887653 2222333333 3333444555553 2233321 111 112223445566677777777776532 Q ss_pred ---Ccc-ccC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 70 ---EGF-RIS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~-~~~-~d~~-------------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) .+| ... .|.+ ..+.+...+..++|.....++.++...+|.|-+++.. +..+-++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:98 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 122 111 1111 1233455677789999999999999999999887753 234557888 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEe---------------------cCCceEEEEEEEeC----CcE--------- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR---------------------DDVAVPDRATLYLP----DET--------- 177 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~----~~~--------- 177 (480) .++..++++.-|.. .++...+|.+.-. +.+....++++++. ... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~ 230 (555) T protein:vir:98 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRN 230 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccc Confidence 89998888877653 3444444332100 00011112332211 100 Q ss_pred -----EEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021297. 178 -----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 -----~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~ 252 (480) ++|...... . .. ..+-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-.....++...+| T Consensus 231 ~p~~s~~~~~~~d~--~-~v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~p 303 (555) T protein:vir:98 231 MAWKSVYFEPGADE--T-RT---LRESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNP 303 (555) T ss_pred cceEEEEEEeccCC--c-cc---cccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111000 0 00 1123567789999998888899999997754 5566777787766677777777777 Q ss_pred hhhhcCCCccccccccccchhhhhhhhhccc-CC--CCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCCh-HHhcc-c Q lcl|NC_021297. 253 LRVISGVTTDELTNDGENTTLDIYYGRILTL-AS--EAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPP-QYLSS-S 326 (480) Q Consensus 253 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-~g--~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~g~-~ 326 (480) .+.+.. +.....++..++.+... .| ++.-.-.++. .+++...+.|+.+...|....-.+. ..+.. + T Consensus 304 p~~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:98 304 PLQLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ceeecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 555421 11111234444433221 11 1222222222 2344444444444333332221110 11211 1 Q ss_pred cccchHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCC--CC--cccceeeeEEecCCCCCCHH-H-- Q lcl|NC_021297. 327 SENPASAEAIIATDSRIVKM----AERKG-RIFGGAWERAMRIAMQIMGR--EV--TEEYTRLETVWRDPSTPTVA-A-- 394 (480) Q Consensus 327 ~~n~~Sg~Al~~~~~~l~~k----~~~~~-~~~~~~l~~~~~~~~~~~~~--~~--~~~~~~i~v~f~~~~~~~~~-~-- 394 (480) ... .++.-++.....+... ..+.. ..+.+-+.+.+.++.+- |. .. ......++|.|-.++.+... . T Consensus 376 ~~~-~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~ 453 (555) T protein:vir:98 376 NPQ-MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT 453 (555) T ss_pred CCc-ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhcCceeEEEeccHHHHHHHHHHH Confidence 112 2333332222111111 12221 22222334444443331 11 01 11224577777555433211 1 Q ss_pred -----HHHHHHHHHhcccc---CCCHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHH-HHHhcccccCcc-c Q lcl|NC_021297. 395 -----KADAVSKLYANGQG---PIPKE----QARIDLGYT------ATQREQMRDWDKQETEDMI-DTLYSTTKAQAD-A 454 (480) Q Consensus 395 -----~ad~~~kl~~~~~~---~~s~e----t~~~~lg~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~ 454 (480) .++.+..+.+.++. .+... .+...+|++ +++++++.+.+.++++... .++.....+.+. - T Consensus 454 ~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:98 454 NSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred HHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11112222222221 12222 223445653 4455555443333222111 111111110000 0 Q ss_pred ccCCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNRTKT 479 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~~~~ 479 (480) +.... +..+...+- .+..++=| T Consensus 534 ~~~~~--~~~~~~~~~-~~~~~~~~ 555 (555) T protein:vir:98 534 GSVDT--SKQNALTDV-TRAFSGYT 555 (555) T ss_pred ccccc--CcchhHHHH-HhhhccCC Confidence 00111 111111111 23333334 No 133 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.79 E-value=4.6e-08 Score=60.78 Aligned_cols=449 Identities=12% Similarity=0.046 Sum_probs=189.2 Q ss_pred CCCHHHH--HHHHHHHHHH-HHHHHHHHHHHhcccCcchhcC----ccc-chhhhhhccccchHHHHHHHHHhhhcc--- Q lcl|NC_021297. 1 MTTYHEH--VERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+.. |.+....++++|+ +.++..+ ... ....+..++..+-+...++++++.|.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8887653 2222333333 3333444555553 2233321 111 112223445566677777777776532 Q ss_pred ---Ccc-ccC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 70 ---EGF-RIS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~-~~~-~d~~-------------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) .+| ... .|.+ ..+.+...+..++|.....++.++...+|.|-+++.. +..+-++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 122 111 1111 1233455677789999999999999999999887753 234557888 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEe---------------------cCCceEEEEEEEeC----CcE--------- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR---------------------DDVAVPDRATLYLP----DET--------- 177 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~----~~~--------- 177 (480) .++..++++.-|.. .++...+|.+.-. +.+....++++++. ... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~ 230 (555) T protein:vir:10 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRN 230 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccc Confidence 89998888877653 3444444332100 00011112332211 100 Q ss_pred -----EEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021297. 178 -----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 -----~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~ 252 (480) ++|...... . .. ..+-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-.....++...+| T Consensus 231 ~p~~s~~~~~~~d~--~-~v---l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~p 303 (555) T protein:vir:10 231 MAWKSVYFEPGADE--T-RT---LRESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNP 303 (555) T ss_pred cceEEEEEEeccCC--c-cc---cccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111000 0 00 1123567789999998888899999997754 5566777787766677777777777 Q ss_pred hhhhcCCCccccccccccchhhhhhhhhccc-CC--CCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCCh-HHhcc-c Q lcl|NC_021297. 253 LRVISGVTTDELTNDGENTTLDIYYGRILTL-AS--EAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPP-QYLSS-S 326 (480) Q Consensus 253 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-~g--~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~g~-~ 326 (480) .+.+.. +.....++..++.+... .| ++.-.-.++. .+++...+.|+.+...|....-.+. ..+.. + T Consensus 304 p~~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:10 304 PLQLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ceeecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 555421 11111234444433221 11 1222222222 2344444444444333332221110 11211 1 Q ss_pred cccchHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCC--CC--cccceeeeEEecCCCCCCHH-H-- Q lcl|NC_021297. 327 SENPASAEAIIATDSRIVKM----AERKG-RIFGGAWERAMRIAMQIMGR--EV--TEEYTRLETVWRDPSTPTVA-A-- 394 (480) Q Consensus 327 ~~n~~Sg~Al~~~~~~l~~k----~~~~~-~~~~~~l~~~~~~~~~~~~~--~~--~~~~~~i~v~f~~~~~~~~~-~-- 394 (480) ... .++.-++.....+... ..+.. ..+.+-+.+.+.++.+- |. .. ......++|.|-.++.+... . T Consensus 376 ~~~-~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~ 453 (555) T protein:vir:10 376 NPQ-MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT 453 (555) T ss_pred CCc-ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhcCceeEEEeccHHHHHHHHHHH Confidence 112 2333332222111111 12221 22222334444443331 11 01 11224577777555433211 1 Q ss_pred -----HHHHHHHHHhcccc---CCCHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHH-HHHhcccccCcc-c Q lcl|NC_021297. 395 -----KADAVSKLYANGQG---PIPKE----QARIDLGYT------ATQREQMRDWDKQETEDMI-DTLYSTTKAQAD-A 454 (480) Q Consensus 395 -----~ad~~~kl~~~~~~---~~s~e----t~~~~lg~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~ 454 (480) .++.+..+.+.++. .+... .+...+|++ +++++++.+.+.++++... .++.....+.+. - T Consensus 454 ~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:10 454 NSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred HHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11112222222221 12222 223445653 4455555443333222111 111111110000 0 Q ss_pred ccCCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNRTKT 479 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~~~~ 479 (480) +.... +..+...+- .+..++=| T Consensus 534 ~~~~~--~~~~~~~~~-~~~~~~~~ 555 (555) T protein:vir:10 534 GSVDT--SKQNALTDV-TRAFSGYT 555 (555) T ss_pred ccccc--CcchhHHHH-HhhhccCC Confidence 00111 111111111 23333334 No 134 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.77 E-value=5.8e-08 Score=60.27 Aligned_cols=435 Identities=13% Similarity=0.109 Sum_probs=178.3 Q ss_pred CCCHHH----HHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhc----c Q lcl|NC_021297. 1 MTTYHE----HVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLD----I 69 (480) Q Consensus 1 ~~~~~~----~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~ 69 (480) |.--.- -+++..+.+..++ +....++++|+- .+++.. .......+..++..+-+...++++++.|. + T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP 78 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAV--TIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFP 78 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCC Confidence 543222 2344444444432 223334444321 122211 11111111223445556667777776553 2 Q ss_pred C-cc-ccC----------CCchh-----------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCc Q lcl|NC_021297. 70 E-GF-RIS----------EDSEG-----------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) Q Consensus 70 ~-~~-~~~----------~d~~~-----------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~ 126 (480) . +| ++. .+.+. .+.+...+..++|.....++.++...+|.|.+++..+ ..++ T Consensus 79 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~-----~~~~ 153 (522) T protein:vir:94 79 QSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEP-----EQGT 153 (522) T ss_pred CCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeecc-----CCCc Confidence 1 12 221 01111 1233445667899999999999999999998877532 1223 Q ss_pred eeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe--------------cCCceEEEEEEEe-----CCcEEEEEecCCcc Q lcl|NC_021297. 127 IPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR--------------DDVAVPDRATLYL-----PDETVPLRRNGGLN 187 (480) Q Consensus 127 ~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 187 (480) ...++.++-.+.++.-|. ..++...++.+.-. +.......+++|+ .+++..|....+. T Consensus 154 ~~~~~~~pl~~y~v~~d~--~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~- 230 (522) T protein:vir:94 154 YSPMRMYRLVSYVVQRDA--FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGI- 230 (522) T ss_pred eeeEEEEEcceEEEeeCC--CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCc- Confidence 345777777775555553 33454444433211 1111122334333 1222222221111 Q ss_pred cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELT 265 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~ 265 (480) .........+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+.+. |..... T Consensus 231 ---~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~-- 304 (522) T protein:vir:94 231 ---EVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEE-YLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPR-- 304 (522) T ss_pred ---eecccCCCCccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccch-- Confidence 111111223578899999999888899999997754 667788889888888888888888875553 111110 Q ss_pred cccccchhhhhhhhhcccCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHHHHH Q lcl|NC_021297. 266 NDGENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATD 340 (480) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~~~~ 340 (480) ......++.+..-..++....++.. .+++. .++.++.-|+..+.... ++. +..+. ++.-++... T Consensus 305 -----~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~-TAtEV~~r~ 373 (522) T protein:vir:94 305 -----RLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNS-----AVQRNAERV-TAEEIRYVA 373 (522) T ss_pred -----heeccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hccCCCccc-cHHHHHHHH Confidence 0111222222221223334444332 23433 34444444444333321 211 11122 232222221 Q ss_pred HHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCH-HHHHHHHHHHHh----cccc Q lcl|NC_021297. 341 SRIVKM----AERKGRI-FGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYA----NGQG 408 (480) Q Consensus 341 ~~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~~~-~~~ad~~~kl~~----~~~~ 408 (480) ..+... ..++... +.+-+.+.+.++.+- |. ..+.+ .+++.+..++..-- .+.++.+...++ .++. T Consensus 374 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~--~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~ 450 (522) T protein:vir:94 374 GELEATLGGVYSVQSQELQLPIVRVLMNQLQSA-GMIPDLPKE--AVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPL 450 (522) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCcc--cEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccch Confidence 111111 1111111 112223333332221 21 22222 36666654433211 112222222222 1111 Q ss_pred ----CCCH----HHHHHhCCCCh-------hHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCC Q lcl|NC_021297. 409 ----PIPK----EQARIDLGYTA-------TQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSP 471 (480) Q Consensus 409 ----~~s~----et~~~~lg~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p 471 (480) .+.. ..+...+|+.+ ++++.+.+.+.++ +..........++.+...+.+.. ++-++. T Consensus 451 ~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~-~~~~~~~~~~~~~~~a~~~~~~~-----~~~~~~ 522 (522) T protein:vir:94 451 SQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQ-QAVVQGASAAGANMGAAVGQGAG-----EDMAQA 522 (522) T ss_pred hhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhhcccc-----hhhhcC Confidence 1222 22234456532 2333332221111 11111111111111110111111 111111 No 135 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.76 E-value=6.2e-08 Score=60.11 Aligned_cols=444 Identities=14% Similarity=0.107 Sum_probs=176.4 Q ss_pred CCCHH-----HHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhcc--- Q lcl|NC_021297. 1 MTTYH-----EHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~-----~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+++ +-++++.+.+..++ +....++++|+- .+++.. .......+..++..+-+...++++++.|.. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQY--TIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhc Confidence 98854 24444555554433 223344444422 122211 111111112244455566667766665532 Q ss_pred -C-cc-ccC-CCch--------------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCC Q lcl|NC_021297. 70 -E-GF-RIS-EDSE--------------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) Q Consensus 70 -~-~~-~~~-~d~~--------------------~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~ 125 (480) . +| ++. .|.+ ..+.+...+..++|.....++.++..++|.+-+++..+ ... T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-----~~~ 153 (536) T protein:vir:21 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-----CCC Confidence 1 22 221 1110 11334556677899999999999999999988776422 223 Q ss_pred ceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEec-------------C---CceEEEEEEEe-----CC--cEEEEEe Q lcl|NC_021297. 126 GIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-------------D---VAVPDRATLYL-----PD--ETVPLRR 182 (480) Q Consensus 126 ~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-------------~---~~~~~~~~~~~-----~~--~~~~~~~ 182 (480) +...++.++-.++++..|.. +++...+|.+.-.. . ......+++|+ ++ .+..|.. T Consensus 154 ~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e 231 (536) T protein:vir:21 154 NYNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEE 231 (536) T ss_pred ceeeEEEEEcCeEEEeeCCC--CCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEec Confidence 44457777777777666642 34444443322110 0 00011222221 11 1111111 Q ss_pred cCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc Q lcl|NC_021297. 183 NGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTD 262 (480) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 262 (480) ..+. ....+....+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........+...+. ++ T Consensus 232 ~~g~----~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~---p~ 303 (536) T protein:vir:21 232 VEGM----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN---PA 303 (536) T ss_pred cCCe----eeccccCccccccCCeeeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcccC---cc Confidence 1110 111222234578899999999999999999997754 556667777776666665555555443331 00 Q ss_pred ccccccccchhhhhhhhhcccCCCCceEEEec-cccHH---HHHHHHHHHHHHHHhccCCChHHhc-cccccchHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELR---NFAEEMEVFRKEAASITGLPPQYLS-SSSENPASAEAII 337 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g-~~~~n~~Sg~Al~ 337 (480) .. ..........++.+..-..++..+.++. ..+++ ..++.++.-|+..+.... +. -+... .++.-++ T Consensus 304 g~--~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~r-~TAtEV~ 375 (536) T protein:vir:21 304 GI--TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGER-VTAEEIR 375 (536) T ss_pred cc--cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-----cccCCCCC-ccHHHHH Confidence 00 0000111222333322112233344443 23333 334444444444332221 11 11112 1333333 Q ss_pred HHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCC-CHHHHHHHHHHHHh----c Q lcl|NC_021297. 338 ATDSRIVK----MAERKGRIF-GGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTP-TVAAKADAVSKLYA----N 405 (480) Q Consensus 338 ~~~~~l~~----k~~~~~~~~-~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~-~~~~~ad~~~kl~~----~ 405 (480) .....+.. -..++...| .+-+++++.++.+ .|. ..+.+. +++.+.-++.. ...+.++.+....+ . T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~ 452 (536) T protein:vir:21 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEA--VEPTISTGLEAIGRGQDLDKLERCVTAWAAL 452 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhh--ccceEEecHHHHHHHHHHHHHHHHHHHHHhh Confidence 32222222 112222222 2223333333322 111 122222 34444323221 11122222222222 1 Q ss_pred ccc----CCCHHHHH----HhCCCC-------hhHHHHHHHHHHHHHHH--HHHHHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 406 GQG----PIPKEQAR----IDLGYT-------ATQREQMRDWDKQETED--MIDTLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 406 ~~~----~~s~et~~----~~lg~~-------~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) ++. .+....++ +.+|++ +++++++.+.+.++.+. ...++.... .++....+.......+++ T Consensus 453 ~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~--~~~~~~~~~~~~~~~~~~ 530 (536) T protein:vir:21 453 APMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATASPEAMAAAADSV 530 (536) T ss_pred chhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhcChhhHHhhhhcc Confidence 111 12332222 345662 34444443322222111 111111111 111111111111111111 Q ss_pred cCCCCC Q lcl|NC_021297. 469 TSPSGF 474 (480) Q Consensus 469 ~~p~~~ 474 (480) |.--|. T Consensus 531 g~~~~~ 536 (536) T protein:vir:21 531 GLQPGI 536 (536) T ss_pred ccCCCC Confidence 110011 No 136 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.74 E-value=6.9e-08 Score=59.83 Aligned_cols=436 Identities=11% Similarity=0.061 Sum_probs=188.0 Q ss_pred CCCHHH-HHHHHHHHHHH----HHHHHHHHHHHhcccCcchhcCc--------ccchhhhhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTYHE-HVERLQGLLAR----DLPNLLEAEAYRNGTRRLKTIGI--------GAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~~~-~i~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~--------~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |....+ +++.|.+.|.. |.+....++++|+ +.+++.+. ..+-..++.++.-+-+...++++++.| T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l 78 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVID--YLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAM 78 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHH--HhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHH Confidence 888754 44444444432 2333344555553 22332211 111112233455556677777777665 Q ss_pred cc----C--cc-ccC-CCch------hHHH-------HHHHH--HhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC Q lcl|NC_021297. 68 DI----E--GF-RIS-EDSE------GLEE-------LWNWW--QANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 68 ~~----~--~~-~~~-~d~~------~~~~-------l~~i~--~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~ 124 (480) .+ . +| ++. .|+. .... +..++ ...+|.....++.++...+|.|-+++.. +. T Consensus 79 ~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~------~~ 152 (549) T protein:vir:10 79 DSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEH------DV 152 (549) T ss_pred HhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEee------cC Confidence 32 1 22 221 1111 1111 22222 2468899999999999999999887753 33 Q ss_pred CceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-------c-------------CCceEEEEEEEeC---C--c--- Q lcl|NC_021297. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-------D-------------DVAVPDRATLYLP---D--E--- 176 (480) Q Consensus 125 ~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~-------------~~~~~~~~~~~~~---~--~--- 176 (480) .+.++++.++-.++++.-|.. +++...++.+.-. . +.+....+++|+. . + T Consensus 153 ~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~ 230 (549) T protein:vir:10 153 GKGIVYRNVPMQRLWFAENNS--GLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPR 230 (549) T ss_pred CCeeEEEEEEcCeEEEeeCCC--CCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcc Confidence 455788888888888877753 3444444332100 0 0011123444321 0 0 Q ss_pred ----------EEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 177 ----------TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSAS 246 (480) Q Consensus 177 ----------~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~ 246 (480) .+++...+ . .. ..+.+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...... T Consensus 231 ~~~~~~~pf~sv~~e~~~-~----~i---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~ 301 (549) T protein:vir:10 231 KLDGRNMQFASYWLDEGR-D----RI---VQNSGFRTFPFAIGRFYVGTDDVYGGSPAYD-AMPDVRMANDMAKTNIRGA 301 (549) T ss_pred ccccccCceEEEEEEecC-C----Ee---eccCCcccCCcceeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHH Confidence 11111110 0 00 1134567789999988888899999997754 5677788888888888888 Q ss_pred HHhcchhhhhcCCCccccccccccchhhhhhhhh-c--ccCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhccCCC Q lcl|NC_021297. 247 QILGTPLRVISGVTTDELTNDGENTTLDIYYGRI-L--TLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLP 319 (480) Q Consensus 247 ~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~-~--~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p 319 (480) +...+|.+.+.- +.....++..++.+ . ...+++..+.++.. .+++. .++.++.-|...+...-+. T Consensus 302 ~~~~~p~~~v~~--------~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~ 373 (549) T protein:vir:10 302 QKLVDPPLLANE--------DGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQ 373 (549) T ss_pred HHHhcCceeecc--------ccccccceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhh Confidence 888888766531 01111112222221 1 11233444555433 24333 3444444444433322111 Q ss_pred hHHhccccccchHHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhCC--C----CcccceeeeEEecCCC Q lcl|NC_021297. 320 PQYLSSSSENPASAEAIIATDSRIVKM----AERKGR-IFGGAWERAMRIAMQIMGR--E----VTEEYTRLETVWRDPS 388 (480) Q Consensus 320 ~~~~g~~~~n~~Sg~Al~~~~~~l~~k----~~~~~~-~~~~~l~~~~~~~~~~~~~--~----~~~~~~~i~v~f~~~~ 388 (480) . . .+.... ++.-++.....+... ..+... .+..-+.+.+.++.+ .|. . .......+++.|..++ T Consensus 374 ~--~-~~~~~~-TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r-~g~lP~~p~~l~~~~~~~~i~yis~L 448 (549) T protein:vir:10 374 I--L-VDSGDM-TATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAE-AGQLPDMPQELIDAGADVDVEYDSPL 448 (549) T ss_pred h--h-cCCCCc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCChhhhcCCceeEEEeecHH Confidence 1 1 122222 333333322222221 223322 223334555554443 121 1 1112345777775544 Q ss_pred CCCHH-HHH-------HHHHHHHhcccc---CCCHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 389 TPTVA-AKA-------DAVSKLYANGQG---PIPKEQA----RIDLGYT------ATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 389 ~~~~~-~~a-------d~~~kl~~~~~~---~~s~et~----~~~lg~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) .+... ..+ +.+..+.+.++. .+....+ ...+|+. +++++++.+.++++.+.. ++... T Consensus 449 a~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~--~~~~~ 526 (549) T protein:vir:10 449 NKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQ--QMLAA 526 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHH--HHHHH Confidence 33111 111 222222222221 1222222 2335643 444444433222221111 11111 Q ss_pred cccCcccccCCCCCCCCcccccCCCCCC Q lcl|NC_021297. 448 TKAQADATPKPTVTDTKTETQTSPSGFN 475 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~~~~~~~p~~~~ 475 (480) ....++ .+.+....+++.++..- T Consensus 527 a~~a~~-----~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 527 APVAAG-----AIKDLSDAQTAAQTARV 549 (549) T ss_pred HHHHHH-----HHHhhhhhcCCCcccCC Confidence 110000 00011111111111111 No 137 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=438 Identities=13% Similarity=0.112 Sum_probs=179.2 Q ss_pred CCCHH------HHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhcc-- Q lcl|NC_021297. 1 MTTYH------EHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLDI-- 69 (480) Q Consensus 1 ~~~~~------~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~-- 69 (480) |+..+ +-+++....+..++ +....++++|+- .+++.. ..........++..+-+...++++++.|.. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQY--TIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLAL 78 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhh Confidence 65433 23444444444433 223344444421 122211 111111111223334455666666665532 Q ss_pred ---Ccc-ccC-CC----------c----------hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC Q lcl|NC_021297. 70 ---EGF-RIS-ED----------S----------EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 70 ---~~~-~~~-~d----------~----------~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~ 124 (480) .+| +.. .| . +..+.+.+.+..++|.....++.++..++|.+.+++.. +. T Consensus 79 tP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~------~~ 152 (535) T protein:vir:33 79 FPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE------PE 152 (535) T ss_pred cCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec------CC Confidence 122 211 11 0 11123445567789999999999999999999877743 23 Q ss_pred CceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-c----C-----------CceEEEEEEEe----C-C--cEEEEE Q lcl|NC_021297. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D----D-----------VAVPDRATLYL----P-D--ETVPLR 181 (480) Q Consensus 125 ~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~----~-----------~~~~~~~~~~~----~-~--~~~~~~ 181 (480) .+..+++.++-.++++..|. .+++...++.+.-. . . ......+++|+ + + .+..|. T Consensus 153 ~~~~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~ 230 (535) T protein:vir:33 153 GSYNPMKLYRLSSYVVQRDA--YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYE 230 (535) T ss_pred CCceeeEEEEcCeeEEeeCC--CCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEE Confidence 45577888877776666654 23444444433211 0 0 00000111221 1 1 111111 Q ss_pred ecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CC Q lcl|NC_021297. 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~ 259 (480) ...+. .........+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+.+. |. T Consensus 231 ~~~~~----~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~ 305 (535) T protein:vir:33 231 EVEDV----EIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGI 305 (535) T ss_pred EEeCc----cccccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc Confidence 11100 001111123577899999999888899999998754 567788888888888888888888875542 11 Q ss_pred CccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHH Q lcl|NC_021297. 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAII 337 (480) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~ 337 (480) ... ......+.+.+.....++....++.. .+++.....++.+...|....-+ + .+.. +.... ++.-++ T Consensus 306 ~~~-------~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~-~~~~~~~~r~-TAtEV~ 375 (535) T protein:vir:33 306 TQP-------RRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFML-N-SAVQRTGERV-TAEEIR 375 (535) T ss_pred cch-------hhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh-h-hcccCCCccc-cHHHHH Confidence 100 00011122222222223344444442 34444444444433333322211 1 1211 11111 232222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHhCCCCc-ccceeeeEEecCCCCCCH-HHHHHHHHHHH Q lcl|NC_021297. 338 ATDSRIVKMAERKGRIFGGAWE------------RAMRIAMQIMGREVT-EEYTRLETVWRDPSTPTV-AAKADAVSKLY 403 (480) Q Consensus 338 ~~~~~l~~k~~~~~~~~~~~l~------------~~~~~~~~~~~~~~~-~~~~~i~v~f~~~~~~~~-~~~ad~~~kl~ 403 (480) . ++..+...++..+. +.+.++.+ .+..+ .....+++.|..++..-- .+.++.+...+ T Consensus 376 ~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~ 446 (535) T protein:vir:33 376 Y-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCI 446 (535) T ss_pred H-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHH Confidence 2 22333333333333 33333221 11111 112346777755543211 11222222222 Q ss_pred ----hcccc----CCCHHHH----HHhCCCCh-------hHHHHHHHHHHHHHHHHHHHHhcccccCc-ccccCCCCCCC Q lcl|NC_021297. 404 ----ANGQG----PIPKEQA----RIDLGYTA-------TQREQMRDWDKQETEDMIDTLYSTTKAQA-DATPKPTVTDT 463 (480) Q Consensus 404 ----~~~~~----~~s~et~----~~~lg~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~ 463 (480) +.++. .+....+ ...+|+.. ++++++.++..+ ++....++.....+.+ .+...++.. T Consensus 447 ~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~-~~~~~~~~~~~g~~~~~~~~~~~~~~-- 523 (535) T protein:vir:33 447 SAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAA-QTGVENAAAAGGAGVGALATSSPEAM-- 523 (535) T ss_pred HHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHH-HHHHHHHHHhhhhhhcchhhcCChhH-- Confidence 22111 1222222 23356542 222222211111 1111111111111111 111111111 Q ss_pred CcccccCCCCCCCC Q lcl|NC_021297. 464 KTETQTSPSGFNRT 477 (480) Q Consensus 464 ~~~~~~~p~~~~~~ 477 (480) ....+..|.+-+ T Consensus 524 --~~~~~~~g~~~~ 535 (535) T protein:vir:33 524 --QGAAAKAGLNAT 535 (535) T ss_pred --HHHHHhccCCCC Confidence 111222244433 No 138 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.68 E-value=7.4e-08 Score=59.66 Aligned_cols=385 Identities=11% Similarity=0.050 Sum_probs=149.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |. +..++-.........-..+..+..+.. .+..+... .-+.+.-...+|+..+.-+-.-++.+.+ . T Consensus 1 M~----~f~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~v~~~---~al~~~~V~~~v~~ia~~ia~~p~~~~~-~-- 66 (397) T protein:vir:38 1 MP----LLKLNKSHSQGFSLNDPDWVNFLTGGE----AQKYVSAD---TALKNSDIFSLIMQLSGDLAMVRYTSES-D-- 66 (397) T ss_pred Cc----chhhhhcccCcccCCchhhhhhhcCCc----CCceechH---HhhccHHHHHHHHHHHHHHhhCcccccc-c-- Confidence 11 111100000000000001111111110 01111111 0011111222333333333233343322 1 Q ss_pred HHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 81 LEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 81 ~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) .+..+..+ | ....+...+..+.+.+|.||+.+-.+ ..|.+ .+..++|..+.++.+.... .+. T Consensus 67 --~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~l~~l~~~~v~i~~~~~~~-~~~---- 133 (397) T protein:vir:38 67 --RSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKN------TNGVDLSWEYLRPSQVQPMLLQDGS-GLI---- 133 (397) T ss_pred --HHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE---- Confidence 22233322 2 34566778888999999999887543 34444 6788999988887765422 111 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) |.......+ ......+.+++ |+||++.......+|.|.+.. +...++.. T Consensus 134 y~~~~~~~~-~~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~~~ 182 (397) T protein:vir:38 134 YNINFDEPA-IGYMENVPAAD-----------------------------VIHIRLLSKNGGKTGISPLSA-LINEQQIK 182 (397) T ss_pred EEEEecccc-ccceeEecCcc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHH Confidence 111111100 00001122222 445554444444578877643 33333322 Q ss_pred HHHHHHHHHHHHHhcchhhhhcCCCc-cccccccccchhhh-----hhhhhcccCCCCceEEEeccc-cHHHHHHHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQILGTPLRVISGVTT-DELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAA-ELRNFAEEMEVF 308 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~~~~~i~g~~~-~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~ 308 (480) ..+..-..+.+...+.|..++.-... .....+.....++. ..+.++.++ ++.++.++... .-..|++..+.. T Consensus 183 ~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~ 261 (397) T protein:vir:38 183 DASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVID-ALEDYKPLEVKGNIASLLNQVDWT 261 (397) T ss_pred HHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecC-CCceEEecCCChhHHHHHHHHHHH Confidence 22222222222323444444431111 11111110111111 112233343 45677776643 233478888999 Q ss_pred HHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCC Q lcl|NC_021297. 309 RKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPS 388 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 388 (480) ..+|+...++|+..+|+...+.+|....+..+ ...|.-++..+....+.....+ ..+.+.| .. T Consensus 262 ~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~~~--------------~~~l~P~~~~ie~~ln~~l~~~-~~~~~~~--~~ 324 (397) T protein:vir:38 262 RDQIAKVYGVPDSYLNGQGDQQSSITQISGQY--------------AKSLNRYVQAIVGELNDKLHAN-ISANIRF--AI 324 (397) T ss_pred HHHHHHHhCCCHHHhCCCCCcccHHHHHHHHH--------------HHHHHHHHHHHHHHHHHhccCh-hcccccc--cc Confidence 99999999999999987654433332222111 1122222221111101000000 1122222 23 Q ss_pred CCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 389 TPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 389 ~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) ..+..+.++.+.+++++| +++...+++.+|+.+-+-..+-. .+ .. ...... .....+++.+..+. T Consensus 325 ~~d~~~~~~~~~~~~~~G--~~t~nE~R~~lg~~p~~~~d~~~---~~------~~--~~~~~~--~~~~~~g~~~~~~~ 389 (397) T protein:vir:38 325 DAMGDQYASTISSSVKGG--TIAGNQARFILQNSGYLAKDLPD---PE------KE--PQQAIQ--LIQQEGGENDGNNS 389 (397) T ss_pred cCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccc---cc------cc--cccccc--ccccccCCCCCCCC Confidence 456778888888888765 66777777777654321000000 00 00 000000 00001111111111 Q ss_pred cCCCCCCC Q lcl|NC_021297. 469 TSPSGFNR 476 (480) Q Consensus 469 ~~p~~~~~ 476 (480) .++++..+ T Consensus 390 ~e~~~~~~ 397 (397) T protein:vir:38 390 DERGSDPE 397 (397) T ss_pred CCCCCCCC Confidence 11111111 No 139 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.68 E-value=1.2e-07 Score=58.55 Aligned_cols=403 Identities=14% Similarity=0.155 Sum_probs=157.3 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchh---hhhhccccchHHHHHHHHHhhhccCcc--- Q lcl|NC_021297. 1 MTTYH--EHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPE---LAYLDVQPGWVATYLRTLSDRLDIEGF--- 72 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~---~~~~~~~~n~~~~iVd~~~~~l~~~~~--- 72 (480) .+++. +.. -.+...|.+.... +...... +......+.....+|+..++-+-.-++ T Consensus 9 ~~~p~~~e~~--------------~~~~~~~~~~~~~---~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~ 71 (518) T protein:vir:10 9 LSAPAMAELS--------------PQMQDSYYYAPAV---GMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred ecCchhhhhh--------------hhhhccccccccc---ceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEE Confidence 23321 110 0111111111000 0000000 001111122334445544443322222 Q ss_pred ccCCCc---hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeC Q lcl|NC_021297. 73 RISEDS---EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDP 144 (480) Q Consensus 73 ~~~~d~---~~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~ 144 (480) +...+. .....+..++.+ | ....+...+..+.+.+|.||+++-.+ .+|++ .+..++|..+.+..+. T Consensus 72 ~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~------~~G~~~~L~~l~p~~v~v~~~~ 145 (518) T protein:vir:10 72 FTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred EEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEECCCceEEEEcC Confidence 211111 112334444433 2 22356677888899999999998653 34555 5788999999888875 Q ss_pred CCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccc Q lcl|NC_021297. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i 224 (480) ... .+. ++|......+ .....|.++++++ |++....+...|.|.| T Consensus 146 ~~~-~~~---y~~~~~~~~~--~~~~~~~~~eViH-----------------------------ir~~s~dg~~~G~spi 190 (518) T protein:vir:10 146 RTG-RYE---YYFQAGAGVG--TQLVSFADDEVVP-----------------------------IRFFNPDGLERGLSLM 190 (518) T ss_pred CCC-EEE---EEEEecCCcc--ceEEEecCCcEEE-----------------------------ecCCCCCcccccccHH Confidence 432 111 1111111111 1112233334333 3322111223566655 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEec Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQIL---GTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFK 294 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~ 294 (480) . .+...+...++-......+| +.|-.++.-- ...+...+.-...|+. ..++++.++ ++.++.++. T Consensus 191 ~----~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~-~G~~~~~l~ 265 (518) T protein:vir:10 191 E----SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQ 265 (518) T ss_pred H----HHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcC-CCceEEEcc Confidence 3 22233332222222222222 3344444321 1111111111111221 112344443 456776665 Q ss_pred cccHH-HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 295 AAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 295 ~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) ....+ .|++..+..+.+|+...++|+..+|....+.-|. +......+...+ -.-+-..|+..+.. .+..... T Consensus 266 ~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn--~eq~~~~f~~~t---L~P~l~~ie~~ln~--~L~~~~~ 338 (518) T protein:vir:10 266 LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--ISAQMRAFYRDT---MAIPIARIQSAMDK--YVGQYWV 338 (518) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHHHHHHHHHHH---HHHHHHHHHHHHHH--hhccccc Confidence 43222 4788888999999999999999998543221121 111111111111 01111111111111 0111101 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHHHHHHHHHHHHHHHHH-hcccc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQMRDWDKQETEDMIDTL-YSTTK 449 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~ 449 (480) ....+++........|..++++++.+++.+| +++...+++.+|+.+-+ -+++- ...... .+... .+... T Consensus 339 --~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G--~lT~NE~R~~~Gl~pie~~~gD~~~--~~~n~~-pl~~~~~~~~~ 411 (518) T protein:vir:10 339 --RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELY--ANSALQ-PLGATPDGAVE 411 (518) T ss_pred --CCceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeee--ecccce-ecccccccccC Confidence 1123555555666788889999999998875 67777788888875421 11110 000000 00000 00000 Q ss_pred cCcccccCCCCCC-----CCccccc----CCCCCCCCCCC Q lcl|NC_021297. 450 AQADATPKPTVTD-----TKTETQT----SPSGFNRTKTR 480 (480) Q Consensus 450 ~~~~~~~~~~~~d-----~~~~~~~----~p~~~~~~~~~ 480 (480) ++..+.+...+.. ++..+++ .++.++....+ T Consensus 412 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:10 412 GEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDS 451 (518) T ss_pred CCCCCCCCCCCccccccccccccccCCCCCcccccccccc Confidence 1100111000000 0000111 11111111111 No 140 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.66 E-value=1.3e-07 Score=58.28 Aligned_cols=401 Identities=11% Similarity=0.029 Sum_probs=161.2 Q ss_pred HHHHHHHHHHHHHHHHH----------HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 5 HEHVERLQGLLARDLPN----------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r----------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.-+.+. ...+..+..+.. .+..+... .-+.+.-...+|+..++-+-.-++.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~----~~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP----STISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc----Cccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 34444443333211000 000111111000 00001000 00011112223333333222223322 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEE Q lcl|NC_021297. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+++..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122345555542 2 34567788889999999999998653 34554 67789999888877 Q ss_pred eCCCCCeeEEE-EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCc Q lcl|NC_021297. 143 DPRNTRRVTRA-VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) Q Consensus 143 d~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~ 221 (480) |.... +... ..+|... .++.. ..+.+++ |+||++.....+.+|. T Consensus 148 d~~~~--~~~~~~~~y~~~-~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVN-TGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGV 192 (432) T ss_pred cCccc--ccccceEEEEEe-cCCeE---EEEcccc-----------------------------EEEecCCCCCCCcccc Confidence 64311 1111 1111111 11110 0111222 5566554344556787 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEec Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFK 294 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~ 294 (480) |.+.. +...++....+..-....+...+.|..++.-. .......+.-...|+. ..++++.++ ++.++.++. T Consensus 193 s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~ 270 (432) T protein:vir:10 193 PTMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPIS 270 (432) T ss_pred cHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEcc Confidence 76532 33333322222211222222223354444321 1111111111111221 112344444 346777665 Q ss_pred ccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 295 AAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 295 ~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) ... -..+++..+....+|+...++|+..+|....+.-| .+......+...+ -.-+-..|++.+.. .+..... T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~~---l~P~~~~ie~~ln~--kLl~~~~ 343 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTDT---LQATLTMYEQEMTY--KLFLDSE 343 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcChhh Confidence 432 22467888888999999999999999753322111 1121111111111 11111112221111 1111111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ......+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+- ..+..--..-..++... + T Consensus 344 ~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~g--gD~~~~~~n~~~~~~~~-------~ 412 (432) T protein:vir:10 344 LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEAG--GDRLLVNGNMLPIDMAG-------Q 412 (432) T ss_pred cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeecccccchhhcc-------c Confidence 111223555555666788899999999998765 677777788888765321 00000000000001000 0 Q ss_pred cccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 454 ATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 454 ~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) ... .+++...+..+. .+-|+ T Consensus 413 ~~~--k~~~~~~~~~~~-~~~~~ 432 (432) T protein:vir:10 413 AYL--KGGDTNGEVSKE-GNEGN 432 (432) T ss_pred ccc--CCCCCCCCCCCC-CCCCC Confidence 000 001100011111 11111 No 141 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.66 E-value=1.3e-07 Score=58.28 Aligned_cols=401 Identities=11% Similarity=0.029 Sum_probs=161.2 Q ss_pred HHHHHHHHHHHHHHHHH----------HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 5 HEHVERLQGLLARDLPN----------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r----------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.-+.+. ...+..+..+.. .+..+... .-+.+.-...+|+..++-+-.-++.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~----~~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP----STISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc----Cccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 34444443333211000 000111111000 00001000 00011112223333333222223322 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEE Q lcl|NC_021297. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+++..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122345555542 2 34567788889999999999998653 34554 67789999888877 Q ss_pred eCCCCCeeEEE-EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCc Q lcl|NC_021297. 143 DPRNTRRVTRA-VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) Q Consensus 143 d~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~ 221 (480) |.... +... ..+|... .++.. ..+.+++ |+||++.....+.+|. T Consensus 148 d~~~~--~~~~~~~~y~~~-~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVN-TGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGV 192 (432) T ss_pred cCccc--ccccceEEEEEe-cCCeE---EEEcccc-----------------------------EEEecCCCCCCCcccc Confidence 64311 1111 1111111 11110 0111222 5566554344556787 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEec Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFK 294 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~ 294 (480) |.+.. +...++....+..-....+...+.|..++.-. .......+.-...|+. ..++++.++ ++.++.++. T Consensus 193 s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~ 270 (432) T protein:vir:10 193 PTMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPIS 270 (432) T ss_pred cHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEcc Confidence 76532 33333322222211222222223354444321 1111111111111221 112344444 346777665 Q ss_pred ccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 295 AAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 295 ~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) ... -..+++..+....+|+...++|+..+|....+.-| .+......+...+ -.-+-..|++.+.. .+..... T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~~---l~P~~~~ie~~ln~--kLl~~~~ 343 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTDT---LQATLTMYEQEMTY--KLFLDSE 343 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcChhh Confidence 432 22467888888999999999999999753322111 1121111111111 11111112221111 1111111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ......+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+- ..+..--..-..++... + T Consensus 344 ~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~g--gD~~~~~~n~~~~~~~~-------~ 412 (432) T protein:vir:10 344 LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEAG--GDRLLVNGNMLPIDMAG-------Q 412 (432) T ss_pred cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeecccccchhhcc-------c Confidence 111223555555666788899999999998765 677777788888765321 00000000000001000 0 Q ss_pred cccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 454 ATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 454 ~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) ... .+++...+..+. .+-|+ T Consensus 413 ~~~--k~~~~~~~~~~~-~~~~~ 432 (432) T protein:vir:10 413 AYL--KGGDTNGEVSKE-GNEGN 432 (432) T ss_pred ccc--CCCCCCCCCCCC-CCCCC Confidence 000 001100011111 11111 No 142 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.66 E-value=1.3e-07 Score=58.28 Aligned_cols=401 Identities=11% Similarity=0.029 Sum_probs=161.2 Q ss_pred HHHHHHHHHHHHHHHHH----------HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc Q lcl|NC_021297. 5 HEHVERLQGLLARDLPN----------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r----------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.-+.+. ...+..+..+.. .+..+... .-+.+.-...+|+..++-+-.-++.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~----~~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP----STISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc----Cccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 34444443333211000 000111111000 00001000 00011112223333333222223322 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEE Q lcl|NC_021297. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+++..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122345555542 2 34567788889999999999998653 34554 67789999888877 Q ss_pred eCCCCCeeEEE-EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCc Q lcl|NC_021297. 143 DPRNTRRVTRA-VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) Q Consensus 143 d~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~ 221 (480) |.... +... ..+|... .++.. ..+.+++ |+||++.....+.+|. T Consensus 148 d~~~~--~~~~~~~~y~~~-~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVN-TGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGV 192 (432) T ss_pred cCccc--ccccceEEEEEe-cCCeE---EEEcccc-----------------------------EEEecCCCCCCCcccc Confidence 64311 1111 1111111 11110 0111222 5566554344556787 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEec Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFK 294 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~ 294 (480) |.+.. +...++....+..-....+...+.|..++.-. .......+.-...|+. ..++++.++ ++.++.++. T Consensus 193 s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~ 270 (432) T protein:vir:10 193 PTMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPIS 270 (432) T ss_pred cHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEcc Confidence 76532 33333322222211222222223354444321 1111111111111221 112344444 346777665 Q ss_pred ccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 295 AAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 295 ~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) ... -..+++..+....+|+...++|+..+|....+.-| .+......+...+ -.-+-..|++.+.. .+..... T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~~---l~P~~~~ie~~ln~--kLl~~~~ 343 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTDT---LQATLTMYEQEMTY--KLFLDSE 343 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcChhh Confidence 432 22467888888999999999999999753322111 1121111111111 11111112221111 1111111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ......+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+- ..+..--..-..++... + T Consensus 344 ~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~g--gD~~~~~~n~~~~~~~~-------~ 412 (432) T protein:vir:10 344 LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEAG--GDRLLVNGNMLPIDMAG-------Q 412 (432) T ss_pred cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeecccccchhhcc-------c Confidence 111223555555666788899999999998765 677777788888765321 00000000000001000 0 Q ss_pred cccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 454 ATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 454 ~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) ... .+++...+..+. .+-|+ T Consensus 413 ~~~--k~~~~~~~~~~~-~~~~~ 432 (432) T protein:vir:10 413 AYL--KGGDTNGEVSKE-GNEGN 432 (432) T ss_pred ccc--CCCCCCCCCCCC-CCCCC Confidence 000 001100011111 11111 No 143 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.66 E-value=1.4e-07 Score=58.20 Aligned_cols=423 Identities=11% Similarity=0.097 Sum_probs=155.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---------------cchh------cCcccchhhhhhccccchHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---------------RLKT------IGIGAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---------------~i~~------~~~~~~~~~~~~~~~~n~~~~i 59 (480) =+++..-+++.-.. -..+..+.+--++++ .+.. -+...+.-++.+ ........+ T Consensus 26 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~-~~n~i~~~~ 100 (563) T protein:vir:99 26 DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKF-GNNPILNAI 100 (563) T ss_pred cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHh-hcchHHHHH Confidence 23333333332111 000111111111111 1110 001111111111 112344555 Q ss_pred HHHHHhhhcc-----------Ccccc-----CC---Cc--hhHHHHHHHHH-----hc----CHHHHHHHHHHHHhhcCe Q lcl|NC_021297. 60 LRTLSDRLDI-----------EGFRI-----SE---DS--EGLEELWNWWQ-----AN----DLDEESVLGHDDSLTFGR 109 (480) Q Consensus 60 Vd~~~~~l~~-----------~~~~~-----~~---d~--~~~~~l~~i~~-----~n----~~~~~~~~~~~~a~~~G~ 109 (480) |++.++.+-. -++.+ .. ++ .....+..++. .+ .+..+...+..+.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:99 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 5544433210 01111 00 11 11122333321 11 345677788899999999 Q ss_pred EEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc Q lcl|NC_021297. 110 AYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 110 a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) +|+++.... +..|++ .+..++|..+.+..+.... ......+++...+ ++. ...+.++.++++... T Consensus 181 ~~~~~~~~r----d~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~-g~~---~~~~~~~evI~~~~~----- 246 (563) T protein:vir:99 181 VNFEKVFNK----NNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD-KRV---VASFTSRELAMGIRN----- 246 (563) T ss_pred eEEEEEEEe----cCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC-Cce---eEEecCcceEEEecc----- Confidence 988754221 234554 5788999999888765421 1111111111111 110 011222222211110 Q ss_pred ccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhc--CC-Ccc Q lcl|NC_021297. 189 QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS--GV-TTD 262 (480) Q Consensus 189 ~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~--g~-~~~ 262 (480) +......+.+|.|.|. .+.+.+...+.--.....+| +.|..+|. |. .+. T Consensus 247 ---------------------~~~d~~~~~~G~Spi~----~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls 301 (563) T protein:vir:99 247 ---------------------PRTELSSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQS 301 (563) T ss_pred ---------------------CCCCcccCcccchHHH----HHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCC Confidence 0011123467887664 33333333222222222333 33443332 21 111 Q ss_pred ccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccch---- Q lcl|NC_021297. 263 ELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA---- 331 (480) Q Consensus 263 ~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~---- 331 (480) ....+.-...|+. ..+++..+-+++.++..+.... -..|++..+....+|+...++|+..+|....+.. T Consensus 302 ~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~ 381 (563) T protein:vir:99 302 QHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSK 381 (563) T ss_pred HHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccc Confidence 1111111112221 1122222224557887776432 2248888899999999999999999984321110 Q ss_pred --HH---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc Q lcl|NC_021297. 332 --SA---EAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) Q Consensus 332 --Sg---~Al~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~ 405 (480) |+ ..+......+...+ .-....+...|.+ .+.. .....+.+.|.+....+..+.. ++.++..+ T Consensus 382 ~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~------~L~~----~~~~~~~~~f~r~D~~~~~e~~-~~~~~~~~ 450 (563) T protein:vir:99 382 GGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR------HIIS----EYGDKYTFQFVGGDTKSATDKL-NILKLETQ 450 (563) T ss_pred cccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh------hhch----hcccccEEEeccCCHHHHHHHH-HHHHHhcC Confidence 10 11111111111111 1111111111111 0110 0112355667655444443332 23444443 Q ss_pred cccCCCHHHHHHhCCCChhHH-HHH------------HHH---HHHHHHHHHHHHhcccccCcccccC-----CCCCCCC Q lcl|NC_021297. 406 GQGPIPKEQARIDLGYTATQR-EQM------------RDW---DKQETEDMIDTLYSTTKAQADATPK-----PTVTDTK 464 (480) Q Consensus 406 ~~~~~s~et~~~~lg~~~~~~-~~~------------~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~d~~ 464 (480) ++++...+++.+|+.+-+- +.+ ... ..+..++..+.+........+..+. +...+.+ T Consensus 451 --G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:99 451 --IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred --CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccc Confidence 4677777788877653210 000 000 0001111122222211111111111 1111222 Q ss_pred cccccCCCCCCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFNRTKTR 480 (480) Q Consensus 465 ~~~~~~p~~~~~~~~~ 480 (480) ++.+++|.+.+...+. T Consensus 529 ~~~~~~~~~~~~~~~~ 544 (563) T protein:vir:99 529 IGTDAQIKGDDNVYRT 544 (563) T ss_pred cccccccccccccccc Confidence 2333444333333332 No 144 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.66 E-value=1.4e-07 Score=58.20 Aligned_cols=423 Identities=11% Similarity=0.097 Sum_probs=155.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---------------cchh------cCcccchhhhhhccccchHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---------------RLKT------IGIGAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---------------~i~~------~~~~~~~~~~~~~~~~n~~~~i 59 (480) =+++..-+++.-.. -..+..+.+--++++ .+.. -+...+.-++.+ ........+ T Consensus 26 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~-~~n~i~~~~ 100 (563) T protein:vir:95 26 DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKF-GNNPILNAI 100 (563) T ss_pred cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHh-hcchHHHHH Confidence 23333333332111 000111111111111 1110 001111111111 112344555 Q ss_pred HHHHHhhhcc-----------Ccccc-----CC---Cc--hhHHHHHHHHH-----hc----CHHHHHHHHHHHHhhcCe Q lcl|NC_021297. 60 LRTLSDRLDI-----------EGFRI-----SE---DS--EGLEELWNWWQ-----AN----DLDEESVLGHDDSLTFGR 109 (480) Q Consensus 60 Vd~~~~~l~~-----------~~~~~-----~~---d~--~~~~~l~~i~~-----~n----~~~~~~~~~~~~a~~~G~ 109 (480) |++.++.+-. -++.+ .. ++ .....+..++. .+ .+..+...+..+.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:95 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 5544433210 01111 00 11 11122333321 11 345677788899999999 Q ss_pred EEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc Q lcl|NC_021297. 110 AYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 110 a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) +|+++.... +..|++ .+..++|..+.+..+.... ......+++...+ ++. ...+.++.++++... T Consensus 181 ~~~~~~~~r----d~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~-g~~---~~~~~~~evI~~~~~----- 246 (563) T protein:vir:95 181 VNFEKVFNK----NNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD-KRV---VASFTSRELAMGIRN----- 246 (563) T ss_pred eEEEEEEEe----cCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC-Cce---eEEecCcceEEEecc----- Confidence 988754221 234554 5788999999888765421 1111111111111 110 011222222211110 Q ss_pred ccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhc--CC-Ccc Q lcl|NC_021297. 189 QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS--GV-TTD 262 (480) Q Consensus 189 ~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~--g~-~~~ 262 (480) +......+.+|.|.|. .+.+.+...+.--.....+| +.|..+|. |. .+. T Consensus 247 ---------------------~~~d~~~~~~G~Spi~----~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls 301 (563) T protein:vir:95 247 ---------------------PRTELSSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQS 301 (563) T ss_pred ---------------------CCCCcccCcccchHHH----HHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCC Confidence 0011123467887664 33333333222222222333 33443332 21 111 Q ss_pred ccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccch---- Q lcl|NC_021297. 263 ELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA---- 331 (480) Q Consensus 263 ~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~---- 331 (480) ....+.-...|+. ..+++..+-+++.++..+.... -..|++..+....+|+...++|+..+|....+.. T Consensus 302 ~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~ 381 (563) T protein:vir:95 302 QHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSK 381 (563) T ss_pred HHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccc Confidence 1111111112221 1122222224557887776432 2248888899999999999999999984321110 Q ss_pred --HH---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc Q lcl|NC_021297. 332 --SA---EAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) Q Consensus 332 --Sg---~Al~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~ 405 (480) |+ ..+......+...+ .-....+...|.+ .+.. .....+.+.|.+....+..+.. ++.++..+ T Consensus 382 ~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~------~L~~----~~~~~~~~~f~r~D~~~~~e~~-~~~~~~~~ 450 (563) T protein:vir:95 382 GGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR------HIIS----EYGDKYTFQFVGGDTKSATDKL-NILKLETQ 450 (563) T ss_pred cccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh------hhch----hcccccEEEeccCCHHHHHHHH-HHHHHhcC Confidence 10 11111111111111 1111111111111 0110 0112355667655444443332 23444443 Q ss_pred cccCCCHHHHHHhCCCChhHH-HHH------------HHH---HHHHHHHHHHHHhcccccCcccccC-----CCCCCCC Q lcl|NC_021297. 406 GQGPIPKEQARIDLGYTATQR-EQM------------RDW---DKQETEDMIDTLYSTTKAQADATPK-----PTVTDTK 464 (480) Q Consensus 406 ~~~~~s~et~~~~lg~~~~~~-~~~------------~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~d~~ 464 (480) ++++...+++.+|+.+-+- +.+ ... ..+..++..+.+........+..+. +...+.+ T Consensus 451 --G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:95 451 --IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred --CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccc Confidence 4677777788877653210 000 000 0001111122222211111111111 1111222 Q ss_pred cccccCCCCCCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFNRTKTR 480 (480) Q Consensus 465 ~~~~~~p~~~~~~~~~ 480 (480) ++.+++|.+.+...+. T Consensus 529 ~~~~~~~~~~~~~~~~ 544 (563) T protein:vir:95 529 IGTDAQIKGDDNVYRT 544 (563) T ss_pred cccccccccccccccc Confidence 2333444333333332 No 145 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.64 E-value=1.6e-07 Score=57.81 Aligned_cols=443 Identities=14% Similarity=0.114 Sum_probs=177.1 Q ss_pred CCC------HHHHHHHHHHHHHHHHHH-HHHHHHHhcccCcchhcCccc--chhhhhhccccchHHHHHHHHHhhhcc-- Q lcl|NC_021297. 1 MTT------YHEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLDI-- 69 (480) Q Consensus 1 ~~~------~~~~i~~l~~~~~~~~~r-~~~~~~YY~g~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l~~-- 69 (480) |.. .++.++++.+.+..++.. ...++++|+- .+++..... .......++..+-+...++++++.|.+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKV--TIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLAL 78 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHH--hccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhh Confidence 544 355666666666554333 3344444421 222211100 000111133444556666666665532 Q ss_pred ---Ccc-cc--CC--------Cchh-----------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC Q lcl|NC_021297. 70 ---EGF-RI--SE--------DSEG-----------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 70 ---~~~-~~--~~--------d~~~-----------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~ 124 (480) .+| ++ ++ +... .+.+...+..++|.....++.++...+|.+-+++..+... . T Consensus 79 tP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~---~ 155 (543) T protein:vir:88 79 FPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDAS---S 155 (543) T ss_pred cCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccc---c Confidence 122 11 11 1111 1234445666899999999999999999998776532210 0 Q ss_pred CceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEec---------------CCceEEEEEEEeCCcEEEEEecCCcccc Q lcl|NC_021297. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD---------------DVAVPDRATLYLPDETVPLRRNGGLNDQ 189 (480) Q Consensus 125 ~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (480) .+...++.++-.++++-.|. .+++...++.+.-.- +.+....+++|+. .|.+..+.... T Consensus 156 ~~~~~~~~~pl~~y~v~~d~--~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~----V~pr~~~~~~~ 229 (543) T protein:vir:88 156 NSYNPMKLYTLHNHVVQRDA--FGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH----IYIDDESGDFL 229 (543) T ss_pred ceecceEEeEcceEEEeeCC--CCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE----EEeecCCCccc Confidence 11112444444444443443 334554444332110 0111112333321 01111111100 Q ss_pred -------cccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCC Q lcl|NC_021297. 190 -------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVT 260 (480) Q Consensus 190 -------~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~ 260 (480) ..+..+....++..+|++++..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+.+. |.. T Consensus 230 ~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~ 308 (543) T protein:vir:88 230 SYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEE-YLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGIT 308 (543) T ss_pred ccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc Confidence 0111222223467789999998888899999998754 567788888888888888888888875542 111 Q ss_pred ccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHH---HHHHHHHHHHHHhccCCChHHhcc-ccccchHHHH Q lcl|NC_021297. 261 TDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNF---AEEMEVFRKEAASITGLPPQYLSS-SSENPASAEA 335 (480) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~---~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~A 335 (480) .. ......+.+.+..-..++....++.. .+++.. ++.++.-|+..+...+ +.. +.... ++.- T Consensus 309 ~~-------~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~-TAtE 375 (543) T protein:vir:88 309 QV-------RRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNS-----AVQRSGERV-TAEE 375 (543) T ss_pred ch-------hhcccCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hccCCCCcc-cHHH Confidence 00 00011122222222223444444442 334433 3444444443332221 211 11121 2322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHhCC--CCcccceeeeEEecCCC-CCCHHHHHHHHH Q lcl|NC_021297. 336 IIATDSRIVKMAERKGRIFGGA------------WERAMRIAMQIMGR--EVTEEYTRLETVWRDPS-TPTVAAKADAVS 400 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~------------l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~-~~~~~~~ad~~~ 400 (480) ++.. +..+...++.. +.+.+.++.+- |. ..+.+ .+++.+.-++ +-...+.++.+. T Consensus 376 V~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~--~v~~~~vs~l~~l~r~~~~~~l~ 445 (543) T protein:vir:88 376 IRYV-------ASELEDTLGGVYSILSQELQLPIVRVLLNQLQAT-QQIPNLPQE--AVEPTVTTGAEALGRGQDLDKLT 445 (543) T ss_pred HHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchh--ceeeeEEecHHHHHHHHHHHHHH Confidence 2222 22333333333 23333333221 11 12222 3445553221 112223333333 Q ss_pred HHHhccccC--------CCHHHHH----HhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 401 KLYANGQGP--------IPKEQAR----IDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 401 kl~~~~~~~--------~s~et~~----~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) ...+....+ +....++ ..+|+. +++++++.+.+.++++.............++.+..+... T Consensus 446 ~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (543) T protein:vir:88 446 QFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAM 525 (543) T ss_pred HHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHH Confidence 332211111 2222222 334652 334444433222221111111111111111222222221 Q ss_pred CCCcccccCCCCCCCCCCC Q lcl|NC_021297. 462 DTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 462 d~~~~~~~~p~~~~~~~~~ 480 (480) ++...+. ....++.++- T Consensus 526 ~~~~~~~--~~~~~p~~~~ 542 (543) T protein:vir:88 526 ESAMDTA--GVQPGPIATQ 542 (543) T ss_pred HHHhhhc--CCCCCCCCCC Confidence 1111111 1111122222 No 146 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.64 E-value=1.6e-07 Score=57.80 Aligned_cols=422 Identities=11% Similarity=0.088 Sum_probs=153.7 Q ss_pred CCCHHHHHHHHHHH-HHH---HH------HHHHHHHHHhcccCcch--------hc-------Ccccc-hhhhh-hc--c Q lcl|NC_021297. 1 MTTYHEHVERLQGL-LAR---DL------PNLLEAEAYRNGTRRLK--------TI-------GIGAP-PELAY-LD--V 51 (480) Q Consensus 1 ~~~~~~~i~~l~~~-~~~---~~------~r~~~~~~YY~g~~~i~--------~~-------~~~~~-~~~~~-~~--~ 51 (480) -++.++.-+.+... |.. .+ ...+.+.+.-++++..- .. +...+ ..+.. ++ . T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 92 (574) T protein:vir:80 13 KSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFG 92 (574) T ss_pred hhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhc Confidence 22333332222221 100 00 11122334433332110 00 00000 01111 01 1 Q ss_pred ccchHHHHHHHHHhhhc-----------cCcccc---CCC-------chhHHHHHHHHHhc---------CHHHHHHHHH Q lcl|NC_021297. 52 QPGWVATYLRTLSDRLD-----------IEGFRI---SED-------SEGLEELWNWWQAN---------DLDEESVLGH 101 (480) Q Consensus 52 ~~n~~~~iVd~~~~~l~-----------~~~~~~---~~d-------~~~~~~l~~i~~~n---------~~~~~~~~~~ 101 (480) .......++++.++.+. +-+|.+ ..+ ......+.+++... .+..+...+. T Consensus 93 ~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv 172 (574) T protein:vir:80 93 NNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLV 172 (574) T ss_pred cChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHH Confidence 11122333333222111 122321 011 11123455555321 2345677788 Q ss_pred HHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEE Q lcl|NC_021297. 102 DDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPL 180 (480) Q Consensus 102 ~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (480) .+.+.+|.+|+.+-.+ .+|++ .+..++|..+.+..+.... ......++|...++.. ...+.++++ T Consensus 173 ~~lll~Gnayi~i~r~------~~G~~~~L~pl~p~~V~v~~d~~~~-~~~~~~~y~~~~~g~~----~~~~~~~ei--- 238 (574) T protein:vir:80 173 RATYMYDQVNFEKVFD------KDGNFIKFDTVDPTTIFLATNGEGK-LIKNGERFVQVIDNRI----VAKFNEREL--- 238 (574) T ss_pred HHHHhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCccc-cccCceEEEEEeCCce----EEEEccccE--- Confidence 8899999999876543 34554 5788999999888765321 0111112221111110 111222232 Q ss_pred EecCCcccccccccccccccccccceEEeecccc---cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHh---cchhh Q lcl|NC_021297. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLR 254 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~---~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~ 254 (480) +||+.++. ..+.+|.|.|. .+. +.+...+.-......+| +.|.. T Consensus 239 --------------------------ih~~~~~~~~~~~~~~G~spi~-~a~---~~i~~~~~a~~~~~~~f~ng~~p~g 288 (574) T protein:vir:80 239 --------------------------AFAVRNPRADIEVGQYGYPELE-IAL---KQFIAHENTEVFNDRFFSHGGTTRG 288 (574) T ss_pred --------------------------EEEeccCCCCcccccccccHHH-HHH---HHHHHHHHHHHHHHHHHhccCCCce Confidence 33332221 23457887663 233 33333322222222333 33443 Q ss_pred hhc--CCC-ccccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhc Q lcl|NC_021297. 255 VIS--GVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLS 324 (480) Q Consensus 255 ~i~--g~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g 324 (480) +|. +.. ......+.-...|.- ..+++..+.+++.++.++.... -..|++..+..+.+|+...++|+..+| T Consensus 289 il~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG 368 (574) T protein:vir:80 289 ILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEIN 368 (574) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhc Confidence 332 211 111100000111211 1123334445667887776432 224788889999999999999999998 Q ss_pred cccccchHH--------HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021297. 325 SSSENPASA--------EAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 325 ~~~~n~~Sg--------~Al~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ 395 (480) ....+...| ..+..+...+...+ .=....+...|.+. ++ .. . ...+.+.|......+..+. T Consensus 369 ~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~---Ll--~~--~---~~~~~~~f~~~d~~~~~~~ 438 (574) T protein:vir:80 369 FPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTY---IV--AE--F---GEKYQFQFRGGDLSAQLDK 438 (574) T ss_pred ccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh---hh--hh--c---CCceEEEecccchhhHHHH Confidence 543221100 11111111111111 11111111111110 11 11 1 1235566765554444433 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhH-HHHH---------HHH-HH-----HHHHHHHHHHhcccccCcccccCCC Q lcl|NC_021297. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQM---------RDW-DK-----QETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~et~~~~lg~~~~~-~~~~---------~~~-~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +. +.++..+ ++++..-+++.+|+.+-+ -+.+ .+. .. +..++..+...... +..++.+.+. T Consensus 439 ~~-~~~~~~~--G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 514 (574) T protein:vir:80 439 LK-IIEQEGK--VFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELS-GGDVEQPEPE 514 (574) T ss_pred HH-HHHHHhC--CccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhcccccccccc-CCCCCCCCCC Confidence 32 3344443 477787788887764321 0000 000 00 00000011000000 0000111000 Q ss_pred CC-----CCCcccccCCCCCCCCCCC Q lcl|NC_021297. 460 VT-----DTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 460 ~~-----d~~~~~~~~p~~~~~~~~~ 480 (480) .+ |+....+..+.+.++..++ T Consensus 515 ~p~~~~~d~~~~~~~~~~~~~~~~~~ 540 (574) T protein:vir:80 515 EPKDSQNDTDVSFQDEQQGLNGKSKK 540 (574) T ss_pred CCCCccccccchhhhhhhhhccchhh Confidence 10 0000001111122222222 No 147 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.63 E-value=1.7e-07 Score=57.70 Aligned_cols=438 Identities=12% Similarity=0.056 Sum_probs=171.9 Q ss_pred CCCHH------HHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhc--- Q lcl|NC_021297. 1 MTTYH------EHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLD--- 68 (480) Q Consensus 1 ~~~~~------~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~--- 68 (480) |+..+ +-+++....+..++ +....++++|+- .+++.. ...+...+..++..+-+...++++++.|. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 78 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATY--TIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLAL 78 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhh Confidence 66533 34444555554433 223334444421 122211 11111122334445556667777776553 Q ss_pred -c--Cccc-cC-CCch-------------h-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccC Q lcl|NC_021297. 69 -I--EGFR-IS-EDSE-------------G-------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 69 -~--~~~~-~~-~d~~-------------~-------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d 123 (480) + .+|+ +. .|.. . .+.+...+..++|.....++.++...+|.|-+++..++ .+ T Consensus 79 tpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~---~~ 155 (532) T protein:vir:99 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE---QV 155 (532) T ss_pred cCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccc---cc Confidence 2 2232 11 1110 1 12334556678999999999999999999988776432 12 Q ss_pred CCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-c---------------CCceEEEEEEEeC-----Cc--EEEE Q lcl|NC_021297. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYLP-----DE--TVPL 180 (480) Q Consensus 124 ~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~---------------~~~~~~~~~~~~~-----~~--~~~~ 180 (480) ..+...++.++-.+.++.-|. ..++...++.+... + +......+++|+. +. ...| T Consensus 156 ~~~~~~f~~~pl~~y~v~~d~--~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~ 233 (532) T protein:vir:99 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) T ss_pred cCcccceEEEEcCeEEEeeCC--CCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEE Confidence 234556777777775555553 33444444322111 0 0011122344331 11 1111 Q ss_pred EecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC Q lcl|NC_021297. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT 260 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~ 260 (480) ....+. .........+|..+|++.+..+...++.||+|-..+ ..+-+..+|.+.-...........|...+. T Consensus 234 ~~~~g~----~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~--- 305 (532) T protein:vir:99 234 QEIDGE----IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVN--- 305 (532) T ss_pred EeecCc----eecccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHcCCCceec--- Confidence 111110 011111123466789999998888899999997754 556677777776666666666666654432 Q ss_pred ccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHH---HHHHHHHHHHHHhccCCChHHhc-cccccchHHHH Q lcl|NC_021297. 261 TDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNF---AEEMEVFRKEAASITGLPPQYLS-SSSENPASAEA 335 (480) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~---~~~l~~~~~~i~~~~~~p~~~~g-~~~~n~~Sg~A 335 (480) ++. ...........++.+..-..++....++.. .+++.. ++.++.-|+..+...+ +. -+.... ++.- T Consensus 306 p~g--~~~~~~~~~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~d~~r~-TAtE 377 (532) T protein:vir:99 306 PNG--VTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-----AVQRGGDRV-TAEE 377 (532) T ss_pred ccc--ccchhhhccCCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCcc-cHHH Confidence 000 000001112222322211112233444332 344433 3444444444332221 11 111111 2322 Q ss_pred HHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhCC--CCcccceeee-EEecCCCCCCHHHHHHHHHHHHhcc- Q lcl|NC_021297. 336 IIATDSRIVKM----AERKGR-IFGGAWERAMRIAMQIMGR--EVTEEYTRLE-TVWRDPSTPTVAAKADAVSKLYANG- 406 (480) Q Consensus 336 l~~~~~~l~~k----~~~~~~-~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~-v~f~~~~~~~~~~~ad~~~kl~~~~- 406 (480) ++.....+... ..++.. .+.+-+.+.+.++.+- |. ..+.+..... +++-.++ ...+..+.++...+.. T Consensus 378 V~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~~~~~~iv~~is~L--araq~~~~l~~~~~~la 454 (532) T protein:vir:99 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIATGLEAL--GRGHDLNKLNVFIDYMI 454 (532) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCChhhcccceeecchHH--HHHHHHHHHHHHHHHHH Confidence 22221111111 111111 1222223334433321 11 1122222222 2232222 1222233332222211 Q ss_pred ---c---cCCCHH----HHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHH-Hhcc--cccCcccccCCCCCCCC Q lcl|NC_021297. 407 ---Q---GPIPKE----QARIDLGYT-------ATQREQMRDWDKQETEDMIDT-LYST--TKAQADATPKPTVTDTK 464 (480) Q Consensus 407 ---~---~~~s~e----t~~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~d~~ 464 (480) + ..+... .+...+|+. +++++.+.+.+.+++...... ..+. .++.+.......+.+.+ T Consensus 455 q~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 455 KLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred hhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 1 112222 223345652 233333322211111111111 1111 11111111111221211 No 148 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=98.61 E-value=2e-07 Score=57.27 Aligned_cols=430 Identities=13% Similarity=0.071 Sum_probs=171.1 Q ss_pred CCCHHHHHHHHHHHHHHHHH-------------------HHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLP-------------------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-------------------r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd 61 (480) .-.+.. --+|.+.|..+-. -++.+ .+|.|..-|=+ . .+..+ .-+.-.++++. T Consensus 60 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F~Gy---~---~la~l-aQ~~eyr~~~~ 130 (695) T protein:vir:36 60 VVEPSP-SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGFPGF---P---TLVLL-AQLPEYRAMHE 130 (695) T ss_pred ccCCCc-ccccceeceecccccCccccchhhhhhcccccccccc-hhhhccCcchH---H---HHHHH-hhccchhhHHH Confidence 100000 0011111111100 00111 12222111100 0 00000 01122233444 Q ss_pred HHHhhhccCc---------------ccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccc Q lcl|NC_021297. 62 TLSDRLDIEG---------------FRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE 120 (480) Q Consensus 62 ~~~~~l~~~~---------------~~~------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~ 120 (480) +.+..+.-.. +.+ ..+.+..+.|..-+++-++...+.++.+++-.||.+.+++...... T Consensus 131 ~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd 210 (695) T protein:vir:36 131 VLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDD 210 (695) T ss_pred HHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCc Confidence 4444432221 111 1222445677777777788899999999999999987655431100 Q ss_pred -----------ccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc Q lcl|NC_021297. 121 -----------SGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 121 -----------~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) +.-..|.+ -+.+++|..+.|-.-...+ +. ..+.+.+++..+- +..++..+- T Consensus 211 ~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~d--P~--------spdfgkP~~y~V~--G~kIH~SRL----- 273 (695) T protein:vir:36 211 QIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSIN--PV--------ADDFYKPSTWWMI--GTEVHATRL----- 273 (695) T ss_pred cccccccccccccccCcceeeeEeecccccccchhhhcc--ch--------hhccCCCceEEEe--ceEEeeeeE----- Confidence 00112222 2566677766663211000 00 0111112221111 111111110 Q ss_pred ccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC-----ccc Q lcl|NC_021297. 189 QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT-----TDE 263 (480) Q Consensus 189 ~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~-----~~~ 263 (480) +.|...|+-.+. ......+|.|... -+.+-+++.+++.-.....+..++...+. +++. ... T Consensus 274 ----------~~f~g~plPd~L--Kp~y~~~GiSv~q-~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~aL~~g~~ 339 (695) T protein:vir:36 274 ----------HTIVSRPVGDML--KPTYSFAGISMTQ-LAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALMPGAN 339 (695) T ss_pred ----------EEecCCCchhhh--hcccccCcccHHH-HHHHHHHHHHHHHhHHHHHHHhhhHHHHH-HHHHHhhcChhH Confidence 000001110000 0011245666543 35556666666655555554333322211 1100 000 Q ss_pred cccccccchhhh--hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc-cc-chHHHHHHHH Q lcl|NC_021297. 264 LTNDGENTTLDI--YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIAT 339 (480) Q Consensus 264 ~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n-~~Sg~Al~~~ 339 (480) ..-...-..++. ....++.+++++-+|-+++. ++..+-+.+-.....+|..++||..-|-+.+ .. ++||++=..- T Consensus 340 ~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rn 418 (695) T protein:vir:36 340 VDLSMRAELINRYRDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRV 418 (695) T ss_pred HHHHHHHHHHHHhcCccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHH Confidence 000000011111 11223344544556666653 4555666677778899999999986664332 22 4678876666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHH Q lcl|NC_021297. 340 DSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQ 414 (480) Q Consensus 340 ~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~et 414 (480) |...+. ...+..+...|++++.++.+-.-+..+. .+...|.+-...+..+.|+.-.|-+++ ..+.|+... T Consensus 419 YYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~e 493 (695) T protein:vir:36 419 WYDYVR--AYQRNALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQ 493 (695) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHH Confidence 666555 5568889999999999876544333332 478899988889999998765544332 112333333 Q ss_pred HHHhC------CCChhHHHHHHH--HHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCC----------C Q lcl|NC_021297. 415 ARIDL------GYTATQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFN----------R 476 (480) Q Consensus 415 ~~~~l------g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~----------~ 476 (480) ++.+| +|... ++.-.+ +..+.+.+....+.......++.++.++. ..+...+|+..+ + T Consensus 494 vr~rL~~d~~s~Y~~~-~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~~~~~v~~~~~~~~~~~ag 569 (695) T protein:vir:36 494 VAARLNTEPDGPYAGK-LDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA---RAGATAPPTVANVNANVNPREAG 569 (695) T ss_pred HHHHHhcCCCcccccc-cccccCCCcCccchhhhhHhhhcCcccccccCCCCcc---cccccCCCcccccccccCccccC Confidence 33332 11100 000000 00000000000011101000011111100 011111121110 0 Q ss_pred CCCC Q lcl|NC_021297. 477 TKTR 480 (480) Q Consensus 477 ~~~~ 480 (480) +++. T Consensus 570 ~~~~ 573 (695) T protein:vir:36 570 AQDA 573 (695) T ss_pred CCCc Confidence 0111 No 149 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=98.60 E-value=2.1e-07 Score=57.25 Aligned_cols=418 Identities=14% Similarity=0.074 Sum_probs=172.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH----HHHH--------------HHhcccCcchhcCcccchhhhhhc--cccchHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNL----LEAE--------------AYRNGTRRLKTIGIGAPPELAYLD--VQPGWVATYL 60 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~----~~~~--------------~YY~g~~~i~~~~~~~~~~~~~~~--~~~n~~~~iV 60 (480) ...+..- -+|.+.|..+..-+ +.-. .+|.|..-| .+.-.- .-+.-.++++ T Consensus 60 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~---------Gy~~la~laQ~~eyr~~~ 129 (695) T protein:vir:78 60 VAEPSPS-LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFP---------GFPTLVLLAQLPEYRAMH 129 (695) T ss_pred ccCCCcc-cccceeceeccccCCccccchhhhhhcccccccccchhhhccCcc---------hHHHHHHHhhccchhhHH Confidence 2211111 11222222111000 0001 122221111 010000 0112223444 Q ss_pred HHHHhhhccCc---------------ccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcc Q lcl|NC_021297. 61 RTLSDRLDIEG---------------FRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV 119 (480) Q Consensus 61 d~~~~~l~~~~---------------~~~------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~ 119 (480) .+.+..+.-.. +.+ ..|.+..+.|..-+++-++...+.++.+++-.||.+.+++..... T Consensus 130 ~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gd 209 (695) T protein:vir:78 130 EVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGD 209 (695) T ss_pred HHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccC Confidence 44444442221 111 112244556777777778888999999999999998765543110 Q ss_pred c-----------ccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc Q lcl|NC_021297. 120 E-----------SGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN 187 (480) Q Consensus 120 ~-----------~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (480) . +.-..|.+ -+.+++|..+.|-.-...+ +. ..+.+.+++..+- +..++..+- T Consensus 210 d~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~d--P~--------spdfgkP~~y~V~--G~kIH~SRL---- 273 (695) T protein:vir:78 210 DQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSIN--PV--------ADDFYKPSTWWMI--GTEVHATRL---- 273 (695) T ss_pred ccccccccccccccccCcceeeeEeecccccccchhhhcc--ch--------hhccCCCceEEEe--ceEEeeeeE---- Confidence 0 00112222 2566677766663211000 00 0111112221111 111111110 Q ss_pred cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-----Ccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-----TTD 262 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-----~~~ 262 (480) +.|...|+-.+. ......+|.|... -+.+-+++.+++.-.....+..++..-+. +++ ... T Consensus 274 -----------~~f~g~plPd~L--Kp~y~~~GiSv~q-~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~~L~~g~ 338 (695) T protein:vir:78 274 -----------HTIVSRPVGDML--KPTYSFAGISMTQ-LAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALMPGA 338 (695) T ss_pred -----------EEecCCCchhhh--hcccccCcccHHH-HHHHHHHHHHHHHhHHHHHHHhhhhHHHH-HHHHHhhcChh Confidence 000001110000 0011245666543 35556666666655555555333322111 110 000 Q ss_pred ccccccccchhhh--hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc-cc-chHHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDI--YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIA 338 (480) Q Consensus 263 ~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n-~~Sg~Al~~ 338 (480) ...-...-..++. ....++.+++++-+|-+++. ++..+-+.+-.....+|..++||..-|-+.+ .. ++||++=.. T Consensus 339 ~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~r 417 (695) T protein:vir:78 339 NVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIR 417 (695) T ss_pred HHHHHHHHHHHHHhcCccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHH Confidence 0000000011111 11223344544556666653 4555666677778899999999986664332 22 467887666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHH Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKE 413 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~e 413 (480) -|...+. ...+..+...|++++.++.+-.-+..+. .+...|.+-...+..+.|+.-.|-+++ ..+.|+.. T Consensus 418 nYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~ 492 (695) T protein:vir:78 418 VWYDYVR--AYQRNALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD 492 (695) T ss_pred HHHHHHH--HHHHHHHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH Confidence 6666555 5568889999999999876544333332 478899988889999998765554332 11223332 Q ss_pred HHHHhC------CCChhHHHHHHH--HHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 414 QARIDL------GYTATQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 414 t~~~~l------g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .++.+| +|... ++.-.+ +..+.+.+....+.. +..++++.++|.+...++|- T Consensus 493 evr~rL~~d~~s~Y~~~-~D~~d~p~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~g~~~ 552 (695) T protein:vir:78 493 QVAARLNTEPDGPYAGK-LDANDDPGVPADDDIDGVLTYVQ--------------RLAEGGDTGAPGGARAGATA 552 (695) T ss_pred HHHHHHhcCCCcccccc-cccccCCCcCccchhhhhHhhhc--------------CcccccccCCCCCCCCCCCC Confidence 222221 11000 000000 000000000000000 11122222222222222221 No 150 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.59 E-value=2.3e-07 Score=57.03 Aligned_cols=386 Identities=11% Similarity=0.026 Sum_probs=158.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCCch-h Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDSE-G 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~~-~ 80 (480) .-++.++.+.....+ .......+...-......+..+.. +.-+.+.....+|+..++-+-.-+|. ..++.+ . T Consensus 1 Mgl~~~~f~~~~~~~-~~~~~~~~~~~~~~~~~~g~~v~~---~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 76 (409) T protein:vir:84 1 MSLFTRIFSGPSEER-TLTKISGIPSPAEDWAMHGDRPGA---NSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIP 76 (409) T ss_pred CchhhhhhcCCCccc-ccccccccccccchhhccCcccch---hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccc Confidence 222222222111000 000000000000000000011110 00111122344455544443333332 222221 1 Q ss_pred HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 81 LEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 81 ~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ...+.+++.. | ....+...+..+.+.+|.+|+++-.. +..|++ .+..++|..+.+.......... T Consensus 77 ~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~-----~~~g~~~~L~~l~p~~v~v~~~~~~~~~~---- 147 (409) T protein:vir:84 77 VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISAR-----DEANRPTAIMPIHPDCIHVTDAKDEDGDW---- 147 (409) T ss_pred cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEE-----CCCCceEEEEEEcCceeEEEEcCCCcceE---- Confidence 2234444432 2 34566778888999999999876421 234554 5778888888766543222111 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) ++......+ ..|.+++ |+||++....+..+|.|.+. .+.+. T Consensus 148 -~~~~~~~~g-----~~~~~~d-----------------------------vih~~~~~~~~~~~G~s~i~----~~~~~ 188 (409) T protein:vir:84 148 -IEPVYRIDG-----KVVPNHR-----------------------------IMHIKRYPVAGCALGMSPIE----KAASA 188 (409) T ss_pred -EEEEecCCc-----eEEchhh-----------------------------EEEecCCCCCcccccccHHH----HHHHH Confidence 110000111 0112222 34555444334457887653 23333 Q ss_pred HHHHHHHHHHHHHHh---cchhhhhcCC-Cccccccccccchhh---hhhhhhcccCCCCceEEEeccccH-HHHHHHHH Q lcl|NC_021297. 235 ASRTLMNLQSASQIL---GTPLRVISGV-TTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAEL-RNFAEEME 306 (480) Q Consensus 235 ~~~~~s~~~~~~~~~---~~~~~~i~g~-~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~l~ 306 (480) +...+.-..-...+| +.|-.+|+.. .......+.-...|. ...++++.++ ++.++.++..... ..|++..+ T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~~~~~~~d~q~~e~~~ 267 (409) T protein:vir:84 189 IGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMS-AGIKWQSVSITPNESQFLETRS 267 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecC-CCceEEEccCChhHHHHHHHHH Confidence 333332222222233 3444444321 111110010011111 1123344443 4567777764322 24778888 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDS-----RIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~-----~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) ....+|+...++|+.-+|....+..++..++.+.. .|.-.+...+..|. +. ... ...++ T Consensus 268 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~----~~-------L~~-----g~~i~ 331 (409) T protein:vir:84 268 FQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALD----TF-------LPR-----GQFVK 331 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHH----Hh-------ccC-----CCeEE Confidence 88999999999999988754333222222322222 22222222222221 11 111 12356 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+..-...|..+.++++.+++.+| +++...+++.+|+.+-+- -.+ .....+- ...+ +..+ T Consensus 332 fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~p~~g--gD~--------~~~~~n~-~~~~-~~~~----- 392 (409) T protein:vir:84 332 FNVDGLMRGDVTARFTAYQMGLQNG--IWSVNEVRAWEDAPPIPE--GDI--------HLQPMNF-VPLG-YVPP----- 392 (409) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cce--------eeecccc-cccc-cCCc----- Confidence 6666666778899999999998876 667777788887765321 000 0000000 0000 0000 Q ss_pred CCCcccccCCCCCCCCC Q lcl|NC_021297. 462 DTKTETQTSPSGFNRTK 478 (480) Q Consensus 462 d~~~~~~~~p~~~~~~~ 478 (480) .++++...+++..++.. T Consensus 393 ~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 393 EEPAQEPQPNSATEGNK 409 (409) T ss_pred cccCcCCCCCCccCCCC Confidence 00111111222222222 No 151 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.58 E-value=2.4e-07 Score=56.85 Aligned_cols=417 Identities=16% Similarity=0.090 Sum_probs=159.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccC----cch------hcCcccchhhhhhccccchHHHHHHHHHhhhccCccc- Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTR----RLK------TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR- 73 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~----~i~------~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~- 73 (480) .=++..|....... ....... .+.+ .+- ..+..+.... -++... .-.+|+..++-+-.-++. T Consensus 1 Mg~~~~l~~r~~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~V~~~~-al~~~~--V~~~v~~Ia~~iA~lp~~~ 73 (457) T protein:vir:13 1 MGFWSALFGRGHSP--ALDGIEA--RAWEPYDPSIYNLGAVAASGETVTPHD-ALQVSA--VFASVRLLSETIATLPLST 73 (457) T ss_pred Cchhhhhhcccccc--ccccccc--ccccccchHHHhhcccccCCceechHH-hhccHH--HHHHHHHHHHhhccCceEE Confidence 22222222211110 0000000 0000 000 0011111110 011100 112233333322222332 Q ss_pred --cCCC---chhHHHHHHHHHh--c--CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce-eEEEEEccceeEEEEe Q lcl|NC_021297. 74 --ISED---SEGLEELWNWWQA--N--DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELD 143 (480) Q Consensus 74 --~~~d---~~~~~~l~~i~~~--n--~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~-~~i~~~~p~~~~~v~d 143 (480) ..++ +.....+..+++. | ....+...+..+.+.+|.||+.+-.. .|+ ..+..++|..+.+..+ T Consensus 74 ~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-------~g~~~~l~~l~p~~v~v~~~ 146 (457) T protein:vir:13 74 YSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-------GPNIVGLDVLDPTKIHVHMV 146 (457) T ss_pred EEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEccCceEEEEe Confidence 2211 1112234444432 2 23456777888899999999887432 233 3577888888887665 Q ss_pred CCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCccc Q lcl|NC_021297. 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~ 223 (480) ......... .+.|. ....+.......|.++.++++ ++....+..+|.|. T Consensus 147 ~~~~~~~~~-~~~y~-~~~~~~~~~~~~~~~~diih~-----------------------------~~~~~~~~~~G~s~ 195 (457) T protein:vir:13 147 MVDGLRRKV-FEAYD-IDADGNEVLLGWFTPRDVLHI-----------------------------PGMMLPGDFVGCSP 195 (457) T ss_pred cCCCcccee-EEEEE-EecCCceeeEEeeCccceEEe-----------------------------cCCCCCCccccccH Confidence 433211111 11221 112222223333445554443 22222233467776 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEeccc Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~ 296 (480) +.- +...++....+..-....+...+.|..+|.-. .......+.-...|+. ..++++.++ ++.++.++... T Consensus 196 i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~ 273 (457) T protein:vir:13 196 ISY-ARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAKFSKVAMS 273 (457) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCC Confidence 532 22333322222111222222233444444311 1111000000111211 113344454 34677776543 Q ss_pred c-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021297. 297 E-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE 375 (480) Q Consensus 297 ~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 375 (480) . -..|++..+....+|+...++|+..+|....+..++..+..+...+...+- .-+-..|++.+.. .+..... . T Consensus 274 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl---~P~~~~ie~~ln~--~L~~~~~-~ 347 (457) T protein:vir:13 274 PDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSL---RPWLERIEAGFNR--LLFAETA-D 347 (457) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHH---HHHHHHHHHHHHH--hhcCccc-c Confidence 2 224778888889999999999999997544333222223322222222110 1111122221111 1111111 1 Q ss_pred cceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH---HHH-HHHHHHHHHHHHHHHhcccc-- Q lcl|NC_021297. 376 EYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR---EQM-RDWDKQETEDMIDTLYSTTK-- 449 (480) Q Consensus 376 ~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~-- 449 (480) ....+++.+......|..++++++.+++++| +++...+++.+|+.+-+- +++ .-..-.. .+....... T Consensus 348 ~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~----~~~~~~~~~~~ 421 (457) T protein:vir:13 348 RFRFVKFNLDEIKRGAPKERMELWSLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGE----VGEEPEPEPAP 421 (457) T ss_pred CceeEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeecccccc----ccccccccccC Confidence 2234666666777788899999999998876 667767788887754210 111 0000000 000000000 Q ss_pred cCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 450 AQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ...+..+..+.++...+..+.+.+.+...+= T Consensus 422 ~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~ 452 (457) T protein:vir:13 422 APPAIEPPAEEPDEEPEPEGKPDDEGATEED 452 (457) T ss_pred CCCCCCCCccccCCCCCCCCCCccccCCCCc Confidence 0000011111111111111222111111111 No 152 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=98.57 E-value=4.4e-08 Score=60.90 Aligned_cols=446 Identities=14% Similarity=0.081 Sum_probs=172.6 Q ss_pred CCCHHHHHHH-HHHH---------HHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHEHVER-LQGL---------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~-l~~~---------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |+|-+--++. .+.. ......+-..-.++|.++.-|+ +.--+..++.+--...++..+|+.++..+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~ 78 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVN--PPYNPDRLAAFLELNETLATGIRKKSRYEVGF 78 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCC--CCCCHHHHHHHHhcChHHHHHHHHHhhhhhcc Confidence 7775522221 0000 0000011112235555554332 22244556666666789999999999999888 Q ss_pred cccc------CCC---chhHHHHHHHHHh---------------cCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCc Q lcl|NC_021297. 71 GFRI------SED---SEGLEELWNWWQA---------------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) Q Consensus 71 ~~~~------~~d---~~~~~~l~~i~~~---------------n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~ 126 (480) ||.+ ..+ ++-.+.++++|.. ..+......+..+...+|.+|+-+..+..+. T Consensus 79 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~----- 153 (651) T protein:vir:99 79 GFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGR----- 153 (651) T ss_pred CceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccc----- Confidence 8753 111 1223345555432 1234555666777788888887665432211 Q ss_pred eeEEEEEccceeEEEEeCCC-------------CCeeEE-------------EEEEEEEecCCceEEEEE---------E Q lcl|NC_021297. 127 IPLIRVESPLYMYAELDPRN-------------TRRVTR-------------AVRLYTTRDDVAVPDRAT---------L 171 (480) Q Consensus 127 ~~~i~~~~p~~~~~v~d~~~-------------~~~~~~-------------~~~~~~~~~~~~~~~~~~---------~ 171 (480) -+.+..+++..+-..-+... ...... ...++...+..+...... . T Consensus 154 pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~ 233 (651) T protein:vir:99 154 PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIR 233 (651) T ss_pred hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEE Confidence 01222233321111000000 000000 000000011111110000 0 Q ss_pred EeCCcEEEEEe-cCCcccccccccccccccccccc---eEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 172 YLPDETVPLRR-NGGLNDQWVVDGDVIKHGLGVVP---VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQ 247 (480) Q Consensus 172 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~iP---vv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~ 247 (480) +..+....+.. ........+.... ..+...+| |+||++.....+.+|.|.+. .+.+.+....+-..-... T Consensus 234 ~~~d~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~~eViHir~~~~~~g~~G~spl~----~a~~~i~~a~~a~~~~~~ 307 (651) T protein:vir:99 234 YREDEESEREPIFVDRETGDVTTGD--ANGLENRPANELIFIPNPSILEDDYGVPDWV----SAIRTISADEAAKDYNRD 307 (651) T ss_pred eccCcceeeeeecccceeeeEEEcC--CCceeEecccceEEecCCCCCCCcccccHHH----HHHHHHHHHHHHHHHHHH Confidence 11110000000 0000000000000 01111233 67887665567789998764 334444333222222333 Q ss_pred Hhc---chhhhhc--CCCccccccccccchhh---hhhhhhcccCC----------CCceEEEecccc--HHHHHHHHHH Q lcl|NC_021297. 248 ILG---TPLRVIS--GVTTDELTNDGENTTLD---IYYGRILTLAS----------EAAKISEFKAAE--LRNFAEEMEV 307 (480) Q Consensus 248 ~~~---~~~~~i~--g~~~~~~~~~~~~~~~~---~~~~~~~~~~g----------~~~~~~~~~~~~--~~~~~~~l~~ 307 (480) +|. .|..+|. |..+.....+.-...|+ -..++++.+++ .+.++..++... -..|++..+. T Consensus 308 ~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~ 387 (651) T protein:vir:99 308 FFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREK 387 (651) T ss_pred HHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHH Confidence 333 3444432 32111111110011111 12223333322 245666665432 2357888888 Q ss_pred HHHHHHhccCCChHHhccccc-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 308 FRKEAASITGLPPQYLSSSSE-NPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) .+.+|+...++|+..+|.... |-++..... +....|.-.+ + .|++.+.. .+...........+.+.| T Consensus 388 ~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~----~----~ie~eln~--kLl~~~e~~~~~~i~~ef 457 (651) T protein:vir:99 388 NEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQ----H----TFAEWLYQ--IIHQQALGVTDWTIEYEL 457 (651) T ss_pred HHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHH----H----HHHHHHHH--hhcCccccccCceEEEEe Confidence 899999999999999985432 212222111 1111111111 1 11111111 111111111122455555 Q ss_pred c--CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 385 R--DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 385 ~--~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) . .....+....++.+.+++++| +++..-+++.+|+.+-+-... ...+...+....+++..+++..... T Consensus 458 ~~~~llr~D~~~~~e~~~~~i~~G--~~T~NE~R~~lglppi~~~~g--------d~~l~~~~~~~~g~~~~gge~~~~~ 527 (651) T protein:vir:99 458 RGADQPKQEAQLAEQRVRAMRLAG--VGLVDEAREELGLDPLGEPYG--------EMTLSEFEAEVAGDVAGGGETEAVH 527 (651) T ss_pred ccchhhhccHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCccc--------cccccccccccccccccCCCCcccc Confidence 4 344467778888888888765 667777888887754210000 0000000000000000000000000 Q ss_pred CCcccccCCCCCCCCC-----CC Q lcl|NC_021297. 463 TKTETQTSPSGFNRTK-----TR 480 (480) Q Consensus 463 ~~~~~~~~p~~~~~~~-----~~ 480 (480) .+.++.+. ++....+ |+ T Consensus 528 ~~~~~~~~-~~~e~~~~~~~~~~ 549 (651) T protein:vir:99 528 EPPEENKI-GEREWDTVKSELTT 549 (651) T ss_pred cCcccccc-ccchhhhhhhhhcc Confidence 00000000 0000001 11 No 153 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=98.57 E-value=1.1e-07 Score=58.75 Aligned_cols=445 Identities=12% Similarity=0.077 Sum_probs=200.8 Q ss_pred CCCHHHHHHH-----------HHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhcc Q lcl|NC_021297. 1 MTTYHEHVER-----------LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~-----------l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) ++....+..- -++.-..-..+|..+..+++-+..+..+-.. -++.+-...+|.-- |-. T Consensus 27 ~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne--------aiv~d~~~~pV~i~---Ld~ 95 (533) T protein:vir:10 27 LDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNE--------TICGNFDDVPVSVE---LSN 95 (533) T ss_pred ccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhcc--------eeeecCCCceEEEE---ecc Confidence 1111111110 0000000111233333333333222211000 00000000000000 000 Q ss_pred CccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 70 EGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~ 149 (480) ..++-+-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-.+. ....+|-..++.+||+.+-++.--. .. T Consensus 96 ~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~--~~pk~GI~ELr~lDPr~i~~vr~i~--~~ 171 (533) T protein:vir:10 96 LKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP--DNPQGGLIELRYIDPRKIRKINETE--QK 171 (533) T ss_pred cccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecC--CCccccceeeeeccccceeeeeeee--cc Confidence 0011111222344556666666888899999999999999998854321 1235688889999999887765211 00 Q ss_pred eEEEEEEE-EEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc--eEEeecccccCccCC--cccc Q lcl|NC_021297. 150 VTRAVRLY-TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDPRLGNRYG--RSEI 224 (480) Q Consensus 150 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP--vv~~~n~~~~~~~~G--~s~i 224 (480) ....++.+ ......+....+-+|.+.... +. +. + ++ +|| .+.|...-...+.-| .|-| T Consensus 172 ~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~-~~--~~---~----------~v-kI~~dAI~y~hSGl~d~~~~~i~syL 234 (533) T protein:vir:10 172 RPEQLRGLPLNQQLSPKSAEYFLYDPKGLK-NS--TT---Q----------GL-KIAPDSICYVHSGIMDLNKNMTLSHL 234 (533) T ss_pred CCCccceeecchhhhccceeeeeecccccc-cc--CC---C----------ce-ecchhheeeeeccceeCCCCceeccc Confidence 11111110 000011111122334432211 10 00 0 00 111 122322211111111 1334 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhh Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYG 278 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~ 278 (480) .+.++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-.... T Consensus 235 hkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlE 312 (533) T protein:vir:10 235 HKAIKAVNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE 312 (533) T ss_pred hHhHHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHh Confidence 444444322 257778888877777776655422222222211100 0111222 Q ss_pred hhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 279 RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 279 ~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ..|. .+|-+.++..++...--.-++-++-....++...++|.+-+...++ |..-|..|-.-+.....-+.+++..| T Consensus 313 DyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 392 (533) T protein:vir:10 313 DFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRF 392 (533) T ss_pred hhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 2332 1344566777776653344566677777888888999887764433 21123357777788888899999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCC Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYT 422 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~ 422 (480) ..-+.++++.=+-+.|.--..+| ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |..+ T Consensus 393 s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 472 (533) T protein:vir:10 393 SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQT 472 (533) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 99999999876555554333333 356777754333333333332221 22222223 499998765 7999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) +++++++.+.++++..+..=.............+.|+.++.+ ..+.+|.+.++.--| T Consensus 473 Deei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 529 (533) T protein:vir:10 473 DVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAP-AEEVAPEGPDPSDER 529 (533) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcc-cccCCCCCCCcchhh Confidence 999988877766664321111000000001112233332222 223344444444444 No 154 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=98.56 E-value=2.9e-07 Score=56.45 Aligned_cols=435 Identities=14% Similarity=0.083 Sum_probs=173.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH----HHHHHHhcccCcchh-----c-Ccccchhhhhhc--cccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNL----LEAEAYRNGTRRLKT-----I-GIGAPPELAYLD--VQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~----~~~~~YY~g~~~i~~-----~-~~~~~~~~~~~~--~~~n~~~~iVd~~~~~l~ 68 (480) =..+. .+|...|..+..-+ ..-..|--+-+.+.. . +... ..+.-.- .-+.-.++++.+.+..+. T Consensus 61 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F-~Gy~~la~laQ~~eyr~~~~~ia~e~~ 136 (694) T protein:vir:10 61 EPSPS---LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGF-PGFPTLVLLAQLPEYRAMHEVLADECI 136 (694) T ss_pred CCCcc---hhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCc-chHHHHHHHhhccchhhHHHHHHHHhh Confidence 22221 12233332211111 111112111111100 0 0000 0111000 011223444555554443 Q ss_pred cCc---------------ccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCccc------- Q lcl|NC_021297. 69 IEG---------------FRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE------- 120 (480) Q Consensus 69 ~~~---------------~~~------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~------- 120 (480) -.. +.+ ..|.+..+.|..-+++-++...+.++.+++-.||.+.+++...... T Consensus 137 R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL 216 (694) T protein:vir:10 137 RTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPL 216 (694) T ss_pred cccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCcccccccc Confidence 221 111 1122445567777777788889999999999999987655431100 Q ss_pred ----ccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccc Q lcl|NC_021297. 121 ----SGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 121 ----~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (480) +.-..|.+ -+.+++|..+.|-.-...+ +. ..+.+.+++..+- +..++..+- T Consensus 217 ~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~d--P~--------spdfgkP~~y~V~--G~~IH~SRL------------ 272 (694) T protein:vir:10 217 VPRPYTVPKGSFQGLRVVEPYWVTPNNYNSIN--PV--------ADDFYKPSTWWMI--GTEVHATRL------------ 272 (694) T ss_pred ccccccccCcceeeeEeecccccccchhhhcc--ch--------hhccCCCceEEEe--ceEEeeeeE------------ Confidence 00112222 2566677766663211000 00 0111112221111 111111110 Q ss_pred cccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-----Ccccccccccc Q lcl|NC_021297. 196 VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-----TTDELTNDGEN 270 (480) Q Consensus 196 ~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-----~~~~~~~~~~~ 270 (480) +.|...|+-.+. ......+|.|... -+.+-+++.+++.-.....+..++..-+. +++ ......-...- T Consensus 273 ---~~f~g~plPd~L--Kp~y~~~G~Sv~q-~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~~L~~g~~~~l~~R~ 345 (694) T protein:vir:10 273 ---HTIVSRPVGDML--KPTYSFAGISMTQ-LAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALMPGANVDLSMRA 345 (694) T ss_pred ---EEecCCCchhhh--hcccccCcccHHH-HHHHHHHHHHHHHhHHHHHHHhhhhHHHH-HHHHHhhcChhHHHHHHHH Confidence 000001110000 0011245666543 35556666666655555554333322111 110 00000000000 Q ss_pred chhhh--hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc-cc-chHHHHHHHHHHHHHHH Q lcl|NC_021297. 271 TTLDI--YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKM 346 (480) Q Consensus 271 ~~~~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n-~~Sg~Al~~~~~~l~~k 346 (480) ..++. ....++.+++++-+|-+++. ++..+-+.+-.....+|..++||..-|-+.+ .. ++||++=..-|...+. T Consensus 346 eli~~~Rsn~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~- 423 (694) T protein:vir:10 346 ELINRYRDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR- 423 (694) T ss_pred HHHHHhcCccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHH- Confidence 11111 11223344544556666653 4555666677778899999999986664332 22 4678876666666555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHHHHHhC-- Q lcl|NC_021297. 347 AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQARIDL-- 419 (480) Q Consensus 347 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~et~~~~l-- 419 (480) ...+..+...|++++.++.+-.-+..+. .+...|.+-...+..+.|+.-.|-+++ ..+.|+...++.+| T Consensus 424 -s~Qe~~L~p~L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~ 499 (694) T protein:vir:10 424 -AYQRNALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNT 499 (694) T ss_pred -HHHHHHHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhc Confidence 5568889999999999876544333332 478899988889999998765554332 11233333333332 Q ss_pred ----CCChhHHHHHHH--HHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCC----------CCCCC Q lcl|NC_021297. 420 ----GYTATQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFN----------RTKTR 480 (480) Q Consensus 420 ----g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~----------~~~~~ 480 (480) +|... ++.-.+ +..+.+.+....+.......++.++.++. ..+..++|+.++ ++++. T Consensus 500 d~~s~Y~~~-~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~~~~~v~~~~~~~~~~~ag~~~~ 572 (694) T protein:vir:10 500 EPDGPYAGK-LDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA---RAGATAPPTVANVNANVNPREAGAQDA 572 (694) T ss_pred CCCcccccc-cccccCCCcCccchhhhhHhhhcCcccccccCCCCcc---cccccCCCcccccccccCccccCCCCc Confidence 11100 000000 00000000000011101000011111100 011111111110 00111 No 155 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.55 E-value=3e-07 Score=56.34 Aligned_cols=441 Identities=14% Similarity=0.088 Sum_probs=167.1 Q ss_pred CCCHHHH-------HHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCcc--cchhhhhhccccchHHHHHHHHHhhhcc- Q lcl|NC_021297. 1 MTTYHEH-------VERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIG--APPELAYLDVQPGWVATYLRTLSDRLDI- 69 (480) Q Consensus 1 ~~~~~~~-------i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~--~~~~~~~~~~~~n~~~~iVd~~~~~l~~- 69 (480) |.++... .++..+.+..++ +....++++|+ +.+++.... .....+..++..+-+...++++++.|.. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~--y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 78 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAK--YTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLA 78 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhh Confidence 7765432 222333333322 22233333332 122221110 0111112234445566666666665532 Q ss_pred ---C-cc-ccC-CC---------ch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccC Q lcl|NC_021297. 70 ---E-GF-RIS-ED---------SE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 70 ---~-~~-~~~-~d---------~~----~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d 123 (480) . +| ++. .| .. .. +.+...+..++|.....++.++...+|.|-+++... T Consensus 79 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~------ 152 (535) T protein:vir:94 79 LFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEP------ 152 (535) T ss_pred hcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccC------ Confidence 1 22 211 11 01 11 223344566899999999999999999997776432 Q ss_pred CCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEec---------------CCceEEEEEEEeCCcEEEEEecCCccc Q lcl|NC_021297. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD---------------DVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 124 ~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) .+...+++.++-.+.++.-|. ..++...++.+.-.. +......+++|+. .|....++.. T Consensus 153 ~~~~~~f~~~pl~~y~v~~d~--~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~~~~ 226 (535) T protein:vir:94 153 EGTYNPMKLYRLSSYVVQRDA--FGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTH----IYLDEESGEY 226 (535) T ss_pred cCcccceEEEEcCeEEEeeCC--CCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEE----EEeeCCCCcE Confidence 122245777776665555553 334544443332110 0111122333321 0111111111 Q ss_pred ccc-------cccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CC Q lcl|NC_021297. 189 QWV-------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) Q Consensus 189 ~~~-------~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~ 259 (480) .+. +.......+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|...+. |. T Consensus 227 ~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~ 305 (535) T protein:vir:94 227 LKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGI 305 (535) T ss_pred EEEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCCcccccccc Confidence 111 11111235788899999999989999999997654 456666777665555555555555543331 11 Q ss_pred CccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhc-cccccchHHHHHH Q lcl|NC_021297. 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLS-SSSENPASAEAII 337 (480) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g-~~~~n~~Sg~Al~ 337 (480) ... .......++.+..-..++..+.++.. .+++.....++.+...|....-+ . .+. .+.... ++.-++ T Consensus 306 ~~~-------~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~-~~~~~d~~rv-TAtEV~ 375 (535) T protein:vir:94 306 TQV-------RRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFML-N-SAVQRTGERV-TAEEIR 375 (535) T ss_pred cch-------hhcccCCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhH-h-hhccCCCCCc-cHHHHH Confidence 100 00111222332221223334444443 34444334343333333222211 0 111 111111 333232 Q ss_pred HHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCHHHHH---HHHHHHH---- Q lcl|NC_021297. 338 ATDSRIVKM----AERKGRI-FGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTVAAKA---DAVSKLY---- 403 (480) Q Consensus 338 ~~~~~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~~~~~~a---d~~~kl~---- 403 (480) .....+... ..++... +.+-+.+.+.++.+- |. ..+.+. +++.+..++. .++.+ +.+...+ T Consensus 376 ~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~-g~lP~~p~~~--v~~~~vs~la--~l~r~~~~~~l~~~~~~la 450 (535) T protein:vir:94 376 YVASELEDTLGGVYSILSQELQLPMVRVLLKQLQAT-NQIPELPKEA--VEPTISTGME--ALGRGQDLDKLERCIAAWS 450 (535) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhC-CCCCCCChhh--ccceEeehHH--HHHHHHHHHHHHHHHHHHH Confidence 222111111 1111111 122223333333221 11 122232 4444432221 12222 2222222 Q ss_pred hcccc----CCCHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhccccc-CcccccCCCCCCCCccc Q lcl|NC_021297. 404 ANGQG----PIPKEQA----RIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKA-QADATPKPTVTDTKTET 467 (480) Q Consensus 404 ~~~~~----~~s~et~----~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~ 467 (480) +.++. .+....+ .+.+|+. +++++.+.+..+++.+ ....+....++ ++..+..+......+++ T Consensus 451 q~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 529 (535) T protein:vir:94 451 ALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTA-MQNAAASAGAGAGTMATASPENMKAAAAQ 529 (535) T ss_pred hhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhcccccChHHHHHHHHH Confidence 22211 1222222 2334543 3333333222222211 11111111111 11111111111111111 Q ss_pred c-cCCC Q lcl|NC_021297. 468 Q-TSPS 472 (480) Q Consensus 468 ~-~~p~ 472 (480) . =.|+ T Consensus 530 ~g~~~~ 535 (535) T protein:vir:94 530 AGMAPN 535 (535) T ss_pred hccCCC Confidence 1 1233 No 156 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.55 E-value=3e-07 Score=56.33 Aligned_cols=406 Identities=15% Similarity=0.153 Sum_probs=156.2 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchh---hhhhccccchHHHHHHHHHhhhccCccc-- Q lcl|NC_021297. 1 MTTYH--EHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPE---LAYLDVQPGWVATYLRTLSDRLDIEGFR-- 73 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~---~~~~~~~~n~~~~iVd~~~~~l~~~~~~-- 73 (480) .+++. ++.. .+..-|-+... .+..+... .......+.....+|+..+.-+-.-++. T Consensus 9 ~~~p~~~~~~~--------------~~~~~~~~~~~---~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~ 71 (518) T protein:vir:78 9 LSAPAMAELSP--------------QMQDSYYYAPA---VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred eccchhhhhhh--------------hhhhcccccce---eceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEE Confidence 22221 1111 11111111100 00000000 0011111223344455444433222322 Q ss_pred -cCCCc---hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeC Q lcl|NC_021297. 74 -ISEDS---EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDP 144 (480) Q Consensus 74 -~~~d~---~~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~ 144 (480) ...+. .....+..++.+ | ....+...+..+.+.+|.+|+++-++ ..|.+ .+..++|..+.+..+. T Consensus 72 ~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~------~~G~~~~L~~l~p~~Vtv~~~~ 145 (518) T protein:vir:78 72 FTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred EEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEECCCceEEEEcC Confidence 12111 112233444433 3 23456677888899999999998653 34555 5788999999888875 Q ss_pred CCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccc Q lcl|NC_021297. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i 224 (480) ... .+ .|+....+ +.......+.++++++ |++....+...|.|.+ T Consensus 146 ~~~-~~----~y~~~~~~-~~~~~~~~~~~~eIiH-----------------------------ir~~~~dg~~~G~Spi 190 (518) T protein:vir:78 146 RTG-RY----EYYFQAGA-GVGTQLVSFADDEVVP-----------------------------IRFFNPDGLERGLSLM 190 (518) T ss_pred CCC-EE----EEEEEecC-CccceeEEecCCcEEE-----------------------------ecCCCCCcccccccHH Confidence 432 11 11111111 1111112233333333 3322111223466655 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEecccc Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE 297 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~ 297 (480) .- +...+.....+..-..+.+...+.|..+|..- .......+.-...|+- ..++++.++ ++.++..+.... T Consensus 191 ~~-~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~-~G~~~~~l~~~~ 268 (518) T protein:vir:78 191 ES-LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVE-EGMEPIPLQLTA 268 (518) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcC-CCceEEeccCCh Confidence 32 22222222222222222223233454444321 1111100100111211 113344444 346776665432 Q ss_pred -HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) -..|++..+..+.+|+...++|+..+|....+.-|. +......+...+ -.-+-..++..+.. .+..... . T Consensus 269 ~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn--~e~~~~~f~~~t---L~P~~~~ie~eln~--~L~~~~~--~ 339 (518) T protein:vir:78 269 VEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--ISAQMRAFYRDT---MAIPIARIQSAMDK--YVGQYWV--R 339 (518) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHHHHHHHHHHH---HHHHHHHHHHHHHH--hhccccc--C Confidence 224788888889999999999999998543221121 121111111111 01111111111111 1111101 1 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHHHHHHHHHHHHHHHHHh-cccccCc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQMRDWDKQETEDMIDTLY-STTKAQA 452 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~---~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 452 (480) ...+++........|..++++++.+++++| +++...+++.+|+.+-+ -+++- ..... ..++... +...++. T Consensus 340 ~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G--~lT~NE~R~~~gl~pie~~~gD~~~--v~~n~-~pl~~~~~~~~~g~~ 414 (518) T protein:vir:78 340 KNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELY--ANSAL-QPLGATPDGAVEGEE 414 (518) T ss_pred cceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceee--ecccc-eecccccccccCCCC Confidence 123455555666788889999999998876 66777788888875421 11110 00000 0000000 0000110 Q ss_pred cccc-CCCCCCC----Cccccc----CCCCCCCCCCC Q lcl|NC_021297. 453 DATP-KPTVTDT----KTETQT----SPSGFNRTKTR 480 (480) Q Consensus 453 ~~~~-~~~~~d~----~~~~~~----~p~~~~~~~~~ 480 (480) .+.+ .+..... +..+++ +++.++....+ T Consensus 415 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:78 415 APAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDS 451 (518) T ss_pred CCCCCCCCcccccccccCccccCCCCCcccccccccc Confidence 0110 0100000 000111 11111111111 No 157 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.54 E-value=3.4e-07 Score=56.08 Aligned_cols=431 Identities=13% Similarity=0.088 Sum_probs=173.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc----c--Ccc-c Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----I--EGF-R 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~~-~ 73 (480) |+ .++..+.|-.+......+-+.+.+|..-.- ....+.......+..++.-+-+...++++++.|. + .+| + T Consensus 1 m~-~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~-~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MK-ARERYNQLTTARQMFLDKAVECSELTLPYL-IDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred Cc-hHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-cCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 76 455555555544444445555555542110 1111111111111234445566667777766553 2 222 2 Q ss_pred c--CC-------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEE Q lcl|NC_021297. 74 I--SE-------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 74 ~--~~-------d~~----~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~ 133 (480) . .+ +.+ .. +.+...+..++|.....++.++...+|.|-+++.. ++ ++.+ T Consensus 79 l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~--------~~---~~~~ 147 (522) T protein:vir:10 79 LQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGK--------DG---LKTF 147 (522) T ss_pred ccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcC--------CC---ceEE Confidence 2 11 100 11 12344566789999999999999999999876642 22 4556 Q ss_pred ccceeEEEEeCCCCCeeEEEEEEEEEe------------------cCCceEEEEEEEeCCcEEEEEecCCccccccc--- Q lcl|NC_021297. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR------------------DDVAVPDRATLYLPDETVPLRRNGGLNDQWVV--- 192 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 192 (480) +-.++++--|. .+++...++.+.-. ...+....+++|+. + |.+.......|.. T Consensus 148 pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~---v-~p~~~~~~~~~~~~~~ 221 (522) T protein:vir:10 148 PLTRYVINRDG--DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTY---V-KLDKSSGRWVWHQEAF 221 (522) T ss_pred EcceEEEeeCC--CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEE---E-EeeccCCceEEEEccC Confidence 55665555543 33444444332211 00011112333321 0 1111111111110 Q ss_pred ----ccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcccccc Q lcl|NC_021297. 193 ----DGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTN 266 (480) Q Consensus 193 ----~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~ 266 (480) ......-++..+|++.+..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+.+. |...... T Consensus 222 ~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~-- 298 (522) T protein:vir:10 222 DKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEE-FLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPAT-- 298 (522) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccc-- Confidence 0111234778899999998888899999998754 556677788877777777777777765552 1111110 Q ss_pred ccccchhhhhhhhhcccCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHH Q lcl|NC_021297. 267 DGENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSR 342 (480) Q Consensus 267 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~ 342 (480) ....+.+.+..-..++....++.. .+++. .++.++.-|+..+...+ ..+.... ++.-++..... T Consensus 299 -----l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~------~~d~~rv-TAtEV~~r~~E 366 (522) T protein:vir:10 299 -----IAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN------VRNAERV-TAEEVRLTQLE 366 (522) T ss_pred -----ccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhcc------CCCCCCC-CHHHHHHHHHH Confidence 011111222111122333344332 34433 34444444444333221 1122222 33333332222 Q ss_pred HHH----HHHHHHH-HHHHHHHHHHHHHHHHhCC--CCcccc-eeeeEEecCCCCCCHHHHHHHHHHHHhcc-----cc- Q lcl|NC_021297. 343 IVK----MAERKGR-IFGGAWERAMRIAMQIMGR--EVTEEY-TRLETVWRDPSTPTVAAKADAVSKLYANG-----QG- 408 (480) Q Consensus 343 l~~----k~~~~~~-~~~~~l~~~~~~~~~~~~~--~~~~~~-~~i~v~f~~~~~~~~~~~ad~~~kl~~~~-----~~- 408 (480) +.. -..++.. .+..-+.+.+.++.+ .|. ..+.+. ....+++-.++.+ .+.++.+....+.. +. T Consensus 367 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~~~i~~~~~p~~ 443 (522) T protein:vir:10 367 LEQQLGGIFSLLVIEFLIPYLNRTLLVLQR-SNQIPKLPKDIVRPTIVAGVNALGR--GQDRESLTAFVGTIAQTLGPEA 443 (522) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccccccccccchhHHHH--HHHHHHHHHHHHHHHHhhCchh Confidence 221 1222222 222334444544332 121 122221 1223445433332 23333333322211 11 Q ss_pred ---CCCHHH----HHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhc-cccc----CcccccCCCCCCCCccccc Q lcl|NC_021297. 409 ---PIPKEQ----ARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYS-TTKA----QADATPKPTVTDTKTETQT 469 (480) Q Consensus 409 ---~~s~et----~~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~~d~~~~~~~ 469 (480) .+.... +...+|++ +++++++.+..+++. ...++.. +.+. .+++...+++-+.- . T Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~--~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~----~ 517 (522) T protein:vir:10 444 LMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQA--AQQSLVDQAGQMTGSPLMDPTKNPQLMDEE----Q 517 (522) T ss_pred hhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcccccCccccHHHHHHh----C Confidence 111111 22334543 233333332222111 1111111 1111 11111112111111 1 Q ss_pred CCCCCC Q lcl|NC_021297. 470 SPSGFN 475 (480) Q Consensus 470 ~p~~~~ 475 (480) +| +-. T Consensus 518 ~~-~~~ 522 (522) T protein:vir:10 518 PP-MEE 522 (522) T ss_pred CC-CCC Confidence 11 111 No 158 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.52 E-value=3.7e-07 Score=55.82 Aligned_cols=371 Identities=12% Similarity=0.072 Sum_probs=146.9 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHhccc--Ccchh-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021297. 7 HVERLQGLLARDLPN--LLEAEAYRNGT--RRLKT-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~~r--~~~~~~YY~g~--~~i~~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++=.+.+.+...... -.....+...- ..+.. .+..+... .-+.+.=...+|+..++-+-.-++.+-.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~v~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSAR---AALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchh---hhhcchHHHHHHHHHHHhhccCceeeccc Confidence 111111111100000 00000000000 00000 00011110 00111112334554444443334443321 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) . ....+.+-...-.-..+...+..+.+.+|.||+++-.+ .+|++ .+.+++|..+.+..+.... .+. T Consensus 78 ~-~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~~~-~~~----- 144 (392) T protein:vir:74 78 K-NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GMY----- 144 (392) T ss_pred h-hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----- Confidence 1 11111111111123556677888999999999887653 34554 6788899988877765322 111 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHH Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~ 236 (480) |....+.+.......+.+++ |+||++....+..+|.|.|. .+...++... T Consensus 145 y~~~~~~~~~~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~i~-~~~~~i~~~~ 194 (392) T protein:vir:74 145 YNITFDDPKIEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLY-SLRRESKIQR 194 (392) T ss_pred EEEEecCCccceeEEEcCcc-----------------------------EEEecCCCCCCccccccHHH-HHHHHHHHHH Confidence 11111111111111222233 34444333334456887653 2333333222 Q ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhh--hhhhhcccCCCCceEEEeccc-cHHHHHHHHHHHH Q lcl|NC_021297. 237 RTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDI--YYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFR 309 (480) Q Consensus 237 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~~--~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~ 309 (480) .+..-....+...+.|..+++ .......+++.... +.- ..+.++.++ ++.++.++... ....+++..+... T Consensus 195 ~~~~~~~~~f~ng~~p~~il~-~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~ 272 (392) T protein:vir:74 195 ASDRLTISSLNSSLNVPGVLT-VKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDWTS 272 (392) T ss_pred HHHHHHHHHHhccCCCceEEE-eCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHHHH Confidence 222222222232333333332 11111111111111 111 112344444 44677777643 2335788889999 Q ss_pred HHHHhccCCChHHhccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCC Q lcl|NC_021297. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDS-RIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPS 388 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~-~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 388 (480) .+|+...++|+..+|....+.++..+++..+. .|.--+...+. .+.+ .... .+++.+..-. T Consensus 273 ~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~~~ie~----~l~~-------~l~~-------~~~~~~~~~~ 334 (392) T protein:vir:74 273 KQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAIS----ELEY-------KLSD-------HISVNMRPAI 334 (392) T ss_pred HHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHH----HHHH-------hccc-------hhcccchhhh Confidence 99999999999999875544433333322111 11111111111 1111 1111 1222222223 Q ss_pred CCCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 389 TPTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 389 ~~~~~~~ad~~~kl~~~~~~~~s~et~~~~---lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ..+..+.++.+.+++.+| +++...++++ .|+.++++.+.+ . ....+ ++| T Consensus 335 ~~d~~~~~~~~~~l~~~g--~~t~near~~~~~~g~~pne~r~~e------------n-l~~~~----------~Gd--- 386 (392) T protein:vir:74 335 DPLGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPE------------N-TNKKT----------TGQ--- 386 (392) T ss_pred cCCHHHHHHHHHHHHhCC--CcCHHHHHHHHHhCCCCccccchhc------------C-CCCCC----------CCC--- Confidence 355667788888888765 5666555544 366655432210 0 00001 111 Q ss_pred ccccCC Q lcl|NC_021297. 466 ETQTSP 471 (480) Q Consensus 466 ~~~~~p 471 (480) +++..| T Consensus 387 ~~~p~p 392 (392) T protein:vir:74 387 SNEPVP 392 (392) T ss_pred CCCCCC Confidence 111112 No 159 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.50 E-value=4.2e-07 Score=55.56 Aligned_cols=431 Identities=12% Similarity=0.069 Sum_probs=176.5 Q ss_pred CCC-HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc--cchhhhhhccccchHHHHHHHHHhhhc----c--Cc Q lcl|NC_021297. 1 MTT-YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG--APPELAYLDVQPGWVATYLRTLSDRLD----I--EG 71 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~--~~~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~ 71 (480) |-+ -+...+.|-.+......+.+.+.+|..- +.... .....+..++.-+-+...++++++.|. + .+ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP-----~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 75 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLP-----YLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTS 75 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----ccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 443 3444555544444444555555666422 11100 001111123444556677777776553 2 22 Q ss_pred cc-c--CC---------Cchh-----------HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 72 FR-I--SE---------DSEG-----------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 72 ~~-~--~~---------d~~~-----------~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) |+ + ++ +++. .+.+...+..++|.....++.++...+|.|.+++.. + T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~--------~--- 144 (542) T protein:vir:78 76 FFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGK--------K--- 144 (542) T ss_pred cccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecC--------C--- Confidence 22 2 11 1111 123445667789999999999999999999776632 1 Q ss_pred EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec-----------------------CCceEEEEE-EEeCCcEEEEEec- Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-----------------------DVAVPDRAT-LYLPDETVPLRRN- 183 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~-~~~~~~~~~~~~~- 183 (480) .+++++-.+.++--|. ..++...++.+...- .+...+.+. ++.....-.|... T Consensus 145 ~~~~~pl~~y~v~~d~--~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~ 222 (542) T protein:vir:78 145 TLKVYPLDRYVIERDG--DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCK 222 (542) T ss_pred CceEEecceeEEeeCC--CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccc Confidence 2555655554444443 233444444332110 000111111 1111111011100 Q ss_pred -CCcccccccc-------cccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhh Q lcl|NC_021297. 184 -GGLNDQWVVD-------GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV 255 (480) Q Consensus 184 -~~~~~~~~~~-------~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~ 255 (480) ......+..+ ....+.+|..+|++.+..+...++.||+|-..+ ..+-+..+|.+.-......+....|.+. T Consensus 223 ~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~pp~l 301 (542) T protein:vir:78 223 LVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEE-FFGDLSSLDALTRSLIEGSAAAAKVVFM 301 (542) T ss_pred cCCCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0000001111 111234778899999998888899999998754 5677788888888888888887887655 Q ss_pred hc--CCCccccccccccchhhhhhhhhcccCCCCceEEEec-cccHH---HHHHHHHHHHHHHHhccCCChHHhcccccc Q lcl|NC_021297. 256 IS--GVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELR---NFAEEMEVFRKEAASITGLPPQYLSSSSEN 329 (480) Q Consensus 256 i~--g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n 329 (480) +. |... ........++.+..-..++....++. ..+++ ..++.++.-|+..+...+. .++.. T Consensus 302 v~~~g~~~-------~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~~------~d~~r 368 (542) T protein:vir:78 302 VSPSATTK-------PQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILNV------RQSER 368 (542) T ss_pred eccccccc-------hhhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccccc------CCccc Confidence 42 1110 00111222333322222333344433 22443 3344444444443332211 01111 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCH-HH Q lcl|NC_021297. 330 PASAEAIIATDSRIVKMAERKGRIFGGAW------------ERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTV-AA 394 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l------------~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~~~-~~ 394 (480) . ++.-++ ..++.+...++..+ .+.+.++.+ .|. ..+.+ -+++.+..++..-- .+ T Consensus 369 v-TAtEV~-------~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r-~g~lP~~p~~--lv~~~~~s~La~~~r~~ 437 (542) T protein:vir:78 369 T-TATEVR-------EVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQR-SKQLPSLPKG--LVMPTVVAGLGGVGRGE 437 (542) T ss_pred c-cHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchh--ceeeeeechHHHHHHHH Confidence 1 222222 22233333344333 333333222 111 12222 25666654442211 11 Q ss_pred HHHHHHHHHhc-----ccc----CCCHHHH----HHhCCCCh-------hHHHHHHHHHHHHHHHHHHHHhc-ccccCc- Q lcl|NC_021297. 395 KADAVSKLYAN-----GQG----PIPKEQA----RIDLGYTA-------TQREQMRDWDKQETEDMIDTLYS-TTKAQA- 452 (480) Q Consensus 395 ~ad~~~kl~~~-----~~~----~~s~et~----~~~lg~~~-------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~- 452 (480) .++.+....+. ++. .+....+ ...+|... ++++.+.+..++ +.....+.. +....+ T Consensus 438 ~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~--~~~~~al~~~a~~~a~~ 515 (542) T protein:vir:78 438 DRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQ--QQMTASLMGQAGQLAKS 515 (542) T ss_pred HHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHH--HHHHHHHHHhhhhcccc Confidence 22222221111 111 1222222 22345532 333322222111 111121111 111010 Q ss_pred ---ccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 453 ---DATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 453 ---~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +.......+++++...++-+|-+- T Consensus 516 ~~~~~~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 516 PIGEKMMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred ccccchhhhcCCCCcCCCCCCcccccC Confidence 111111122222223333344444 No 160 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.50 E-value=4.4e-07 Score=55.44 Aligned_cols=435 Identities=13% Similarity=0.094 Sum_probs=170.9 Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc----c--Ccc- Q lcl|NC_021297. 4 YHEHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----I--EGF- 72 (480) Q Consensus 4 ~~~~i~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~~- 72 (480) -++.+++..+.+..+ ..+.+.+.+|..-. ...........+..++..+-+...++++++.|. + .+| T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~---~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF 77 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPY---ILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFF 77 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 233344444444433 33334444443221 111111111122234455666777777776653 2 122 Q ss_pred ccC-CC---------chhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE Q lcl|NC_021297. 73 RIS-ED---------SEGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 73 ~~~-~d---------~~~~-----------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~ 131 (480) ++. .| ++.. +.+...+..++|.....++.++...+|.+-+++.. ++ ++ T Consensus 78 ~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~--------~~---~~ 146 (555) T protein:vir:17 78 KLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGK--------KN---LK 146 (555) T ss_pred ccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecC--------Cc---ee Confidence 221 11 1111 22344456689999999999999999998765532 21 45 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEE-Eec------C------------------------------CceEEEEEEEeC Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYT-TRD------D------------------------------VAVPDRATLYLP 174 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~-~~~------~------------------------------~~~~~~~~~~~~ 174 (480) +++-.+.++-.|. .+++...++.+. ... . ......+++|+. T Consensus 147 ~~pl~~y~v~~d~--~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~ 224 (555) T protein:vir:17 147 LYPLDRFVVSRDG--EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTY 224 (555) T ss_pred EEEcCeEEEeeCC--CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeec Confidence 5555555444553 234444443322 100 0 000011223321 Q ss_pred ----C-cEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021297. 175 ----D-ETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL 249 (480) Q Consensus 175 ----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~ 249 (480) . .+++|....+. .......+-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-......+.. T Consensus 225 ~~~~~~~~~~~~e~~~~----~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~ 299 (555) T protein:vir:17 225 VCRKDGQVKWHQECDGK----VIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEE-FMGDLKSLEALSQAMVEGSAAS 299 (555) T ss_pred ccccCCeeEEEEecCce----eccccccccCcccCCeeeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 1 11111111111 011112245777899999999888899999997754 6677888888888888888888 Q ss_pred cchhhhhc--CCCccccccccccchhhhhh-hhhcccCCCCceEEEecc-ccH---HHHHHHHHHHHHHHHhccCCChHH Q lcl|NC_021297. 250 GTPLRVIS--GVTTDELTNDGENTTLDIYY-GRILTLASEAAKISEFKA-AEL---RNFAEEMEVFRKEAASITGLPPQY 322 (480) Q Consensus 250 ~~~~~~i~--g~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~-~~~---~~~~~~l~~~~~~i~~~~~~p~~~ 322 (480) ..|.+.+. |..... .+..+. +.+..-..++....++.. .++ +..++.++.-|++.+...+. T Consensus 300 ~~pp~lv~~~g~~~~~--------~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~~---- 367 (555) T protein:vir:17 300 AKVVFMVSPSATTKPQ--------NLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQV---- 367 (555) T ss_pred hCCceeeccccccCcc--------eeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcCC---- Confidence 88875552 111110 111111 122111112222333332 223 34455555555554433221 Q ss_pred hccccccchHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCC-CHHH Q lcl|NC_021297. 323 LSSSSENPASAEAIIATDSRIVKM----AERKG-RIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTP-TVAA 394 (480) Q Consensus 323 ~g~~~~n~~Sg~Al~~~~~~l~~k----~~~~~-~~~~~~l~~~~~~~~~~~~~--~~~~~~~~i~v~f~~~~~~-~~~~ 394 (480) .+.... ++.-++.....+... ..+.. ..+..-+.+.+.++.+- |. ..+.+...+++. -+... ...+ T Consensus 368 --~d~~r~-TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~-g~lP~~p~~~v~~~i~--~~l~~l~r~~ 441 (555) T protein:vir:17 368 --RQSERT-TATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQ-RKLPQLPKDLVQPTVV--AGLWGVGRGQ 441 (555) T ss_pred --CCcccc-hHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhC-CCCCCCCHhhhcccee--ehHHHHHHHH Confidence 111221 232222222111111 12222 22222333444433321 21 222232233222 11111 0111 Q ss_pred HHHHHHHHH----hcc-c----cCCCH----HHHHHhCCC-------ChhHHHHHHHHHHHHHHHH--HH---HHhcccc Q lcl|NC_021297. 395 KADAVSKLY----ANG-Q----GPIPK----EQARIDLGY-------TATQREQMRDWDKQETEDM--ID---TLYSTTK 449 (480) Q Consensus 395 ~ad~~~kl~----~~~-~----~~~s~----et~~~~lg~-------~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~ 449 (480) ..+.+...+ +.. . ..+.. +.+...+|+ ++++++++.+.++++.+.. .. ++..... T Consensus 442 ~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~ 521 (555) T protein:vir:17 442 DKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPM 521 (555) T ss_pred HHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 222222222 211 0 01222 223445666 3455555543322221111 00 1111000 Q ss_pred c-C--cccccCCCCCCCCc--cc-ccCCCCCCCC Q lcl|NC_021297. 450 A-Q--ADATPKPTVTDTKT--ET-QTSPSGFNRT 477 (480) Q Consensus 450 ~-~--~~~~~~~~~~d~~~--~~-~~~p~~~~~~ 477 (480) + + ....+.+....+.+ .+ ..+|-|.-++ T Consensus 522 ~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 522 AEQAMQLIQQQQEGAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred hhhHHhccccchhhhhHHHHHHhhcCCcccccCC Confidence 0 0 00011111111110 01 1112222222 No 161 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.47 E-value=5.3e-07 Score=54.99 Aligned_cols=412 Identities=13% Similarity=0.102 Sum_probs=156.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..+++.+.+. ....+.-......+||+- ++ -+..+++..-...+...+|+..+..+...++.+..+. T Consensus 11 ~~~~~~i~~~---~~~s~~~~~~~~~~~~~p--p~------~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~-- 77 (542) T protein:vir:41 11 LEKYKAIKRE---EVESQALGETRFEEYVEP--KV------NPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDD-- 77 (542) T ss_pred cccchhhhhc---cccccccccccCCccccC--CC------CHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeeccc-- Confidence 1111111111 000000000011112210 00 1122333333335667788888877777777664322 Q ss_pred HHHHHHHHHhc--CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQAN--DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 81 ~~~l~~i~~~n--~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ...+..++-.. ....+...+..+.+.+|.||+.+-.+ ..|++ .+.+++|..+.+..|... . ..+ T Consensus 78 ~~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd------~~G~~~~L~~l~~~~v~v~~d~~~---~---~~~- 144 (542) T protein:vir:41 78 EGVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRD------DRGDPIRFEYIPSHTIRVHKDGSR---Y---RQT- 144 (542) T ss_pred chhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEEcCcceEEEEcCCe---e---Eee- Confidence 22333333221 34566778888999999999988653 33444 478888988877665321 0 111 Q ss_pred EEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHH Q lcl|NC_021297. 158 TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~ 237 (480) .+..+ .+++..|.....+.. ..+.. ...++.=-|+||++.....+.+|.|.+.. +...+.. T Consensus 145 --~~~~~-~~~~~~y~~~~~~~~--~~g~~----------~~~~~~~eIiHir~~~~~~~~~Glspi~~----~~~~i~~ 205 (542) T protein:vir:41 145 --WDGVN-ITHFKDYRYEGEINP--ETGED----------QDSVGANELVFIHIPSPVCSYYGVPRYVS----AAPAILA 205 (542) T ss_pred --ecCCc-ceeEEeecccccccc--ccccc----------ccccCcccEEEecCCCCCCCcccccHHHH----HHHHHHH Confidence 11111 122222222211110 00000 00111113677877666677899987643 3333333 Q ss_pred HHHHHHHHHHHhcc---hh--hhhcCCCccccccccc---------cchhhh-------hhhhhcccC-----CCCceEE Q lcl|NC_021297. 238 TLMNLQSASQILGT---PL--RVISGVTTDELTNDGE---------NTTLDI-------YYGRILTLA-----SEAAKIS 291 (480) Q Consensus 238 ~~s~~~~~~~~~~~---~~--~~i~g~~~~~~~~~~~---------~~~~~~-------~~~~~~~~~-----g~~~~~~ 291 (480) ...-......+|.+ |. +.+.|...++...+.. ...|+. ..++++.++ +++.++. T Consensus 206 ~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~ 285 (542) T protein:vir:41 206 MQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFT 285 (542) T ss_pred HHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEE Confidence 22222222233333 32 3333432221111100 001110 112232321 2345666 Q ss_pred Eeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021297. 292 EFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKM-AERKGRIFGGAWERAMRIAMQIM 369 (480) Q Consensus 292 ~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~~~~~~~ 369 (480) .+... .-..|++..+....+|+...++|+..+|....+..++..+......+... ..-..+.+...|.+ .. T Consensus 286 pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~-------~L 358 (542) T protein:vir:41 286 PLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTD-------FF 358 (542) T ss_pred EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHh-------hc Confidence 65432 22347788888899999999999999986432211111112111111111 11111222222221 11 Q ss_pred CCCCcccceeeeEEecC--CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 370 GREVTEEYTRLETVWRD--PSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQETEDMIDTLYS 446 (480) Q Consensus 370 ~~~~~~~~~~i~v~f~~--~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l-g~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (480) .... ...+.+.|.. ....+ ....+.+++++| +++...+++.| |+.+-+...+.-. .. ....+.. T Consensus 359 ~~~~---~~~~~~~f~~~~ll~~d---~~~~~~~~v~~G--ilT~NE~Re~L~g~~pgdd~~l~p~---~~--~~~~~~~ 425 (542) T protein:vir:41 359 QVKF---NPKTRFKFNDETLLESD---SVRNCALLVQSG--VLTPAEARERLFGLDGGPDIFMVPS---KG--AAKSVKR 425 (542) T ss_pred cccc---CCceEEEecchhhcchH---HHHHHHHHHhCC--CCCHHHHHHhhCCCCCCCccccccc---cc--ccccccc Confidence 1011 1234455543 22222 223344555554 55665566644 5432110000000 00 0000000 Q ss_pred ccccCcccccCCCCCCCCcccccCC------CCCCC---CCCC Q lcl|NC_021297. 447 TTKAQADATPKPTVTDTKTETQTSP------SGFNR---TKTR 480 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~~~~~~~p------~~~~~---~~~~ 480 (480) ++....+++..........+.| +..+. +++. T Consensus 426 ---~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~ 465 (542) T protein:vir:41 426 ---QERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKK 465 (542) T ss_pred ---CCcCCCCCchhhhhhcccccCccccccccccccchhhccc Confidence 0000000000000000000000 00000 0000 No 162 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=98.45 E-value=6.2e-07 Score=54.61 Aligned_cols=449 Identities=10% Similarity=0.088 Sum_probs=203.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +-+.+.- ++.-..-..+|+.+..+++-+..+..+-.. -++.+-....|.-- |-...++-+-.+.. T Consensus 43 ~~~~~~~----~~~~~eLI~~YR~ma~~pEvd~Av~eIVne--------aiv~d~~~~pV~i~---Ld~~~~s~~iK~kI 107 (558) T protein:vir:10 43 YVDIEGA----YRSEYDLIRRYREMALHPEADGAIEDVVNE--------AIVSDLYDSPVEVE---LSNLNASNTLKKKI 107 (558) T ss_pred eecccch----hhhHHHHHHHHHHHhhccchhhHHHHhhcc--------eeEecCCCceEEEE---ecccCcchHHHHHH Confidence 1111000 000011112333444444433333221000 00000000000000 00001111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+.-.+.|+.|++...... ...+|-..++.++|+.+-.|..-... ..-........ T Consensus 108 ~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k--~pk~GI~ELr~lDPr~i~~Vr~i~~~--~~~~~~~~~~~ 183 (558) T protein:vir:10 108 REEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTK--NPQEGIQDLRYIDPLKIKFIRQEKRK--PGNQDPAIRVR 183 (558) T ss_pred HHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCcccceeeeeeccc--cccccceeeee Confidence 445556666668889999999999999999998765321 23568888999999988766542111 11001111111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCcc--CCcccchhhHHHH Q lcl|NC_021297. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISPELRKV 231 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~--~G~s~i~~~v~~l 231 (480) +..+ ....+-+|.+.........+. ... .+++ +||- +.|........+ .=.|-|.+.++++ T Consensus 184 ~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~-----~~~----~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~ 253 (558) T protein:vir:10 184 SEQDVVPNPEFEEFYIYTPKVQHPTGMVGQ-----MGG----KNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKAL 253 (558) T ss_pred cccceeeccceeEeeeecCCccccccccee-----ecC----CCce-eechhheeeecccceecCCCeeeecchHhhHhH Confidence 1111 111222344432222111110 000 0111 2221 333332111111 1123344444443 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc--- Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT--- 282 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~--- 282 (480) -. -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-......|. T Consensus 254 NQ--LkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRR 331 (558) T protein:vir:10 254 NQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRR 331 (558) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhccccc Confidence 22 257788888877777777655422222222211100 01111222332 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) .+|-+.++..++...--.-++-++-....++...++|.+-++..++- ..-+..|-.-+.....-+.+++..|..-+.++ T Consensus 332 eGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~ 411 (558) T protein:vir:10 332 EGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDM 411 (558) T ss_pred CCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 13445667777765433445566777778888889988877644322 11233566677788888999999999999999 Q ss_pred HHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCChhHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYTATQREQM 429 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~~~~~~~~ 429 (480) ++.=+-+.|.-...+| ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |.+++++++++ T Consensus 412 Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~ 491 (558) T protein:vir:10 412 LKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEI 491 (558) T ss_pred HHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHH Confidence 9876555554333333 356777754333333333332221 22222233 499998765 79999999988 Q ss_pred HHHHHHHHHHHH-------HHHhccc-ccCcccc-cCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 430 RDWDKQETEDMI-------DTLYSTT-KAQADAT-PKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 430 ~~~~~~~~~~~~-------~~~~~~~-~~~~~~~-~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .+.++++..+.+ +.+.+.. +.+++.. +....+-...+.+++++..+-..+| T Consensus 492 ~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (558) T protein:vir:10 492 DTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSK 551 (558) T ss_pred HHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhh Confidence 776666543211 1111100 0000000 0111110111223444444444455 No 163 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=98.44 E-value=4.4e-07 Score=55.43 Aligned_cols=453 Identities=10% Similarity=0.065 Sum_probs=201.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh-cc----Ccccc- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DI----EGFRI- 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-~~----~~~~~- 74 (480) +++....+.. .-+-.+.++|-| +..+.....+ ..|+.+ ..+.=+...|+..++-. +. +++.+ T Consensus 25 ~~~~~~~i~~---------g~~g~~v~~~g~-~~~~n~~eLI-~~YR~m-a~~pEVd~Av~eIVneaIv~d~~~~pV~vd 92 (564) T protein:vir:10 25 DEASVSTVAG---------GYFGTYVDTSGG-QNSRNEYELI-RRYRDM-SLHPEVDSAIDEIVNEFVVNDGDDKPVEVD 92 (564) T ss_pred cCCChhhhhc---------cccceeeecccc-cchhhHHHHH-HHHHHH-hhccchhhHHHHhhcceeEecCCCceEEEE Confidence 3333322210 000001111111 1110000000 111111 01111111222111111 00 11111 Q ss_pred --------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCC Q lcl|NC_021297. 75 --------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) Q Consensus 75 --------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~ 146 (480) +-.++..+.+..|++--+|+....+..+.-.+.|+.|.+.-.+. ....+|-..++.+||+.+-.+|.... T Consensus 93 L~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~--~~pk~GI~eLr~lDPr~i~~vr~i~~ 170 (564) T protein:vir:10 93 LQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDL--DNPKKGILELRYIDSLKIRKVRQKLK 170 (564) T ss_pred ecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeC--CChhhhhhhhhhhcccceeeeeeecc Confidence 11122344556666666888899999999999999998854321 12356777889999998887774322 Q ss_pred CC--eeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccc-----ccce--EEeecccccCc Q lcl|NC_021297. 147 TR--RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG-----VVPV--VPLTNDPRLGN 217 (480) Q Consensus 147 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~iPv--v~~~n~~~~~~ 217 (480) +. .....++-+......+....+.+|.+.. | .+..+ .....+.+. +||. |.|.+.-...+ T Consensus 171 ~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~---~--~g~~~------~~~~~~~~~~~~~ikI~~daI~y~hSGL~d~ 239 (564) T protein:vir:10 171 DVDPNRKEIEKGTALQYDYGDFIEYYIYNPKG---F--AGNIP------MVTGSMDWSNQEGIKIASDAIAQSTSGLMDL 239 (564) T ss_pred ccccccceeeeeeeeeccccccccceeecccc---c--cCccc------ccccccccccccceeechhhcceecccceeC Confidence 11 1111111111111111111112222211 0 00000 000000000 1222 11222111111 Q ss_pred cCC--cccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc------------------------ Q lcl|NC_021297. 218 RYG--RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT------------------------ 271 (480) Q Consensus 218 ~~G--~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~------------------------ 271 (480) .-| .|-|.+.++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+... T Consensus 240 ~~~~i~gyLhkAIKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrd 317 (564) T protein:vir:10 240 NKKMTLSFLHKAIKSLNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRD 317 (564) T ss_pred CCCceeccchhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecc Confidence 111 1224444444322 256778888877777776655422222222111100 Q ss_pred --hhhhhhhhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc--cchHHHHHHHHHHHHH Q lcl|NC_021297. 272 --TLDIYYGRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE--NPASAEAIIATDSRIV 344 (480) Q Consensus 272 --~~~~~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~--n~~Sg~Al~~~~~~l~ 344 (480) .+-......|. .+|-..++..++...--.-++-++-....++...++|.+-+..... +..-+..|-.-+.... T Consensus 318 drk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~ 397 (564) T protein:vir:10 318 DKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFT 397 (564) T ss_pred cchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHH Confidence 01112222332 1344566777776543334556677777888888998887764421 2222335666777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHH Q lcl|NC_021297. 345 KMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKE 413 (480) Q Consensus 345 ~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~e 413 (480) .-+.+++..|..-+.++++.=+-+.|.--..+| ..|.+.|..-..-.....++.+.. +.+...+. +|.+ T Consensus 398 KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~d 477 (564) T protein:vir:10 398 KFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTE 477 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 889999999999999999876555554333333 356777754333333333332221 22222233 4999 Q ss_pred HHHHh-CCCChhHHHHHHHHHHHHHHHHHHH-------Hhccc-ccC----------cccccCCCCCCCCcccccCCCCC Q lcl|NC_021297. 414 QARID-LGYTATQREQMRDWDKQETEDMIDT-------LYSTT-KAQ----------ADATPKPTVTDTKTETQTSPSGF 474 (480) Q Consensus 414 t~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~-~~~----------~~~~~~~~~~d~~~~~~~~p~~~ 474 (480) ++++. |..++++++++.+.++++..+..-. +.... ++. ++..+.++.....+.++.+|... T Consensus 478 yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 557 (564) T protein:vir:10 478 YIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQ 557 (564) T ss_pred HHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCC Confidence 98765 7999999998877766664432100 00000 000 00011111111122334555666 Q ss_pred CCCCCC Q lcl|NC_021297. 475 NRTKTR 480 (480) Q Consensus 475 ~~~~~~ 480 (480) +..++| T Consensus 558 ~~~~~~ 563 (564) T protein:vir:10 558 SKSQSN 563 (564) T ss_pred CcCcCC Confidence 777777 No 164 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=98.40 E-value=8.3e-07 Score=53.91 Aligned_cols=430 Identities=14% Similarity=0.079 Sum_probs=173.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH----HHHH--------------HHhcccCcchhcCcccchhhhhhc--cccchHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNL----LEAE--------------AYRNGTRRLKTIGIGAPPELAYLD--VQPGWVATYL 60 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~----~~~~--------------~YY~g~~~i~~~~~~~~~~~~~~~--~~~n~~~~iV 60 (480) ...+..- -+|.+.|..+..-+ +.-. .+|.|..-| .+.-.- .-+.-.++++ T Consensus 60 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~---------Gy~~la~laQ~~eyr~~~ 129 (698) T protein:vir:10 60 VAEPSPS-LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFP---------GFPTLVLLAQLPEYRAMH 129 (698) T ss_pred ccCCCcc-ccccccceeccccCCccccchhhhhhcccccccccchhhhccCcc---------hHHHHHHHhhccchhhHH Confidence 2222111 11222222111100 0001 122221111 110000 0112233444 Q ss_pred HHHHhhhccCc---------------ccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcc Q lcl|NC_021297. 61 RTLSDRLDIEG---------------FRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV 119 (480) Q Consensus 61 d~~~~~l~~~~---------------~~~------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~ 119 (480) .+.+..+.-.. +.+ ..|.+..+.|..-+++-++...+.++.+.+-.||.+.+++..... T Consensus 130 ~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gd 209 (698) T protein:vir:10 130 EVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGD 209 (698) T ss_pred HHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecC Confidence 44444442221 111 112244556777777778888999999999999998655432110 Q ss_pred c-----------ccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc Q lcl|NC_021297. 120 E-----------SGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN 187 (480) Q Consensus 120 ~-----------~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (480) . ..-..|.. -+.+++|..+.|-.-...+ +. ..+.+.+++..+- +..++..+- T Consensus 210 d~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~d--P~--------spdfgkP~~y~V~--G~~IH~SRL---- 273 (698) T protein:vir:10 210 DQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSIN--PV--------ADDFYKPSTWWMI--GSEVHATRL---- 273 (698) T ss_pred ccccccccccccccccCccceeeeeecccccccchhhhcc--ch--------hhccCCCceEEEe--cceecceeE---- Confidence 0 00011121 2555666666552110000 00 0011111111111 111110000 Q ss_pred cccccccccccccccc--cceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGV--VPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~--iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 265 (480) +.|.. +|-. ++ .....+|.|... -+.+-+++++++.-.....+..+....+. +++... .. T Consensus 274 -----------~~~vg~pvpd~-LK---p~y~f~G~Sv~q-~~~e~V~~~~rT~~~v~~Li~~~~~~~l~-~dla~a-L~ 335 (698) T protein:vir:10 274 -----------HTIVSRPVGDM-LK---PTYSFAGISMTQ-LAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQA-LT 335 (698) T ss_pred -----------EEecCCCchhh-hc---chhccCCccHHH-HHHHHHHHHHHHhhhHHHHHHHhhHHHHH-HHHHHh-cC Confidence 00000 1110 11 012245766553 35566666666655555554333332221 111100 00 Q ss_pred cccc------cchhhh--hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc-cc-chHHHH Q lcl|NC_021297. 266 NDGE------NTTLDI--YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEA 335 (480) Q Consensus 266 ~~~~------~~~~~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n-~~Sg~A 335 (480) .... -..+.. ....++.+++++-+|.+++. ++..+-+.+......+|..++||..-|-+.+ .. ++||++ T Consensus 336 ~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st-~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~ 414 (698) T protein:vir:10 336 PGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEG 414 (698) T ss_pred ChhhHHHHHHHHHHHHhcCccceEEEecCCcceEEEec-CcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchh Confidence 0000 011111 11223344544556666653 4555666677778899999999986664332 22 467887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCC Q lcl|NC_021297. 336 IIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPI 410 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~ 410 (480) =..-|...+. ...+..+...|++++.++.+-.-+..+. .+.+.|.+-...+..+.|+.-.|-+++ ..+.| T Consensus 415 D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI 489 (698) T protein:vir:10 415 EIRVWYDYVR--AYQRNALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVI 489 (698) T ss_pred hHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCC Confidence 6666666555 5568889999999999876544333433 478899988889999998865554332 12344 Q ss_pred CHHHHHHhC------CCCh--hHHHHHHHHHHHHHHHHHHHHhcccccCc--ccccCCCCCCCCcccccCC-CCCCCCCC Q lcl|NC_021297. 411 PKEQARIDL------GYTA--TQREQMRDWDKQETEDMIDTLYSTTKAQA--DATPKPTVTDTKTETQTSP-SGFNRTKT 479 (480) Q Consensus 411 s~et~~~~l------g~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~d~~~~~~~~p-~~~~~~~~ 479 (480) +...++.+| ||.. |..++--.-.+-..+.....+++...+.. .++.....-.+.+.+.+.. -..+.+.| T Consensus 490 ~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (698) T protein:vir:10 490 RPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPR 569 (698) T ss_pred CHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCcc Confidence 444433332 2210 00000000000001111111111111110 0000000001111111000 01111111 Q ss_pred C Q lcl|NC_021297. 480 R 480 (480) Q Consensus 480 ~ 480 (480) + T Consensus 570 ~ 570 (698) T protein:vir:10 570 E 570 (698) T ss_pred c Confidence 1 No 165 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=98.39 E-value=8.8e-07 Score=53.79 Aligned_cols=445 Identities=12% Similarity=0.081 Sum_probs=199.6 Q ss_pred CCCHHHHHHH-----------HHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhcc Q lcl|NC_021297. 1 MTTYHEHVER-----------LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~-----------l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) ++....++.. -++.-..-..+|..+..+++-+..+..+-.. -++.+-....|.-- |-. T Consensus 28 ~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne--------aiv~d~~~~pV~i~---Ld~ 96 (537) T protein:vir:10 28 LDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNE--------TICGNFDDVPISID---LHN 96 (537) T ss_pred ccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcc--------eeEecCCCceEEEE---ecc Confidence 1100000000 0000001111233333333333322211000 00000000000000 000 Q ss_pred CccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 70 EGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~ 149 (480) ..++-+-.+...+.+..|++--+|+....+..+.-.+.|+.|++...... ...+|-..++.++|+.+..+.--... T Consensus 97 ~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k--~pk~GI~ELr~lDPr~i~~vR~i~~~-- 172 (537) T protein:vir:10 97 LKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPK--KPRQGLVELRYVDPRKIRKVTEYEAK-- 172 (537) T ss_pred cccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCCccceeeEeeccc-- Confidence 01111112233455566666668899999999999999999988765321 23568888999999888666532111 Q ss_pred eEEEEEEEEEecC-CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc--eEEeeccccc--CccCCcccc Q lcl|NC_021297. 150 VTRAVRLYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDPRL--GNRYGRSEI 224 (480) Q Consensus 150 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP--vv~~~n~~~~--~~~~G~s~i 224 (480) ....++....... ......+.+|.+...+ +. ++ .+ =+|| .+.|...... ......|-| T Consensus 173 ~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~-~~--~~-------------~~-vkI~~dAI~y~hSGl~d~n~~~i~syL 235 (537) T protein:vir:10 173 RPEALRTQDLNQQLTQQSASYFLYNPKGLK-NS--TN-------------QG-MKIAPDSIAYCHSGIQDLNKNMVLSHL 235 (537) T ss_pred CCccceEEecceeeeecccceeeecccccc-cc--CC-------------Cc-eeccHhheeeecccceeCCCCeeeeee Confidence 1111111100000 0000111223332211 10 00 00 0122 1233332111 122334556 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhh Q lcl|NC_021297. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYG 278 (480) Q Consensus 225 ~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~ 278 (480) .+.++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-.... T Consensus 236 hkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlE 313 (537) T protein:vir:10 236 HKAIKAVNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE 313 (537) T ss_pred hhhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhh Confidence 565555432 257788888877777777655422222222211110 0111222 Q ss_pred hhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 279 RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 279 ~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ..|. .+|-..++..++...--.-++-++-....++...++|.+-+...++ |..-|..|-.-+.....-+.+++..| T Consensus 314 DyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 393 (537) T protein:vir:10 314 DFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRF 393 (537) T ss_pred hhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 2332 1344566777776643344566677777888888999887764433 21123357777788888899999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCC Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYT 422 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~ 422 (480) ..-+.++++.=+-+.|.-...+| ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |.++ T Consensus 394 s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~t 473 (537) T protein:vir:10 394 SELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQT 473 (537) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccC Confidence 99999998876555554333333 346777754333333333332221 22222233 489998765 7999 Q ss_pred hhHHHHHHHHHHHHHHHHHH----HHhcccccCcccccCCCCC-CCC--cccccCCCCCCCCCC Q lcl|NC_021297. 423 ATQREQMRDWDKQETEDMID----TLYSTTKAQADATPKPTVT-DTK--TETQTSPSGFNRTKT 479 (480) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~-d~~--~~~~~~p~~~~~~~~ 479 (480) +++++++.+.++++..+..- +......+.++..+-+.++ +.+ +...++|.+--++-- T Consensus 474 DeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 474 ESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred HHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 99999887776666442111 1111111112222111111 100 001112221111111 No 166 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.37 E-value=1e-06 Score=53.41 Aligned_cols=371 Identities=12% Similarity=0.081 Sum_probs=146.7 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHhcccCcch--h-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021297. 7 HVERLQGLLARDL--PNLLEAEAYRNGTRRLK--T-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~--~r~~~~~~YY~g~~~i~--~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++-.+.+.+.... +.......+........ . .+..+.... -+.+.-...+|+..++-+-.-++.+-.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARA---ALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHH---hhccHHHHHHHHHHHHhhccCceeeccc Confidence 1111221111100 00000000100000000 0 001111100 0111222334444444433344544322 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) . ....+.+-...-....+...+..+.+.+|.||+++-.+ ..|++ .+..++|..+.+..+.... .+. T Consensus 78 ~-~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~~~~~~~~~-~~~----- 144 (392) T protein:vir:39 78 K-NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GMY----- 144 (392) T ss_pred h-hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----- Confidence 1 11111111111123566677888999999999987653 34554 6788889888877764322 111 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHH Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~ 236 (480) |....+.+.......+.+++ |+|+++....+..+|.|.+.- +...++... T Consensus 145 y~~~~~~~~~~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~~~~ 194 (392) T protein:vir:39 145 YNITFDDPKIEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLYS-LRRESKIQR 194 (392) T ss_pred EEEEecCcccceeEEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHH Confidence 11011111000111122222 444544433344568776532 333333222 Q ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hh--hhhhhhcccCCCCceEEEecccc-HHHHHHHHHHHH Q lcl|NC_021297. 237 RTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LD--IYYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFR 309 (480) Q Consensus 237 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~--~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~ 309 (480) .+..-....+...+.|..++. ........+..... +. ...+.++.++ ++.++.++.... ...+++..+... T Consensus 195 ~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~ 272 (392) T protein:vir:39 195 ASDRLTISSLNSSLNVPGVLT-VKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDWTS 272 (392) T ss_pred HHHHHHHHHHhccCCCceEEE-eCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHHHH Confidence 221112222222233333332 11111111111111 11 1112344444 456777776432 235788888899 Q ss_pred HHHHhccCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCC Q lcl|NC_021297. 310 KEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPS 388 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 388 (480) .+|+..-++|+..+|....+.++..+.+. ....|.-.+...+..+... .... +++...... T Consensus 273 ~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~-----------L~~~-------~~~d~~~~~ 334 (392) T protein:vir:39 273 KQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYK-----------LSDH-------ISVNMRPAI 334 (392) T ss_pred HHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------cccc-------ccccchhhh Confidence 99999999999999865544333322222 1112222222222211111 1111 111112222 Q ss_pred CCCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 389 TPTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 389 ~~~~~~~ad~~~kl~~~~~~~~s~et~~~~---lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) -.+..+.+..+.++..+| +++...+++. .|+.++++.+++. + ...+ ++| T Consensus 335 ~~d~~~~~~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~e~r~~e~------------l-~~~~----------~Gd--- 386 (392) T protein:vir:39 335 DPLGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPEN------------T-NKKT----------TGQ--- 386 (392) T ss_pred ccCHHHHHHHHHHHHhCC--CcCHHHHHHHHHhcCCCccccchhcC------------C-CCCC----------CCC--- Confidence 345567777788887765 5566555544 4777664322100 0 0011 111 Q ss_pred ccccCC Q lcl|NC_021297. 466 ETQTSP 471 (480) Q Consensus 466 ~~~~~p 471 (480) +++..| T Consensus 387 ~~~p~p 392 (392) T protein:vir:39 387 SNEPVP 392 (392) T ss_pred CCCCCC Confidence 111112 No 167 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.37 E-value=1e-06 Score=53.41 Aligned_cols=371 Identities=12% Similarity=0.081 Sum_probs=146.7 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHhcccCcch--h-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021297. 7 HVERLQGLLARDL--PNLLEAEAYRNGTRRLK--T-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~--~r~~~~~~YY~g~~~i~--~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++-.+.+.+.... +.......+........ . .+..+.... -+.+.-...+|+..++-+-.-++.+-.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARA---ALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHH---hhccHHHHHHHHHHHHhhccCceeeccc Confidence 1111221111100 00000000100000000 0 001111100 0111222334444444433344544322 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) . ....+.+-...-....+...+..+.+.+|.||+++-.+ ..|++ .+..++|..+.+..+.... .+. T Consensus 78 ~-~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~~~~~~~~~-~~~----- 144 (392) T protein:vir:10 78 K-NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GMY----- 144 (392) T ss_pred h-hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----- Confidence 1 11111111111123566677888999999999987653 34554 6788889888877764322 111 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHH Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~ 236 (480) |....+.+.......+.+++ |+|+++....+..+|.|.+.- +...++... T Consensus 145 y~~~~~~~~~~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~~~~ 194 (392) T protein:vir:10 145 YNITFDDPKIEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLYS-LRRESKIQR 194 (392) T ss_pred EEEEecCcccceeEEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHH Confidence 11011111000111122222 444544433344568776532 333333222 Q ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hh--hhhhhhcccCCCCceEEEecccc-HHHHHHHHHHHH Q lcl|NC_021297. 237 RTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LD--IYYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFR 309 (480) Q Consensus 237 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~--~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~ 309 (480) .+..-....+...+.|..++. ........+..... +. ...+.++.++ ++.++.++.... ...+++..+... T Consensus 195 ~~~~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~ 272 (392) T protein:vir:10 195 ASDRLTISSLNSSLNVPGVLT-VKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDWTS 272 (392) T ss_pred HHHHHHHHHHhccCCCceEEE-eCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHHHH Confidence 221112222222233333332 11111111111111 11 1112344444 456777776432 235788888899 Q ss_pred HHHHhccCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCC Q lcl|NC_021297. 310 KEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPS 388 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 388 (480) .+|+..-++|+..+|....+.++..+.+. ....|.-.+...+..+... .... +++...... T Consensus 273 ~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~-----------L~~~-------~~~d~~~~~ 334 (392) T protein:vir:10 273 KQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYK-----------LSDH-------ISVNMRPAI 334 (392) T ss_pred HHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------cccc-------ccccchhhh Confidence 99999999999999865544333322222 1112222222222211111 1111 111112222 Q ss_pred CCCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 389 TPTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 389 ~~~~~~~ad~~~kl~~~~~~~~s~et~~~~---lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) -.+..+.+..+.++..+| +++...+++. .|+.++++.+++. + ...+ ++| T Consensus 335 ~~d~~~~~~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~e~r~~e~------------l-~~~~----------~Gd--- 386 (392) T protein:vir:10 335 DPLGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPEN------------T-NKKT----------TGQ--- 386 (392) T ss_pred ccCHHHHHHHHHHHHhCC--CcCHHHHHHHHHhcCCCccccchhcC------------C-CCCC----------CCC--- Confidence 345567777788887765 5566555544 4777664322100 0 0011 111 Q ss_pred ccccCC Q lcl|NC_021297. 466 ETQTSP 471 (480) Q Consensus 466 ~~~~~p 471 (480) +++..| T Consensus 387 ~~~p~p 392 (392) T protein:vir:10 387 SNEPVP 392 (392) T ss_pred CCCCCC Confidence 111112 No 168 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.34 E-value=1.2e-06 Score=52.96 Aligned_cols=402 Identities=13% Similarity=0.073 Sum_probs=153.9 Q ss_pred HHHHHHHHHHHH--------HHHHHHhc----ccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cC Q lcl|NC_021297. 11 LQGLLARDLPNL--------LEAEAYRN----GTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---IS 75 (480) Q Consensus 11 l~~~~~~~~~r~--------~~~~~YY~----g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~ 75 (480) |.+.+...++.- .-+...+. +-......+..+.+.. -++. .=...+|+..++-+-.-++. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~-al~~--~~V~~~v~~Ia~~iA~lp~~~~~~~ 77 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEA-VLSF--HAVFACISLISQDIAKMRLRLMQTD 77 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHH-hhcc--HHHHHHHHHHHHhhccCceEEEEec Confidence 111111000000 00000000 0000000111111110 0111 00122333333322222322 12 Q ss_pred CCc----hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCC Q lcl|NC_021297. 76 EDS----EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) Q Consensus 76 ~d~----~~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~ 146 (480) .+. .....+..++.+ | ....+...+..+.+.+|.||+++-.+ ..|++ -+..++|..+.+++++. T Consensus 78 ~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~- 150 (454) T protein:vir:93 78 AQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRN------ARGQIKELRILDWNRVEPLVADD- 150 (454) T ss_pred cCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCcceEEEEcCC- Confidence 111 112223344433 3 23466777888999999999987653 23554 57889999998887643 Q ss_pred CCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchh Q lcl|NC_021297. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) .++.+. +......+.. ....+..++ |+||+......+.+|.|.+. T Consensus 151 -g~~~y~---~~~~~~~~~~-~~~~~~~~e-----------------------------ViH~k~~~~~~~~~G~sp~~- 195 (454) T protein:vir:93 151 -GEVFYR---ITPDRNCGIT-EAVTVPARE-----------------------------VIHDRFNCFFHPLIGLPPVY- 195 (454) T ss_pred -CcEEEE---EEeccccccc-eeEEecCcc-----------------------------eEEeccCCCCCCceeccHHH- Confidence 222211 1111111100 011122222 44554433344566777653 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccchhhhh-----hhhhcccCCCCceEEEecccc Q lcl|NC_021297. 227 ELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAAE 297 (480) Q Consensus 227 ~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g-~~~~~~~~~~~~~~~~~~-----~~~~~~~~g~~~~~~~~~~~~ 297 (480) .+.+.+.....--.....+|. .|..++.- ........+.-...|+.. .+.++.++ ++.++.++.... T Consensus 196 ---~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~ 271 (454) T protein:vir:93 196 ---AAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILS-NGAKYNPTTFSP 271 (454) T ss_pred ---HHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceecc-CCceEEEcccCh Confidence 223333322222222222333 34333321 111111111111112111 22344443 345676665432 Q ss_pred -HHHHHHHHHHHHHHHHhccCCChHHhccccccchHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EAI--IATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg-~Al--~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) -..|++..+..+.+|+...++|+..+|....+.-|. ... .+....|.-.+...+ ..+.+. + +.+ T Consensus 272 ~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie----~~ln~~----L-~~~--- 339 (454) T protein:vir:93 272 VDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIE----LLLDEA----L-ETG--- 339 (454) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHH----HHHHHh----h-cCC--- Confidence 234788888889999999999999998543221111 111 111111221121111 111110 0 111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHHh---cccc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLY---STTK 449 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~~~~~~~~~~~~~~~~~~---~~~~ 449 (480) ....+++.+......|..++++++.+++++| +++...+++.+|+.+-+- +++- .... --.++.+. ...+ T Consensus 340 --~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~ggD~~~--~~~~-~~~~~~~~~~~~~~~ 412 (454) T protein:vir:93 340 --ENESTEFDVTTLLRMDSERRMKTLGDAVKNT--LLTPNEARKRENLPPLAGGDALY--LQQQ-NYSLEALSRRDARED 412 (454) T ss_pred --CCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeee--eccC-ccchHhhhccCcccC Confidence 1134566666667788899999999998876 667777788887754210 0000 0000 00000000 0000 Q ss_pred -----cCccccc-CCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 450 -----AQADATP-KPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 450 -----~~~~~~~-~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) +.....+ .+...|+.........+...+..| T Consensus 413 ~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~ 449 (454) T protein:vir:93 413 PFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFR 449 (454) T ss_pred CCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhh Confidence 0000000 000000000001111122222222 No 169 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.33 E-value=1.3e-06 Score=52.93 Aligned_cols=372 Identities=10% Similarity=0.012 Sum_probs=144.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccC-cch---hcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTR-RLK---TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~-~i~---~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .-+..++...... +......+..... ... ..+..+... ..+.+.-...+|+..+.-+-..++.+... .. T Consensus 1 M~~f~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~~~---~al~~~~v~~~i~~ia~~ia~~p~~~~~~-~~ 73 (386) T protein:vir:49 1 MPIFNITNLATES---PPINQESFFDIADSDFLASLNSSEWVSAE---NALKNSDLFSIISQLSNDLATAKITTSRK-QL 73 (386) T ss_pred CchhhhhccCCCC---cccchhhhhhhhhccccccccCCceechh---hhhccHHHHHHHHHHHHHhhhCceeeccc-hh Confidence 1222221111100 0000000100000 000 001111111 00111112233343333333334443221 11 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+-........+...+..+.+.+|.||+.+-.+ .+|++ .+.+++|..+.++.++... .+.+ .|.. T Consensus 74 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~l~~i~~~~v~v~~~~~~~-~~~y---~~~~ 143 (386) T protein:vir:49 74 QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------DNGRDMKWEYLRPSQVSFNRLDNQN-GLYY---NITF 143 (386) T ss_pred hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEecCceeEEEEcCCCc-eEEE---EEEE Confidence 1111111111233566778888999999999987643 34554 5788899888877654321 1111 1111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) .+..+.. ...+..++ |+||++....+..+|.|.+.. +...++....+. T Consensus 144 ~~~~~~~--~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~ 191 (386) T protein:vir:49 144 DDPHIAP--KQHVPQND-----------------------------ILHFRLLSVDGGLTSVSPLMA-LGREFNIQKASD 191 (386) T ss_pred cCccccc--eeEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHHHHH Confidence 1111000 01112222 455554333344578876532 333333222222 Q ss_pred HHHHHHHHHhcchhhhhc--CCCccccccccccchh---hhhhhhhcccCCCCceEEEeccc-cHHHHHHHHHHHHHHHH Q lcl|NC_021297. 240 MNLQSASQILGTPLRVIS--GVTTDELTNDGENTTL---DIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAA 313 (480) Q Consensus 240 s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~---~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~ 313 (480) .-..+.....+.|..++. +...++ ........+ ....++++.++ ++.++.++... .-..+++..+....+|+ T Consensus 192 ~~~~~~~~ng~~~~~il~~~~~~~~~-~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 269 (386) T protein:vir:49 192 KLTISALKNALNANGILKIKGGGLLD-FKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLSQADWTTGQFA 269 (386) T ss_pred HHHHHHHHccCCccEEEEeCCCCChH-HHHHHHHHHHHhccCCCCceecC-CCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 222222232334544443 111110 000000011 11223444444 34677777543 22357888899999999 Q ss_pred hccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 314 ~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~ 393 (480) ...++|+..+|....+.+++..++..+...+... -+.+...|.+ ++ +. .+++........+.. T Consensus 270 ~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~---l~~i~~~~~~------~l-~~-------~~~~~~~~~~~~d~~ 332 (386) T protein:vir:49 270 KVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSRY---LRPFVSEMSK------KL-SC-------EVDVDISPAVDPTGS 332 (386) T ss_pred HHhCCCHHHhCCCCCccchHHHHHHHHHHHHHHH---HHHHHHHHHH------Hh-cc-------hhcccchhhhccCHH Confidence 9999999999876555444544443332222111 1111111111 11 11 122222233334556 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) +.+..+.++..+| +++.-.+++++ |+.++++.+. ..-..+...++|.. +++ T Consensus 333 ~~~~~~~~l~~~g--~~t~nE~r~~l~~~~~~~~~~~~~---------------------~~~~~~~~~gGd~~-~~~ 386 (386) T protein:vir:49 333 NYISLINSMVKSG--TLAQNQGLYILQQAEILPKELPDG---------------------KNPNRTSLKGGEIN-EQD 386 (386) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHHhhCCCCCCcCcch---------------------hccCCCCCCCCCCC-CCC Confidence 6677777777665 44444444443 3333221110 00000011111111 111 No 170 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.31 E-value=1.4e-06 Score=52.65 Aligned_cols=392 Identities=13% Similarity=0.045 Sum_probs=159.2 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCCc Q lcl|NC_021297. 6 EHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDS 78 (480) Q Consensus 6 ~~i~~l~~~~~~~~----~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~ 78 (480) =+++++..+..... .-...+..++-|.... .+..+.. +.-+.......+|+..++-+-.-++. ..++. T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~---~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~ 75 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTA--SGERVSE---SNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGG 75 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccc--cCceech---hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc Confidence 12222221111000 0012233344332111 1111111 11111222334455444433333332 12111 Q ss_pred --h-hHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 79 --E-GLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 79 --~-~~~~l~~i~~-~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~ 149 (480) . ....+..++. + | ....+...++.+.+.+|.||+++..+ ..|.+ .+..++|..+.++.++... . T Consensus 76 ~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~ 148 (416) T protein:vir:12 76 IERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFG------SHGYPEALFPLRPDYTNAYVHPTTG-M 148 (416) T ss_pred cccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEECCcceEEEEeCCCc-E Confidence 1 1112333332 2 3 33466778888999999999988653 34554 5778999988887765432 1 Q ss_pred eEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHH Q lcl|NC_021297. 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~ 229 (480) + +|....+++. ..+.+++ |+||++.. ..+.+|.|.+.. +. T Consensus 149 ~-----~~~~~~~g~~----~~~~~~e-----------------------------iih~~~~~-~~~~~G~s~i~~-~~ 188 (416) T protein:vir:12 149 L-----WYQTVLNGKA----IELYDYE-----------------------------VLHFKGLS-TDGIHGKSPIGV-VR 188 (416) T ss_pred E-----EEEEecCCeE----EEecCcc-----------------------------EEEecCcC-CCCcccccHHHH-HH Confidence 1 1111111111 0112222 33343221 234677776532 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh--hhhhhcccCCCCceEEEecccc-HHHHHHHH Q lcl|NC_021297. 230 KVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI--YYGRILTLASEAAKISEFKAAE-LRNFAEEM 305 (480) Q Consensus 230 ~liD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l 305 (480) ..++.......-..+.....+.|..++.-. ...+...+.-...|+- ..+.++.++ ++.++.++.... -..+++.. T Consensus 189 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~ 267 (416) T protein:vir:12 189 EHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKVENIAIID-YGLEYQSISMPLQEAQFVESM 267 (416) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCeeecC-CCceEEEccCChhhHHHHHHH Confidence 333322222222222233333444444311 1111101111111211 123344444 345777665432 22477888 Q ss_pred HHHHHHHHhccCCChHHhccccccc-hHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYLSSSSENP-ASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~n~-~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) +....+|+...++|++.+|....+. ++..... +....|.-.+...+.. |.+ .+...........+++ T Consensus 268 ~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~----l~~------~l~~~~~~~~g~~i~f 337 (416) T protein:vir:12 268 KFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQE----LNV------KLFLDHDQKSGHYVKF 337 (416) T ss_pred HHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHH----HHH------hhcCchhhcCCceEEe Confidence 8889999999999999998543221 2211111 1112222222221111 110 1111111111234555 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..-...|..++++++.+++.+| +++...+++.+|+.+-+- +++. ..... ...+.+ ........ ++ T Consensus 338 d~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~Pi~ggd~~~--~~~n~-~~~~~~------~~~~~~~~-~~ 405 (416) T protein:vir:12 338 NIDSELRGDSKTQAEYLKTLHETG--VLNKDEIRELLERNPIENGDKYI--SSLNY-VFLDFL------EEYQRLKA-GG 405 (416) T ss_pred echhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcceee--ecccc-cccccc------chhhcccc-cc Confidence 666667788899999999998876 667777788887755321 1100 00000 000000 00000000 00 Q ss_pred CCCcccccCCCCCC Q lcl|NC_021297. 462 DTKTETQTSPSGFN 475 (480) Q Consensus 462 d~~~~~~~~p~~~~ 475 (480) +.. .|.+.+.| T Consensus 406 ~~~---gge~~~~g 416 (416) T protein:vir:12 406 AMK---GGDNKNEG 416 (416) T ss_pred ccC---CCCCcCCC Confidence 000 01111111 No 171 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.31 E-value=1.5e-06 Score=52.58 Aligned_cols=412 Identities=8% Similarity=-0.025 Sum_probs=145.3 Q ss_pred CCCHHHHHHHHHHHHHHHH----HHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcc---c Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGF---R 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~----~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~---~ 73 (480) |++-..- .+.... .-+..+.+.|.+ +++..... ..+.+. +.-|+ ++........+-++ . T Consensus 53 ~~~~~~g------~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~i---~t~~~~--va~~~--~i~~~s~~~~~~~i~l~~ 118 (535) T protein:vir:10 53 ADGNVAG------QYSVASISDVLSTKKLLKAYAD-NDIVQAII---RTRTNQ--VLTYS--NPSRYNRNGVGFKVELKD 118 (535) T ss_pred ccCCccc------ccccCccccccCHHHHHHHhcc-ChhHHHHH---HHHHHH--HHHHH--HHHHHhcccCcceeEEEe Confidence 2221000 000000 001111111111 11211100 000010 01111 11111111100011 0 Q ss_pred c---CCCc--hhHHHHHHHHHh--cC-------HHHHHHHHHHHHhhcC-eEEEEEecCcccccCCCcee-EEEEEccce Q lcl|NC_021297. 74 I---SEDS--EGLEELWNWWQA--ND-------LDEESVLGHDDSLTFG-RAYITVSHPDVESGDPAGIP-LIRVESPLY 137 (480) Q Consensus 74 ~---~~d~--~~~~~l~~i~~~--n~-------~~~~~~~~~~~a~~~G-~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~ 137 (480) . +..+ .....+..++.. |. +..+...+..+++.+| .+|+++..+ ..|+| .+..++|.. T Consensus 119 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~------~~G~~~~L~~l~p~~ 192 (535) T protein:vir:10 119 ATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKN------DSNELDHFNAVDASK 192 (535) T ss_pred ccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEEC------CCCcEEEEEEeCCce Confidence 0 0000 111223333321 21 1234556666767665 678887643 34555 488999999 Q ss_pred eEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeeccc---c Q lcl|NC_021297. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP---R 214 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~---~ 214 (480) +.+..++....... ++|.... ++.. ..|.++++ +||+.++ . T Consensus 193 V~v~~d~~~~~~~~---~~~~~~~-~~~~---~~~~~~ei-----------------------------ih~~~~~~~~~ 236 (535) T protein:vir:10 193 VVISYSPRSKDQPR---KFEQFVS-ETKS---VKFSERNL-----------------------------TFINYWNLSDT 236 (535) T ss_pred eEEEEcCccccCce---EEEEEec-Ccee---EEECcccE-----------------------------EEEeccCCCCc Confidence 98888764322111 1111111 1110 11223333 3333222 2 Q ss_pred cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhc--CCCccccccccc---cchhhh------hhhhh Q lcl|NC_021297. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVIS--GVTTDELTNDGE---NTTLDI------YYGRI 280 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~--g~~~~~~~~~~~---~~~~~~------~~~~~ 280 (480) ..+.+|.|.|. .+.+++...+.-......+|. .|..+|. +.......++.. ...|+- ..+++ T Consensus 237 ~~~~~G~Spi~----~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~ 312 (535) T protein:vir:10 237 DRRGYGYSPVE----ASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKI 312 (535) T ss_pred ccccccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccccc Confidence 23467887663 333444433333333333333 3433332 211111111110 011211 11233 Q ss_pred cccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHH--HHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAE--AIIATD-SRIVKMAERKGRIFGG 356 (480) Q Consensus 281 ~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~--Al~~~~-~~l~~k~~~~~~~~~~ 356 (480) ..+.+++.++..+.... -..|++..+..+.+|+...++|+..+|....+.-|.. +-...+ ..+.. ........ T Consensus 313 ~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~---~~~~~~~~ 389 (535) T protein:vir:10 313 PILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKA---KLESSKDK 389 (535) T ss_pred ccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHH---HHHHHHHH Confidence 44555667887766432 2248888899999999999999999985332111110 001011 11111 11111112 Q ss_pred HHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HHHHHHHH Q lcl|NC_021297. 357 AWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDK 434 (480) Q Consensus 357 ~l~~~~~~~~~~~~~~-~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~~~~~~~ 434 (480) .|.-+++.+....+.. ...-...+++.|......+..+.++.. ++..++ +++...+++++|+.+-+- +...-.. T Consensus 390 ~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~-~~~~~g--~lT~NE~R~~~gl~piegGD~~~~~~- 465 (535) T protein:vir:10 390 GLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVW-KLKLAN--GYFINEYRKDHGLKTVDGLDVPGFIG- 465 (535) T ss_pred HHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHH-HHHHcC--CCCHHHHHHHhCCCCCCCcccccccc- Confidence 2322222221111100 011112467788777777777766544 444443 467777888887754210 0000000 Q ss_pred HHHHHHHHHHhcccccCccccc-------CCCCCC---------CCcccccCCC---CCCCCCCC Q lcl|NC_021297. 435 QETEDMIDTLYSTTKAQADATP-------KPTVTD---------TKTETQTSPS---GFNRTKTR 480 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~~d---------~~~~~~~~p~---~~~~~~~~ 480 (480) .............+..++..+ .....+ ...++.++|. ..+.+-|- T Consensus 466 -~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 529 (535) T protein:vir:10 466 -SAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSN 529 (535) T ss_pred -chhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCcccc Confidence 000000000000000000000 000000 0001101110 00111111 No 172 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.29 E-value=1.6e-06 Score=52.39 Aligned_cols=399 Identities=12% Similarity=-0.001 Sum_probs=146.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHH-Hh---c--cc-Ccchh----cCcccc-hhhhhhccccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEA-YR---N--GT-RRLKT----IGIGAP-PELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~-YY---~--g~-~~i~~----~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) |++--+ |.-.|..+..+.....+ |. - |. ++-.. ++.... ..+...+-....++.|||..++.+- T Consensus 1 ~~~~~~----~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~ 76 (449) T protein:vir:10 1 MTDKLT----LAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCW 76 (449) T ss_pred CchhhH----HHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhh Confidence 887532 33334433333322222 22 1 11 11111 111111 1122223333456899999888763 Q ss_pred cCccc-cCCCc----hhHHHHHHHHH---hcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccc---- Q lcl|NC_021297. 69 IEGFR-ISEDS----EGLEELWNWWQ---ANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPL---- 136 (480) Q Consensus 69 ~~~~~-~~~d~----~~~~~l~~i~~---~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~---- 136 (480) -.... +.+++ +....+...++ .+++...+.++.+++..+|.|.+++...+ ++..-..+.+. T Consensus 77 ~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d-------~~~l~~Pl~~~~~i~ 149 (449) T protein:vir:10 77 QTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRD-------EKDWNLPATKGRGLQ 149 (449) T ss_pred hcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecC-------CCCCCcccccCccee Confidence 33221 11111 11112222222 23555677888999999999887765322 11110001111 Q ss_pred eeEEEEeCCCCCeeEEEEEEEEE--ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccc Q lcl|NC_021297. 137 YMYAELDPRNTRRVTRAVRLYTT--RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~ 214 (480) ++.|+|-.. +... .++.+ ..+.+.+.++.+-+. ..++...+. .-|+--.+. |.-.+ T Consensus 150 ~i~v~~~~~----i~~~-~~~~dp~sp~yg~P~~y~v~~~-------~~g~~~~~~------~iH~SRl~~---~~~~~- 207 (449) T protein:vir:10 150 KVSVSWAGS----LKVA-EWDTGINSKTYGQPKLWKYTER-------LPNGSSRRV------DIHPDRVFI---LGDYS- 207 (449) T ss_pred eEEeecccc----CChh-hhhcCCCCCCCCCceEEEEeee-------ccCCCccce------eeccceeEe---ecCCC- Confidence 111111100 0000 00110 011122211111000 001000000 012221111 21101 Q ss_pred cCccCCcccchhhHHHHHHHH---HHHHHH-----HHHHHHHhcchh---hhhcCC------Cccccccc--cccchhhh Q lcl|NC_021297. 215 LGNRYGRSEISPELRKVTDAA---SRTLMN-----LQSASQILGTPL---RVISGV------TTDELTND--GENTTLDI 275 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~liD~~---~~~~s~-----~~~~~~~~~~~~---~~i~g~------~~~~~~~~--~~~~~~~~ 275 (480) .-|.|.+.+ ..+++ +++.-. +.+......... .-+.|. ..+...+. .....++. T Consensus 208 ---~~g~~~L~~----~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~ 280 (449) T protein:vir:10 208 ---EDAIGFLEP----AYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINR 280 (449) T ss_pred ---CCChhHHHH----HHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhc Confidence 114433321 11111 111000 001111000000 001111 10000000 00001111 Q ss_pred hhhhhcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHh-cccccc-chHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 276 YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAERKGRI 353 (480) Q Consensus 276 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~ 353 (480) +.+.++...+++..+.+.+-+.+.. .+......+++.++||..-| |...+. ++++. ++. | ...+..++.. T Consensus 281 ~~~~~~i~~~~d~~~~~~~~sgl~d---~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~n-y---yd~i~~~Q~~ 352 (449) T protein:vir:10 281 GNDVLMTTQGATVTPLVTSVADPTA---TYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKY-F---NARCQSRRVD 352 (449) T ss_pred cchheeecCCcceEEEecccCChhH---HHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHH-H---HHHHHHHHHh Confidence 2222233344444444444445554 46666788999999997666 433332 23332 332 3 3333334456 Q ss_pred HHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC--C-CChhHHHHHH Q lcl|NC_021297. 354 FGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL--G-YTATQREQMR 430 (480) Q Consensus 354 ~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l--g-~~~~~~~~~~ 430 (480) +...|++++.+++...-+..+ .++++.|.+-..++..+.|+...+.+++. .++.... | ++.+++.+. T Consensus 353 l~p~le~l~~~l~~s~~g~~~---~d~~i~f~pL~~~t~kEkAei~k~~A~a~------~~~~~ag~~~~~~~~EiR~~- 422 (449) T protein:vir:10 353 LSFEIEDFCDKLIELKIIDAV---AKKAVIWDDLNEQTGTEKLTNAKTMGEIN------QTMLGSGDNPAFSREEIRTA- 422 (449) T ss_pred hhHHHHHHHHHHHHhhcCCCC---CceeEEeCCCCCCCHHHHHHHHHHHHHHH------HHHHHccccCCcCHHHHHHH- Confidence 899999999887654332222 26999999999999999988776665432 1222211 1 223322110 Q ss_pred HHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) . +-.+......+.++.+ +..+ +.+.++ T Consensus 423 ----------~----~~~~~~~~~~~~e~~d------e~~~-~~d~~a 449 (449) T protein:vir:10 423 ----------A----GYDNDDEEPLGEEDGD------EEDK-ATDSAA 449 (449) T ss_pred ----------h----cccCCCCCCCCCCCCc------cccc-cCCcCC Confidence 0 0011111111111111 1111 112222 No 173 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=98.25 E-value=2e-06 Score=51.83 Aligned_cols=397 Identities=11% Similarity=0.052 Sum_probs=150.1 Q ss_pred CCCH--HHH-----HHHHHHHHHHHH---HH--HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTY--HEH-----VERLQGLLARDL---PN--LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~--~~~-----i~~l~~~~~~~~---~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) |+=- ++- +--|......|- +. ...+-....|-.... ...+... .-++... .-.+|+..++-+- T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~al~~~~--V~~cv~~Ia~~iA 85 (441) T protein:vir:79 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKDI-EAIRHSD--IFTAVMMIASDLA 85 (441) T ss_pred ccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCccc--ccccchh-hhhccHH--HHHHHHHHHHhhc Confidence 1100 000 000000000000 00 001111111100000 0001000 0011110 1112333332222 Q ss_pred cCccccCCCc--hhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEE Q lcl|NC_021297. 69 IEGFRISEDS--EGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYA 140 (480) Q Consensus 69 ~~~~~~~~d~--~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~ 140 (480) .-++.+-.+. .....+..++.. |. -..+...+..+.+.+|.||+.+..+ ..|++ .+..++|..+.+ T Consensus 86 ~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v 159 (441) T protein:vir:79 86 RMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIEL 159 (441) T ss_pred cCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEE Confidence 2233322111 112233444432 32 2456677888899999999988643 34665 578899999998 Q ss_pred EEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCC Q lcl|NC_021297. 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G 220 (480) ..|+.. ++.+.. ...+..+ ......|.++. |+||++.. ..+.+| T Consensus 160 ~~d~~g--~~~~~~---~~~~~~~-~~~~~~~~~~d-----------------------------vih~k~~~-~dg~~G 203 (441) T protein:vir:79 160 KSDARG--RLYYFH---QRIDSNG-NNIERNVKFED-----------------------------MLDIKFYS-LDGING 203 (441) T ss_pred EECCCc--cEEEEE---EEeccCC-ceeEEEEcccc-----------------------------EEEeccCC-CCCccc Confidence 887532 222111 1111111 01111222222 34444332 234567 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHH---hcchhhhhc--CCCccccccccccchhhhh------hhhhcccCCCCce Q lcl|NC_021297. 221 RSEISPELRKVTDAASRTLMNLQSASQI---LGTPLRVIS--GVTTDELTNDGENTTLDIY------YGRILTLASEAAK 289 (480) Q Consensus 221 ~s~i~~~v~~liD~~~~~~s~~~~~~~~---~~~~~~~i~--g~~~~~~~~~~~~~~~~~~------~~~~~~~~g~~~~ 289 (480) .|.|. .+.+.+.....-..-...+ .+.|..++. |...+....+.-...|+.. .++++.++ ++.+ T Consensus 204 ~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~-~G~~ 278 (441) T protein:vir:79 204 LSLLD----TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMT 278 (441) T ss_pred cCHHH----HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCce Confidence 77653 3333333222222222222 233444432 2111110000001112111 12344444 3457 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +.+++.+. -..|++..+....+|+...++|+..+|....+. |.......+. .-.. -+-..++..+.. .+ T Consensus 279 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~~~---~tl~----P~~~~ie~eln~--kl 348 (441) T protein:vir:79 279 FDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDYL---STLK----PYITCVCAELNF--KF 348 (441) T ss_pred EEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHHH---HHHH----HHHHHHHHHHhh--hc Confidence 77665432 224788888899999999999999998654432 2111111111 1011 111111111111 11 Q ss_pred hCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HH-HHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 369 MGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQ-MRDWDKQETEDMIDTLYS 446 (480) Q Consensus 369 ~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~-~~~~~~~~~~~~~~~~~~ 446 (480) ... .....+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- .. +..... ... ..+.... T Consensus 349 ~~~---~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~~-n~~-~~~~~~~ 421 (441) T protein:vir:79 349 NDE---YVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDL-NHV-NIELVDE 421 (441) T ss_pred ccc---ccCceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecc-ccc-ccccccc Confidence 111 11234555555556678888898888888765 667777788887754210 00 000000 000 0000000 Q ss_pred ccccCcccccCCCCCCCCcccccCCCCCCCC Q lcl|NC_021297. 447 TTKAQADATPKPTVTDTKTETQTSPSGFNRT 477 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~ 477 (480) .....+..++....+ |.++- T Consensus 422 ~~~~~~~~~~~~~kg-----------Ge~~e 441 (441) T protein:vir:79 422 YQMNKSRATDKKLKG-----------GEENE 441 (441) T ss_pred cccccccccccccCC-----------CCCCC Confidence 000000001111000 00000 No 174 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=98.25 E-value=2e-06 Score=51.83 Aligned_cols=397 Identities=11% Similarity=0.052 Sum_probs=150.1 Q ss_pred CCCH--HHH-----HHHHHHHHHHHH---HH--HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTY--HEH-----VERLQGLLARDL---PN--LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~--~~~-----i~~l~~~~~~~~---~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) |+=- ++- +--|......|- +. ...+-....|-.... ...+... .-++... .-.+|+..++-+- T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~al~~~~--V~~cv~~Ia~~iA 85 (441) T protein:vir:94 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKDI-EAIRHSD--IFTAVMMIASDLA 85 (441) T ss_pred ccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCccc--ccccchh-hhhccHH--HHHHHHHHHHhhc Confidence 1100 000 000000000000 00 001111111100000 0001000 0011110 1112333332222 Q ss_pred cCccccCCCc--hhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEE Q lcl|NC_021297. 69 IEGFRISEDS--EGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYA 140 (480) Q Consensus 69 ~~~~~~~~d~--~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~ 140 (480) .-++.+-.+. .....+..++.. |. -..+...+..+.+.+|.||+.+..+ ..|++ .+..++|..+.+ T Consensus 86 ~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v 159 (441) T protein:vir:94 86 RMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIEL 159 (441) T ss_pred cCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEE Confidence 2233322111 112233444432 32 2456677888899999999988643 34665 578899999998 Q ss_pred EEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCC Q lcl|NC_021297. 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G 220 (480) ..|+.. ++.+.. ...+..+ ......|.++. |+||++.. ..+.+| T Consensus 160 ~~d~~g--~~~~~~---~~~~~~~-~~~~~~~~~~d-----------------------------vih~k~~~-~dg~~G 203 (441) T protein:vir:94 160 KSDARG--RLYYFH---QRIDSNG-NNIERNVKFED-----------------------------MLDIKFYS-LDGING 203 (441) T ss_pred EECCCc--cEEEEE---EEeccCC-ceeEEEEcccc-----------------------------EEEeccCC-CCCccc Confidence 887532 222111 1111111 01111222222 34444332 234567 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHH---hcchhhhhc--CCCccccccccccchhhhh------hhhhcccCCCCce Q lcl|NC_021297. 221 RSEISPELRKVTDAASRTLMNLQSASQI---LGTPLRVIS--GVTTDELTNDGENTTLDIY------YGRILTLASEAAK 289 (480) Q Consensus 221 ~s~i~~~v~~liD~~~~~~s~~~~~~~~---~~~~~~~i~--g~~~~~~~~~~~~~~~~~~------~~~~~~~~g~~~~ 289 (480) .|.|. .+.+.+.....-..-...+ .+.|..++. |...+....+.-...|+.. .++++.++ ++.+ T Consensus 204 ~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~-~G~~ 278 (441) T protein:vir:94 204 LSLLD----TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMT 278 (441) T ss_pred cCHHH----HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCce Confidence 77653 3333333222222222222 233444432 2111110000001112111 12344444 3457 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +.+++.+. -..|++..+....+|+...++|+..+|....+. |.......+. .-.. -+-..++..+.. .+ T Consensus 279 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~~~---~tl~----P~~~~ie~eln~--kl 348 (441) T protein:vir:94 279 FDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDYL---STLK----PYITCVCAELNF--KF 348 (441) T ss_pred EEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHHH---HHHH----HHHHHHHHHHhh--hc Confidence 77665432 224788888899999999999999998654432 2111111111 1011 111111111111 11 Q ss_pred hCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HH-HHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 369 MGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQ-MRDWDKQETEDMIDTLYS 446 (480) Q Consensus 369 ~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~-~~~~~~~~~~~~~~~~~~ 446 (480) ... .....+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- .. +..... ... ..+.... T Consensus 349 ~~~---~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~~-n~~-~~~~~~~ 421 (441) T protein:vir:94 349 NDE---YVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDL-NHV-NIELVDE 421 (441) T ss_pred ccc---ccCceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecc-ccc-ccccccc Confidence 111 11234555555556678888898888888765 667777788887754210 00 000000 000 0000000 Q ss_pred ccccCcccccCCCCCCCCcccccCCCCCCCC Q lcl|NC_021297. 447 TTKAQADATPKPTVTDTKTETQTSPSGFNRT 477 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~ 477 (480) .....+..++....+ |.++- T Consensus 422 ~~~~~~~~~~~~~kg-----------Ge~~e 441 (441) T protein:vir:94 422 YQMNKSRATDKKLKG-----------GEENE 441 (441) T ss_pred cccccccccccccCC-----------CCCCC Confidence 000000001111000 00000 No 175 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=98.24 E-value=2.2e-06 Score=51.61 Aligned_cols=424 Identities=14% Similarity=0.082 Sum_probs=160.1 Q ss_pred CCCH----HHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc----c-- Q lcl|NC_021297. 1 MTTY----HEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----I-- 69 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~-- 69 (480) ||=+ .+.+.+..+.+.. |.+....++++|+ +.+++.........+.-++.-+-+...++++++.|. + T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 78 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSR--FTLPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLFPAQ 78 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHH--HhccccccCCCCCccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 5432 3334444444443 3333344445542 223332111111111223444555666666666553 2 Q ss_pred Cccc-cC-CC---------ch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce Q lcl|NC_021297. 70 EGFR-IS-ED---------SE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 70 ~~~~-~~-~d---------~~----~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~ 127 (480) .+|+ .. .| .+ .. +.+...+..++|.....++.++...+|.|-+++. ++. T Consensus 79 ~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~---------~~~ 149 (517) T protein:vir:10 79 RSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHP---------DKT 149 (517) T ss_pred CccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe---------CCC Confidence 1222 11 11 11 11 2234456678999999999999999999865542 122 Q ss_pred eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-c------CC-----------ceEEEEEEEe-----CCc-EEEEEec Q lcl|NC_021297. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D------DV-----------AVPDRATLYL-----PDE-TVPLRRN 183 (480) Q Consensus 128 ~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~------~~-----------~~~~~~~~~~-----~~~-~~~~~~~ 183 (480) ..++.++-.++++.-|. .+ ++...++..... . .. +....+++|+ ++. +.+|... T Consensus 150 ~~~~~~pl~~y~v~~d~-~G-~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~ 227 (517) T protein:vir:10 150 SPIQAVPLHHYCVRRDN-NG-TVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSA 227 (517) T ss_pred CcEEEEEcCeEEEeeCC-Cc-CeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEe Confidence 34666666665555554 33 333333222100 0 00 0001223333 111 1111111 Q ss_pred CCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCc Q lcl|NC_021297. 184 GGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTT 261 (480) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~ 261 (480) .+. ........++..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|.+.+. |... T Consensus 228 d~~-----~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~ 301 (517) T protein:vir:10 228 DDV-----PVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAED-HAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTD 301 (517) T ss_pred Cce-----eeccccccccccCCeeeeeeeecCCCCcccchHHH-hHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccc Confidence 111 00111112357889999998888899999997643 455566666665555555555555544432 1111 Q ss_pred cccccccccchhhhhhhhhcccCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHH Q lcl|NC_021297. 262 DELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAI 336 (480) Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al 336 (480) .. .......+.+..-..++....++. ..+++. .++.++.-|+..+.... +.. ++... ++.-+ T Consensus 302 ~~-------~l~~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~rv-TAtEV 368 (517) T protein:vir:10 302 IN-------QFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA-----MTRRDAERV-TAYEI 368 (517) T ss_pred hh-------hccCCCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hhccCCccc-cHHHH Confidence 00 001111122211111222333333 223433 34444444444443322 111 11111 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhCCCCcccceeeeEEecCCCC-CCHHHHHHHHHHHHhc-- Q lcl|NC_021297. 337 IATDSRIVKMAERKGRIFGGAWERA--------MRIAMQIMGREVTEEYTRLETVWRDPST-PTVAAKADAVSKLYAN-- 405 (480) Q Consensus 337 ~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~~~~~~~~~~~~~~~~i~v~f~~~~~-~~~~~~ad~~~kl~~~-- 405 (480) + ..+..++..++..+.++ +...+.++....... .+++.+--++. -...+.++.+.+..+. T Consensus 369 ~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~--~v~~~~~s~la~l~r~~~~~~i~~~~~~i~ 439 (517) T protein:vir:10 369 Q-------RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSK--NVSPTILTGIEALGRMAELDKLGTFNGYVS 439 (517) T ss_pred H-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCC--CccceeeccHHHHHHHHHHHHHHHHHHHHH Confidence 2 33344444555554442 222222222222222 23333322211 1111222222222111 Q ss_pred ----ccc----CCCHHH----HHHhCCCCh------hHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccc Q lcl|NC_021297. 406 ----GQG----PIPKEQ----ARIDLGYTA------TQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTET 467 (480) Q Consensus 406 ----~~~----~~s~et----~~~~lg~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ++. .+.... +...+|.+. ++++++.+.+.++ +.......+..++.++...++ T Consensus 440 ~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~-~~~~~~~~~ag~~~~~~~~~~--------- 509 (517) T protein:vir:10 440 MTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQ-EATKYAAEQAGKAIPDMVKNG--------- 509 (517) T ss_pred HhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCC--------- Confidence 111 112222 223355542 3333322211111 111111111111111110000 Q ss_pred ccCCCCCC Q lcl|NC_021297. 468 QTSPSGFN 475 (480) Q Consensus 468 ~~~p~~~~ 475 (480) +.+|.|.- T Consensus 510 ~~~~~~~~ 517 (517) T protein:vir:10 510 QINPQGGQ 517 (517) T ss_pred CCCCCCCC Confidence 00110000 No 176 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.19 E-value=2.9e-06 Score=50.92 Aligned_cols=334 Identities=10% Similarity=0.038 Sum_probs=142.0 Q ss_pred chhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc-CCCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcC Q lcl|NC_021297. 35 LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFG 108 (480) Q Consensus 35 i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G 108 (480) |- .-++.+ ..++.....+.+++.. | .-..+...++...+.+| T Consensus 1 ia--------------------------------~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~G 48 (348) T protein:vir:93 1 MA--------------------------------SLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKG 48 (348) T ss_pred Cc--------------------------------ccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcC Confidence 10 011111 1112223334454432 3 23455667888889999 Q ss_pred eEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcc Q lcl|NC_021297. 109 RAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN 187 (480) Q Consensus 109 ~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (480) .||+++-++ ..|++ .+..++|..+.++.+... ..+. |......+.. ..|.+++ T Consensus 49 na~~~i~r~------~~G~~~~L~~l~~~~v~~~~~~~~-~~~~-----y~~~~~~g~~---~~~~~~e----------- 102 (348) T protein:vir:93 49 NAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQS-RELY-----YSIHAATGNK---LIVHNMD----------- 102 (348) T ss_pred CeEEEEEEC------CCCcEEEEEEEcCCceEEEEeCCC-cEEE-----EEEEcCCCeE---EEEcccc----------- Confidence 999998653 34555 577888888887776432 1111 1111111110 0122222 Q ss_pred cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELT 265 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~ 265 (480) |+||++.....+.+|.|.+. .+.+.++...+-....+..+..+-.++. +....... T Consensus 103 ------------------iih~r~~~~~~~~~G~s~~~----~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~ 160 (348) T protein:vir:93 103 ------------------MLHFKHIVASNMVQGISPID----VLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVSTEK 160 (348) T ss_pred ------------------EEEecCCCCCCceeeccHHH----HHHHHHHHHHHHHHHHHHhcCCCceeEEecCCCCCHHH Confidence 35555443345566777553 2233333222111112333333322221 11111110 Q ss_pred cccccchhh---hhhhhhcccCCCCceEEEeccccHH-HHHHHHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHH Q lcl|NC_021297. 266 NDGENTTLD---IYYGRILTLASEAAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATD 340 (480) Q Consensus 266 ~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~ 340 (480) .+.-...|+ ...++++.++ ++.++.+++.+..+ .|++..+....+|+...++|+.-+|.... +-++. .... T Consensus 161 ~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~---e~~~ 236 (348) T protein:vir:93 161 RQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKN---EELN 236 (348) T ss_pred HHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH---HHHH Confidence 000011111 1122334443 44677777643322 57787788899999999999999985432 22222 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC Q lcl|NC_021297. 341 SRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG 420 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg 420 (480) ..+...| -.-+-..+++.+.. .+...........+++.+..-...+..+.++++.+++.+| +++...+++.+| T Consensus 237 ~~~~~~~---l~P~~~~ie~~l~~--~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G--~~T~NE~R~~~g 309 (348) T protein:vir:93 237 RFYLQHT---LLPIVKQYEEEFNR--KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWED 309 (348) T ss_pred HHHHHHH---HHHHHHHHHHHHHH--hhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhC Confidence 2222211 01111111111110 1111111111233555555666678888999999998875 667777788888 Q ss_pred CChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) +.+-+- -.+. .+..........++......+++. ..+++ T Consensus 310 ~~p~~g--gD~~-------~~~~n~~~~~~~~~~~~~~~gg~~-n~~~~ 348 (348) T protein:vir:93 310 LPPVEG--GDKP-------LISGDLYPIDTPLELRKSLKGGDK-NVNES 348 (348) T ss_pred CCCCCC--cCeE-------eecccccccccchhhcccccCCCC-CcCCC Confidence 754320 0000 000000000000000000011110 01111 No 177 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=98.18 E-value=3.1e-06 Score=50.78 Aligned_cols=366 Identities=10% Similarity=0.001 Sum_probs=138.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcc--cCc-chhc--CcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNG--TRR-LKTI--GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g--~~~-i~~~--~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) .-|.++ .......-....+..-+ ... +... +..+.... -++ +.=...+|+..+.-+-..+|.+.... T Consensus 1 Mglf~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-al~--~~~V~~~i~~Ia~~ia~l~~~~~~~~- 72 (384) T protein:vir:49 1 MPIFNI----TNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAET-ALK--NSDLFSIISQLSNDLATAKITTSRKQ- 72 (384) T ss_pred Cccccc----cccCcccccccchhhccccchhhcccccCCceechhh-hhc--cHHHHHHHHHHHHHHhhCceeeecch- Confidence 111110 00000000000000000 000 0000 00011000 001 11123344544444444455543221 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ....+.+-...-....+...+..+.+.+|.||+.+-.+ ..|++ -+.+++|..+.++.++... .+. |.. T Consensus 73 ~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~v~~~~~~~-~~~----y~~ 141 (384) T protein:vir:49 73 LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNQN-GLY----YNI 141 (384) T ss_pred hhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----EEE Confidence 11111111111234567778889999999999988653 34554 6778899988877654321 111 111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) ...+.. ......+.+++ |+||.+....+..+|.|.+. .+.+.++.. T Consensus 142 ~~~~~~-~~~~~~~~~~e-----------------------------Vih~~~~~~~~~~~G~s~i~----~~~~~i~~~ 187 (384) T protein:vir:49 142 TFDDPR-IPPKQHVPQGD-----------------------------ILHFRLLSVDGGLTSVSPLM----ALGRELNIQ 187 (384) T ss_pred EecCcc-ccceeEecCcc-----------------------------EEEecCCCCCCceeeccHHH----HHHHHHHHH Confidence 111110 00001122222 34454433334456777553 344444333 Q ss_pred HHH---HHHHHHHhcchhhhhc--CCCccccccccccc--hhhhhhhhhcccCCCCceEEEeccc-cHHHHHHHHHHHHH Q lcl|NC_021297. 239 LMN---LQSASQILGTPLRVIS--GVTTDELTNDGENT--TLDIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRK 310 (480) Q Consensus 239 ~s~---~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~ 310 (480) ..- ..+.+...+.|..++. |...++........ ......++++.++ ++.++.++... ....+++..+.... T Consensus 188 ~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~ 266 (384) T protein:vir:49 188 KASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLSQADWTTG 266 (384) T ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecC-CCceEEEccCChhhHHHHHHHHHHHH Confidence 222 2222233334444332 21111100000000 0011223344443 44677777643 22347788899999 Q ss_pred HHHhccCCChHHhccccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021297. 311 EAASITGLPPQYLSSSSENPASAEAIIATDSRIVKM-AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 311 ~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~ 389 (480) +|+...++|+..+|....+.+++..++..+...+.. +.-....+...|.+- + .. ++.... ++.. T Consensus 267 ~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~------l-----~~---~~~~~~-~~~~ 331 (384) T protein:vir:49 267 QFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCE------V-----DA---DILPAV-DPTG 331 (384) T ss_pred HHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchh------h-----hh---hhhhhh-hccc Confidence 999999999999987655544454544443333222 222222222222110 0 00 000000 0111 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 390 PTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~et~~~~---lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ...... +..+..++ +.++..+++. .|+.++++.+++. .... .++++++.- T Consensus 332 ~~~~~~---~~~l~~~~--~~t~~e~~~~l~~~g~~~ne~r~~~~-------------~~p~-~gGd~~~~~ 384 (384) T protein:vir:49 332 SNYIGL---INSMVKTG--TLAQNQGLYVLQQAEILPKDLPEGET-------------DSTL-KGGETNEQY 384 (384) T ss_pred hHHHHH---HHHHhhcC--cccHHHHHHHHhhCCCCChhHHHHcC-------------CCCC-CCCCCCCCC Confidence 111222 22344433 3444444443 3666554322210 0000 011111111 No 178 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.17 E-value=3.2e-06 Score=50.71 Aligned_cols=394 Identities=11% Similarity=0.043 Sum_probs=156.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHH--HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPN--LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) |..+. ++.++-..+..+... -..+..+-..... ...+ +.. +.-..+.....+|+..++-+-.-++.+- .. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--v~~---~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:93 1 MAKEN-IVTRIKKKLIDNWIDQSTSKLYDFSPWKNR-SFWG--VIN---NTLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred CCccc-hhhhhhhhhhhhhhccccccccccccccCc-cccc--cch---hhhhccHHHHHHHHHHHHhhhhCceeEeecc Confidence 66532 333322222111000 0000010000000 0001 000 1111112223334433333322233221 11 Q ss_pred chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 78 SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +.....+..++.. | .-..+...++.+.+.+|.||+++..+ ..|++ .+..++|..+.+..++... .+. T Consensus 74 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~~~~~~~~~-~~~ 146 (409) T protein:vir:93 74 KVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-ELY 146 (409) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-EEE Confidence 2223344555432 3 23455677888899999999988653 34544 5778899888887764321 111 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) |......+.. ..+.+++ |+||++.....+.+|.|.|.- +... T Consensus 147 -----y~~~~~~g~~---~~~~~~e-----------------------------Vih~r~~~~~~~~~G~s~i~~-~~~~ 188 (409) T protein:vir:93 147 -----YSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASNMVQGISPIDV-LKNT 188 (409) T ss_pred -----EEEEcCCceE---EEEcccc-----------------------------EEEeCCCCCCCccccccHHHH-HHHH Confidence 1111111110 1122222 344444333345668776532 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh---hhhhhcccCCCCceEEEeccccH-HHHHHHH Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI---YYGRILTLASEAAKISEFKAAEL-RNFAEEM 305 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~~---~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~l 305 (480) ++..+.+ ... .+..+..+-..+. +........+.-...|+- ..+.++.++ ++.++.+++.... ..+++.. T Consensus 189 i~~~~~~-~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~r 264 (409) T protein:vir:93 189 TDFDNAV-RTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHH-HHH--HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHH Confidence 3321111 111 2222333222221 211111110111111211 122344443 4567777764322 2577877 Q ss_pred HHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEec Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWR 385 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~ 385 (480) +....+|+...++|+.-+|....+.-| .+......+...+ -.-+-..|++.+.. .+...........+++.+. T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~s--n~e~~~~~f~~~~---l~P~~~~ie~~l~~--~Ll~~~~~~~~~~~~fd~~ 337 (409) T protein:vir:93 265 NLTRERVANVFQLPSVFLNARSNTNFA--KNEELNRFYLQHT---LLPIVKQYEEEFNR--KLLTKTDREKNRYFKFNVK 337 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHHh--hcCCcccccCcceEEeech Confidence 888899999999999999864332111 1222221121111 01111111111111 1111111111223444444 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) .-...|..+.++++.+++.+| +++.-.+++.+|+.+-+- -.+.. .......... .........++++. T Consensus 338 ~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--gD~~~-------~~~n~~~~~~-~~~~~~~~~gG~~n 405 (409) T protein:vir:93 338 SYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--GDKPL-------ISGDLYPIDT-PLELRKSLKGGDKN 405 (409) T ss_pred hhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCeee-------eccccccccc-chhhcccccCCCCC Confidence 555678889999999998875 667777788888765320 00000 0000000000 00000000000011 Q ss_pred cccc Q lcl|NC_021297. 466 ETQT 469 (480) Q Consensus 466 ~~~~ 469 (480) .+++ T Consensus 406 ~~e~ 409 (409) T protein:vir:93 406 VNES 409 (409) T ss_pred cCCC Confidence 1111 No 179 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.16 E-value=3.3e-06 Score=50.62 Aligned_cols=406 Identities=14% Similarity=0.090 Sum_probs=146.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHH-HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r-~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~ 76 (480) |. .+++++.++....-.. ...+-++. |.. +.-.+.....-....-....+.-.+|+..++-+-.-++.+ .. T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~-g~~-~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~ 75 (460) T protein:vir:10 1 MA---NRIIRALRELTGLDNKFNDAFIKYI-GQT-FTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKD 75 (460) T ss_pred Cc---hhHHHHHhhhhccCCCchHHHHHhh-ccc-cCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccC Confidence 33 3444444333221111 11222222 211 1101111111111111122233344554444333223221 11 Q ss_pred Cch-------------------------------hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccc Q lcl|NC_021297. 77 DSE-------------------------------GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVES 121 (480) Q Consensus 77 d~~-------------------------------~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~ 121 (480) +.. .......++.+ | ....+...+..+.+.+|.||+++..+..+ T Consensus 76 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~- 154 (460) T protein:vir:10 76 TKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDG- 154 (460) T ss_pred CccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCC- Confidence 110 00111122222 2 23455677888999999999998764322 Q ss_pred cCCCceeE-EEEEccceeEEEEeCCCCCeeE-EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccc Q lcl|NC_021297. 122 GDPAGIPL-IRVESPLYMYAELDPRNTRRVT-RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKH 199 (480) Q Consensus 122 ~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (480) +..|.++ +..++|..+.+..+........ ..++.|....+ +. ...+.++++++++.... T Consensus 155 -~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~-g~---~~~~~~~evih~r~~~~-------------- 215 (460) T protein:vir:10 155 -INAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQG-DQ---FIEFNEDEVIHTKYANP-------------- 215 (460) T ss_pred -ccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecC-ce---eEEecccceEEEecCCC-------------- Confidence 3356654 7788999888776543321111 01111111111 11 11223333333321110 Q ss_pred cccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhh--- Q lcl|NC_021297. 200 GLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI--- 275 (480) Q Consensus 200 ~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~~~~~~~~~~~~~~~~~--- 275 (480) .+.....+.+|.|.+.. +...+........-....+...+.|..++. +....+...+.-...|+- T Consensus 216 ----------~~~~~~~~~~G~sp~~~-~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~ 284 (460) T protein:vir:10 216 ----------NFDLQGSHLYGMSPIRA-ILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDK 284 (460) T ss_pred ----------CcccccCccccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhc Confidence 01111234567776532 233333222221111122222222322222 111111111111111211 Q ss_pred ---hhhhhcccCCCCceEEEeccccH-HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHH-----HHHHH Q lcl|NC_021297. 276 ---YYGRILTLASEAAKISEFKAAEL-RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDS-----RIVKM 346 (480) Q Consensus 276 ---~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~-----~l~~k 346 (480) ..+.++.++ ++.++.++..... ..+++..+....+|+...++|++.+|.......++..+..... .|.-. T Consensus 285 g~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~ 363 (460) T protein:vir:10 285 SPDRLSQIAGAS-GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPD 363 (460) T ss_pred CccccCCceecC-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHH Confidence 112344444 4567777765432 2478888899999999999999999853221111212222222 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH Q lcl|NC_021297. 347 AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR 426 (480) Q Consensus 347 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~ 426 (480) +...+. .|.+ .+...........+++.|... ..+.+...+..+++.+| +++...+++.+|+.+-+. T Consensus 364 ~~~ie~----~ln~------kl~~~~~~~~~~~i~~d~~~l--~~l~~d~~~~~~~~~~g--~~T~NE~R~~~g~~pi~~ 429 (460) T protein:vir:10 364 LVILKQ----AFDK------KFIKRFKGYENAVIEWDISEL--PEMQTDMVAMASWLNTI--PVTPNEIRIAMKYETLNQ 429 (460) T ss_pred HHHHHH----HHHH------hhcCcccccCCceEEeecchh--hhHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCC Confidence 222211 1111 111111111112344433221 11223333444555543 566666666666543210 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 427 EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 427 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) +-. +...-.. +-.+-+.. .++..+++.|..+ T Consensus 430 ~~g------------D~~~~~~----n~~~~~~~-----~~~~~~~~~nq~~ 460 (460) T protein:vir:10 430 DGM------------DIVFMPS----NKVRIDDV-----SNNLIDSAFNQNQ 460 (460) T ss_pred CCC------------Ceeeecc----cccchhhc-----ccccCCCcccCCC Confidence 000 0000000 00000000 0001111111111 No 180 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.13 E-value=3.9e-06 Score=50.24 Aligned_cols=386 Identities=11% Similarity=0.066 Sum_probs=151.4 Q ss_pred HHHHHHHHHHH--------HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch--hH Q lcl|NC_021297. 12 QGLLARDLPNL--------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GL 81 (480) Q Consensus 12 ~~~~~~~~~r~--------~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~ 81 (480) ...+....+|- ..+-...-|-... .+..+.... -++. .-.-.+|+..++-+-.-++.+-.+.+ .. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-al~~--~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT--KLRQYKDIE-AIRH--SDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccccc--Cccccchhh-hhcc--hHHHHHHHHHHHhhccCceEEecCcccccc Confidence 11111100000 0000000010000 000010000 0110 00111344444333333444322211 12 Q ss_pred HHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 82 EELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 82 ~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ..+..++.. | ....+...+....+.+|.||+++..+ ..|++ .+..++|..+.++.|... .+.+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~g--~~~~--- 144 (416) T protein:vir:81 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIELKSDARG--RLYY--- 144 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEECCCc--cEEE--- Confidence 334444432 3 23456677888889999999998753 34555 478899999988876432 2211 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) ++...+..+. .....+.++. |+||++.. ..+.+|.|.+. .+.+.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~e-----------------------------vihir~~~-~d~~~G~s~i~----~~~~~i 189 (416) T protein:vir:81 145 FHQRIDSNGN-NIERNVKFED-----------------------------MLDIKFYS-LDGINGLSLLD----TLSRTI 189 (416) T ss_pred EEEEecCCCc-eeEEEEcccc-----------------------------EEEeccCC-CCCccccCHHH----HHHHHH Confidence 1111111111 1111222222 23444332 23456777653 233333 Q ss_pred HHHHHHHHHHHHHh---cchhhhhc--CCCccccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQIL---GTPLRVIS--GVTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 236 ~~~~s~~~~~~~~~---~~~~~~i~--g~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) ....+-......+| +.|..++. |...+....+.-...|+. ..++++.++ ++.++.++.... -..|++ T Consensus 190 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e 268 (416) T protein:vir:81 190 ESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIR 268 (416) T ss_pred HHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHH Confidence 33222222222222 33444332 211110000000111211 112344444 345676665432 224778 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDS-RIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~-~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) ..+....+|+...++|+..+|....+. |.......+. .|.-.+ ..+++.+.. .+... .....+++ T Consensus 269 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~~~~l~P~~--------~~ie~~ln~--~l~~~---~~~~~~~f 334 (416) T protein:vir:81 269 ENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDYLSTLKPYI--------TCVCAELNF--KFNDE---YVNREFKF 334 (416) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHHHHHHHHHH--------HHHHHHHhh--hcccc---ccCceEEE Confidence 888889999999999999998654332 2221111111 111111 111111110 11111 11234555 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH-HHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED-MIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+......|..+.++++.+++.+| +++...+++.+|+.+-+--.. ....-...- .++.........+..+..+..+ T Consensus 335 ~~~~l~~~D~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~-~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kg 411 (416) T protein:vir:81 335 DTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNG-SIHRVDLNHVNIELVDEYQMNKSRATDKKLKG 411 (416) T ss_pred echhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCc-ceEeecccccccccccccCcccccccccccCC Confidence 555666678888999998888775 667777788887754220000 000000000 0000000000000000000000 Q ss_pred CCCccccc Q lcl|NC_021297. 462 DTKTETQT 469 (480) Q Consensus 462 d~~~~~~~ 469 (480) ++.++ T Consensus 412 ---Ge~n~ 416 (416) T protein:vir:81 412 ---GEENE 416 (416) T ss_pred ---CCCCC Confidence 01111 No 181 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.13 E-value=3.9e-06 Score=50.24 Aligned_cols=386 Identities=11% Similarity=0.066 Sum_probs=151.4 Q ss_pred HHHHHHHHHHH--------HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch--hH Q lcl|NC_021297. 12 QGLLARDLPNL--------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GL 81 (480) Q Consensus 12 ~~~~~~~~~r~--------~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~ 81 (480) ...+....+|- ..+-...-|-... .+..+.... -++. .-.-.+|+..++-+-.-++.+-.+.+ .. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-al~~--~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT--KLRQYKDIE-AIRH--SDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccccc--Cccccchhh-hhcc--hHHHHHHHHHHHhhccCceEEecCcccccc Confidence 11111100000 0000000010000 000010000 0110 00111344444333333444322211 12 Q ss_pred HHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 82 EELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 82 ~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ..+..++.. | ....+...+....+.+|.||+++..+ ..|++ .+..++|..+.++.|... .+.+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~g--~~~~--- 144 (416) T protein:vir:45 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIELKSDARG--RLYY--- 144 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEECCCc--cEEE--- Confidence 334444432 3 23456677888889999999998753 34555 478899999988876432 2211 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) ++...+..+. .....+.++. |+||++.. ..+.+|.|.+. .+.+.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~e-----------------------------vihir~~~-~d~~~G~s~i~----~~~~~i 189 (416) T protein:vir:45 145 FHQRIDSNGN-NIERNVKFED-----------------------------MLDIKFYS-LDGINGLSLLD----TLSRTI 189 (416) T ss_pred EEEEecCCCc-eeEEEEcccc-----------------------------EEEeccCC-CCCccccCHHH----HHHHHH Confidence 1111111111 1111222222 23444332 23456777653 233333 Q ss_pred HHHHHHHHHHHHHh---cchhhhhc--CCCccccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQIL---GTPLRVIS--GVTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 236 ~~~~s~~~~~~~~~---~~~~~~i~--g~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) ....+-......+| +.|..++. |...+....+.-...|+. ..++++.++ ++.++.++.... -..|++ T Consensus 190 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e 268 (416) T protein:vir:45 190 ESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIR 268 (416) T ss_pred HHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHH Confidence 33222222222222 33444332 211110000000111211 112344444 345676665432 224778 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDS-RIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~-~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) ..+....+|+...++|+..+|....+. |.......+. .|.-.+ ..+++.+.. .+... .....+++ T Consensus 269 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~~~~l~P~~--------~~ie~~ln~--~l~~~---~~~~~~~f 334 (416) T protein:vir:45 269 ENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDYLSTLKPYI--------TCVCAELNF--KFNDE---YVNREFKF 334 (416) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHHHHHHHHHH--------HHHHHHHhh--hcccc---ccCceEEE Confidence 888889999999999999998654332 2221111111 111111 111111110 11111 11234555 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH-HHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED-MIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+......|..+.++++.+++.+| +++...+++.+|+.+-+--.. ....-...- .++.........+..+..+..+ T Consensus 335 ~~~~l~~~D~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~-~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kg 411 (416) T protein:vir:45 335 DTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNG-SIHRVDLNHVNIELVDEYQMNKSRATDKKLKG 411 (416) T ss_pred echhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCc-ceEeecccccccccccccCcccccccccccCC Confidence 555666678888999998888775 667777788887754220000 000000000 0000000000000000000000 Q ss_pred CCCccccc Q lcl|NC_021297. 462 DTKTETQT 469 (480) Q Consensus 462 d~~~~~~~ 469 (480) ++.++ T Consensus 412 ---Ge~n~ 416 (416) T protein:vir:45 412 ---GEENE 416 (416) T ss_pred ---CCCCC Confidence 01111 No 182 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=98.13 E-value=3.9e-06 Score=50.23 Aligned_cols=398 Identities=12% Similarity=0.083 Sum_probs=148.8 Q ss_pred CCCH------HHHHHH-----------HHHHHHHHH---H--HHHHHHHHhcccCcchhcCcccchhhhhhccccchHHH Q lcl|NC_021297. 1 MTTY------HEHVER-----------LQGLLARDL---P--NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 1 ~~~~------~~~i~~-----------l~~~~~~~~---~--r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) |-=. -+|..+ +......|- + -...+-....|-... .+..+.... -++... .-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-al~~~~--V~a 75 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT--KLRQYKDIE-AIRHSD--IFT 75 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhccccc--Cccccchhh-hhccHH--HHH Confidence 1000 000000 000000000 0 000000000000000 000010000 011110 111 Q ss_pred HHHHHHhhhccCccccCCC--chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EE Q lcl|NC_021297. 59 YLRTLSDRLDIEGFRISED--SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LI 130 (480) Q Consensus 59 iVd~~~~~l~~~~~~~~~d--~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i 130 (480) +|+..++-+-.-++.+-.+ ......+..++.. | .-..+...+..+.+.+|.||+++..+ .+|++ .+ T Consensus 76 cv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~------~~G~~~~L 149 (441) T protein:vir:98 76 AVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNL 149 (441) T ss_pred HHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEE Confidence 2333333222223332211 1112234444432 3 22356677888899999999988653 34554 57 Q ss_pred EEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEee Q lcl|NC_021297. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLT 210 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~ 210 (480) ..++|..+.+..+.. .++.+... ..+..+. .....+.++. |+||+ T Consensus 150 ~~i~~~~v~v~~~~~--g~~~~~~~---~~~~~~~-~~~~~~~~~d-----------------------------viHir 194 (441) T protein:vir:98 150 TFRKTSEIELKLDAR--GRLYYFHQ---RIDSNGN-NIERNVKFED-----------------------------MLDIK 194 (441) T ss_pred EEEcCceeEEEECCC--CcEEEEEE---EeccCcc-eeeEEEcccc-----------------------------EEEec Confidence 889999999888643 22222111 1111110 0111222223 33444 Q ss_pred cccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhhh------hhhhcc Q lcl|NC_021297. 211 NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIY------YGRILT 282 (480) Q Consensus 211 n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~~~------~~~~~~ 282 (480) +.. ..+.+|.|.+.- +...++..+....-....+...+.|..++. +.-.+....+.-...|+.. .++++. T Consensus 195 ~~~-~dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~v 272 (441) T protein:vir:98 195 FYS-LDGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVV 272 (441) T ss_pred cCC-CCCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCccee Confidence 322 233457665432 223332211111111122222233443332 2111100000001112111 123444 Q ss_pred cCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 283 ~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) ++ ++.++.+++... -..|++..+....+|+...++|+..+|....+. |-......+ ..... -+-..+++. T Consensus 273 l~-~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~y---~~tl~----P~~~~ie~~ 343 (441) T protein:vir:98 273 LD-ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY---LSTLK----PYITCVCAE 343 (441) T ss_pred cC-CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH---HHHHH----HHHHHHHHH Confidence 54 345776665432 224778888889999999999999998654432 211111111 11111 111122221 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HH-H--HHHHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQ-M--RDWDKQET 437 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~-~~-~--~~~~~~~~ 437 (480) +.. .+... .....+++........|..++++++.+++.+| +++...+++.+|+.+-+- .. + ....-... T Consensus 344 ln~--~L~~~---~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~ 416 (441) T protein:vir:98 344 LNF--KFNDE---YVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNI 416 (441) T ss_pred HHh--hcccc---ccCceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeeccccccc Confidence 111 11111 11223444445556678888999888888765 667777788877754210 00 0 00000000 Q ss_pred HHHHHHHhcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 438 EDMIDTLYSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) + ..+..+ ...+..++....+ ++.++ T Consensus 417 ~-~~~~~q---~~~~~~~~~~~kg---Ge~ne 441 (441) T protein:vir:98 417 E-LVDEYQ---MNKSRATDKKLKG---GEENE 441 (441) T ss_pred c-cccccc---cccccccccccCC---CCCCC Confidence 0 000000 0000000000000 00000 No 183 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.11 E-value=4.4e-06 Score=49.96 Aligned_cols=395 Identities=11% Similarity=0.031 Sum_probs=154.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-cCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~ 78 (480) |.. +.++.++-+.+.... ...-+.+-..... .+.....-..+.-..+.-...+|+..+.-+-.-++.+- ..+ T Consensus 1 ~~~-~~~~~~~k~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:94 1 MAK-ENIVTRIKKKLIDNW-----IDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred Ccc-cccchhhhhHHhhhh-----hcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc Confidence 655 223333222211100 0000001011100 00010000001111112223334433333322233221 111 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+..++.. | .-..+...++.+.+.+|.||+++.++ ..|.+ .+..++|..+.++.+.... .+. T Consensus 75 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~~~- 146 (409) T protein:vir:94 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-ELY- 146 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-EEE- Confidence 222334454432 3 23455677888899999999988653 34554 6778899998888765321 111 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) |......+.. ..+.++. |+||++.....+.+|.|.+. .+...+ T Consensus 147 ----y~~~~~~g~~---~~~~~~d-----------------------------vih~r~~~~~~~~~G~s~l~-~~~~~i 189 (409) T protein:vir:94 147 ----YSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASNMVQGISPID-VLKNTT 189 (409) T ss_pred ----EEEEcCCceE---EEEcccc-----------------------------EEEecCCCCCCccccccHHH-HHHHHH Confidence 1111111111 0122222 34444332334556777653 233333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhh--cCCCccccccccccchhhh---hhhhhcccCCCCceEEEecccc-HHHHHHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVI--SGVTTDELTNDGENTTLDI---YYGRILTLASEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i--~g~~~~~~~~~~~~~~~~~---~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) +..+.+ ... .+..+..+-.++ .+........+.-...|+- ..+.++.++ ++.++.+++... -..+++..+ T Consensus 190 ~~~~~~-~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 265 (409) T protein:vir:94 190 DFDNAV-RTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred HHHHHH-HHH--HHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHH Confidence 322211 111 222223322222 1211111111111111211 122344443 446777776432 225778878 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecC Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRD 386 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~ 386 (480) ....+|+...++|+.-+|....+. ...+......+...+ -.-+-..|+..+.. .+...........+++.... T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~~--~sn~e~~~~~f~~~~---l~P~~~~ie~~ln~--~Ll~~~~~~~~~~i~fd~~~ 338 (409) T protein:vir:94 266 LTRERVANVFQLPSVFLNARSNTN--FAKNEELNRFYLQHT---LLPIVKQYEEEFNR--KLLTKTDREKNRYFKFNVKS 338 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHHHHHH---HHHHHHHHHHHHHH--hhCCcccccCcceEEeechh Confidence 888999999999999998643221 111222222222211 00111111111110 11111111112234444445 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) -...|..++++++.+++.+| +++.-.+++.+|+.+-+- -.+.. +......... .........++++.. T Consensus 339 ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--gD~~~-------~~~n~~~~~~-~~~~~~~~kGG~~n~ 406 (409) T protein:vir:94 339 YLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--GDKPL-------ISGDLYPIDT-PLELRKSLKGGDKNV 406 (409) T ss_pred hhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCeEe-------eccccccccc-chhhcccccCCCCCc Confidence 55678888999999998875 566666777777654320 00000 0000000000 000000001110111 Q ss_pred ccc Q lcl|NC_021297. 467 TQT 469 (480) Q Consensus 467 ~~~ 469 (480) +++ T Consensus 407 ~e~ 409 (409) T protein:vir:94 407 NES 409 (409) T ss_pred CCC Confidence 111 No 184 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.11 E-value=4.4e-06 Score=49.95 Aligned_cols=381 Identities=14% Similarity=0.093 Sum_probs=155.8 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCC Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLL----EAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISED 77 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~----~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d 77 (480) .=|..++...+..+.+... .+..+.-|.. ..+ ..-+.+.-...+|+..++-+-.-++. ..++ T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-------~~~----~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~ 69 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWLGVDP-------DTP----RNQLSEATYFACLKILSESLGKLPLKMYQKTER 69 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHhcCcc-------cCh----hhhhccHHHHHHHHHHHHhHhhCceeEEEecCC Confidence 2233332222211110000 0001110100 000 00011111223344333333222322 2221 Q ss_pred c---hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCC Q lcl|NC_021297. 78 S---EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 78 ~---~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~ 148 (480) . .....+..+++. | ....+...+..+.+.+|.||+++..+ .|.+ .+..++|..+.++.|+.... T Consensus 70 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-------~g~~~~l~~l~~~~v~~~~~~~~~~ 142 (411) T protein:vir:81 70 GIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-------GPQLQALWILPSQYVTIVVDDRGLL 142 (411) T ss_pred ceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCceEEEEEECCceEEEEEcCcccc Confidence 1 112344555432 3 33566778888999999999987642 2333 47789999998888753211 Q ss_pred eeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhH Q lcl|NC_021297. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) .....+.+.......+.. ..+.+++ |+||+++....+.+|.|.+. T Consensus 143 ~~~~~~~~~~~~~~~g~~---~~~~~~e-----------------------------iih~k~~~~~~~~~G~s~~~--- 187 (411) T protein:vir:81 143 GEKNAIWYRYNDPYDGKM---YVFRNDE-----------------------------ILHFKTSVTFDGITGLSVRD--- 187 (411) T ss_pred cccceEEEEEEecCCceE---EEEcccc-----------------------------EEEEcCCCCCCCcccccHHH--- Confidence 111111110000000000 0011112 55665444445667877653 Q ss_pred HHHHHHHHHHHHHHHHHHHHh---cchhhhhcCCC-ccccccccccchhhh------hhhhhcccCCCCceEEEeccccH Q lcl|NC_021297. 229 RKVTDAASRTLMNLQSASQIL---GTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 229 ~~liD~~~~~~s~~~~~~~~~---~~~~~~i~g~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) .+...+.....-..-...+| +.|..++.... ......+.-...|+- ..++++.++ ++.++.+++.... T Consensus 188 -~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~ 265 (411) T protein:vir:81 188 -VLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVP-LGMKLVPLDIKLT 265 (411) T ss_pred -HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCceEEEccCCHH Confidence 23333333222222222233 33554543211 111111111111211 112344443 4467777764322 Q ss_pred -HHHHHHHHHHHHHHHhccCCChHHhcccccc-chHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021297. 299 -RNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 299 -~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n-~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 374 (480) ..+++..+....+|+...++|+..+|....+ -++..... +....|.-.+. .+++.+. ..+...... T Consensus 266 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~--------~ie~~l~--~~ll~~~~~ 335 (411) T protein:vir:81 266 DSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLK--------QYEEEIT--YKILSNDLI 335 (411) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHH--------HHHHHHH--hhcCChhhc Confidence 2467777888999999999999999754322 12222111 11111111111 1111111 011111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccc Q lcl|NC_021297. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .....+++.+......|..+.++++.+++.+| +++...+++.+|+.+.+- -.+......-..++.+.. + T Consensus 336 ~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~gl~p~~g--gD~~~~~~n~~pl~~~~~-------~ 404 (411) T protein:vir:81 336 SQGHYFKFNVNVILRADIKTQMDSLSTAVQNG--IMTPNEARDYLDMPADDY--GNNLMANGNYIPLSMLGA-------N 404 (411) T ss_pred CCCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeeeccCccchhhhhh-------h Confidence 12234555556666778889999999998775 667777788888765421 000000000000000000 0 Q ss_pred ccCCCCCCCCcccccCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFN 475 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~ 475 (480) ...++| + T Consensus 405 --~~kgGd------------~ 411 (411) T protein:vir:81 405 --YGKGGD------------S 411 (411) T ss_pred --hccCCC------------C Confidence 000101 0 No 185 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.10 E-value=4.5e-06 Score=49.88 Aligned_cols=396 Identities=9% Similarity=0.085 Sum_probs=141.7 Q ss_pred CCCHH-----------------------------------HHHHHHHHHHHHHHHHHHHHHHHhcccC--cchhcCcccc Q lcl|NC_021297. 1 MTTYH-----------------------------------EHVERLQGLLARDLPNLLEAEAYRNGTR--RLKTIGIGAP 43 (480) Q Consensus 1 ~~~~~-----------------------------------~~i~~l~~~~~~~~~r~~~~~~YY~g~~--~i~~~~~~~~ 43 (480) |...+ .++..+++........+.....+....- +++.+..... T Consensus 54 ~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~ 133 (576) T protein:vir:96 54 QAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAE 133 (576) T ss_pred chhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCc Confidence 11111 1222333333322222221111111100 0000000000 Q ss_pred hhhhhhccccchHHHHHHHHHhhhccCccccCCCch--hHHHHHHHHH-----h----cCHHHHHHHHHHHHhhcCeEEE Q lcl|NC_021297. 44 PELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GLEELWNWWQ-----A----NDLDEESVLGHDDSLTFGRAYI 112 (480) Q Consensus 44 ~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~~~l~~i~~-----~----n~~~~~~~~~~~~a~~~G~a~~ 112 (480) ..++. ....+..++. . ..+..+...+..+.+.+|.+|+ T Consensus 134 -------------------------------~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~ 182 (576) T protein:vir:96 134 -------------------------------PGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNF 182 (576) T ss_pred -------------------------------cchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEE Confidence 00000 0011111110 0 1345567778889999999998 Q ss_pred EEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccc Q lcl|NC_021297. 113 TVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWV 191 (480) Q Consensus 113 ~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (480) ++.... +..|++ .+..++|..+.++.+... ........|....+++ ....+.++.++++... T Consensus 183 ~i~~~r----d~~g~~~~L~pl~p~~V~v~~~~dg--~~~~~~~~~~~~~~~~---~~~~~~~~dii~~~~~-------- 245 (576) T protein:vir:96 183 EKVFNK----KNATTMDKFIAVDPSTIFYATDKNG--KIIKGGKRFVQVINKK---VVASFTSREMAMGIRN-------- 245 (576) T ss_pred EEEEec----CCCCceEEEEEeCCceeEEEECCCC--ceeeeeeEEEEecCCc---eEEEecccceEEEeec-------- Confidence 865321 223443 578899999988887542 1111111111111111 1111222222221110 Q ss_pred cccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhc--CC-Cccccc Q lcl|NC_021297. 192 VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS--GV-TTDELT 265 (480) Q Consensus 192 ~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~--g~-~~~~~~ 265 (480) +......+.+|.|.|. .+.+.+...+.-......+| +.|-.+|. |. ...... T Consensus 246 ------------------~~~d~~~~~~G~Spi~----~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~ 303 (576) T protein:vir:96 246 ------------------PRTELSSSGYGLSEVE----IAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRA 303 (576) T ss_pred ------------------CCCCcccCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHH Confidence 0011122456777654 23333332222222222333 33443332 31 111111 Q ss_pred cccccchhhh------hhhhh-cccCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHH---- Q lcl|NC_021297. 266 NDGENTTLDI------YYGRI-LTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA---- 333 (480) Q Consensus 266 ~~~~~~~~~~------~~~~~-~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg---- 333 (480) .+.-...|.. ..+++ ..+ +++.++..+... .-..|++..+..+.+|+...++|+..+|....+.++| T Consensus 304 ~~~lr~~~~~~~~G~~nag~~p~vl-~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~ 382 (576) T protein:vir:96 304 LENFKREWKSSFSGINGSWQVPVVM-ADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGG 382 (576) T ss_pred HHHHHHHHHHHhccccccccceeec-CCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccc Confidence 1111112221 11222 233 345777777643 2335788889999999999999999998532111100 Q ss_pred -----HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Q lcl|NC_021297. 334 -----EAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ 407 (480) Q Consensus 334 -----~Al~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~ 407 (480) ..+......+...+ .-....+...|.+ .+.. .....+.+.|.+....+..+... +.+... . T Consensus 383 ~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~------~Ll~----~~~~~~~~~f~r~d~~~~~e~~~-~~~~~~--~ 449 (576) T protein:vir:96 383 NTLNEADPGKKQQQSQNKGLQPLLRFIEDLINT------HIIS----EYSDKYVFQFVGGDTKSELDKIK-ILQEEV--K 449 (576) T ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHHHh------hhch----hccCceEEEeccCCHHHHHHHHH-HHHHHh--c Confidence 11111111111111 1111111111111 0110 01123556676554444433332 222222 3 Q ss_pred cCCCHHHHHHhCCCChhHH-HHHH---------HH------HHHHHHHHHHHHhcccccCcccccCCCCC--CCCccccc Q lcl|NC_021297. 408 GPIPKEQARIDLGYTATQR-EQMR---------DW------DKQETEDMIDTLYSTTKAQADATPKPTVT--DTKTETQT 469 (480) Q Consensus 408 ~~~s~et~~~~lg~~~~~~-~~~~---------~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--d~~~~~~~ 469 (480) ++++...+++.+|+.+-+- +.+. .. ..+..++..+.......+.....+.+..+ ...+.... T Consensus 450 G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~ 529 (576) T protein:vir:96 450 TYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESN 529 (576) T ss_pred CccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccc Confidence 5677777788887653210 0000 00 00001111111111111111111111111 11111112 Q ss_pred CCCCCCCCCCC Q lcl|NC_021297. 470 SPSGFNRTKTR 480 (480) Q Consensus 470 ~p~~~~~~~~~ 480 (480) .+++.++.=+| T Consensus 530 ~~~~~~~~~~~ 540 (576) T protein:vir:96 530 DPTKIDSPVGT 540 (576) T ss_pred cCCCCCCcccc Confidence 22233322222 No 186 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=98.09 E-value=4.9e-06 Score=49.71 Aligned_cols=380 Identities=13% Similarity=0.091 Sum_probs=155.8 Q ss_pred CCCHHHHHHHHH-HHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhh------hc-cccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MTTYHEHVERLQ-GLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY------LD-VQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~-~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~------~~-~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |.-..++-+-+. +.+...+.....- +-+.-...........+.-+.. .+ ........+|+..+.-+-..+| T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~ 79 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEES-KAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTI 79 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhh-hhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCce Confidence 333222222211 1221111100000 0000000000000000000000 01 1123445556655555444444 Q ss_pred cc---CCC--chhHHHHHHHHH--hc---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEE Q lcl|NC_021297. 73 RI---SED--SEGLEELWNWWQ--AN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAE 141 (480) Q Consensus 73 ~~---~~d--~~~~~~l~~i~~--~n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v 141 (480) .+ .++ ......+..++. -| ....+...+..+.+.+|.||+++..+.. .+.+ .+..++|..+.+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~-----g~~~~~L~~l~~~~v~~~ 154 (413) T protein:vir:96 80 QLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVS-----GDKIIGLTPISPYKVTFN 154 (413) T ss_pred EEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-----CCceEEEEEecCceeEEE Confidence 32 211 122233444443 23 2356678888999999999999875321 1233 5778889888877 Q ss_pred EeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccc-cCccCC Q lcl|NC_021297. 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR-LGNRYG 220 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~-~~~~~G 220 (480) +++.. + + |.....++ .+.++ -|+||+.+.. ..+..| T Consensus 155 ~~~~~---~----~-y~~~~~~~------~~~~~-----------------------------evih~k~~~~~~~~~~G 191 (413) T protein:vir:96 155 VSDDD---L----D-YSITFDNK------EYDPS-----------------------------TLLHFVLNPSIERPFIG 191 (413) T ss_pred EcCCe---E----E-EEEeecCc------EEchh-----------------------------hEEEEeccCCCCCcccc Confidence 65321 1 1 11111110 01111 2455553322 233457 Q ss_pred cccchhhHHHHHHHHHHHHHH---HHHHHHHhcchhhhhcCCC-ccccccccccchhhh------hhhhhcccCCCCceE Q lcl|NC_021297. 221 RSEISPELRKVTDAASRTLMN---LQSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKI 290 (480) Q Consensus 221 ~s~i~~~v~~liD~~~~~~s~---~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~ 290 (480) .|.+. .+.+.+.....- ....+...+.|-.++.-.. ......+.-...|+- ..++++.++++..++ T Consensus 192 ~s~~~----~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~ 267 (413) T protein:vir:96 192 TGYKV----ALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNV 267 (413) T ss_pred ccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccc Confidence 77553 333333332222 2222233334444443211 111111111111211 112334444444444 Q ss_pred EEe---ccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 291 SEF---KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) Q Consensus 291 ~~~---~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~ 367 (480) .++ +..+. .+++..+..+.+|+...++|+..+|....+ +..+..+....|.-.+...+. .|.+ . T Consensus 268 ~~~~~~~~~d~-q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~--~~~~~~~~~~~l~P~~~~ie~----~ln~------~ 334 (413) T protein:vir:96 268 QQIKPLTLNDL-AINDAVTLDKKTVAGIFGVPAFLLGVGTYN--KDEFNNFINTKIMSIAQVIQQ----TYNK------L 334 (413) T ss_pred cccccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHcCCCcch--HHHHHHHHHHHHHHHHHHHHH----HHHH------h Confidence 333 22223 467888888999999999999999753221 222222222222221111111 1111 1 Q ss_pred HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 368 ~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) +.. ....+++.+......+..+.++++.+++.+| +++...+++++|+.+.+- ..+. .-..+-. T Consensus 335 ll~-----~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~p~~~--gd~~--------~~~~n~~ 397 (413) T protein:vir:96 335 IVE-----EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLN--ALRRNEFRNWVGMPPDAE--MDDL--------LVLENYL 397 (413) T ss_pred hCC-----CCcEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--ccee--------eeccccc Confidence 111 1234566666677788889999999998875 667777788888866431 0000 0000000 Q ss_pred cccCcccccCCCCCCCCccc Q lcl|NC_021297. 448 TKAQADATPKPTVTDTKTET 467 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~~~~ 467 (480) ............++ ++ T Consensus 398 ~~~~~~~~~~~~~~----dt 413 (413) T protein:vir:96 398 QQKDLVNQKKLIQD----ET 413 (413) T ss_pred chhhcccccCCCCC----CC Confidence 00000000001111 11 No 187 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.08 E-value=5.2e-06 Score=49.57 Aligned_cols=419 Identities=10% Similarity=0.053 Sum_probs=158.7 Q ss_pred CCCHHHHHHHHHHHHHHH---------HHHH---HHHH------------HH--hcccCcchhcCcc-cch------hhh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARD---------LPNL---LEAE------------AY--RNGTRRLKTIGIG-APP------ELA 47 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~---------~~r~---~~~~------------~Y--Y~g~~~i~~~~~~-~~~------~~~ 47 (480) -.--+.++.+-.-.|+.. ..+. .++. +. +.+.. ....+.. .|. ... T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es-~s~vtsls~pdaf~~vnVs~ 118 (945) T protein:vir:10 40 YPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPES-LMYLPSISDPDAFFLINLFR 118 (945) T ss_pred CCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCcc-ceecccccCccceeeehhhh Confidence 111122222211111110 0000 0000 00 11111 1010000 000 001 Q ss_pred hhccccchHHHHHHHHHhhhccCcccc---CCCc---------hhHHHHHHHHHh-cCH-------HHHHHHHHHHHhhc Q lcl|NC_021297. 48 YLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS---------EGLEELWNWWQA-NDL-------DEESVLGHDDSLTF 107 (480) Q Consensus 48 ~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~---------~~~~~l~~i~~~-n~~-------~~~~~~~~~~a~~~ 107 (480) +......-...+|+..++-+-.-++.+ .++. .....+..++.+ |.. ......+..+.+.+ T Consensus 119 ~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~ 198 (945) T protein:vir:10 119 KYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTI 198 (945) T ss_pred hhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhc Confidence 111111122334555544443333321 1111 112345555543 321 12445677899999 Q ss_pred CeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCc Q lcl|NC_021297. 108 GRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGL 186 (480) Q Consensus 108 G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (480) |.+|+.+.++ .+|++ .+..++|.++.+..++... .. .+|....+++. ...+.+...+++.+. T Consensus 199 GNAYieIiRd------~~G~ii~L~pLdPs~Vti~~ddDG~-~~----y~Yv~~idG~~---~~~v~a~DvIlhirn--- 261 (945) T protein:vir:10 199 DRGAIVKIRD------EQGNLVAITPVDGTTIKPILSEDTG-IV----VGYVQEVDGAI---VAHFDKRDVVLFRQN--- 261 (945) T ss_pred CCeEEEEEEC------CCCcEEEEEEECCcceEEEEcCCCc-EE----EEEEEecCCce---EEEecCCceEEEecc--- Confidence 9999988653 34555 5888999998887765432 11 11211112111 111222222211110 Q ss_pred ccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc----chhhhh--cCCC Q lcl|NC_021297. 187 NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG----TPLRVI--SGVT 260 (480) Q Consensus 187 ~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~----~~~~~i--~g~~ 260 (480) .+-......+|.|.| ..+.+++...+.-......+|. .|..++ .+.. T Consensus 262 -----------------------~s~DG~~~GyGlSPI----eaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~ 314 (945) T protein:vir:10 262 -----------------------LTPDVYMYGYSLPPI----EILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPS 314 (945) T ss_pred -----------------------CCCCcccccCCchHH----HHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCcc Confidence 000011122466654 3444455444433333334432 333232 2211 Q ss_pred ccccc-----cccc----cchhhh-----hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhcc Q lcl|NC_021297. 261 TDELT-----NDGE----NTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSS 325 (480) Q Consensus 261 ~~~~~-----~~~~----~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~ 325 (480) ..... ..+. ...|+. ..+..+.+ +++.++.++.... -..+++..+..+.+|+...++|+..+|. T Consensus 315 ~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVL-deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~ 393 (945) T protein:vir:10 315 YKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILS-GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGI 393 (945) T ss_pred ccccccccccCHHHHHHHHHHHHHHhCCcccccceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccc Confidence 10000 0000 011111 01122223 3456777765432 2347788888999999999999999985 Q ss_pred ccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHh Q lcl|NC_021297. 326 SSENPASAEAIIATDSRIVKM-AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYA 404 (480) Q Consensus 326 ~~~n~~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~ 404 (480) ...+.-| .+.......... ..-....+...+.+.+ .. ......+.+.|......+..+.++++.++.+ T Consensus 394 ~e~st~S--NiEqq~~~Fv~~tL~Pil~~IEqeLNrkL---l~------~~eg~~i~fdFd~ldl~D~ksraEal~kli~ 462 (945) T protein:vir:10 394 LEGSNKA--TAEVMASLTKAKGLEPLMATISKGFDEVV---SE------FRNEKDIKLWFKEDDLEKERDWWNIIQGQLN 462 (945) T ss_pred CCCCCcc--hHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cc------cccCceeEEEecchhccCHHHHHHHHHHHHh Confidence 4332112 122222222111 1111122222222111 11 1112356778876666777888988888887 Q ss_pred ccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccc--ccCCCCCCCCCCC Q lcl|NC_021297. 405 NGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTET--QTSPSGFNRTKTR 480 (480) Q Consensus 405 ~~~~~~s~et~~~~lg~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~--~~~p~~~~~~~~~ 480 (480) +| +++...+++.+|+.+-+- +... .....-...+......++...+......+++++.. ....++...+.+| T Consensus 463 sG--iLTiNEvRe~lGLpPIeGGD~ll--i~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~k 537 (945) T protein:vir:10 463 TG--FRSINEARMEKGLEPVPWGDVPF--SGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQK 537 (945) T ss_pred CC--CcCHHHHHHHhCCCCCCCcceee--eccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCccc Confidence 65 667777888887754320 0000 00000000000000000000000000000000000 0001111122222 No 188 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=98.07 E-value=5.2e-06 Score=49.54 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=131.1 Q ss_pred HHHHHHHhcccCcchh-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCchhHHHHHHHHHh--c-- Q lcl|NC_021297. 22 LLEAEAYRNGTRRLKT-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~g~~~i~~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~l~~i~~~--n-- 91 (480) +..+.+.+..+..... .+..+.. ...........+|+..++-+-..++.+- .++.....+..++.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~ 77 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTD 77 (395) T ss_pred CchhhhhhccCccccccccchhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcC Confidence 1111222211111100 0011100 1111122333344444433333333221 122223334444432 3 Q ss_pred -CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEE--EEeCCCCCeeEEEEEEEEEecCCceEEE Q lcl|NC_021297. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRLYTTRDDVAVPDR 168 (480) Q Consensus 92 -~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~--v~d~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (480) ....+...+..+.+..|.+|+++..+ +.. ...++..+.+ +++.. ...+ .....+ + T Consensus 78 ~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~--~~~~~~~~~~~~~~~~~-----~~~~----~~~~~~---~ 135 (395) T protein:vir:10 78 LSSDSFWQQVIYKLIYDNEVLIVVSDS--------KEL--LIADSFYREEYALYDDI-----FKDV----TVKDYT---Y 135 (395) T ss_pred CCHHHHHHHHHHHHhhCCceEEEEecC--------CCe--EecCCccceeEeecCcc-----eeEE----EEcCce---e Confidence 22345566777778888888766421 111 1122211111 11100 0000 000100 0 Q ss_pred EEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 169 ATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQI 248 (480) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~ 248 (480) ...+.+++ |+||..+......+|.|.|.- ....++ ..+ ..... T Consensus 136 ~~~~~~~e-----------------------------vih~~~~~~~~~~~G~spi~~-~~~~~~---~~~----~~~~~ 178 (395) T protein:vir:10 136 QRTFTMQE-----------------------------VIYLKYNNNKVTHFVESLFED-YGKIFG---RMI----GAQLK 178 (395) T ss_pred eeeecccc-----------------------------EEEEccCCCCcccccchHHHH-HHHHHH---HHH----HHHHh Confidence 01112222 444443333334567665431 222222 222 22222 Q ss_pred hcchhhhhc--CCCccccccccccchhhhh-------hhhhcccCCCCceEEEecccc------HHHHHHHHHHHHHHHH Q lcl|NC_021297. 249 LGTPLRVIS--GVTTDELTNDGENTTLDIY-------YGRILTLASEAAKISEFKAAE------LRNFAEEMEVFRKEAA 313 (480) Q Consensus 249 ~~~~~~~i~--g~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~~~~~~~~------~~~~~~~l~~~~~~i~ 313 (480) .+.+--+|. +...++...+.-...|+.. ...++.+ +++.++.+++... ...|++..+....+|+ T Consensus 179 ~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l-~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia 257 (395) T protein:vir:10 179 NYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPL-IEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVA 257 (395) T ss_pred cCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEc-CCCceeeeccccccccchhHHHHHHHHHHHHHHHH Confidence 222222221 1111111111111112111 1112222 2345665554322 2247888888899999 Q ss_pred hccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 314 ~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~ 393 (480) ...++|+..+|++..| .+.....+....|.-.+...+..+.. .+....... ..+++.+......|.. T Consensus 258 ~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~f~~~~l~~~D~~ 324 (395) T protein:vir:10 258 LMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTRIEIVGVNKKDPL 324 (395) T ss_pred HHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccceecchhhhccCHH Confidence 9999999999865333 12212222222222222222221111 111111111 1234455556667888 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) +.++++.+++.+| +++...+++.+|+.+-+-....+..--..-...+.........++..+. .+..++ T Consensus 325 ~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~k----------gg~~~~ 392 (395) T protein:vir:10 325 QYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLK----------GGDEDE 392 (395) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccC----------CCCCCC Confidence 8999999998875 5667777788877543210000000000000000000000000000011 111111 Q ss_pred CCC Q lcl|NC_021297. 474 FNR 476 (480) Q Consensus 474 ~~~ 476 (480) .|. T Consensus 393 ~g~ 395 (395) T protein:vir:10 393 SGD 395 (395) T ss_pred CCC Confidence 111 No 189 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=98.07 E-value=5.2e-06 Score=49.54 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=131.1 Q ss_pred HHHHHHHhcccCcchh-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCchhHHHHHHHHHh--c-- Q lcl|NC_021297. 22 LLEAEAYRNGTRRLKT-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~g~~~i~~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~l~~i~~~--n-- 91 (480) +..+.+.+..+..... .+..+.. ...........+|+..++-+-..++.+- .++.....+..++.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~ 77 (395) T protein:vir:95 1 MSILEKIFKTRKDITYMLDLDMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTD 77 (395) T ss_pred CchhhhhhccCccccccccchhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcC Confidence 1111222211111100 0011100 1111122333344444433333333221 122223334444432 3 Q ss_pred -CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEE--EEeCCCCCeeEEEEEEEEEecCCceEEE Q lcl|NC_021297. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRLYTTRDDVAVPDR 168 (480) Q Consensus 92 -~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~--v~d~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (480) ....+...+..+.+..|.+|+++..+ +.. ...++..+.+ +++.. ...+ .....+ + T Consensus 78 ~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~--~~~~~~~~~~~~~~~~~-----~~~~----~~~~~~---~ 135 (395) T protein:vir:95 78 LSSDSFWQQVIYKLIYDNEVLIVVSDS--------KEL--LIADSFYREEYALYDDI-----FKDV----TVKDYT---Y 135 (395) T ss_pred CCHHHHHHHHHHHHhhCCceEEEEecC--------CCe--EecCCccceeEeecCcc-----eeEE----EEcCce---e Confidence 22345566777778888888766421 111 1122211111 11100 0000 000100 0 Q ss_pred EEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 169 ATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQI 248 (480) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~ 248 (480) ...+.+++ |+||..+......+|.|.|.- ....++ ..+ ..... T Consensus 136 ~~~~~~~e-----------------------------vih~~~~~~~~~~~G~spi~~-~~~~~~---~~~----~~~~~ 178 (395) T protein:vir:95 136 QRTFTMQE-----------------------------VIYLKYNNNKVTHFVESLFED-YGKIFG---RMI----GAQLK 178 (395) T ss_pred eeeecccc-----------------------------EEEEccCCCCcccccchHHHH-HHHHHH---HHH----HHHHh Confidence 01112222 444443333334567665431 222222 222 22222 Q ss_pred hcchhhhhc--CCCccccccccccchhhhh-------hhhhcccCCCCceEEEecccc------HHHHHHHHHHHHHHHH Q lcl|NC_021297. 249 LGTPLRVIS--GVTTDELTNDGENTTLDIY-------YGRILTLASEAAKISEFKAAE------LRNFAEEMEVFRKEAA 313 (480) Q Consensus 249 ~~~~~~~i~--g~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~~~~~~~~------~~~~~~~l~~~~~~i~ 313 (480) .+.+--+|. +...++...+.-...|+.. ...++.+ +++.++.+++... ...|++..+....+|+ T Consensus 179 ~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l-~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia 257 (395) T protein:vir:95 179 NYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPL-IEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVA 257 (395) T ss_pred cCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEc-CCCceeeeccccccccchhHHHHHHHHHHHHHHHH Confidence 222222221 1111111111111112111 1112222 2345665554322 2247888888899999 Q ss_pred hccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 314 ~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~ 393 (480) ...++|+..+|++..| .+.....+....|.-.+...+..+.. .+....... ..+++.+......|.. T Consensus 258 ~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~f~~~~l~~~D~~ 324 (395) T protein:vir:95 258 LMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTRIEIVGVNKKDPL 324 (395) T ss_pred HHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccceecchhhhccCHH Confidence 9999999999865333 12212222222222222222221111 111111111 1234455556667888 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) +.++++.+++.+| +++...+++.+|+.+-+-....+..--..-...+.........++..+. .+..++ T Consensus 325 ~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~k----------gg~~~~ 392 (395) T protein:vir:95 325 QYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLK----------GGDEDE 392 (395) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccC----------CCCCCC Confidence 8999999998875 5667777788877543210000000000000000000000000000011 111111 Q ss_pred CCC Q lcl|NC_021297. 474 FNR 476 (480) Q Consensus 474 ~~~ 476 (480) .|. T Consensus 393 ~g~ 395 (395) T protein:vir:95 393 SGD 395 (395) T ss_pred CCC Confidence 111 No 190 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=98.07 E-value=5.2e-06 Score=49.54 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=131.1 Q ss_pred HHHHHHHhcccCcchh-----cCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCchhHHHHHHHHHh--c-- Q lcl|NC_021297. 22 LLEAEAYRNGTRRLKT-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~g~~~i~~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~l~~i~~~--n-- 91 (480) +..+.+.+..+..... .+..+.. ...........+|+..++-+-..++.+- .++.....+..++.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~ 77 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTD 77 (395) T ss_pred CchhhhhhccCccccccccchhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcC Confidence 1111222211111100 0011100 1111122333344444433333333221 122223334444432 3 Q ss_pred -CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEE--EEeCCCCCeeEEEEEEEEEecCCceEEE Q lcl|NC_021297. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRLYTTRDDVAVPDR 168 (480) Q Consensus 92 -~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~--v~d~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (480) ....+...+..+.+..|.+|+++..+ +.. ...++..+.+ +++.. ...+ .....+ + T Consensus 78 ~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~--~~~~~~~~~~~~~~~~~-----~~~~----~~~~~~---~ 135 (395) T protein:vir:10 78 LSSDSFWQQVIYKLIYDNEVLIVVSDS--------KEL--LIADSFYREEYALYDDI-----FKDV----TVKDYT---Y 135 (395) T ss_pred CCHHHHHHHHHHHHhhCCceEEEEecC--------CCe--EecCCccceeEeecCcc-----eeEE----EEcCce---e Confidence 22345566777778888888766421 111 1122211111 11100 0000 000100 0 Q ss_pred EEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 169 ATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQI 248 (480) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~ 248 (480) ...+.+++ |+||..+......+|.|.|.- ....++ ..+ ..... T Consensus 136 ~~~~~~~e-----------------------------vih~~~~~~~~~~~G~spi~~-~~~~~~---~~~----~~~~~ 178 (395) T protein:vir:10 136 QRTFTMQE-----------------------------VIYLKYNNNKVTHFVESLFED-YGKIFG---RMI----GAQLK 178 (395) T ss_pred eeeecccc-----------------------------EEEEccCCCCcccccchHHHH-HHHHHH---HHH----HHHHh Confidence 01112222 444443333334567665431 222222 222 22222 Q ss_pred hcchhhhhc--CCCccccccccccchhhhh-------hhhhcccCCCCceEEEecccc------HHHHHHHHHHHHHHHH Q lcl|NC_021297. 249 LGTPLRVIS--GVTTDELTNDGENTTLDIY-------YGRILTLASEAAKISEFKAAE------LRNFAEEMEVFRKEAA 313 (480) Q Consensus 249 ~~~~~~~i~--g~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~~~~~~~~------~~~~~~~l~~~~~~i~ 313 (480) .+.+--+|. +...++...+.-...|+.. ...++.+ +++.++.+++... ...|++..+....+|+ T Consensus 179 ~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l-~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia 257 (395) T protein:vir:10 179 NYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPL-IEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVA 257 (395) T ss_pred cCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEc-CCCceeeeccccccccchhHHHHHHHHHHHHHHHH Confidence 222222221 1111111111111112111 1112222 2345665554322 2247888888899999 Q ss_pred hccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 314 ~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~ 393 (480) ...++|+..+|++..| .+.....+....|.-.+...+..+.. .+....... ..+++.+......|.. T Consensus 258 ~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~f~~~~l~~~D~~ 324 (395) T protein:vir:10 258 LMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTRIEIVGVNKKDPL 324 (395) T ss_pred HHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccceecchhhhccCHH Confidence 9999999999865333 12212222222222222222221111 111111111 1234455556667888 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) +.++++.+++.+| +++...+++.+|+.+-+-....+..--..-...+.........++..+. .+..++ T Consensus 325 ~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~k----------gg~~~~ 392 (395) T protein:vir:10 325 QYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLK----------GGDEDE 392 (395) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccC----------CCCCCC Confidence 8999999998875 5667777788877543210000000000000000000000000000011 111111 Q ss_pred CCC Q lcl|NC_021297. 474 FNR 476 (480) Q Consensus 474 ~~~ 476 (480) .|. T Consensus 393 ~g~ 395 (395) T protein:vir:10 393 SGD 395 (395) T ss_pred CCC Confidence 111 No 191 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.04 E-value=6.3e-06 Score=49.10 Aligned_cols=419 Identities=14% Similarity=0.089 Sum_probs=156.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc-chhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |=....-|..|-+ +...+... ....-+.+.- -..+.... +..+++.--...+...+|+..+..+...++.+..+.. T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~-~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~ 77 (540) T protein:vir:41 1 MFNYHLSIKSLEK-YRAIKGDT-DSQALKEDRF-EEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDG 77 (540) T ss_pred CCCcccChhhccc-hhhhhccc-cccccccCCC-CccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCcc Confidence 1111111111100 10000000 0000011100 00010011 1222333223456677788888877777776543322 Q ss_pred hHHHHHHHHHhc---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 80 GLEELWNWWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 80 ~~~~l~~i~~~n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) .. ..+. .| ....+...+..+.+.+|.||+.+..+ ..|.+ .+..++|..+.+..+.. + T Consensus 78 ~~---~~~l-pN~~~t~~~f~~~~v~dlll~Gnayv~i~r~------~~G~~~~L~~i~~~~V~v~~~~~---------~ 138 (540) T protein:vir:41 78 GV---EELL-RACRPSFEFILLQALEDLQVFNYCTLEVVRD------DQGEPVRLDYIPAHTVRVHRDGS---------R 138 (540) T ss_pred ch---hhhc-cCCCCCHHHHHHHHHHHHHhcCCeEEEEEEC------CCCcEEEEEEeCCcceEEeEcCc---------e Confidence 21 1221 22 34566778888999999999987653 23544 57888898887665432 1 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) ++...+.. ...++..|.....+.+ ..+. ....++.=-|+||++.....+.+|.|.+. .+..++ T Consensus 139 ~~~~~d~~-~~~~~~~~~~~~~~~~--~~g~----------~~~~~~~~eViHir~~~~~~~~~G~Spi~----~~~~~i 201 (540) T protein:vir:41 139 YMQTWDGI-HVTYFKDYRYEGEVNP--DNGE----------DQDGVGANEIIFIHLPSPICSYYGVPRYL----SAAPSI 201 (540) T ss_pred eEeeecCc-eeeeeecccccceeec--cccc----------cceeecccceEEecCCCCCCCcccccHHH----HHHHHH Confidence 22112211 1222222222111111 0000 00111222367787655556788998764 333343 Q ss_pred HHHHHHHHHHHHHhc---chhhhh--cCCCcccccc-ccc----cchhh-----------hhhhhhcccC-----CCCce Q lcl|NC_021297. 236 SRTLMNLQSASQILG---TPLRVI--SGVTTDELTN-DGE----NTTLD-----------IYYGRILTLA-----SEAAK 289 (480) Q Consensus 236 ~~~~s~~~~~~~~~~---~~~~~i--~g~~~~~~~~-~~~----~~~~~-----------~~~~~~~~~~-----g~~~~ 289 (480) .....-......+|. .|-.+| .|...+.... ... ...++ -..++++.++ +++.+ T Consensus 202 ~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~ 281 (540) T protein:vir:41 202 LAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVT 281 (540) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCccccee Confidence 333222222233333 233232 2321111100 000 00111 0112233322 23456 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhcccc---ccchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~n~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) +..+.... -..|++..+....+|+...++|+..+|... .|-++..... +....|.-.+ ..+...|.+.+ T Consensus 282 ~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~----~~ie~~ln~~L- 356 (540) T protein:vir:41 282 FTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQ----EIVSSVLTDFI- 356 (540) T ss_pred EEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHH----HHHHHHHHHhh- Confidence 76665332 224888889999999999999999997432 1212222221 1222222222 22222222211 Q ss_pred HHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQETEDMID 442 (480) Q Consensus 364 ~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~l-g~~~~~~~~~~~~~~~~~~~~~~ 442 (480) +. . .. ..+.+.|....... .+.+..+.+++++| +++..-+++.| |..+-...-+.-. .-... T Consensus 357 --~~--~--~~---~~~~i~f~~~~ll~-~D~~~~~~~lv~~G--~lT~NE~Re~L~g~e~gdd~~l~p~-----n~~~~ 419 (540) T protein:vir:41 357 --QL--K--LD---PGARFVFNEEILME-SEFVHNYALLVQCG--VLTPSEVREKLFGLDGGPDMFMVPS-----SIGKS 419 (540) T ss_pred --hh--c--cC---CceEEEecchhhcc-hHHHHHHHHHHhCC--CCCHHHHHHHhCcCcCCCccccccc-----ccccc Confidence 11 1 11 13445564332221 23333445566654 55655556543 5432211101000 00000 Q ss_pred HHhcccccCccccc-CCCCCCCCcc--cccCCC-C-----------CCCCCCC Q lcl|NC_021297. 443 TLYSTTKAQADATP-KPTVTDTKTE--TQTSPS-G-----------FNRTKTR 480 (480) Q Consensus 443 ~~~~~~~~~~~~~~-~~~~~d~~~~--~~~~p~-~-----------~~~~~~~ 480 (480) .+.........+.+ +......+.+ .++..+ . ..+...| T Consensus 420 ~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (540) T protein:vir:41 420 AMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFR 472 (540) T ss_pred cccccccccCCCCccccccccchhcccccCccccccccccccccccccccccC Confidence 00000000000000 0000000000 000000 0 0001111 No 192 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.03 E-value=6.6e-06 Score=48.99 Aligned_cols=393 Identities=11% Similarity=0.011 Sum_probs=159.7 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCC--- Q lcl|NC_021297. 5 HEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISED--- 77 (480) Q Consensus 5 ~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d--- 77 (480) .-+++.|...... .......+...+.+... ...+..+.... -+.+.-...+|+..+.-+-..++. ..++ T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~-~~~g~~v~~~~---al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~ 76 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYD-TYTGKQISSQR---AMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQ 76 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCcc-ccCCceechhh---hhccHHHHHHHHHHHHHhccCceEEEEecCCcee Confidence 2222222222111 00001111122211110 00111111110 011112233344333333222332 1211 Q ss_pred chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 78 SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) ......+..++.. | ....+...+..+.+.+|.||+++..+ .|.+ .+..++|..+.+.++... .+. T Consensus 77 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-------~g~~~~L~~l~~~~v~~~~~~~~--~~~ 147 (414) T protein:vir:44 77 RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-------FGEVAELLPVDPGCVVPKLNSSW--EPV 147 (414) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCcEEEEEEEcCceEEEEECCCC--cEE Confidence 1112234444432 2 34567778888999999999988642 2444 578899999888876432 221 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) |+ .....+.. ..+.+++ |+||.+. ...+++|.|.+.- +... T Consensus 148 ----y~-~~~~~g~~---~~~~~~e-----------------------------vih~~~~-~~d~~~G~s~i~~-~~~~ 188 (414) T protein:vir:44 148 ----YQ-VTFPDGST---DVLSQED-----------------------------IWHVRTL-TLDGLVGLNPIAY-AREA 188 (414) T ss_pred ----EE-EEecCceE---EEEcccc-----------------------------EEEecCC-CCCCcccccHHHH-HHHH Confidence 11 11111111 1122222 3344432 2234668776532 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEeccccH-HHHHH Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAEL-RNFAE 303 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~~-~~~~~ 303 (480) ++.......-..+.+...+.|..++.-. .......+.-...|+- ..++++.++ ++.++.++..... ..|++ T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~~~~e 267 (414) T protein:vir:44 189 ISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILE-MGLDWKSMALNAEDSQFLE 267 (414) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChHHHHHHH Confidence 3322222222222222223344444311 1111101100111111 112344444 3467777654322 24778 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~ 383 (480) ..+....+|+...++|+..+|....+.-| .+..+...+...+ -.-+-..++..+.. .+...... ....+++. T Consensus 268 ~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~--n~e~~~~~~~~~~---l~P~~~~ie~~ln~--~L~~~~~~-~~~~i~fd 339 (414) T protein:vir:44 268 TRKFQLEEICRLFRVPLHMVQNTDRATFN--NIEELGLGFINYS---LVPYLTRIEQRINT--GLVRKSKQ-GVFYAKFN 339 (414) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHHh--hcCCcccc-CceEEEEe Confidence 88888999999999999999854322111 1111111111111 01111112111111 11111111 12235555 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +......|..+.++++.+++++| +++...+++.+|+.+-+- . +.+...... +..+.. .. T Consensus 340 ~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g----------g----D~~~~~~n~----~~~~~~-~~ 398 (414) T protein:vir:44 340 AGALLRGDMKSRFEAYATGINWG--IYSPNDCRDLEDMNPRPG----------G----DVYLTPMNM----TTKPSD-GS 398 (414) T ss_pred chhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC----------c----ceecccccc----cccCCc-cc Confidence 55666678889999999988765 667777788888764320 0 000000000 000111 11 Q ss_pred CcccccCCCCCCCCCC Q lcl|NC_021297. 464 KTETQTSPSGFNRTKT 479 (480) Q Consensus 464 ~~~~~~~p~~~~~~~~ 479 (480) +.+.++.++..+.++| T Consensus 399 ~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 399 KAGKQKDNANADETTS 414 (414) T ss_pred cCCCCCCCCCCCCCCC Confidence 1222333444455555 No 193 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.02 E-value=6.7e-06 Score=48.95 Aligned_cols=369 Identities=10% Similarity=0.000 Sum_probs=142.5 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchhHHH Q lcl|NC_021297. 5 HEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~ 83 (480) .-+++++...-.... .........+-+- . ..+..+... .-+...-...+|+..+.-+-.-+|.+-... .+. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~v~~~---~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~--~~~ 72 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLAS--L-KGNEWVSAE---TALRNSDLFSIINQLSNDLATVKLITSRKK--LQG 72 (382) T ss_pred CccccccccCCcccccccccchhhhcccc--c-cCCcccchH---hhhccHHHHHHHHHHHHhhccCceeeecch--hhh Confidence 122222111100000 0000000000000 0 000011110 001111223344444443333344433221 111 Q ss_pred HHH-HHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec Q lcl|NC_021297. 84 LWN-WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 84 l~~-i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) |.. -...-....+...+..+.+.+|.||+++-.+ .+|++ .+..++|..+.++.+.... .+. |....+ T Consensus 73 L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd------~~G~~~~l~~i~~~~v~v~~~~~~~-~~~----y~~~~~ 141 (382) T protein:vir:48 73 IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNKD-GIY----YNITFD 141 (382) T ss_pred hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCC-eEE----EEEEec Confidence 211 1111133566778888999999999988653 34554 6788899998887754322 111 111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHH Q lcl|NC_021297. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~ 241 (480) +.. ......+.+++ |+||.+....+..+|.|.+.. +...++.......- T Consensus 142 ~~~-~~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~~~ 190 (382) T protein:vir:48 142 DPR-IPPKQHVPQND-----------------------------VLHFRLLSVDGGMTSVSPLMA-LSRELDIQKASGNL 190 (382) T ss_pred Ccc-ccceeEEcCcc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHHHHHHH Confidence 100 00001111222 555654444455678876542 34444333222222 Q ss_pred HHHHHHHhcchhhhhcCCCcccccccccc--chh---hhhhhhhcccCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhc Q lcl|NC_021297. 242 LQSASQILGTPLRVISGVTTDELTNDGEN--TTL---DIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 242 ~~~~~~~~~~~~~~i~g~~~~~~~~~~~~--~~~---~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~ 315 (480) ..+.+...+.|..++.- ......+.... ..+ ....+.++.++ ++.++.++... .-..+++..+....+|+.. T Consensus 191 ~~~~~~ng~~p~~il~~-~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~a 268 (382) T protein:vir:48 191 TINSLKNALNANGILKI-KGGGLLDFKTKLSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVSQLLKQADWTTGQFAKV 268 (382) T ss_pred HHHHHhccCCCceEEEe-CCCCChHHHHHHHHHHHhhccCCCCeeEcC-CCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 33333333445444431 11111111100 001 11123444454 34677766533 2235778888899999999 Q ss_pred cCCChHHhccccccchHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021297. 316 TGLPPQYLSSSSENPASAEAIIAT-DSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 316 ~~~p~~~~g~~~~n~~Sg~Al~~~-~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~ 394 (480) .++|+..+|....+..+..+.+.. ...|.-.+...+..+...| ...... ++...+ + .+... T Consensus 269 fgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l-----------~~~~~~---~~~~~~-~---~~~~~ 330 (382) T protein:vir:48 269 YGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKL-----------SCDVDA---DIFPAV-D---PTGSN 330 (382) T ss_pred hCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------cChhhh---hhhhhh-c---cchhH Confidence 999999998654433233332221 1112222222211111111 110000 111111 1 11223 Q ss_pred HHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 395 KADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 395 ~ad~~~kl~~~~~~~~s~et~~~~l---g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) ....+.++..++ +.+...+++.+ |+.++++.+.+. . .+.-.++|.. +++ T Consensus 331 ~~~~~~~l~~~g--~~t~~e~r~~l~~~g~~~~~~~~~~~------------~----------~~~~~GGd~~-~~~ 382 (382) T protein:vir:48 331 YISRINSLVKTG--TLAQNQGLYILQQAEILPKELPNGEN------------P----------NSTLKGGEED-GQD 382 (382) T ss_pred HHHHHHHHhhcC--ccCHHHHHHHHhhCCCCCcchhhhhc------------C----------CCCCCCCCCC-CCC Confidence 344455665554 45555555543 555543322110 0 0111111111 111 No 194 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=400 Identities=14% Similarity=0.088 Sum_probs=158.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcC-cccchhhhhhccccchHHHHHHHHHhhhccCcccc---CC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIG-IGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~-~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~ 76 (480) |++ .-.|-.....++ ......-......+.....+|+..++-+-.-+|.+ .. T Consensus 1 ~~~------------------------~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~ 56 (723) T protein:vir:94 1 MTT------------------------FPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDG 56 (723) T ss_pred Ccc------------------------cccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCC Confidence 110 000000000000 00000000000111222333444443332223322 11 Q ss_pred CchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCee Q lcl|NC_021297. 77 DSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 77 d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~ 150 (480) .......+..++.. | ....+...+..+.+.+|.+|+++..+.. +..|.| .+..++|..+.++......... T Consensus 57 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r---~~~g~p~~l~~l~~~~~~v~~~~~~~~~~ 133 (723) T protein:vir:94 57 ELDELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGR---TPAGVPDEIWYVYDRVTTIVATRAADAVP 133 (723) T ss_pred ccchhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC---ccccceeEEEEecCcceEEeecCCCccce Confidence 12222345555543 3 2345666777788999999999865421 123444 3555666555444332211100 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHH Q lcl|NC_021297. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) ......|.... ..+..+.|. .==|+||+......+.+|.|.|. . T Consensus 134 ~~~~~~y~~~~-----------~~G~~~~~~---------------------~~dIiHir~~~~~dg~~G~Spi~----~ 177 (723) T protein:vir:94 134 QAQIIGYVIER-----------TDGVRVPVL---------------------ADEMLWLRFSDPYDPLAVMAPWK----A 177 (723) T ss_pred eeeeeEEEEEe-----------cCceeEEec---------------------ccceEEecCCCCCCCcccccHHH----H Confidence 00000010000 011111110 00145565443345667888663 3 Q ss_pred HHHHHHHHHHHHHHHHHHhcc---hhhhhcCCCccccccccccchhhh------hhhhhcccCC---------CCceEEE Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGT---PLRVISGVTTDELTNDGENTTLDI------YYGRILTLAS---------EAAKISE 292 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~---~~~~i~g~~~~~~~~~~~~~~~~~------~~~~~~~~~g---------~~~~~~~ 292 (480) +.+.+...++-......+|.+ |-.+|.--.......+.-...|+. ..++.+.++| ++.++.. T Consensus 178 a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~ 257 (723) T protein:vir:94 178 ARAAVDADFYAATWQRQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTS 257 (723) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEE Confidence 344444433333333344443 433443111110000000111211 1123344432 3456666 Q ss_pred eccccH-HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021297. 293 FKAAEL-RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 293 ~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~ 370 (480) +..+.. ..|++..+..+.+|+...++|+..+++.+.+..+..+.+. ....|.-.+ ..|++.+.. .+.. T Consensus 258 l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~~~tL~P~~--------~~ie~~ln~--~Ll~ 327 (723) T protein:vir:94 258 LSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVWTETLIPQM--------EVMASITDL--QLLP 327 (723) T ss_pred ccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHHHHHHHHHH--------HHHHHHHhH--hhcc Confidence 654322 2478888888999999999999988765432212222221 111121111 112221111 1111 Q ss_pred CCCcccceeeeEEecC--CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH--HHHH--HHH----------HH Q lcl|NC_021297. 371 REVTEEYTRLETVWRD--PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ--REQM--RDW----------DK 434 (480) Q Consensus 371 ~~~~~~~~~i~v~f~~--~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~--~~~~--~~~----------~~ 434 (480) .....+++.|.. .+..|..+.++++.+++.+| +++..-+++.+|+.+-+ ...+ .-. .- T Consensus 328 ----~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G--~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p 401 (723) T protein:vir:94 328 ----DIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVND--VLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAP 401 (723) T ss_pred ----cccCceEEeecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCc Confidence 111246667753 34577888899898888775 66776778877764321 0000 000 00 Q ss_pred HHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 435 QETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ..++.. .++.+...-.++..|.++.+...+...+.+.|.+...|+ T Consensus 402 ~~~e~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 402 AVEEGA-ARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred cchhhh-HhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 000000 111111111111122333332223333344455555555 No 195 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.93 E-value=1e-05 Score=47.89 Aligned_cols=382 Identities=13% Similarity=0.040 Sum_probs=147.4 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCC--c Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED--S 78 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~-~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d--~ 78 (480) .-+...+.........+. .....++.+...... ..... ..-........+|+..++-+-..+|.+ .++ + T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 75 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSF--LVPGY---VRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI 75 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCc--cccCH---HHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Confidence 111111110000000000 000111111110000 00000 000122344556666665554444432 211 1 Q ss_pred hhHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 79 EGLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~i~~-~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+...+. + | ....+...+..+.+.+|.+|.++.... +..|.+ .+..++|..+.++.+... T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~----~~~g~~~~l~~i~~~~v~~~~~~~~------ 145 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKY----TADGLIDELVPLTPSKVNFLDTPDG------ 145 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEE----CCCCcEEEEEEEcCceeEEEEcCCe------ Confidence 11122223222 1 2 345667788888888877655543221 234454 567788888877665421 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeeccc-ccCccCCcccchhhHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP-RLGNRYGRSEISPELRKV 231 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~-~~~~~~G~s~i~~~v~~l 231 (480) .++. .++ ..|.+++ |+||+.+. ...+.+|.|.+.- +... T Consensus 146 -~~~~---~~~------~~~~~~e-----------------------------vih~~~~~~~~~~~~G~s~i~~-~~~~ 185 (406) T protein:vir:95 146 -YQVL---YGG------QTFNYDE-----------------------------VLHFIYNPDPERPYIGRGYRVV-LKDI 185 (406) T ss_pred -EEEE---ecc------EEEchhH-----------------------------EEEeeccCCCCCCccccCHHHH-HHHH Confidence 0000 000 0111222 45555332 2233567776532 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccccccccchhhh------hhhhhcccCCCCceEEEe---ccccHHHH Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEF---KAAELRNF 301 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~---~~~~~~~~ 301 (480) ++....+..-....+...+.|..++.-.. ......+.....|.- ..+.+..++++...+.++ +..+ ..+ T Consensus 186 i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d-~q~ 264 (406) T protein:vir:95 186 ADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKD-IAI 264 (406) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhH-HHH Confidence 33222222222222222233333332111 111111111111111 112233333333333333 2222 246 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhCCCCcccceee Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-QIMGREVTEEYTRL 380 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~i 380 (480) ++..+....+|+...++|++-+|....+ +.. +.. .+...|.-+++.+. ++...-.......+ T Consensus 265 ~e~~~~~~~~Ia~~fgVp~~~lg~~~~~--~~~-----~~~----------~~~~~l~P~~~~ie~~l~~~l~~~~~~~~ 327 (406) T protein:vir:95 265 NEAVELDKRTVAGMFGVPAFLLGIGEFN--RDE-----YNN----------FINSTILPIAKGIEQELTRKLLISPDLYF 327 (406) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCch--HHH-----HHH----------HHHHHHHHHHHHHHHHHHHhcCCCCCcEE Confidence 7888888999999999999998743211 111 111 11222222222221 11100011112346 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+......|..+.++.+.+++.+| +++...+++++|+.+-+- ..+..--..-..++.... .....+.+.++ T Consensus 328 ~fd~~~l~~~d~~~~~~~~~~l~~~G--~~t~NE~R~~~gl~p~~~--gd~~~~~~n~~~~~~~~~---~~~~k~g~~~~ 400 (406) T protein:vir:95 328 KFNPRSLYAYDLKELAEVGSNMYVRG--IMEGNEVRDWLGLSPKEG--LSELVILENYIPLDKIGD---QSKLKGGDNSG 400 (406) T ss_pred EeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cceeeeccCccchhhccc---ccccCCCCCCC Confidence 66666666778888999999988865 677777888888865321 100000000000000000 00000001111 Q ss_pred CCCCcc Q lcl|NC_021297. 461 TDTKTE 466 (480) Q Consensus 461 ~d~~~~ 466 (480) .+++++ T Consensus 401 ~~~~~~ 406 (406) T protein:vir:95 401 ADGQTD 406 (406) T ss_pred CCCCCC Confidence 111111 No 196 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=417 Identities=10% Similarity=0.067 Sum_probs=194.8 Q ss_pred CCCH-----------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhcc Q lcl|NC_021297. 1 MTTY-----------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~-----------~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) |.-+ -++-.. ++.-..-..+|+.+..+++-+..+..+-..+ ++.+--..+|.-- |-- T Consensus 46 ~~~~~~~~gg~~~~~~~~e~~-~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~ 113 (521) T protein:vir:81 46 DNLPASAWNSLTQQFYSTDQK-ISTTKQLVNTYRGLMNNHEVENAVQNIVNDA--------IVFEEGHEVVSLN---LEA 113 (521) T ss_pred CCCcceeecceeeeecccccc-hhhHHHHHHHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecc Confidence 1100 000000 0000111223444444444443332210000 0000000000000 000 Q ss_pred CccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCe Q lcl|NC_021297. 70 EGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~ 149 (480) ..++-+-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-... ...+|-..++.++|+.+..+.--..... T Consensus 114 ~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~---~pk~GI~Elr~lDPr~i~~vr~i~k~~~ 190 (521) T protein:vir:81 114 TGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGK---NPKDGIVELRQLDPRNLEYVREIITEDT 190 (521) T ss_pred cccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcC---CccccceeeeeeCCcceeeeeeeccccc Confidence 0111111223445566666666888999999999999999988865331 3467888899999998876653211100 Q ss_pred eEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc--eEEeecccccCccCC--cccch Q lcl|NC_021297. 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDPRLGNRYG--RSEIS 225 (480) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP--vv~~~n~~~~~~~~G--~s~i~ 225 (480) .- +.. .+....+-+|.++... |...+.. .. .. ++ =+|| .+.|...-...+.-| .|-|. T Consensus 191 ~~--~~v------~~~~~e~f~Y~~~~~~-~~~~g~~----~~-~~---~~-vkI~~dAI~y~hSGl~d~~~~~i~syLh 252 (521) T protein:vir:81 191 PE--GKI------YKATKEYFIYTVGNSS-YCAGGQV----FS-PN---SR-VKIPRSAITYAHSGLMDCDDKYIIGYLH 252 (521) T ss_pred Cc--cce------ecceeeeeeeecCCcc-cccccee----ec-CC---cc-eeechhheeeeeccceeCCCCeeeecch Confidence 00 000 0011222344443222 1111110 00 00 00 0111 122332211111111 13344 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch--------------------------hhhhhhh Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT--------------------------LDIYYGR 279 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~--------------------------~~~~~~~ 279 (480) +.++++-. -+++.|.+.+-+....|-+=++-.+.-..+..+...- +-..... T Consensus 253 kAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlED 330 (521) T protein:vir:81 253 RAVKPANQ--LKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTED 330 (521) T ss_pred hhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhh Confidence 54544432 2577888888777777776654222222222111100 0111122 Q ss_pred hcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc---cchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 280 ILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE---NPASAEAIIATDSRIVKMAERKGRI 353 (480) Q Consensus 280 ~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~---n~~Sg~Al~~~~~~l~~k~~~~~~~ 353 (480) .|. .+|-+.++..++..+--.-++-++-....++...++|.+-++..+. +..-|..|-.-+.....-+.+++.. T Consensus 331 yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r 410 (521) T protein:vir:81 331 YWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQ 410 (521) T ss_pred hcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHH Confidence 222 1344566777776543344566677778888899999888843322 1123345777778888889999999 Q ss_pred HHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCC Q lcl|NC_021297. 354 FGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGY 421 (480) Q Consensus 354 ~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~ 421 (480) |..-+.++++.=+-+.|.-...++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |.. T Consensus 411 Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~ 490 (521) T protein:vir:81 411 FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKY 490 (521) T ss_pred HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcc Confidence 999999998865555554333333 346677754333333333332221 22222233 488998765 799 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 422 TATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) ++++++++.+.+++|... .... .|+.+..+- T Consensus 491 tDeei~~~~k~I~~E~~~---~~~~--------~p~~~~~~f 521 (521) T protein:vir:81 491 TDDQMDTEKKQIEEEAND---PRFK--------QTPDEIEDF 521 (521) T ss_pred CHHHHHHHHHHHHHHhhC---CCCC--------CCcccccCC Confidence 999988887666665431 1111 111111111 No 197 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=97.89 E-value=1.2e-05 Score=47.47 Aligned_cols=385 Identities=9% Similarity=0.061 Sum_probs=141.7 Q ss_pred HHHHHHHHHHH----HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchhHHHHHHH Q lcl|NC_021297. 12 QGLLARDLPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNW 87 (480) Q Consensus 12 ~~~~~~~~~r~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~i 87 (480) ...|..+..+. .-+..+.-|...-......+ -.......+ ...|.+..+. |-+.-++...+......+..+ T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A-l~~~~V~~~---i~~Ia~~iA~-lp~~~~~~~g~~~~~~~~~~l 75 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYLGVSA-LKNSDILTA---TSIIAGDIAR-FPLVKKDVNGDIIHDEDINYL 75 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccccchh-hccHHHHHH---HHHHHHhhhh-CeeEEEecCccccccchHHHH Confidence 11111111100 01111111111100000000 000011111 1222333322 111111112222223345555 Q ss_pred HHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec Q lcl|NC_021297. 88 WQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 88 ~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) +.. | ....+...+..+.+.+|.||+++..+. ..|.+ .+..++|.++.+..++.. .+. |..... T Consensus 76 L~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~-----~~g~~~~L~~i~p~~v~v~~~~~~--~~~----y~~~~~ 144 (406) T protein:vir:97 76 LNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP-----KTNQALQFQFYRPSETTVEETDNH--EIV----YTFTDM 144 (406) T ss_pred hhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-----CCCeEEEEEEECCCeeEEEEcCCc--eEE----EEEEec Confidence 532 3 234667778889999999999886532 23443 677889998887665421 111 111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHH Q lcl|NC_021297. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~ 241 (480) ..+.. ..+.++++ +||++.. ..+..|.|.+. .+.+.+.....- T Consensus 145 ~~~~~---~~~~~~ev-----------------------------ih~r~~~-~dg~~G~spi~----~~~~~i~~~~a~ 187 (406) T protein:vir:97 145 LTAKQ---VKCFAHDV-----------------------------IHWKFFS-HDTILGRSPLL----SLGDEIDLQTGG 187 (406) T ss_pred CCceE---EEEccccE-----------------------------EEecCCC-CCCcccccHHH----HHHHHHHHHHHH Confidence 11110 11222232 3343322 12345666553 233333322221 Q ss_pred HHHHHHHhc---chhhhh-cCCCccccccccccchhhh-----hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHH Q lcl|NC_021297. 242 LQSASQILG---TPLRVI-SGVTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKE 311 (480) Q Consensus 242 ~~~~~~~~~---~~~~~i-~g~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~ 311 (480) ......+|. .|-.++ ++....+...+.-...|+. ..++++.++ ++.++.+++... -..+++..+..+.+ T Consensus 188 ~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~ 266 (406) T protein:vir:97 188 INTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFD-STMEYTPLEIDTNVLQLITSNNFSTAQ 266 (406) T ss_pred HHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecC-CCceEEEccCCHHHHHHHHHHHhhHHH Confidence 111222222 222111 1211111111111111211 112344443 345776665332 22477777888999 Q ss_pred HHhccCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCC Q lcl|NC_021297. 312 AASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTP 390 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~ 390 (480) |+...++|+..+|....+.......+ +....|.-.+. .|++.+.. .+..... .....+.|. ... T Consensus 267 Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~~--------~ie~~l~~--kll~~~~---~~~~~i~fd--~~~ 331 (406) T protein:vir:97 267 IAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYFD--------AITSELGL--KTLNDKD---RRLYHIEFD--TRS 331 (406) T ss_pred HHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHHH--------HHHHHHhh--hhcChhh---ccceeEEEe--cCc Confidence 99999999999986543321121111 11111111111 12211111 1111111 112333442 122 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccC Q lcl|NC_021297. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTS 470 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) +....++++.+++++| +++...+++.+|+.+-+-....+.. ........+ .. ++......+...+. T Consensus 332 ~~~~~~~~~~~~~~~g--~~T~NE~R~~~g~~p~~~~~gD~~~-------~~~n~~~~~----~~-~~~~~~~~~~~~gg 397 (406) T protein:vir:97 332 VTGRNVDEIVKLVNNQ--ILTPNQGLVELGKQKSTDPNMDRYQ-------SSLNYVFLD----KK-EEYQDKVGIKGKGG 397 (406) T ss_pred cchhhHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeEe-------eccCccchh----cc-cccccccccccCCC Confidence 3455667777887765 5677777777776542110000000 000000000 00 00000111111222 Q ss_pred CCCCCCCCC Q lcl|NC_021297. 471 PSGFNRTKT 479 (480) Q Consensus 471 p~~~~~~~~ 479 (480) +++..+.+| T Consensus 398 ~~~~~~~~~ 406 (406) T protein:vir:97 398 EVNAEEDKS 406 (406) T ss_pred CCCCCCCCC Confidence 222222333 No 198 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.89 E-value=1.3e-05 Score=47.43 Aligned_cols=407 Identities=11% Similarity=0.115 Sum_probs=193.0 Q ss_pred CCCHHH---HHHHHHHH------HHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh-ccC Q lcl|NC_021297. 1 MTTYHE---HVERLQGL------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DIE 70 (480) Q Consensus 1 ~~~~~~---~i~~l~~~------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-~~~ 70 (480) |.+.-- +...+... -..-..+|+.+..+++-+..+. .||+..+-+= ..+ T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd~Av~---------------------eIvneaiv~d~~~~ 105 (521) T protein:vir:10 47 IDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVDNAID---------------------EIINDAIVQEDNRD 105 (521) T ss_pred CCccccccchhhhhhccccccchHHHHHHHHHHHhhccchhhHHH---------------------hhhcceEEecCCCc Confidence 222100 00000000 0000112222223232222221 1122111000 001 Q ss_pred cccc---------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEE Q lcl|NC_021297. 71 GFRI---------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 71 ~~~~---------~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v 141 (480) ++.+ +-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-.+. ....+|-..++.++|+.+-.+ T Consensus 106 pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~--~~pk~GI~Elr~lDPr~i~~v 183 (521) T protein:vir:10 106 TVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDP--ARPKDGIKELRLLDPRNVEYY 183 (521) T ss_pred eEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeC--CCccccceeeeeeCCcceeee Confidence 1111 11122344555666666888899999999999999988853321 123567788888999877544 Q ss_pred EeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeeccccc--Cc Q lcl|NC_021297. 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRL--GN 217 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~--~~ 217 (480) .--... ....+..+ . ....+-+|.+....+|...++. ...=+||. |.|.+.... .. T Consensus 184 r~i~k~--~~~~~~v~-----~-~~~e~f~Y~~~~~~~~~~~g~~------------~~~vkI~~daI~y~hSGL~d~~~ 243 (521) T protein:vir:10 184 RVNLKS--NENGNDVY-----K-GVKEFFTYGATEDNRYNISGNS------------NNLVQIPIDAIVYSHSGKVDIDG 243 (521) T ss_pred eeecCC--CCCcchhh-----c-cceeeeeeccCCCceecCCCCC------------CcceeechhheeeecccceeCCC Confidence 321111 00001000 0 0112234443322223222211 11011222 334432222 22 Q ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc-------------------------- Q lcl|NC_021297. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT-------------------------- 271 (480) Q Consensus 218 ~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-------------------------- 271 (480) ....|-|.+.++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+... T Consensus 244 ~~i~syLhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddr 321 (521) T protein:vir:10 244 KTIVGYLHNVIKPANQ--LKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSS 321 (521) T ss_pred CceeccchhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccch Confidence 3345556665555432 257788888877777777655422222222211110 Q ss_pred hhhhhhhhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc--cchHHHHHHHHHHHHHHH Q lcl|NC_021297. 272 TLDIYYGRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE--NPASAEAIIATDSRIVKM 346 (480) Q Consensus 272 ~~~~~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~--n~~Sg~Al~~~~~~l~~k 346 (480) .+-......|. .+|-+.++..++...--.-++-++-....++...++|.+-+...++ +..-+..|-.-+.....- T Consensus 322 k~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KF 401 (521) T protein:vir:10 322 NNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKY 401 (521) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHH Confidence 01111222232 1344566777776653344566677777888888998877754421 211122466677788888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhcccc------CCCHH Q lcl|NC_021297. 347 AERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQG------PIPKE 413 (480) Q Consensus 347 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~------~~s~e 413 (480) +.+++..|..-+.++++.=+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+...+ .+|.+ T Consensus 402 I~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~d 481 (521) T protein:vir:10 402 IRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHE 481 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchH Confidence 9999999999999998876555554333333 346777754333333333332221 2222223 57899 Q ss_pred HHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 414 QARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 414 t~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) ++++. |..++++++++.+.++++... .... .|+.+..|- T Consensus 482 yi~k~ILr~tDeeik~~~k~I~~E~~~---~~~~--------~p~~e~~df 521 (521) T protein:vir:10 482 YVMKNILRMSDEDIKTEREKIDGELKD---SVYK--------NPEDPMEEF 521 (521) T ss_pred HHHHHHhcCCHhHHHHHHHHHHHhhhC---CCCC--------CCcchhhcC Confidence 98765 799999988877666655321 1111 111111111 No 199 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.85 E-value=1.5e-05 Score=47.04 Aligned_cols=397 Identities=14% Similarity=0.091 Sum_probs=157.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH-HHHHhcccCc------c--------hhcCcccchhhhhhccccchHHHHHHHHHh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLE-AEAYRNGTRR------L--------KTIGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~-~~~YY~g~~~------i--------~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |-...+ ..+.+... +..|. |..- + ...+..+... .-+.+.-...+|+..++ T Consensus 1 ~~~~~~----------~~~~~~~~~~~~~~-g~~~s~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~ci~~Ia~ 66 (437) T protein:vir:10 1 MKQGKQ----------RALGRIKSSFLKWL-GVPISLTDGSFWSAWGGMGSSSGETVTAD---SALQLSAVWSCVRLIAE 66 (437) T ss_pred CCcchh----------hhhhhhHHhhhhhc-CCcccCCchhHHHhhcccccCCCceechH---hhhccHHHHHHHHHHHH Confidence 332211 11122211 11221 2100 0 0011111111 00111112223443333 Q ss_pred hhccCcc---ccCCCc----hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEE Q lcl|NC_021297. 66 RLDIEGF---RISEDS----EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRV 132 (480) Q Consensus 66 ~l~~~~~---~~~~d~----~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~ 132 (480) -+-.-++ ....+. .....+..++.. | ....+...+..+.+.+|.||+++-.+ .|++ .+.. T Consensus 67 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-------~g~~~~L~~ 139 (437) T protein:vir:10 67 TIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-------AGVLIGLEL 139 (437) T ss_pred HHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEE Confidence 3222222 221111 112334454432 2 34566778888999999999988642 2444 4778 Q ss_pred EccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc Q lcl|NC_021297. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~ 212 (480) ++|..+.+..+.. ..+. |+... ..+.. ..+.++. |+||++. T Consensus 140 l~p~~v~i~~~~~--g~~~----y~~~~-~~g~~---~~~~~~d-----------------------------Iih~r~~ 180 (437) T protein:vir:10 140 MLPQRTTVKRLTS--GALQ----YTYRN-VDGTV---STLAEDD-----------------------------VFHVRGF 180 (437) T ss_pred EcCcceEEEECCC--CeEE----EEEEe-cCceE---EEEcccc-----------------------------EEEecCc Confidence 8998888776532 1111 11111 11111 1122222 3444433 Q ss_pred cccCccCCcccchhhHHHHHHHHHHHHHH---HHHHHHHhcchhhhhcCCC-ccccccccccchhhhh------hhhhcc Q lcl|NC_021297. 213 PRLGNRYGRSEISPELRKVTDAASRTLMN---LQSASQILGTPLRVISGVT-TDELTNDGENTTLDIY------YGRILT 282 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~liD~~~~~~s~---~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~------~~~~~~ 282 (480) . ..+.+|.|.|. .+.+.+...+.- ....+...+.|..++.-.. ......+.-...|.-. .++++. T Consensus 181 ~-~d~~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~v 255 (437) T protein:vir:10 181 S-LDGLMGLTPIQ----YAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMV 255 (437) T ss_pred C-CCCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCccee Confidence 2 34467887653 333333332222 2222232333544443211 1111111111112110 123444 Q ss_pred cCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 283 ~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) ++ ++.++.++.... -..|++..+....+|+...++|+..+|....+...+..+......+...+- .-+-..++.. T Consensus 256 l~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl---~P~~~~ie~~ 331 (437) T protein:vir:10 256 LE-AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTL---RPWLTRIEQA 331 (437) T ss_pred cc-CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHH---HHHHHHHHHH Confidence 44 446777776432 224788888889999999999999998543332222223322222222110 1111112111 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH-HHHHHHHHHHHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQMRDWDKQETEDM 440 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~-~~~~~~~~~~~~~~~ 440 (480) +.. .+........ ..+++.+......|..++++++.++..+| +++...+++.+|+.+-+ -....-... .-.. T Consensus 332 l~~--kll~~~e~~~-~~~~fd~~~ll~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~gg~~~~~~~~--~~~~ 404 (437) T protein:vir:10 332 ARR--SLLRPGERDQ-FYAEFSVEGLLRADSAGRAAFYSTMTQNG--LMTRDECRAKENLPPMGGNAAVLTVQS--ALLP 404 (437) T ss_pred HHh--hccCccccCc-eEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcceEeecC--cccc Confidence 111 1111111111 23555556666778889999999888765 66777778888775421 111000000 0000 Q ss_pred HHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 441 IDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 441 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ++.. ..... +....++..+.....++.+++.+ | T Consensus 405 ~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~e---~ 437 (437) T protein:vir:10 405 IDKL---GEHTT-ATAAQDALKAWLYQEEKTRATQE---R 437 (437) T ss_pred hhhc---cCcCC-CcchhccccccCCCCCCCCcccc---C Confidence 1110 00000 00000000000000111111111 1 No 200 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.82 E-value=1.7e-05 Score=46.73 Aligned_cols=421 Identities=11% Similarity=-0.023 Sum_probs=161.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCccc--chhhhhhccccchHHHHHHHHHhhhc----cC--ccc-c Q lcl|NC_021297. 4 YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLD----IE--GFR-I 74 (480) Q Consensus 4 ~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l~----~~--~~~-~ 74 (480) .+..++.+..++. |.+-...++++++ +.+++..... .......+..-+-+...++++++.|. +- +|+ . T Consensus 1 mk~~~~~~~~~lk-R~~~e~~w~e~a~--~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:63 1 MKTTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccccc Confidence 3444444444442 2233344445552 2232221110 00111112334445666666666543 21 222 1 Q ss_pred --CC--------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEE Q lcl|NC_021297. 75 --SE--------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 75 --~~--------d~~----~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~ 133 (480) ++ +.. .. +.+...+..++|.....++.++...+|.+-+++.. ++. +++.+ T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~--------~~~-~~~~~ 148 (510) T protein:vir:63 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDS--------DAA-TVVAW 148 (510) T ss_pred CCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcC--------CCc-EEEEE Confidence 11 111 11 23455677789999999999999999998666532 232 46666 Q ss_pred ccceeEEEEeCCCCCeeEEEEEEEEEe-----c-----------CCceEEEEEEEe-----CCc----E-EEEEecCCcc Q lcl|NC_021297. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-----D-----------DVAVPDRATLYL-----PDE----T-VPLRRNGGLN 187 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~-----~-----------~~~~~~~~~~~~-----~~~----~-~~~~~~~~~~ 187 (480) +-.+.++.-|. .+ ++...++.+.-. + +.+....+++|+ ++. + ++++.++.. T Consensus 149 pl~~y~v~~d~-~G-~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~- 225 (510) T protein:vir:63 149 SLRSYAVRRDA-TG-RWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR- 225 (510) T ss_pred EcceeEEeeCC-Cc-CeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCce- Confidence 66664444443 23 333333322111 0 001111223332 111 0 111111110 Q ss_pred cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELT 265 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~ 265 (480) .......++..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|...+. |.... T Consensus 226 -----~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~--- 296 (510) T protein:vir:63 226 -----VGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV--- 296 (510) T ss_pred -----eccccccccccCceeeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccch--- Confidence 0111123466789999998888899999997643 456666777666666665555555543321 11000 Q ss_pred cccccchhhhhhhhhcccCCCCceEEEecc-ccHH---HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHH Q lcl|NC_021297. 266 NDGENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDS 341 (480) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~ 341 (480) ........+.+..-..++....++.. .+++ ..++.++.-|+..+... + ...++... ++.-++.... T Consensus 297 ----~~~~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l----~~~~~~rv-TAtEV~~r~~ 366 (510) T protein:vir:63 297 ----DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A----NQRDAERV-TAEEVRITAE 366 (510) T ss_pred ----hhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh-c----ccCCCCCc-CHHHHHHHHH Confidence 00111111222111112233344332 3343 34444444444443321 1 11122222 3333332211 Q ss_pred HHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC-CCcccc-eeeeEEecCCCCCCHHHHHHHHH---HHHhccc---- Q lcl|NC_021297. 342 RIVKM----AERKGRI-FGGAWERAMRIAMQIMGR-EVTEEY-TRLETVWRDPSTPTVAAKADAVS---KLYANGQ---- 407 (480) Q Consensus 342 ~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~-~~~~~~-~~i~v~f~~~~~~~~~~~ad~~~---kl~~~~~---- 407 (480) .+... ..+.... +.+-+++.+.++.+ .|. ..+.+. ....+++-.++.+ .+.++.+. +.++... T Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~q~l~~~~~~aq 443 (510) T protein:vir:63 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSR--SAAVQSMLNASQVIAGLAPIAQ 443 (510) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCchhcccceecchhHHHH--HHHHHHHHHHHHHHHHhcCchh Confidence 11111 1222222 22223333433322 121 112221 1122333222222 12222222 2121111 Q ss_pred --cCCCHH----HHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 408 --GPIPKE----QARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 408 --~~~s~e----t~~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) ..+... .+...+|.+ +++++.+.+.++++......+.....++.++.+..+.+- T Consensus 444 ~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred hhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 112222 233445653 334444433222222211111111122222222222222 No 201 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=395 Identities=11% Similarity=0.032 Sum_probs=157.0 Q ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-cCcccchhhhhhccccchHHHHHHHHHhhhccCcccc-CC Q lcl|NC_021297. 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SE 76 (480) Q Consensus 1 ~~~~--~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~ 76 (480) |.-. +-+++++-+....+.. ..-..+-..... .+.....-..+.-........+|+..+.-+-.-++.+ .. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~ 75 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWI-----DQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYED 75 (412) T ss_pred CccchhhhhhhhhhhhHhhhhh-----cccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeec Confidence 6555 2233332222111100 000000010000 0000000000111111222333443333332233332 11 Q ss_pred CchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCee Q lcl|NC_021297. 77 DSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 77 d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~ 150 (480) .+.....+..++.. | .-..+...++.+.+.+|.||+++.++ ..|++ .+..++|..+.+..+.... .+ T Consensus 76 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~~ 148 (412) T protein:vir:26 76 YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-EL 148 (412) T ss_pred cccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-EE Confidence 22233345555542 3 23455677888999999999988753 34554 5778899999888875422 11 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHH Q lcl|NC_021297. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) . |.. ....+.. ..+.++++ +||++.....+.+|.|.+.- +.. T Consensus 149 ~----y~~-~~~~g~~---~~~~~~ev-----------------------------ih~~~~~~~~~~~G~s~i~~-~~~ 190 (412) T protein:vir:26 149 Y----YSI-HAATGNK---LIVHNMDM-----------------------------LHFKHIVASNMVQGISPIDV-LKN 190 (412) T ss_pred E----EEE-EcCCceE---EEEccccE-----------------------------EEeCCCCCCCCcccccHHHH-HHH Confidence 1 111 1111111 11223333 44443333345667776532 333 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhh---hhhhhhcccCCCCceEEEecccc-HHHHHHH Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFAEE 304 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~ 304 (480) .++..+. .... .+.....+-..+. +........+.-...|+ ...+.++.++ ++.++.+++... -..|++. T Consensus 191 ~i~~~~a-~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~ 266 (412) T protein:vir:26 191 TTDFDNA-VRTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVAS 266 (412) T ss_pred HHHHHHH-HHHH--HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHH Confidence 3332111 1111 1222222221221 11111110000011111 1122344443 446777765432 2247787 Q ss_pred HHHHHHHHHhccCCChHHhccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENP-ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~ 383 (480) .+....+|+...++|+.-+|....+. ++... ....+...+ -.-+-..|++.+.. .+...........+++. T Consensus 267 ~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~---~~~~f~~~~---l~P~~~~ie~~ln~--kLl~~~~~~~~~~~~fd 338 (412) T protein:vir:26 267 ENLTRERVANVFQLPSVFLNARSNTNFAKNEE---LNRFYLQHT---LLPIVKQYEEEFNR--KLLTKTDREKNRYFKFN 338 (412) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH---HHHHHHHHH---HHHHHHHHHHHHHh--hcCCcccccCcceEEee Confidence 78888999999999999998643221 12211 111111111 01111111111111 11111111112335555 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +..-...+..+.++++.+++.+| +++...+++.+|+.+-+- ..+.. +......... .........+++ T Consensus 339 ~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g--gD~~~-------~~~n~~~~~~-~~~~~~~~~gG~ 406 (412) T protein:vir:26 339 VKSYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--GDKPL-------ISGDLYPIDT-PLELRKSLKGGD 406 (412) T ss_pred chhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCeee-------eccccccccc-chhhcccccCCC Confidence 55566678889999999998875 667777788888755320 00000 0000000000 000000000000 Q ss_pred Cccccc Q lcl|NC_021297. 464 KTETQT 469 (480) Q Consensus 464 ~~~~~~ 469 (480) +.++++ T Consensus 407 ~n~~e~ 412 (412) T protein:vir:26 407 KNVNES 412 (412) T ss_pred CCcCCC Confidence 111111 No 202 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=97.78 E-value=2e-05 Score=46.38 Aligned_cols=400 Identities=12% Similarity=0.052 Sum_probs=193.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |.+..++|. +|..+..+++-+..+..+-..+ ++.+--..+|.- + |--..++-+-.+.. T Consensus 64 ~~~~~~LI~-----------~YR~ma~~pEvd~Av~eIvnea--------iv~d~~~~pV~l--~-l~~~e~s~sik~kI 121 (516) T protein:vir:10 64 ISGTKDLIN-----------TYRQLTNNPEVERAVANIVNEA--------VVYEKGHKVVSL--D-LDDTEFSSSIKDKI 121 (516) T ss_pred cccHHHHHH-----------HHHHhhhccchhHHHHHhhcce--------eEecCCCceEEE--E-ecccccchHHHHHH Confidence 444333322 3444555555444332210000 000000000000 0 00001111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+.-.+.|+.|.+...+ ...+|-..++.++|+.+..+.- ..+. T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~elr~lDPr~i~~vR~-------------i~~~ 184 (516) T protein:vir:10 122 LEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP----NPKEGIVELRRLDPRHVEYYRE-------------IVTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec----CcccceeeeeeeCCcceeeEEe-------------eecc Confidence 4555666666688889999999999999999886543 3457888899999987766542 1111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCcc--CCcccchhhHHHH Q lcl|NC_021297. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISPELRKV 231 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~--~G~s~i~~~v~~l 231 (480) +.++ ....+-+|..+.. .|...+... . |+.-=+||- +.|...-..... .=.|-|.+.++++ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~~----~-----~~~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 185 DVGGTSVVKGYREFFVYTTGNE-GYAYNGRLF----E-----PNTRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPA 254 (516) T ss_pred cCcchhhhhceeeeeeeecCcc-ceecccccc----C-----CCCceecchhheeeeecCcccCCCCceeceehhhhHhH Confidence 1111 0112223333322 122222110 0 000001221 223321111111 0023344444443 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc--- Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT--- 282 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~--- 282 (480) -. -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-......|. T Consensus 255 NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 NQ--LKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc Confidence 22 257778888877777776655422222222211110 01111222232 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+|-+.++..++.+.--.-++-++-....++...++|.+-+....+.+ .-+..|-.-+.....-+.+++..|..-+. T Consensus 333 eGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~ 412 (516) T protein:vir:10 333 DGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFL 412 (516) T ss_pred CCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 134456677777665334456667777788889999988886443211 23345666677778888899999999888 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHH-------HHHHHHhccccCCCHHHHHHh-CCCChhHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKAD-------AVSKLYANGQGPIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad-------~~~kl~~~~~~~~s~et~~~~-lg~~~~~~~ 427 (480) ++++.=+-+.|.-...++ ..|.+.|..-..-.....++ .+.++-.-....+|.+++++. |..++++++ T Consensus 413 ~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~ 492 (516) T protein:vir:10 413 DPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIA 492 (516) T ss_pred HHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHH Confidence 888865555554333333 35667775433333333332 223322111246799998765 799999988 Q ss_pred HHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+.++++... .... ++.++.+. T Consensus 493 ~~~k~I~~E~~~---~~~~------~p~~e~~f 516 (516) T protein:vir:10 493 QEEKQIEKEANV---KRFQ------NPENEDDF 516 (516) T ss_pred HHHHHHHHhhhC---CCCC------CCCccccC Confidence 887666655421 1110 11111111 No 203 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=97.77 E-value=2.1e-05 Score=46.28 Aligned_cols=372 Identities=9% Similarity=-0.045 Sum_probs=141.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcch--hcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchhHH Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLK--TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLE 82 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~--~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~ 82 (480) .-+...+...-.....+.........+. -.. .-+..+... .-..+.-...+|+..+.-+-..++.+-. ..... T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~---~~~~~~~v~~~i~~ia~~ia~~p~~~~~-~~~~~ 75 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPD-FLSTLNGSEWVSAE---SALRNSDLFSIINQLSNDLATVKLTASR-KQLQG 75 (386) T ss_pred Ccccccccccccccccccccccccccch-hcccccCCceechh---hhhcchHHHHHHHHHHHhhccCceeecc-chhHH Confidence 0000000000000000000000000000 000 000000000 0001111123333333333333444332 11111 Q ss_pred HHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec Q lcl|NC_021297. 83 ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 83 ~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) .+.+-...-....+...+..+.+.+|.||+.+-.+ ..|.+ .+..++|..+.+..+.... .+ .|....+ T Consensus 76 l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~v~~~~~~~-~~----~y~~~~~ 144 (386) T protein:vir:48 76 IIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNKD-GI----YYNITFD 144 (386) T ss_pred HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEecCceeEEEEcCCCc-eE----EEEEEec Confidence 11111111234466777888999999999987653 34444 5778888888877654321 11 1111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHH Q lcl|NC_021297. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~ 241 (480) +.. ......+.+++ |+||++....+..+|.|.+.. +...+.....+..- T Consensus 145 ~~~-~~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~i~~-~~~~i~~~~~~~~~ 193 (386) T protein:vir:48 145 DPR-IPPKQHVPQGD-----------------------------VLHFKLLSVDGGLTSVSPLMA-LSRELNIQKASDKL 193 (386) T ss_pred Ccc-ccceeEecCcc-----------------------------EEEecCCCCCCceeeccHHHH-HHHHHHHHHHHHHH Confidence 110 00011111222 455555444445678886642 22223222222222 Q ss_pred HHHHHHHhcchhhhhcCCCccccccccccc---hhh---hhhhhhcccCCCCceEEEeccccHH-HHHHHHHHHHHHHHh Q lcl|NC_021297. 242 LQSASQILGTPLRVISGVTTDELTNDGENT---TLD---IYYGRILTLASEAAKISEFKAAELR-NFAEEMEVFRKEAAS 314 (480) Q Consensus 242 ~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~---~~~---~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~ 314 (480) ....+...+.|-.++.-... ..++.... .+. ...++++.++ ++.++.++..+..+ .|++..+....+|+. T Consensus 194 ~~~~~~ng~~~~~ii~~~~~--~~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 270 (386) T protein:vir:48 194 TLNSLKNALNANGILKIKGG--GLLDFKTKLSRSRQAMKQMQGGPLVLD-DLEEFTPLEIKSNVSQLLKQADWTTGQFAK 270 (386) T ss_pred HHHHHhccCCcceEEEeCCC--CCHHHHHHHHHHHHHhhcCCCCceecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHH Confidence 22222223334444431111 11111000 111 1122334443 45677777643222 478888888999999 Q ss_pred ccCCChHHhccccccchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021297. 315 ITGLPPQYLSSSSENPASA-EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 315 ~~~~p~~~~g~~~~n~~Sg-~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~ 393 (480) ..++|+..+|..+.+..+. ..+.+....|.-.+...+..+...| .. .+++.+......+.. T Consensus 271 ~fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~~-------~~~~~~~~~~~~d~~ 332 (386) T protein:vir:48 271 VYGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKL-----------SC-------DVDADILPAVDPTGS 332 (386) T ss_pred HhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------cc-------hhhcchhhhhccChH Confidence 9999999998654432222 2222222222222222222222211 11 111111122224445 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCC---CChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccc Q lcl|NC_021297. 394 AKADAVSKLYANGQGPIPKEQARIDLG---YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQ 468 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~et~~~~lg---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) ..+..+.++..++ +++...+++.+| +.+.++.+.+ .. ...+.. +++.. +++ T Consensus 333 ~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~~~~~~~------------~~--------~~~~~~-gGd~~-~~~ 386 (386) T protein:vir:48 333 NSVSRINSMVKSG--TLAQNQGLYILQQAEILPKELPEGE------------NP--------NKTTLK-GGEIN-GED 386 (386) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHhhcCCCCCccchhhc------------CC--------CCCccC-CCCCC-CCC Confidence 6666777777765 556665666554 3333221110 00 000111 11100 000 No 204 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=418 Identities=11% Similarity=0.063 Sum_probs=192.6 Q ss_pred CCCHHH--------HHHH--HHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHE--------HVER--LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~--------~i~~--l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |..+.. ++.- -++.-..-..+|+.+..+++-+..+..+-..+ ++.+--..+|.-- |--. T Consensus 46 ~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~~ 114 (521) T protein:vir:65 46 DNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDA--------IVFEEGHEVVSLN---LEAT 114 (521) T ss_pred CCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---eccc Confidence 111100 0000 00000111223333444443333332210000 0000000000000 0000 Q ss_pred ccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCee Q lcl|NC_021297. 71 GFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 71 ~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~ 150 (480) .++-+-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-.. ....+|-..++.++|+.+..+.--...... T Consensus 115 ~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid---~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~ 191 (521) T protein:vir:65 115 GFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG---KNPKDGIVELRQLDPRNLEYVREIITEDTP 191 (521) T ss_pred ccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEc---CCccccceeeeeeCCcceeeeeeecccccC Confidence 11111122334556666666688899999999999999998886533 134678888999999988766532111000 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccCC--cccchh Q lcl|NC_021297. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYG--RSEISP 226 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~G--~s~i~~ 226 (480) - +.. .+....+-+|.++...+ ...+.. .. |+.-=+||- +.|...-.....-| .|-|.+ T Consensus 192 ~--~~v------~~~~~e~f~Y~~~~~~~-~~~g~~----~~-----~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhk 253 (521) T protein:vir:65 192 E--GKI------YKATKEYFIYTVGNSSY-CAGGQV----FS-----PNSRVKIPRSAITYAHSGLMDCDDKYIIGYLHR 253 (521) T ss_pred C--cce------ecceeeeeeeecCCcce-ecccee----ec-----CCcceeechhheeeeeccceeCCCCeeeecchh Confidence 0 000 00112223444432221 111110 00 000001111 22322211111111 133445 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch--------------------------hhhhhhhh Q lcl|NC_021297. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT--------------------------LDIYYGRI 280 (480) Q Consensus 227 ~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~--------------------------~~~~~~~~ 280 (480) .++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+...- +-...... T Consensus 254 AiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDy 331 (521) T protein:vir:65 254 AVKPANQ--LKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDY 331 (521) T ss_pred hhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhh Confidence 5544432 2577888888777777776654222222222111100 01111222 Q ss_pred cc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 281 ~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~---~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) |. .+|-+.++..++..+--.-++-++-....++...++|.+-++..+++. .-|..|-.-+.....-+.+++..| T Consensus 332 WLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rF 411 (521) T protein:vir:65 332 WLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQF 411 (521) T ss_pred cccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHH Confidence 22 134456677777654334456667777788888899987764332211 223457777788888899999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCC Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYT 422 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~ 422 (480) ..-+.++++.=+-+.|.-...++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |..+ T Consensus 412 s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~t 491 (521) T protein:vir:65 412 SEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYT 491 (521) T ss_pred HHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 99999998875555554333333 346677754333333333332221 22222233 489998765 7999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +++++++.+.+++|... .... .|..+..+- T Consensus 492 Deei~~~~k~I~~E~~~---~~~~--------~p~~~~~~f 521 (521) T protein:vir:65 492 DDQMDTEKKQIEEEAND---PRFK--------QTPDEIEDF 521 (521) T ss_pred HHHHHHHHHHHHHhhhC---CCCC--------CCcccccCC Confidence 99988887666655431 1110 111111111 No 205 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.74 E-value=2.3e-05 Score=45.98 Aligned_cols=409 Identities=11% Similarity=0.086 Sum_probs=190.1 Q ss_pred CCC---------------HHHHHHHHH----HHH---HHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHH Q lcl|NC_021297. 1 MTT---------------YHEHVERLQ----GLL---ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 1 ~~~---------------~~~~i~~l~----~~~---~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) -+| ..-+-+.+. ... ..-..+|+.+..+++-+..+.. T Consensus 36 ~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e--------------------- 94 (523) T protein:vir:68 36 LDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVDNAVSE--------------------- 94 (523) T ss_pred CCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhhccchhhHHHH--------------------- Confidence 000 000001000 000 0111222333333333322211 Q ss_pred HHHHHHhhh-ccCc---------cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 59 YLRTLSDRL-DIEG---------FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 59 iVd~~~~~l-~~~~---------~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) ||+..+-+= ..++ |+-+-.+...+.+..|++--+|+....+..+.-.+.|+.|++...+.. ...+|-. T Consensus 95 IVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~ 172 (523) T protein:vir:68 95 IVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIK 172 (523) T ss_pred hhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCC--Cccccce Confidence 121111000 0011 111112233455666666668889999999999999999988765322 2356878 Q ss_pred EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce-- Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV-- 206 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv-- 206 (480) .++.++|+.+-.+.--.... ...+..+. ....+-+|.+... .|...+... ... +.+ +||- T Consensus 173 Elr~lDPr~i~~vr~i~~~~--~~g~~vi~------~~~e~f~Y~~~~~-~~~~~g~~~----~~~----~~i-kI~~dA 234 (523) T protein:vir:68 173 ELRRLDPRQVQYVREVITTT--EAGVKIVK------GYKEYFIYDTSHE-SYACDGRIY----EAG----TKI-KIPKAA 234 (523) T ss_pred eeeeeCCcceeEEEeecCCC--Ccchhhhh------hhhhheeeccccc-ccccccccc----CCC----cce-ecchhh Confidence 88899998764443211100 00000000 0111223444321 121111100 000 011 2221 Q ss_pred EEeecccccCccC--CcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc------------- Q lcl|NC_021297. 207 VPLTNDPRLGNRY--GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT------------- 271 (480) Q Consensus 207 v~~~n~~~~~~~~--G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~------------- 271 (480) |.|.+.....+.- =.|-|.+.++++-. -+++.|.+.+-+....|-+=++-.+.-..+..+... T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKl 312 (523) T protein:vir:68 235 IVYAHSGLVDCCGKNIIGYLHRAIKPANQ--LKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRI 312 (523) T ss_pred eeeeeccceeCCCCceeccchhhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhccee Confidence 3343322111111 11334444444322 256778888877777776655422222222211110 Q ss_pred -------------hhhhhhhhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc--ccchHH Q lcl|NC_021297. 272 -------------TLDIYYGRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--ENPASA 333 (480) Q Consensus 272 -------------~~~~~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~n~~Sg 333 (480) .+-......|. .+|-+.++..++.++--.-++-++-....++...++|.+-+.... -|-.-| T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~ 392 (523) T protein:vir:68 313 AYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAG 392 (523) T ss_pred EEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccc Confidence 01111122222 134456677777664334456667777788888888887774332 122223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhcc Q lcl|NC_021297. 334 EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANG 406 (480) Q Consensus 334 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~ 406 (480) ..|-.-+.....-+.+++..|..-+.++++.=+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+.. T Consensus 393 ~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 472 (523) T protein:vir:68 393 TSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMA 472 (523) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHh Confidence 36777778888889999999999999999876555554333333 356777754333333333333221 22222 Q ss_pred ccC----CCHHHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 407 QGP----IPKEQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 407 ~~~----~s~et~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) .+. +|.+++++. |..++++++++.+.++++..+ ...... ..+.++. T Consensus 473 dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~---~~~~~p-----~~e~~~f 523 (523) T protein:vir:68 473 EPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKE---ARFQDP-----DQEQEDF 523 (523) T ss_pred hhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhc---CCCCCC-----chhhhcC Confidence 233 489998765 799999988887666665431 111100 0111111 No 206 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=97.72 E-value=2.5e-05 Score=45.83 Aligned_cols=419 Identities=11% Similarity=0.072 Sum_probs=189.4 Q ss_pred CCCH------HHHHHH----HHHHHH---HHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTY------HEHVER----LQGLLA---RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~----l~~~~~---~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) +..- .-+.+. +..... .-..+|+.+..+++-+..+..+-..+ ++.+--..+|.-- | T Consensus 45 I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L 113 (524) T protein:vir:10 45 IETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNAVQEIVSDA--------IVYEDDKEVVALN---L 113 (524) T ss_pred eccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---e Confidence 1110 000000 001111 11123333333333332222110000 0000000000000 0 Q ss_pred ccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCC Q lcl|NC_021297. 68 DIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 68 ~~~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~ 147 (480) -...|+-+-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-.+. ....+|-..++.++|+.+-.+.--.. T Consensus 114 d~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~--~~pk~GI~Elr~lDPr~i~~vr~i~~- 190 (524) T protein:vir:10 114 DGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINP--KKMKDGVQELRRLDPRQVQYIREIVT- 190 (524) T ss_pred cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeC--CCccccceeeeeeCCccceeeeeecc- Confidence 000011111223345556666666888999999999999999988854321 12356778888899987754432111 Q ss_pred CeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccC--Cccc Q lcl|NC_021297. 148 RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRY--GRSE 223 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~--G~s~ 223 (480) +....+..+. + ...+-+|.++.- .|...+.. .... +.+ +||- |.|.+.-...+.- =.|- T Consensus 191 -~~~~~~~vi~-----~-~~e~f~Y~~~~~-~~~~~~~~----~~~~----~~i-kI~~dAIvy~~SGL~d~~~~~i~sy 253 (524) T protein:vir:10 191 -RMEDGVKIVD-----G-YREFFVYDTGHE-SYCADGRI----YSAG----TKV-KIPRAAVVYAHSGLLDCCGKNIIGY 253 (524) T ss_pred -cCcccchhhc-----c-hhhheeecCCCc-ccccCcce----ecCC----cce-ecchhheeeeccCcccCCCCceecc Confidence 1111111100 0 111223333211 11111100 0000 011 2222 2222221111110 1133 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhh Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYY 277 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~ 277 (480) |.+.++++-. -+++.|.+.+-+....|-+=++-.+.-..+..+... .+-... T Consensus 254 LhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMl 331 (524) T protein:vir:10 254 LQRAIKPANQ--LKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMT 331 (524) T ss_pred chHhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhH Confidence 4444444322 257778888877777776655422222222211111 011111 Q ss_pred hhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc---cchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 278 GRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE---NPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 278 ~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~---n~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) ...|. .+|-+.++..++...--.-++-++-....++...++|.+-++.... +..-+..|-.-+.....-+.+++ T Consensus 332 EDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR 411 (524) T protein:vir:10 332 EDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQ 411 (524) T ss_pred hhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHH Confidence 22232 1344566777776653344566677777888899999888843321 22233456777788888899999 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-C Q lcl|NC_021297. 352 RIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-L 419 (480) Q Consensus 352 ~~~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-l 419 (480) ..|..-+.++++.=+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. | T Consensus 412 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL 491 (524) T protein:vir:10 412 NKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFL 491 (524) T ss_pred HHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHh Confidence 99999999999876555554333333 356777754333333333332221 22222233 488998765 7 Q ss_pred CCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 420 GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 420 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ..++++++++.+.++++..+ ...... ..+.++. T Consensus 492 r~tDeei~~~~k~I~~E~k~---~~~~~~-----~~~~~~f 524 (524) T protein:vir:10 492 QMTDEEINQEAKQIEEESKE---ARFQNP-----DEEEEDF 524 (524) T ss_pred ccCHHHHHHHHHHHHHHhhc---CCCCCC-----ChhhhcC Confidence 99999988887666665431 111100 0111111 No 207 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=97.70 E-value=2.7e-05 Score=45.66 Aligned_cols=429 Identities=11% Similarity=0.022 Sum_probs=170.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH---HHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLE---AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~---~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-- 75 (480) |.+-++...-|......+..+... ...||+-.-++...+... .-+..+. ......-++++-...+....+.+. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~-~ly~~m~-~D~hi~s~l~~Rk~av~~~~w~v~p~ 78 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESI-KTFQLMM-RDPAVAASVNIIKMFVRKVNWRFVPP 78 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchH-HHHHHHh-hChHHHHHHHHHHHHHhcCCceEecC Confidence 888777766666543333322211 123343111111111111 1222332 133444444444433444444431 Q ss_pred C----Cch---hHHHHHHHHHh--cCHHHHHHHHHHHHhhcCeE-EEEEecCccccc------CCCceeEEEEEccceeE Q lcl|NC_021297. 76 E----DSE---GLEELWNWWQA--NDLDEESVLGHDDSLTFGRA-YITVSHPDVESG------DPAGIPLIRVESPLYMY 139 (480) Q Consensus 76 ~----d~~---~~~~l~~i~~~--n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~------d~~~~~~i~~~~p~~~~ 139 (480) + +.+ ..+.+.+++.. +.|..++..+ .+|..+|.+ ++++|....... -.+|...+..+.++ T Consensus 79 ~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~R--- 154 (488) T protein:vir:95 79 KGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIR--- 154 (488) T ss_pred CCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccccCCeeeeeeeeec--- Confidence 1 111 12345555543 2455666665 478899986 677885321100 01222222111110 Q ss_pred EEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc---eEEeecccccC Q lcl|NC_021297. 140 AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP---VVPLTNDPRLG 216 (480) Q Consensus 140 ~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---vv~~~n~~~~~ 216 (480) ...-++++.-..+++... .+.+............ +. ...... -.+| ++.+.++...+ T Consensus 155 ----------pq~~~~~f~~d~d~~l~~----~~~~~~~~~~~~~~~~--~~---~~~~~~-~~lP~~kfi~~~~~~~~g 214 (488) T protein:vir:95 155 ----------NQSTLDKWYFDEDFRRVT----GVRQNLRNVSHIAGAI--NL---GERPLT-RKLPRAKFMLFKYDDEYG 214 (488) T ss_pred ----------CcccccceeeccCCCcee----eccccccccccccccc--cc---cccccc-ccccccceEEEeecCCCC Confidence 000001111011111100 0000000000000000 00 000001 1133 34566777888 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc--cccccccccchh----hhhhhh------hcccC Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT--DELTNDGENTTL----DIYYGR------ILTLA 284 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~--~~~~~~~~~~~~----~~~~~~------~~~~~ 284 (480) .++|.+.+..-.-..+- =+..+..++.-++.|..|..+.+|... ....+......+ ++..+. -..++ T Consensus 215 ~p~g~gLlr~~~w~~~f-K~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP 293 (488) T protein:vir:95 215 NPEGRSPLLNAYVPWKY-KVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWP 293 (488) T ss_pred ccchhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeec Confidence 89998876532222211 145566677777877777777666321 111111111111 111110 01111 Q ss_pred -CCCceEE---------EeccccHHHHHHHHHHHHHHHHhccCCChHHhccc--c-ccchHHHHHHH-HHHHHHHHHHHH Q lcl|NC_021297. 285 -SEAAKIS---------EFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS--S-ENPASAEAIIA-TDSRIVKMAERK 350 (480) Q Consensus 285 -g~~~~~~---------~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~--~-~n~~Sg~Al~~-~~~~l~~k~~~~ 350 (480) |-+.+++ .....+...|...++..-.+|++.. +|.+ + ....++-|+.. ...-....+..- T Consensus 294 ~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i------LGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aD 367 (488) T protein:vir:95 294 RYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF------MSDVLAMGQSKYGSFSLADSKTSLLAMSVDIL 367 (488) T ss_pred cccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH------hccccccccCcchhhhHHHHHHHHHHHHHHHH Confidence 1112221 1112233345555555555555553 2211 0 00112223221 122222333333 Q ss_pred HHHHHHHHH-HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC----HHHHHHhCCCChhH Q lcl|NC_021297. 351 GRIFGGAWE-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP----KEQARIDLGYTATQ 425 (480) Q Consensus 351 ~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s----~et~~~~lg~~~~~ 425 (480) .+.+...+. +++.-++.++.... ....++.|....+.++.+.++++.+|+.+|. .++ .+.+++.+|+...+ T Consensus 368 a~~i~~tln~~li~~l~~~Nfg~~---~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~-~i~~~~~~~~i~e~~gip~~~ 443 (488) T protein:vir:95 368 LKQIKNVINRDLVAQTYALNMWDD---EEHVQITYDDIETPDLEAIGSYIQKTVAVGA-LEVDKELSNKLREHIGLPPAD 443 (488) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCC---CCccEEEecCcChhhHHHHHHHHHHHHhCCC-ccccHHHHHHHHHHhCCCCCC Confidence 344455553 46665566653322 1235678888888888999999999998874 344 35678888886542 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 426 REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ..+ .... ....+.....+..... ....+..+|.+.+++..| T Consensus 444 ~~e----------~~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~ 484 (488) T protein:vir:95 444 ESQ----------PVSE---KLSPNSQSRSGDGYKT-AGEGTAKTPSAKDPSTAN 484 (488) T ss_pred CCc----------cccc---cCCCCCCCCCCcccCC-CcccCCcccccccchhhh Confidence 111 0000 0111111111111111 111233344445555555 No 208 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=97.70 E-value=2.7e-05 Score=45.66 Aligned_cols=408 Identities=12% Similarity=0.074 Sum_probs=191.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +.+..++| .+|+.+..+++-+..+..+-..+ ++.+--..+|.-- |--..++-+-.+.. T Consensus 69 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~~~~s~~iK~kI 126 (524) T protein:vir:10 69 MKTTRELI-----------DTYRNLMNNYEVDNAVSEIVSDA--------IVYEDDTEVVALN---LDKSKFSPKIKNMM 126 (524) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecCcCcchHHHHHH Confidence 11111111 12333333343333322210000 0000000000000 00000111111233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|++--+|+....+..+.-.+.|+.|++...+.. ...+|-..++.++|+.+-.+.--.. +.......+. T Consensus 127 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~Elr~lDPr~i~~vr~i~~--~~~~~~~vi~-- 200 (524) T protein:vir:10 127 LDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIKELRRLDPRQVQYVREIIT--ETEAGTKIVK-- 200 (524) T ss_pred HHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCC--CccccceeeeeeCCccceeeeeecc--CCCccchhhc-- Confidence 455666666668889999999999999999988765322 2356778888899987654432100 0000000000 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccC--CcccchhhHHHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRY--GRSEISPELRKVTDAAS 236 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~--G~s~i~~~v~~liD~~~ 236 (480) ....+-+|.++. ..|...+.... .. +.+ +||- |.|.+.....+.- =.|-|.+.++++-. - T Consensus 201 ----~~~e~f~Y~~~~-~~y~~~g~~~~----~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQ--L 264 (524) T protein:vir:10 201 ----GYKEYFIYDTAH-ESYACDGRMYE----AG----TKI-KIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQ--L 264 (524) T ss_pred ----chhhheeeccCc-cccccCccccC----CC----cce-ecchhheeeeeccceeCCCCceeccchhhhHHHHh--h Confidence 011122344432 12222211100 00 111 2221 3343322211111 11334444444322 2 Q ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc---cCCCC Q lcl|NC_021297. 237 RTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT---LASEA 287 (480) Q Consensus 237 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~---~~g~~ 287 (480) +++.|.+.+-+....|-+=++-.+.-..+..+... .+-......|. .+|-+ T Consensus 265 kmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrg 344 (524) T protein:vir:10 265 KLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAV 344 (524) T ss_pred hHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcc Confidence 56778888877777776655422222222111100 01111222232 13445 Q ss_pred ceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 288 AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 288 ~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 364 (480) .++..++.++--.-++-++-....++...++|.+-+.... -|-.-|..|-.-+.....-+.+++..|..-+.++++. T Consensus 345 TEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~ 424 (524) T protein:vir:10 345 TEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKT 424 (524) T ss_pred cceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6677777664334456667777788889999988873221 1222344677778888888999999999999999987 Q ss_pred HHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCChhHHHHHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYTATQREQMRDW 432 (480) Q Consensus 365 ~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~~~~~~~~~~~ 432 (480) =+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |..++++++++.+. T Consensus 425 qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~ 504 (524) T protein:vir:10 425 NLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQ 504 (524) T ss_pred hhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHH Confidence 6555554333333 356777754333333333333221 22222233 489998765 79999998888766 Q ss_pred HHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 433 DKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++++..+ ...... ..+.++. T Consensus 505 I~~E~k~---~~~~~~-----~~~~~~f 524 (524) T protein:vir:10 505 IEEESKE---ARFQDP-----DQEQEDF 524 (524) T ss_pred HHHHhhc---CCCCCC-----chhhhcC Confidence 6665431 111100 0111111 No 209 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=97.69 E-value=2.8e-05 Score=45.51 Aligned_cols=408 Identities=12% Similarity=0.075 Sum_probs=191.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +.+..++| .+|+.+..+++-+..+..+-..+ ++..--..+|.-- |--..++-+-.+.. T Consensus 69 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~~~~s~~iK~kI 126 (524) T protein:vir:72 69 MKTTRELI-----------DTYRNLMNNYEVDNAVSEIVSDA--------IVYEDDTEVVALN---LDKSKFSPKIKNMM 126 (524) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecCcCcchHHHHHH Confidence 11111111 12333333343333322210000 0000000000000 00000111111233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|++--+|+....+..+.-.+.|+.|++...... ...+|-..++.++|+.+-.+.--.. +.......+. T Consensus 127 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~Elr~lDPr~i~~vr~i~~--~~~~~~~vi~-- 200 (524) T protein:vir:72 127 LDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIKELRRLDPRQVQYVREIIT--ETEAGTKIVK-- 200 (524) T ss_pred HHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCCccceeeeeecc--CCCccchhhc-- Confidence 455666666668889999999999999999988754321 2356778888899987654432100 0000000000 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccC--CcccchhhHHHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRY--GRSEISPELRKVTDAAS 236 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~--G~s~i~~~v~~liD~~~ 236 (480) ....+-+|.++. ..|...+.... . .+.+ +||- |.|.+.....+.- =.|-|.+.++++-. - T Consensus 201 ----~~~e~f~Y~~~~-~~y~~~g~~~~----~----~~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQ--L 264 (524) T protein:vir:72 201 ----GYKEYFIYDTAH-ESYACDGRMYE----A----GTKI-KIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQ--L 264 (524) T ss_pred ----chhhheeeccCc-cccccCccccC----C----Ccce-ecchhheeeeeccceeCCCCceeccchhhhHhHHh--h Confidence 011122344432 12222211100 0 0111 2221 3344322211111 11334444444322 2 Q ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc---cCCCC Q lcl|NC_021297. 237 RTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT---LASEA 287 (480) Q Consensus 237 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~---~~g~~ 287 (480) +++.|.+.+-+....|-+=++-.+.-..+..+... .+-......|. .+|-+ T Consensus 265 kmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrg 344 (524) T protein:vir:72 265 KLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAV 344 (524) T ss_pred hHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcc Confidence 56778888877777776655422222222111100 01111222232 13445 Q ss_pred ceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 288 AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 288 ~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 364 (480) .++..++.++--.-++-++-....++...++|.+-+.... -|-.-|..|-.-+.....-+.+++..|..-+.++++. T Consensus 345 TEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~ 424 (524) T protein:vir:72 345 TEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKT 424 (524) T ss_pred cceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6677777664334456667777788889999988873221 1222344677778888888999999999999999987 Q ss_pred HHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCChhHHHHHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYTATQREQMRDW 432 (480) Q Consensus 365 ~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~~~~~~~~~~~ 432 (480) =+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |..++++++++.+. T Consensus 425 qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~ 504 (524) T protein:vir:72 425 NLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQ 504 (524) T ss_pred hhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHH Confidence 6555554333333 356777754333333333333221 22222233 489998765 79999998888766 Q ss_pred HHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 433 DKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) ++++..+ .... .|+++..|- T Consensus 505 I~~E~k~---~~~~--------~~~~~~~~f 524 (524) T protein:vir:72 505 IEEESKE---ARFQ--------DPDQEQEDF 524 (524) T ss_pred HHHHhhc---CCCC--------CCchhhhcC Confidence 6665431 1111 011111111 No 210 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.67 E-value=3e-05 Score=45.39 Aligned_cols=420 Identities=11% Similarity=-0.018 Sum_probs=159.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccc--hhhhhhccccchHHHHHHHHHhhhc----cC--ccc-c Q lcl|NC_021297. 4 YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP--PELAYLDVQPGWVATYLRTLSDRLD----IE--GFR-I 74 (480) Q Consensus 4 ~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~--~~~~~~~~~~n~~~~iVd~~~~~l~----~~--~~~-~ 74 (480) -+..++.+..++. |.+-...++++++ +.+++...... ....-.+..-+-+...++++++.|. +- +|+ . T Consensus 1 mk~~~~~~~~~lk-r~~~e~~w~e~a~--~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 3445555555442 2233344555552 22322211000 0000112333445666666666543 21 222 1 Q ss_pred --CC--------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEE Q lcl|NC_021297. 75 --SE--------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 75 --~~--------d~~----~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~ 133 (480) ++ +.. .. +.+...+..++|.....++.++...+|.+-+++.. ..+ +++.+ T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~-------~~~--~~~~~ 148 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-------DEA--TVVAW 148 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeC-------CCC--eEEEE Confidence 11 001 11 23444567789999999999999999998665532 122 35666 Q ss_pred ccceeEEEEeCCCCCeeEEEEEEEEEe-------c---------CCceEEEEEEEeC-----Cc----E-EEEEecCCcc Q lcl|NC_021297. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-------D---------DVAVPDRATLYLP-----DE----T-VPLRRNGGLN 187 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~---------~~~~~~~~~~~~~-----~~----~-~~~~~~~~~~ 187 (480) +-.+.++.-|. .+ ++...++.+.-. . .......+++|+. +. + ++++.++. . T Consensus 149 pl~~y~v~~d~-~G-~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~-~ 225 (510) T protein:vir:78 149 SLRSYAVRRDA-TG-RWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV-R 225 (510) T ss_pred EcceeEEeeCC-Cc-CeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCe-e Confidence 65554444443 23 343333332111 0 0011112333321 10 1 11111111 0 Q ss_pred cccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc Q lcl|NC_021297. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND 267 (480) Q Consensus 188 ~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~ 267 (480) + ......++..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|...+. -+. . T Consensus 226 ---i--~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~-p~g--~--- 293 (510) T protein:vir:78 226 ---V--GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAK--G--- 293 (510) T ss_pred ---e--ccccccccccCCeeeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhcCCcccC-Ccc--c--- Confidence 0 111123456789999998888899999997644 455566677665555555554454442221 000 0 Q ss_pred cccchhhh-hhhhhcccCCC--CceEEEecc-ccHH---HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHH Q lcl|NC_021297. 268 GENTTLDI-YYGRILTLASE--AAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 268 ~~~~~~~~-~~~~~~~~~g~--~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 340 (480) .....+.. ..+.+ .+|. +....++.. ++++ ..++.++.-|+..+... + ...++... ++.-++... T Consensus 294 ~~~~~l~~~~~g~~--v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l----~~~~~~rv-TAtEV~~r~ 365 (510) T protein:vir:78 294 AVVDDYQDAEMGDY--VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A----NQRDAERV-TAEEVRITA 365 (510) T ss_pred cchhhhccCCCcee--ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhhc-c----ccCCCCCc-CHHHHHHHH Confidence 00000111 11222 1222 223333332 2333 33444444444433221 1 11122222 333333221 Q ss_pred HHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCC-CCccc-ceeeeEEecCCCCCCHHHHHHHHH---HHHhccc--- Q lcl|NC_021297. 341 SRIVKM----AERKGRI-FGGAWERAMRIAMQIMGR-EVTEE-YTRLETVWRDPSTPTVAAKADAVS---KLYANGQ--- 407 (480) Q Consensus 341 ~~l~~k----~~~~~~~-~~~~l~~~~~~~~~~~~~-~~~~~-~~~i~v~f~~~~~~~~~~~ad~~~---kl~~~~~--- 407 (480) ..+... ..+.... +..-+++.+.++.+ .+. ....+ .....+++-.++.+ .+.++.+. +.++... T Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~q~l~~~~~~~ 442 (510) T protein:vir:78 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSR--SAAVQSMLNASQVIAGLAPIA 442 (510) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCcccccceeeecccHHHH--HHHHHHHHHHHHHHHHhcChh Confidence 111111 1222222 22223333443322 111 11111 22233444333222 22222222 2121111 Q ss_pred ---cCCCHH----HHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 408 ---GPIPKE----QARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 408 ---~~~s~e----t~~~~lg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) ..+... .+...+|.+ +++++++.+.+.+++........+...+.++.++ .+.| T Consensus 443 q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~-------------~~~g 509 (510) T protein:vir:78 443 QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTN-------------ALAG 509 (510) T ss_pred hhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc-------------cCCC Confidence 112222 233445653 3445444433222221111111111111111111 1222 Q ss_pred C Q lcl|NC_021297. 474 F 474 (480) Q Consensus 474 ~ 474 (480) . T Consensus 510 ~ 510 (510) T protein:vir:78 510 V 510 (510) T ss_pred C Confidence 2 No 211 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.65 E-value=3.3e-05 Score=45.16 Aligned_cols=420 Identities=14% Similarity=0.120 Sum_probs=163.6 Q ss_pred CCC-------HHHHHHHHHHHHHHHH-HHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc---- Q lcl|NC_021297. 1 MTT-------YHEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD---- 68 (480) Q Consensus 1 ~~~-------~~~~i~~l~~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~---- 68 (480) |-| ..+.+.+....+..++ +-...++++|+- .+++.-.........-++.-+-+...++++++.|. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~--tlP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 78 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKL--TLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLF 78 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHH--hcccccCCCCCcccccccccchHHHHHHHHHHHHHHhhc Confidence 433 1223444444444333 223344444422 22222111111111112333445666666666543 Q ss_pred cC--cc-ccC-CCc---------hhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCC Q lcl|NC_021297. 69 IE--GF-RIS-EDS---------EGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 69 ~~--~~-~~~-~d~---------~~~-----------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~ 124 (480) +- +| +.. .|+ ... +.+...+..++|.....++.++...+|.+-+++.. T Consensus 79 pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~-------- 150 (515) T protein:vir:70 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS-------- 150 (515) T ss_pred CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeC-------- Confidence 21 22 221 010 011 22444467789999999999999999999766632 Q ss_pred CceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-c-----------------CCceEEEEEEE-----eCCc-EEEE Q lcl|NC_021297. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D-----------------DVAVPDRATLY-----LPDE-TVPL 180 (480) Q Consensus 125 ~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~-----------------~~~~~~~~~~~-----~~~~-~~~~ 180 (480) ++. +++++-.+.++--|. .+ ++...++.+.-. . ..+....+++| .++. +.+| T Consensus 151 ~~~--~~~~pl~~y~v~~d~-~G-~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~ 226 (515) T protein:vir:70 151 KGA--MSAVPMHHYVVNRDT-NG-DLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) T ss_pred CCC--eEEEEcCeEEEeeCC-Cc-CeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEE Confidence 222 555665664444443 23 344333322100 0 00000112232 2222 1122 Q ss_pred EecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021297. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g 258 (480) ....+. .......-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|.+.+. | T Consensus 227 ~e~d~~-----~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g 300 (515) T protein:vir:70 227 QSADDI-----PVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAED-YSGDLFVIQFLSEAMARGAALMADIKYLIRPGS 300 (515) T ss_pred EecCce-----eeccccccccccCCceeeeeeecCCCCcccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCCeeeCccc Confidence 211111 11111223367789999988888899999997654 556677777777777777777777765542 1 Q ss_pred CCccccccccccchhhhhh-hhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHH Q lcl|NC_021297. 259 VTTDELTNDGENTTLDIYY-GRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAI 336 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al 336 (480) ... ...+..+. +.+..-..++....++.. .+++.....++.+...|....-+.. ..-.++.+. ++.-+ T Consensus 301 ~~~--------~~~l~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~-l~~rd~~rv-TAtEV 370 (515) T protein:vir:70 301 QTD--------VDHFVNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMET-MTRRDAERV-TAVEI 370 (515) T ss_pred ccc--------hhhccccCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhh-hhccCCccc-cHHHH Confidence 111 01111111 222111112233334432 3444433444333333332221110 011111122 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCCcccceeeeEEecCCCCCCHHHH---HHHHHHHHhc Q lcl|NC_021297. 337 IATDSRIVKMAERKGRIFGGAWERAMR--------IAMQIMGREVTEEYTRLETVWRDPSTPTVAAK---ADAVSKLYAN 405 (480) Q Consensus 337 ~~~~~~l~~k~~~~~~~~~~~l~~~~~--------~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~---ad~~~kl~~~ 405 (480) + ..+.+++..++..+.++-. +++.-..-..+.+. +++.+-. +.+..+. ++.+..+.+. T Consensus 371 ~-------~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~--v~~~~vs--~l~~L~r~q~~~~i~~~~q~ 439 (515) T protein:vir:70 371 Q-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSEL--VDPVIVT--GIEALGRMAELDKLANFAQY 439 (515) T ss_pred H-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhh--cccceeh--hHHHHHHHHHHHHHHHHHHH Confidence 2 3345555556665555422 11111222222222 3333311 1222222 2222222111 Q ss_pred ------cc----cCCCH----HHHHHhCCC------ChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 406 ------GQ----GPIPK----EQARIDLGY------TATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 406 ------~~----~~~s~----et~~~~lg~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) +. -.+.. +.+...+|. ++++++++.+.++++ ...+..+..- +++.+...+..- T Consensus 440 i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~---~~~~~~~~~~--~~a~~~~~~~~~-- 512 (515) T protein:vir:70 440 MSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQA---QQEAMLNEGV--AKAVPGVIQQEM-- 512 (515) T ss_pred HHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHH---HHHHHHHHhh--hhhcccchhhhh-- Confidence 10 01111 222233332 344444443222221 1111111111 111111111110 Q ss_pred cccc Q lcl|NC_021297. 466 ETQT 469 (480) Q Consensus 466 ~~~~ 469 (480) .+ + T Consensus 513 ~~-~ 515 (515) T protein:vir:70 513 KE-G 515 (515) T ss_pred cc-C Confidence 00 0 No 212 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=97.64 E-value=3.4e-05 Score=45.05 Aligned_cols=419 Identities=15% Similarity=0.115 Sum_probs=161.9 Q ss_pred CCCHHH----HHHHHHHHHHHHHHH-HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc----cC- Q lcl|NC_021297. 1 MTTYHE----HVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----IE- 70 (480) Q Consensus 1 ~~~~~~----~i~~l~~~~~~~~~r-~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~~- 70 (480) |+.+.. .+++..+.+..++.. ...++++|+ +.+++.-.......+.-++.-+-+...++++++.|. +- T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~--~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 82 (516) T protein:vir:96 5 IDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSK--LTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQ 82 (516) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--hhcccccCCCCCccccCCcccchHHHHHHHHHHHHHhhhcCCC Confidence 444433 344444555444333 344455542 222222111111111113334455666666666553 21 Q ss_pred -cc-ccCC-C----------c---hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce Q lcl|NC_021297. 71 -GF-RISE-D----------S---EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 71 -~~-~~~~-d----------~---~~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~ 127 (480) +| +..- | . +.. +.+...+..++|.....++.++...+|.+-+++. +++ T Consensus 83 ~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d--------~~~- 153 (516) T protein:vir:96 83 RSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP--------SKG- 153 (516) T ss_pred CcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec--------CCC- Confidence 22 2211 1 0 111 2344566778999999999999999999876552 222 Q ss_pred eEEEEEccceeEEEEeCCCCCeeEEEEEEEE-E------e----------c-CCceEEEEEEE-----eCCcEE-EEEec Q lcl|NC_021297. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYT-T------R----------D-DVAVPDRATLY-----LPDETV-PLRRN 183 (480) Q Consensus 128 ~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~-~------~----------~-~~~~~~~~~~~-----~~~~~~-~~~~~ 183 (480) .++.++-.+.++.-|. .+ ++...++... . . . ..+....+++| .++..+ +|... T Consensus 154 -~~~~~pl~~y~v~~d~-~G-~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~ 230 (516) T protein:vir:96 154 -AISAIPMHHYVVNRDT-NG-DLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSA 230 (516) T ss_pred -CEEEEEcCeEEEeeCC-CC-CeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEe Confidence 2555655554444443 22 3333222110 0 0 0 00001122333 233222 22221 Q ss_pred CCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCc Q lcl|NC_021297. 184 GGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTT 261 (480) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~ 261 (480) ++.. ......-+|..+|++.+..+...++.||+|-..+ ..+-+..+|.+.-...........|.+.+. |... T Consensus 231 d~~~-----~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~ 304 (516) T protein:vir:96 231 DDIP-----VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAED-YSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTD 304 (516) T ss_pred Ccee-----eccccccccccCCeeeeeeeecCCCCcccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccc Confidence 1111 1111223456789999998888899999997643 455566677666666666666666554432 1110 Q ss_pred cccccccccchhhh-hhhhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHHHH Q lcl|NC_021297. 262 DELTNDGENTTLDI-YYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIA 338 (480) Q Consensus 262 ~~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al~~ 338 (480) ...+.. ..+.+..-..++....++.. .+++.....++.+...|....-+. .+.. ++... ++.-++ T Consensus 305 --------~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~rv-TAtEV~- 372 (516) T protein:vir:96 305 --------VDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERV-TAVEIQ- 372 (516) T ss_pred --------hhhhccCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccc-cHHHHH- Confidence 001111 11222111112233344432 334433333333333332222110 0111 11111 232222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHH---HHHHHHHHh--- Q lcl|NC_021297. 339 TDSRIVKMAERKGRIFGGAWERA--------MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAK---ADAVSKLYA--- 404 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~--------~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~---ad~~~kl~~--- 404 (480) ..+.+++..++..+.++ +..++...+...+... +++.+-.+ -+.+.. ++.+....+ T Consensus 373 ------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~--v~~~~vs~--l~~l~r~~~~~~i~~~~~~i~ 442 (516) T protein:vir:96 373 ------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDL--VDPVIITG--IEALGRMAELDKLANFAQYMS 442 (516) T ss_pred ------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCcccc--ccceeech--HHHHHHHHHHHHHHHHHHHHH Confidence 33444555555555543 1112223343333222 33333211 111222 222222211 Q ss_pred ----ccc---cCCCHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccc Q lcl|NC_021297. 405 ----NGQ---GPIPKE----QARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTET 467 (480) Q Consensus 405 ----~~~---~~~s~e----t~~~~lg~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ..+ ..+... .+...+|.+ +++++++.+.+.+.++. ........++.+ ....++..+. T Consensus 443 ~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~-~~~a~~~~~~~~-----~~~~~~~~~~ 516 (516) T protein:vir:96 443 LPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQA-QMLEEGVAKAVP-----GVIQQELKEA 516 (516) T ss_pred HHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH-HHHHHHhhhhhh-----HHhhcccccC Confidence 111 111222 223334543 33333332222111111 111111111111 0000000000 No 213 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.62 E-value=3.6e-05 Score=44.94 Aligned_cols=413 Identities=12% Similarity=0.072 Sum_probs=167.8 Q ss_pred CCCHHHHHHHHHHHHH---HHHHHHHHHHHHhcccCcchhcCcc-------------cch---hhhhhccccchHHHHHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLA---RDLPNLLEAEAYRNGTRRLKTIGIG-------------APP---ELAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~---~~~~r~~~~~~YY~g~~~i~~~~~~-------------~~~---~~~~~~~~~n~~~~iVd 61 (480) |..+-+...+-++.-. ..-.++....+-+.+ |+.+.+... ... -+..+.-......-+++ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~ 79 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFAN-HPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMS 79 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcc-cCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 2222111000000000 000111111122111 111110000 000 00000001122222233 Q ss_pred HHHhhhccCccccC---CC----chhHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeE--- Q lcl|NC_021297. 62 TLSDRLDIEGFRIS---ED----SEGLEELWNWWQA-NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL--- 129 (480) Q Consensus 62 ~~~~~l~~~~~~~~---~d----~~~~~~l~~i~~~-n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~--- 129 (480) +-...+....+.+. ++ .+..+.+.+++.+ .+|...... +.+|..||.+ ++++|... +|... T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~------~g~~~~~~ 152 (528) T protein:vir:10 80 KRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQ------GREWLPQA 152 (528) T ss_pred HHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeec------CCceeEEE Confidence 32223334444431 12 1223445666644 246665554 4568889986 56667421 23332 Q ss_pred EEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEe Q lcl|NC_021297. 130 IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPL 209 (480) Q Consensus 130 i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~ 209 (480) +.+++|+.+ .|++... . .++...+.. .....+ +++ ++.+ T Consensus 153 ~~~r~~~~f--~~~~~~~-----------------------------~-~l~~~~~~~-----~g~~l~-~~k---~iv~ 191 (528) T protein:vir:10 153 FDHRPQSWF--QLNPDDQ-----------------------------D-ELRLRDNSI-----AGEVLQ-PFG---WIMH 191 (528) T ss_pred eeeecccce--eeccCCC-----------------------------c-EEeccCCCC-----Cceeec-CCC---eEEE Confidence 222222211 1111110 0 010000000 000000 111 2334 Q ss_pred ecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhhhhhhhccc-C Q lcl|NC_021297. 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDIYYGRILTL-A 284 (480) Q Consensus 210 ~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~~~~~~~~~~-~ 284 (480) .++...+.++|.+.+.. +--..=-=+..+.+++.-++.|+.|.++.+ -+....+++.... .+++.+....+ . T Consensus 192 ~~~~~~g~p~g~gLlr~-~~w~~~fK~~~~~~w~~f~E~yG~P~~igk--y~~~a~~~ek~~L~~al~~i~~~~~~iiP~ 268 (528) T protein:vir:10 192 KPRSRSGYVARSGLFRV-LAWPYLFKHYSTADLAEMLEIYGLPIRLGK--YPPGTPDEEKVTLLRAVTGLGHAAAGIIPE 268 (528) T ss_pred eecCCCCCccccchHHH-HHHHHHHHHhhHHHHHHHHHHcCCCeEEEe--cCCCCCHHHHHHHHHHHHHHhhCcEEEecC Confidence 45666677888887643 222222225678888888999999976664 2211111111111 12222223233 3 Q ss_pred CCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccc-cccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_021297. 285 SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAIIA-TDSRIVKMAERKGRIFGGAWE-RA 361 (480) Q Consensus 285 g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~-~~ 361 (480) |...+|.+....+...|...++..-.+|++..- -+.+... +.+-.+..|+.. ...-....++.-.+.+...+. ++ T Consensus 269 ~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL--GqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~l 346 (528) T protein:vir:10 269 SMSIDFQEASKGSAEPFMAMMRWCDDSMSKAIL--GGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDL 346 (528) T ss_pred CceeEEeecCCCChhHHHHHHHHHHHHHHHHHh--hhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 456777776556667777777777777766641 0111110 001112223321 222333344444456666674 47 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMI 441 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~ 441 (480) ++.++.++...........++.|....+.++.+.++.+.+++..|. .++.+.+.+.+|+......+ ..+ T Consensus 347 i~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~p~~~e----------~~~ 415 (528) T protein:vir:10 347 LWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGV-QVPVNWVQEQLGIPLPANGE----------AVL 415 (528) T ss_pred HHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCCc----------ccc Confidence 7777777654332223346778888889999999999999998873 47999999999986542111 000 Q ss_pred HHHhcccccCcccccCCCCCCCCcccccCCCCCCCCCC---------C Q lcl|NC_021297. 442 DTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKT---------R 480 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~---------~ 480 (480) ............+.+.+.... ......+...+.... + T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~d~~~~~~~~~ 461 (528) T protein:vir:10 416 GDQAGAGIAQLSRRPGPRIAA--LAQVIGPRYRDQEALDQVLASLPAQ 461 (528) T ss_pred cCCCcccccccCccccccccc--ccccccccccccchHHHHHHHHHHH Confidence 000000000000000000000 000000000000000 0 No 214 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.60 E-value=4e-05 Score=44.72 Aligned_cols=390 Identities=12% Similarity=0.035 Sum_probs=156.4 Q ss_pred CCCHHHHHHHHHHHHHHHH-HHHHH---------HHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDL-PNLLE---------AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-~r~~~---------~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |- +..+|........ .+... ...|+.+- .++ .+..+.. ..-+.+.-...+|+..++-+-.- T Consensus 1 MG----~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~-~~~~v~~---~~al~~~~v~~ci~~ia~~iA~l 71 (422) T protein:vir:13 1 MG----FLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKF-GIK-LNFSVRG---KRALKENTVYVCTKIRAESIGKL 71 (422) T ss_pred Cc----hhhhhhhccCCccchhhhhhhccccccCcchhhhhc-ccc-CCcccch---hhhhccHHHHHHHHHHHHhhhhC Confidence 22 3333322211100 00000 00111110 000 0000000 00011122233344444333233 Q ss_pred cccc--CCCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEE Q lcl|NC_021297. 71 GFRI--SEDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 71 ~~~~--~~d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~ 142 (480) ++.+ ..++.....+..++.. | ........+..+.+.+|.||+.+..+ ..|++ .+..++|.++.+++ T Consensus 72 p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~~~~ 145 (422) T protein:vir:13 72 SLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERD------RKGKIIGLYPINSDNVTKII 145 (422) T ss_pred ceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEECCcceEEEE Confidence 3332 1111112234455432 3 23467788889999999999988653 24554 67889999999888 Q ss_pred eCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcc Q lcl|NC_021297. 143 DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s 222 (480) |.........-+ +|......+.. ..+.++ -|+|+..+....+.+|.| T Consensus 146 ~~~~~~~~~~~~-~y~~~~~~g~~---~~~~~~-----------------------------eiih~~~~~~~~~~~G~s 192 (422) T protein:vir:13 146 DDDNFLSSLSKV-WYVVTDKNGKE---HKLLPD-----------------------------EMLHFIGDITLDGLIGIK 192 (422) T ss_pred cCCcceeccceE-EEEEEeCCCeE---EEEccc-----------------------------ceEEEcCCCCCCCccccc Confidence 754211110011 11111111110 011122 234444333345567887 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEE Q lcl|NC_021297. 223 EISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISE 292 (480) Q Consensus 223 ~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~ 292 (480) .+. .+.+.+....+-......+|. .|..++.-. ...+...+.-...|+. ..+.++.++ ++.++.+ T Consensus 193 ~~~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~ 267 (422) T protein:vir:13 193 PLD----YLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLP-FGYQFQP 267 (422) T ss_pred HHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCceeee Confidence 663 233333333222222222333 354444321 1111100111111211 112344443 3467766 Q ss_pred ecccc-HHHHHHHHHHHHHHHHhccCCChHHhcccccc-chHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 293 FKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAE--AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 293 ~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n-~~Sg~--Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +.... -..+++..+....+|+...++|+..+|....+ .++.. .+.+....|.-.+...+..|.. .+ T Consensus 268 l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~----------~L 337 (422) T protein:vir:13 268 ISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQD----------KL 337 (422) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH----------hh Confidence 65432 22467888888999999999999999854322 11111 1111111121112111111111 11 Q ss_pred hCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_021297. 369 MGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 369 ~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ...........+++.+......|..+.++++.+++++| +++...+++.+|+.+-+- -.+. .-.. T Consensus 338 l~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--gD~~------------~~~~ 401 (422) T protein:vir:13 338 FSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGG--FIEANEARRRENLPPVEG--GDRL------------LVNG 401 (422) T ss_pred CChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCee------------eecc Confidence 11111111123444445556678888999999998875 677777888888765320 0000 0000 Q ss_pred ccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 449 KAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) . -.+-... +++.+..|..|+. T Consensus 402 n----~~~l~~~-----~~~~~~~g~~~g~ 422 (422) T protein:vir:13 402 N----MIPIEMA-----GEQYKKGGEKGGK 422 (422) T ss_pred C----ccchhhc-----ccccccCCCcCCC Confidence 0 0000000 0000011111111 No 215 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=97.59 E-value=4.1e-05 Score=44.64 Aligned_cols=421 Identities=14% Similarity=0.125 Sum_probs=163.3 Q ss_pred CCCH--------HHHHHHHHHHHHHHHHH-HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc--- Q lcl|NC_021297. 1 MTTY--------HEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--- 68 (480) Q Consensus 1 ~~~~--------~~~i~~l~~~~~~~~~r-~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~--- 68 (480) |-.. ++.+++..+.+..++.. ...++++|+ +.+++.........+.-++.-+-+...++++++.|. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~--~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSK--LTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHH--hhcccccCCCCCcccccccccchHHHHHHHHHHHHHhhh Confidence 3222 22344444444433322 344455542 223322111111111113334455666666666543 Q ss_pred -cC--ccc-cC--CC---------c---hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccC Q lcl|NC_021297. 69 -IE--GFR-IS--ED---------S---EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 69 -~~--~~~-~~--~d---------~---~~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d 123 (480) +- +|+ .. +. . +.. +.+...+..++|.....++.++...+|.|-+++. T Consensus 79 tpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d-------- 150 (516) T protein:vir:10 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP-------- 150 (516) T ss_pred cCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec-------- Confidence 21 222 11 10 0 011 2344456778999999999999999999865542 Q ss_pred CCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe-------cC-----------CceEEEEEEEe-----CCcEEEE Q lcl|NC_021297. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-------DD-----------VAVPDRATLYL-----PDETVPL 180 (480) Q Consensus 124 ~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~~-----------~~~~~~~~~~~-----~~~~~~~ 180 (480) +++ .++.++-.++++.-|. .+ ++...++...-. .. .+....+++|+ ++..+.+ T Consensus 151 ~~~--~~~~~pl~~y~v~~d~-~G-~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~ 226 (516) T protein:vir:10 151 SKG--AISAIPMHHYVVNRDT-NG-DLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWEL 226 (516) T ss_pred CCC--CeEEEEcCeEEEeeCC-CC-CeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEE Confidence 122 2555655665444443 33 343333321100 00 00111223332 2222222 Q ss_pred EecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021297. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g 258 (480) ....+.. .......-+|..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|.+.+. | T Consensus 227 ~~~~d~~----~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~-~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g 301 (516) T protein:vir:10 227 KQSADDI----PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAED-YSGDLFVIQFLSEAVARGAALMADIKYLIRPGA 301 (516) T ss_pred EEeeCce----eeccccccccccCCeeeeeeeecCCCCcccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCCcccCccc Confidence 2211110 01111123466789999988888899999997643 455566666666556665665566554442 1 Q ss_pred CCccccccccccchhhhhhhhhcccCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhccCCChHHhcc-ccccchHHHHH Q lcl|NC_021297. 259 VTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAI 336 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~n~~Sg~Al 336 (480) ..... .....+.+.+..-..++....++.. .+++.....++.+...|....-+.. +.. ++... ++.-+ T Consensus 302 ~~~~~-------~l~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~rd~~rv-TAtEV 371 (516) T protein:vir:10 302 QTDVD-------HFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMET--MTRRDAERV-TAVEI 371 (516) T ss_pred ccchh-------hhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhh--hhccCCccc-cHHHH Confidence 11000 0011111222211112233344432 3444433444433333332221110 111 11111 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCCcccceeeeEEecCCCCCCHHHHH---HHHHHHHh- Q lcl|NC_021297. 337 IATDSRIVKMAERKGRIFGGAWERAMR--------IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKA---DAVSKLYA- 404 (480) Q Consensus 337 ~~~~~~l~~k~~~~~~~~~~~l~~~~~--------~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a---d~~~kl~~- 404 (480) + ..+.+++..++..+.++-. ..+.......+.....+.+. .+.+...++ +.+..+.+ T Consensus 372 ~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~v----~~i~~L~raq~~~~i~~~~q~ 440 (516) T protein:vir:10 372 Q-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVII----TGIEALGRMAELDKLANFAQY 440 (516) T ss_pred H-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCccee----hhHHHHHHHHHHHHHHHHHHH Confidence 2 3445555555555554322 11111222333332222221 112222222 22222111 Q ss_pred ------cccc---CCC----HHHHHHhCCC------ChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 405 ------NGQG---PIP----KEQARIDLGY------TATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 405 ------~~~~---~~s----~et~~~~lg~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ..+. .+. -+.+...+|. ++++++++.+.+.+..+... ....... +..+..++.-. T Consensus 441 i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~-~~~~~~~-----~~~~~~~~~~~ 514 (516) T protein:vir:10 441 MSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQM-LEEGVAK-----AVPGVIQQELK 514 (516) T ss_pred HHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHH-HHHHhhh-----cccchhhhhhh Confidence 1100 011 1333444443 34455554433322222111 1111111 11111111111 Q ss_pred cc Q lcl|NC_021297. 466 ET 467 (480) Q Consensus 466 ~~ 467 (480) +. T Consensus 515 ~~ 516 (516) T protein:vir:10 515 EA 516 (516) T ss_pred cC Confidence 11 No 216 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=97.59 E-value=4.1e-05 Score=44.62 Aligned_cols=391 Identities=11% Similarity=0.031 Sum_probs=169.1 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHhcccC----cchh-cCcccchhhhhhccccchHHHHHHHHHhhhccCcccc--CCCc Q lcl|NC_021297. 8 VER--LQGLLARDLPNLLEAEAYRNGTR----RLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SEDS 78 (480) Q Consensus 8 i~~--l~~~~~~~~~r~~~~~~YY~g~~----~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--~~d~ 78 (480) |++ |..+..+...-.+.+..|..|-. .|.. ....-...+..+. ......-++++....+....|.+ ++++ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~-~D~~i~s~l~~rk~av~~~~w~i~p~~~~ 79 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEIL-SDAQVKTVWGQRQLAVVSREWKVEAGGDR 79 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHh-hChHHHHHHHHHHHHHhcCCceEEcCCCC Confidence 333 22222221111112222322210 1100 0000001111111 12233334444444444445544 2222 Q ss_pred ----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEE---EEccceeEEEEeCCCCCee Q lcl|NC_021297. 79 ----EGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR---VESPLYMYAELDPRNTRRV 150 (480) Q Consensus 79 ----~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~---~~~p~~~~~v~d~~~~~~~ 150 (480) +..+.+.+.++.-+|...+..+. ++..||.+ ++++|.. .+|...+. +++|+.+ .||+... T Consensus 80 ~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~~f--~~d~~~~--- 147 (488) T protein:vir:99 80 PIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGR------DDRYITLEAIKVRNRRRF--RYDQDGG--- 147 (488) T ss_pred hHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEee------cCCeeeEeeeeeecccce--eecCCCc--- Confidence 23356777777767887777765 68889986 6667742 13443332 3333211 1221110 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHH Q lcl|NC_021297. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) ..+....+. ......|.+++. ++ ..++...+.++|.|.+..-... T Consensus 148 ---------------------------l~~~~~~~~-----~~g~~lp~~~~~--i~-~~~~~~~g~p~g~gLl~~~~w~ 192 (488) T protein:vir:99 148 ---------------------------LRLLTPNNM-----FEGEPCPAPYFW--HF-STGADNDDEPYGLGLAHWLYWP 192 (488) T ss_pred ---------------------------eEEeccCCC-----CCccccccCceE--EE-EeecCCCCCcccchHHHHHHHH Confidence 001100000 011111222221 22 2345566778888877542222 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc-ccccccc----hhhhhhhhhccc-CCCCceEEEeccccHHHHHHH Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGTPLRVISGVTTDEL-TNDGENT----TLDIYYGRILTL-ASEAAKISEFKAAELRNFAEE 304 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~-~~~~~~~----~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~ 304 (480) .=-=+..+.+++.-++.|+.|.++.+ -++.. .++.... ...++......+ .|...++.+....+...|... T Consensus 193 -~~fK~~~~~~w~~f~E~yG~P~~igk--y~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~l 269 (488) T protein:vir:99 193 -VFFKRNGIKFWLIFLDKFGMPTAVGR--YDDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTL 269 (488) T ss_pred -HHHHHhhHHHHHHHHHHcCCceeeee--cCCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHH Confidence 11225567788888899999976554 22111 1111111 122233333333 344566766655555667777 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~i~v 382 (480) ++..-.+|++..- -+.+.....+ .| -|+. ....-....++.-.+.+...+. ++++.++.++.... ....+ T Consensus 270 i~~~d~~Isk~iL--Gqtlts~~~~-Gs-~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~----~~p~~ 341 (488) T protein:vir:99 270 HDTMDATIAKVGL--GQVASTQGTP-GR-LGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGA----QPPRV 341 (488) T ss_pred HHHHHHHHHHHHh--hhhhcccccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCc----CCcee Confidence 7777676665531 0111111111 11 1221 1222333444444455666664 46776666654322 22456 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) .|....+.++.+.++.+.+++..+-..++.+.+.+.+|+......+ + . . .+.+.... T Consensus 342 ~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~-------~---~----~---------~~~~~~~~ 398 (488) T protein:vir:99 342 YRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQA-------E---A----T---------APTPSTEF 398 (488) T ss_pred EecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCccccc-------c---c----c---------cCCCcccC Confidence 7777788899999999999988522457899999999987542111 0 0 0 00000000 Q ss_pred CCcccccCCCC-CCCCCCC Q lcl|NC_021297. 463 TKTETQTSPSG-FNRTKTR 480 (480) Q Consensus 463 ~~~~~~~~p~~-~~~~~~~ 480 (480) .++.....+.+ .-....+ T Consensus 399 ~~~~~~~~~~~~~~~~~~~ 417 (488) T protein:vir:99 399 AEGDQPSDPAAAMAPQLAE 417 (488) T ss_pred CCCCCCCCchHHHHHHHHH Confidence 00000000000 0000001 No 217 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.58 E-value=4.3e-05 Score=44.55 Aligned_cols=382 Identities=11% Similarity=-0.007 Sum_probs=150.3 Q ss_pred HHHHHHHHHHH-H---HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCch--h Q lcl|NC_021297. 10 RLQGLLARDLP-N---LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSE--G 80 (480) Q Consensus 10 ~l~~~~~~~~~-r---~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~--~ 80 (480) -|.+....+.. . .....-.+-|... .+..+... .-+.+.-...+|+..++-+-.-++.+ .++.+ . T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~ 74 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINP---SETYVNGK---SCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVP 74 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCc---Ccceechh---hhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeecc Confidence 01000000000 0 0000000001000 00011000 00111112333444333332223322 11111 1 Q ss_pred HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 81 LEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 81 ~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ...+..++.. | ....+...++.+.+.+|.||+++-.+ ..|.+ .+.+++|..+.++.++........-+ T Consensus 75 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~ 148 (409) T protein:vir:10 75 DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFK------KNGEIKGLYPLKSDGMKIFVDDTGLLNSENNV 148 (409) T ss_pred CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEEcCCceEEEEcCCccccccceE Confidence 2234444432 3 33466778889999999999988653 34444 57788999888877653211111111 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) .|.. ....+.. ..+.+++ |+||.+.. ..+.+|.|.|. .+.+. T Consensus 149 ~y~~-~~~~g~~---~~~~~~e-----------------------------vih~r~~~-~d~~~G~s~i~----~~~~~ 190 (409) T protein:vir:10 149 WYLY-TDDLGQR---HKFMSDE-----------------------------ILHFKGLT-ADGLAGLSVIE----LLNHL 190 (409) T ss_pred EEEE-EeCCcee---EEecccc-----------------------------EEEecCcC-CCCcccccHHH----HHHHH Confidence 1111 1111110 0112222 34444321 23466777653 33333 Q ss_pred HHHHHHHHHHHHHHh---cchhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHH Q lcl|NC_021297. 235 ASRTLMNLQSASQIL---GTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 235 ~~~~~s~~~~~~~~~---~~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) +....+-......+| +.|-.++.-. .......+.-...|+- ..+.++.++ ++.++.+++.+. -..+++ T Consensus 191 i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e 269 (409) T protein:vir:10 191 IENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLP-IGYKFEPISQKLVDAQFLE 269 (409) T ss_pred HHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecC-CCceEEEccCChhhHHHHH Confidence 333322222222233 3344343311 1111111100111111 122344443 345776665432 224678 Q ss_pred HHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) ..+....+|+...++|+..+|.... +.++. ......+...+ -.-+-..|++.+.. .+...........+++ T Consensus 270 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~---e~~~~~f~~~~---l~P~~~~ie~~ln~--kL~~~~~~~~~~~~~f 341 (409) T protein:vir:10 270 NSQLTIRQIASVFGVKMHQLNDLDRATHSNI---TEQNREFYIDT---LQSILNMYELEINY--KLFLISEIKNGFYSKF 341 (409) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCccccH---HHHHHHHHHHH---HHHHHHHHHHHHHH--hhcCchhccCCcEEEE Confidence 8888899999999999999985432 21222 21111111111 01111112211110 1111111112234555 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) .+......|..+.++++.+++.+| +++...+++.+|+.+-+- -.+.. .....-.....++ +...+++ T Consensus 342 d~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~lgl~p~~g--gD~~~-------~~~n~~~~~~~~~--~~~kgGe 408 (409) T protein:vir:10 342 NVDTILRADIKTRYESYKEAIQNG--FKTPNEIRELEEDEPLEG--GDVLL-------INGNMIPVKMAGE--QYSKGGE 408 (409) T ss_pred echhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCeee-------eccCccchhhccc--cccccCC Confidence 555666678889999999998876 566666677777754320 00000 0000000000000 0000000 Q ss_pred CC Q lcl|NC_021297. 463 TK 464 (480) Q Consensus 463 ~~ 464 (480) + T Consensus 409 -~ 409 (409) T protein:vir:10 409 -K 409 (409) T ss_pred -C Confidence 0 No 218 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=412 Identities=10% Similarity=0.097 Sum_probs=191.3 Q ss_pred CCCH----------------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTTY----------------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~----------------~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) |.+. -.-...|| .+|+.+..+++-+..+..+-..+ ++..--..+|.-- T Consensus 51 ~~~~~~~g~~~~~y~~~e~~~~~~~eLI-------~~YR~ma~~pEvd~Av~eIVnea--------Iv~~~~~~pV~l~- 114 (524) T protein:vir:98 51 LNNQKYAGVFQQFYSGQDPAIQNKEQLI-------NTYRGIMSYPEVENAVSEIIDDA--------IVNEQGKDIITMD- 114 (524) T ss_pred CCcceecceeeeeccccccccchHHHHH-------HHHHHHhhccchhhHHHhhhcce--------eEecCCCceEEEE- Confidence 1110 00011111 13333444444333332210000 0000000000000 Q ss_pred hhhccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeC Q lcl|NC_021297. 65 DRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDP 144 (480) Q Consensus 65 ~~l~~~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~ 144 (480) |--..++-+-.+...+.+..|++--+|+....+..+.-.+.|+.|.+.-... ....|-..++.++|+.+-.|..- T Consensus 115 --L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~---~~~kGI~ELr~lDPr~i~~vr~~ 189 (524) T protein:vir:98 115 --LAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHK---DESKGIRELRQLDPRCMELIRES 189 (524) T ss_pred --ecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcC---CCCcceeeeeeeCCccceeeeec Confidence 0000111111223345556666666888999999999999999988864332 22458788899999887665532 Q ss_pred CCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccCC-c Q lcl|NC_021297. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYG-R 221 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~G-~ 221 (480) -. .....+...+. ....+-+|.+... .|...+.. +. ++.-=+||- |.|.+.....+.-| . T Consensus 190 ~~-~~~~~~~~v~~------~~~e~f~Y~~~~~-~~~~~g~~----~~-----~~~~ikI~~dAIvy~hSGL~d~~~~ii 252 (524) T protein:vir:98 190 IT-ETLDGGVKVFR------GYREFFVYSAPKA-GYTYNGQI----YQ-----ANQKIKIPRSAIVYAHSGLEDCSNNII 252 (524) T ss_pred cc-cccccchhhcc------ceeeeeeeccCCC-ccccccce----ec-----CCCceeechhheeeeccCcccCCCCee Confidence 11 11111111110 0112233433222 11111100 00 000002221 22222211111000 1 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhh Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDI 275 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~ 275 (480) |-|.+.++++-. -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-. T Consensus 253 syLhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~ms 330 (524) T protein:vir:98 253 GYLHRAVKPANQ--LRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLS 330 (524) T ss_pred eehhHhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccc Confidence 334444444322 257788888887777777665422222222211111 0111 Q ss_pred hhhhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcc-c-cccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 276 YYGRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSS-S-SENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 276 ~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~-~~n~~Sg~Al~~~~~~l~~k~~~~ 350 (480) .....|. .+|-+.++..++...--.-++-++-....++...++|.+-+.. + +-|..-|..|-.-+.....-+.++ T Consensus 331 MlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rL 410 (524) T protein:vir:98 331 MTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTL 410 (524) T ss_pred hhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHH Confidence 1122222 1344566777776653344566677777888888898877742 1 223223346777778888889999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhcccc----CCCHHHHHHh- Q lcl|NC_021297. 351 GRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQG----PIPKEQARID- 418 (480) Q Consensus 351 ~~~~~~~l~~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~----~~s~et~~~~- 418 (480) +..|..-+.++++.=+-+.|.-...++ ..|.+.|..-..-.....++.+.. +.+...+ .+|.+++++. T Consensus 411 R~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~I 490 (524) T protein:vir:98 411 QIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEI 490 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHH Confidence 999999999998865555554333333 346677754333333333332221 2222222 5789998765 Q ss_pred CCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 419 LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 419 lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) |..++++++++.++++++... .... ....+.++. T Consensus 491 Lr~tDeei~~~~k~I~~E~k~---~~~~-----~p~~e~~~f 524 (524) T protein:vir:98 491 LRMSDEDIDEQAKLIEEESKE---ERFK-----NPEAEEENF 524 (524) T ss_pred hccCHHHHHHHHHHHHHHHhC---CCCc-----CCccccccC Confidence 799999998887776666431 1111 001111111 No 219 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.55 E-value=4.6e-05 Score=44.36 Aligned_cols=398 Identities=14% Similarity=0.062 Sum_probs=154.7 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHH------------HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh Q lcl|NC_021297. 1 MTTY-HEHVERLQGLLARDLPNL------------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~r~------------~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |..- ..++.+... ..+..+ ..+...+-|... ..+..+.... -++. .=.-.+|+..++-+ T Consensus 1 ~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~--~~g~~v~~~~-al~~--~~V~~~i~~ia~~i 72 (434) T protein:vir:43 1 MSKSLGKVLSSATS---APRSSLFGWGGKTIRLTDGAFWSQFLGRES--SSGKKVTVDK-AMKL--SAVWACVRLISTSV 72 (434) T ss_pred Cccchhhhhhhccc---ccchhhhcccccccccCchHHHHHHhcCCc--cCCceechhh-hhcc--HHHHHHHHHHHHhh Confidence 3221 111111111 111110 001111111100 0111111110 0111 00112333333322 Q ss_pred ccCcc---ccCCC----chhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEc Q lcl|NC_021297. 68 DIEGF---RISED----SEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVES 134 (480) Q Consensus 68 ~~~~~---~~~~d----~~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~ 134 (480) -.-++ ....+ ......+..++.. |. -..+...+..+.+.+|.+|+++... .|++ .+..++ T Consensus 73 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-------~G~~~~L~~l~ 145 (434) T protein:vir:43 73 AGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-------AGRPAALDFLL 145 (434) T ss_pred hhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCcEEEEEEEc Confidence 22222 21211 1122345555532 32 3466777888999999999887532 3554 567889 Q ss_pred cceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccc Q lcl|NC_021297. 135 PLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 135 p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~ 214 (480) |..+.+..+.. ..+. |+... ..+. ...|.+++ |+||++. . T Consensus 146 p~~v~~~~~~~--g~~~----y~~~~-~~g~---~~~~~~~e-----------------------------Vih~~~~-~ 185 (434) T protein:vir:43 146 PSRVDLECDEN--GRLK----YFYTT-KKGA---RREIERTN-----------------------------MLHIPAF-T 185 (434) T ss_pred CcceEEEEcCC--CeEE----EEEEe-cCce---EEEEcccc-----------------------------EEEecCc-C Confidence 99888777643 1111 11111 1111 01122222 2344433 2 Q ss_pred cCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccchhhh-----hhhhhcccCC Q lcl|NC_021297. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTTLDI-----YYGRILTLAS 285 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g-~~~~~~~~~~~~~~~~~-----~~~~~~~~~g 285 (480) ..+.+|.|.+. .+.+.+.....--.-...+|. .|-.++.- ........+.-...++. ..++++.++ T Consensus 186 ~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~- 260 (434) T protein:vir:43 186 LDGRIGLSAIR----YGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLE- 260 (434) T ss_pred CCCccccCHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccC- Confidence 23456776553 233333332222222223333 34333321 11111100000111111 112344444 Q ss_pred CCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 364 (480) ++.++.++.... -..|++..+....+|+...++|+..+|....+...+..+......+...+ -.-+-..|++.+.. T Consensus 261 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~---L~P~~~~ie~~ln~ 337 (434) T protein:vir:43 261 QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFS---ISSITNQIQQCVNK 337 (434) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHH---HHHHHHHHHHHHHh Confidence 356777776432 22478888888999999999999998753322111223332222222111 01111111111110 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL 444 (480) Q Consensus 365 ~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~ 444 (480) .+....... ...+++.+......|..+.++++.++..+| +++...+++.+|+.+-+- -.+..-...-..++.. T Consensus 338 --kL~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--gD~~~~~~n~~~~~~~ 410 (434) T protein:vir:43 338 --RLLTAPERI-RYYAEFSLEGFLKADSAGRAAWYSTMAQNG--FMTRNEGRRKENLPELPG--GDILTVQSNLVPIDQL 410 (434) T ss_pred --hcCChhhhc-CceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeeccCccchhhh Confidence 111111111 133555555666778889999999998875 667777788887754321 0000000000000110 Q ss_pred hcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 445 YSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) ....... +..+......++.+.++ T Consensus 411 ~~~~~~~-~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 411 GQSNKSQ-AVRAALMNWFSQPEPQE 434 (434) T ss_pred hccCCCc-chhhhhhccCCCCCCCC Confidence 0000000 00000011011111111 No 220 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.55 E-value=4.6e-05 Score=44.36 Aligned_cols=368 Identities=11% Similarity=0.064 Sum_probs=138.6 Q ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .-++.++.-......... ..+..+.-|.-.... +.. +..+.+.-...+|+..++-+-..++.+..+. . T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~----v~~---~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~-~ 72 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSY----VSA---LSALQNTNVYSVINRIASDVSSAHFKTENTA-T 72 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCccccc----cch---hHhhcchHHHHHHHHHHHhhccCceeecccc-h Confidence 001100000000000000 000001111000000 000 1111111223334444443333455543322 1 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) ...+..-...-........+..+.+.+|.||+++.... ..+...+|.++.+..+. .. +.++... T Consensus 73 ~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---------~~~~p~~~~~v~~~~~~--~~-----~~~~~~~ 136 (383) T protein:vir:10 73 LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---------LEHIPNSDVQINYLPGN--MG-----IVYTVLE 136 (383) T ss_pred hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---------eeEeecCcceEEEEEcC--Cc-----eEEEEEE Confidence 11222111111345667788888999999999875311 11222333333322221 10 1111111 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc--cccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) ..++. ...|.+++ |+||++. ....+.+|.|.+.. +...++....+ T Consensus 137 ~~~~~---~~~~~~~e-----------------------------vih~r~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~ 183 (383) T protein:vir:10 137 SNDRP---KMVLRQDQ-----------------------------MLHFRLMPDPQYRYLIGRSPLES-LQNALNLDDKA 183 (383) T ss_pred cCCce---EEEEcccc-----------------------------eEEeccCCCCcccccccccHHHH-HHHHHHHHHHH Confidence 11111 01122222 3444422 11234567776642 33333332222 Q ss_pred HHHHHHHHHHhcchhhhhc--CCCccccccccccchhhhh-----hhhhcccCCCCceEEEeccc--cHHHHHHHHHHHH Q lcl|NC_021297. 239 LMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAA--ELRNFAEEMEVFR 309 (480) Q Consensus 239 ~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~~~-----~~~~~~~~g~~~~~~~~~~~--~~~~~~~~l~~~~ 309 (480) ..-....+...+.|-.++. |...+....+.-...|+.. .+.++.++ ++.++..+... ..+.+.+..+... T Consensus 184 ~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~l~e~~~~~~ 262 (383) T protein:vir:10 184 SKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLP-DGFDYTQLEMKTDVFKALADNSAYSA 262 (383) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccC-CCceEEecCCChhHHHHHHHHHHHHH Confidence 2222222232333433333 2111110000001112111 12344443 45667666543 3443456777888 Q ss_pred HHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021297. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~ 389 (480) .+|+...++|+..+|....+.+.+..+.... ..|...|.-+++.+.......... ..+++.+..... T Consensus 263 ~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~-----------~~~~~~l~P~~~~ie~~l~~~l~~--~~~~f~~~~l~~ 329 (383) T protein:vir:10 263 DQISKAFGVPSDILGGGTSTESQHSNIDQIK-----------ATYLANLNSYVNPIVDELRLKMNA--PDLELDIKDMLD 329 (383) T ss_pred HHHHHHhCCCHHHcCCccCCCCccccHHHHH-----------HHHHHHHHHHHHHHHHHHHHhhCC--ceEEeechhhhc Confidence 9999999999999986421111111111111 111112222222111100000000 136667777778 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccccc Q lcl|NC_021297. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQT 469 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) .|..+.++++.+++.+| +++...+++.+|..+-+-..+ .......++ .. T Consensus 330 ~d~~~~~~~~~~~~~~G--~~t~nE~R~~lg~~p~~~~d~------------------~~~~~~~~~-~~---------- 378 (383) T protein:vir:10 330 VDDSILINQVSNLAKSG--VLGAEQAQFILTRSGFLPDNL------------------PEFKPLTNE-TK---------- 378 (383) T ss_pred cCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCcccCCcc------------------cccCCCccc-CC---------- Confidence 89999999999998875 567767777776543110000 000000011 11 Q ss_pred CCCCCCC Q lcl|NC_021297. 470 SPSGFNR 476 (480) Q Consensus 470 ~p~~~~~ 476 (480) .|+++ T Consensus 379 --gGd~e 383 (383) T protein:vir:10 379 --GGDDK 383 (383) T ss_pred --CCCCC Confidence 12222 No 221 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=97.52 E-value=5.1e-05 Score=44.12 Aligned_cols=410 Identities=12% Similarity=0.003 Sum_probs=161.4 Q ss_pred CCCHHH--HHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhh---hhhccccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021297. 1 MTTYHE--HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPEL---AYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~---~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) |.=+-. -...|.++.............|..-+.-++..+....+++ .++.-.....+-.+++....+....+++ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~ 80 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQ 80 (446) T ss_pred CcccccCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceec Confidence 211100 0111122221221122222223321111222222221111 1111112222233333222233334443 Q ss_pred CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCcee--EEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 75 SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIP--LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 75 ~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~--~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +.+++..+.+.+++..-.|...... +.++..||.+ .++||....+.. -.++. .+....|...-=.+|.. .+.+. T Consensus 81 p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~-~p~~~~d~~~~~~~~~~r~~~~~~-~~~~~ 157 (446) T protein:vir:98 81 HGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDN-MPATVLDDIVNYHPLQVMLIANDN-GRIVD 157 (446) T ss_pred CccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeeccccc-ccchhhccccccccccceeeeccC-Ccccc Confidence 4556667778888876666555544 5788889986 666775321100 00000 00011111110011110 00000 Q ss_pred EEEEEEEEecCCceEEEEE-EEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRAT-LYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) ... .+...+.. .|.+-. .+....... ......+.+ .++.--++.+.++...+.|+|.|.+..-.-. T Consensus 158 ~~~--------~~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~g~~~--~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~ 224 (446) T protein:vir:98 158 GDT--------VTASQYKSGYWVPLP--PYRIGDPPK-KVDVVGSHV--RLPSHKRLFINYNTKGNNPWGTSCLTSVLDY 224 (446) T ss_pred ccc--------cchhhcccccccCcc--cchhhhhhh-hcccCcccc--cccccceEEEEecCCCCCccccchHHHHHHH Confidence 000 00000000 000000 000000000 000000001 1122233567778888889999876432111 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhc---CCCcccccccccc--------chhh----hhhhh-hcc-----cCCCCce Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGTPLRVIS---GVTTDELTNDGEN--------TTLD----IYYGR-ILT-----LASEAAK 289 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~~~~~i~---g~~~~~~~~~~~~--------~~~~----~~~~~-~~~-----~~g~~~~ 289 (480) .. -=+..+-+++.-++.|+.|.++.+ |.+..+..+.+.. ..++ +.... ++. -+|-..+ T Consensus 225 ~~-fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie 303 (446) T protein:vir:98 225 SI-FKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVG 303 (446) T ss_pred HH-HHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEE Confidence 11 125567778888899999987765 3332222111100 0010 00000 000 1122333 Q ss_pred EEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_021297. 290 ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMRIAMQ 367 (480) Q Consensus 290 ~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~ 367 (480) +.+........|...++..-.+|++..--..-.+|...+...|. |+. ....-....++.-.+.+...+. ++++-++. T Consensus 304 ~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~ 382 (446) T protein:vir:98 304 ALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIR 382 (446) T ss_pred eeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44433333335666666666777765422211222221111221 221 1112222223333345555663 57776777 Q ss_pred HhCCCCcccc--eeeeEEecCCCCCCHHHHHHHHHHHHhccccC-CCHHHHHHhCCCChhHHHH Q lcl|NC_021297. 368 IMGREVTEEY--TRLETVWRDPSTPTVAAKADAVSKLYANGQGP-IPKEQARIDLGYTATQREQ 428 (480) Q Consensus 368 ~~~~~~~~~~--~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~-~s~et~~~~lg~~~~~~~~ 428 (480) ++......-. ..-.+.|....+.|+.+.++++.++++.|... .+.+.+++.+|+.+.+-.. T Consensus 383 lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 383 LNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred hCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 7654332211 11223455556788899999999999887421 1356788888875432212 No 222 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=393 Identities=11% Similarity=0.067 Sum_probs=170.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC------cch-hcCcccchhhhhhccccchHHHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR------RLK-TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~------~i~-~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) ++...+-+...+.. +. +. .++-.+-. .|. ..+.. ..-+..+. ......-++++....+....|. T Consensus 15 ~~~~~~~~~~~ia~---~~-~~---~~~~~~~~~~~~~~~iLr~~~~~-~~~y~~m~-~D~~i~s~l~~Rk~av~~~~w~ 85 (491) T protein:vir:10 15 FGEPDKSLSSQIAT---RA-RS---IDFFALGMYLPNPDPVLKALGKD-IRVYRELR-ADAHVGGCVRRRKAAVKALEWG 85 (491) T ss_pred cccCChHHHHHHHh---hh-cc---cccccccCCccchHHHHHhcCCC-HHHHHHHh-hChHHHHHHHHHHHHHhCCCcE Confidence 44433322222221 10 00 01100000 000 00000 01111221 2333444444444444455565 Q ss_pred c---CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeE---EEEEccceeEEEEeCCC Q lcl|NC_021297. 74 I---SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL---IRVESPLYMYAELDPRN 146 (480) Q Consensus 74 ~---~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~---i~~~~p~~~~~v~d~~~ 146 (480) + .++++..+.+.+++++-+|......+. ++..||.+ ++++|.. .+|... +.+++|+.+ .||+.. T Consensus 86 i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~~f--~~d~~~ 156 (491) T protein:vir:10 86 LDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGK------VGNYIVPIDVVGKPADWF--VYDPEN 156 (491) T ss_pred EecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEee------cCCeeEEEEeeeecccce--eeccCC Confidence 4 234455678888888878888888775 78899987 6667742 133333 333333322 122211 Q ss_pred CCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchh Q lcl|NC_021297. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ...|...++.. ..... .+++ ++.+.++...+.++|.+.+.. T Consensus 157 ------------------------------~l~~~~~~~~~-----~g~~l-~~~k---~i~~~~~~~~~~p~g~gLl~~ 197 (491) T protein:vir:10 157 ------------------------------QLRFRSKDHWM-----QGEEL-PARK---FLVPRQEATYLNPYGFPDLSM 197 (491) T ss_pred ------------------------------ceEEecCCCCC-----Cccee-cCCC---EEEEEecCCCCCcccchhHHH Confidence 11111111100 00000 0111 345566667777888887754 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhhhhhhhccc-CCCCceEEEecc--ccHH Q lcl|NC_021297. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDIYYGRILTL-ASEAAKISEFKA--AELR 299 (480) Q Consensus 227 ~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~~~~~~~~~~-~g~~~~~~~~~~--~~~~ 299 (480) +--..---+..+.+++.-++.|+.|+++.+ -+....+++.... .++..+..... .|...++.+... .+.. T Consensus 198 -~~w~~~fK~~~~~~w~~f~E~yG~P~~igk--y~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~ 274 (491) T protein:vir:10 198 -CFWPTTFKKGGLKFWVQFTEKYGSPMLVGK--HPRSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSAD 274 (491) T ss_pred -HHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--cCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChh Confidence 322222346678888888999999976654 2221111211111 22222223333 345666766542 3455 Q ss_pred HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) .|...++..-.+|++..- -+.+... + .++.|+.. ...-....++.-.+.+...+.++++-++.++..... T Consensus 275 ~y~~li~~~d~~Isk~iL--GqtlTt~--~-~gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~~~---- 345 (491) T protein:vir:10 275 VYERLLHFCRGEVSIALL--GQNQTTE--A-TSTRASAQAGLEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDGAD---- 345 (491) T ss_pred HHHHHHHHHHHHHHHHHh--hhhcccC--c-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC---- Confidence 676666655555555431 0111111 1 11122211 122222233333455566677777777777654322 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ...+.|..+. ......++.+.+++..|. .++.+.+.+.+|+...+..+. .. ..... ...+.. T Consensus 346 ~p~f~~~~~~-e~~~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~~~----------~~-----~~~~~-~~~~~~ 407 (491) T protein:vir:10 346 RPVFDMWEQE-QVDEIQAGRDQKLTQAGA-RFTPAYFKRAYNLQDGDLDER----------PL-----PVSAV-DTVGAA 407 (491) T ss_pred cceEEecCcC-chhHHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCcCcc----------cc-----ccCCC-CCcccc Confidence 2455665443 333567899999998874 578999999999865432110 00 00000 000000 Q ss_pred CCCCCCcccccCCC-CCCCCCCC Q lcl|NC_021297. 459 TVTDTKTETQTSPS-GFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~p~-~~~~~~~~ 480 (480) ......+..+..+. -.+....| T Consensus 408 ~~~~~~~~~~~~~d~~~~~~~~~ 430 (491) T protein:vir:10 408 SFAEFEAPDQDALDAALNTLSAR 430 (491) T ss_pred cccccCCCCCCchHHHHHHHHHH Confidence 00000000000000 00000001 No 223 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.49 E-value=5.7e-05 Score=43.84 Aligned_cols=406 Identities=10% Similarity=0.051 Sum_probs=166.9 Q ss_pred CCCHHHHHHH-----HHHHHHHHHHHHHHHHHHhcccCcchhcCcc-------------cc---hhhhhhccccchHHHH Q lcl|NC_021297. 1 MTTYHEHVER-----LQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-------------AP---PELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~-----l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~-------------~~---~~~~~~~~~~n~~~~i 59 (480) |.-+-+.--+ .+.+.. .+++..+.+-+.+ |+...+... .. +-+.++--......-+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~--~~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~ 77 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQ--TSRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CCeeeCCCCCccCccccchhh--hhhhhhhhhhccc-CCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 1111110000 000000 0111111111111 111100000 00 0000000011122222 Q ss_pred HHHHHhhhccCcccc--C-CC----chhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeE- Q lcl|NC_021297. 60 LRTLSDRLDIEGFRI--S-ED----SEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL- 129 (480) Q Consensus 60 Vd~~~~~l~~~~~~~--~-~d----~~~~~~l~~i~~~n-~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~- 129 (480) +.+-...+....+++ + ++ .+..+.+.+++..- +|......+ .+|.-||.+ ++++|... +|... T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~-ldA~~~G~s~~Ei~w~~~------~g~~~~ 150 (526) T protein:vir:79 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCIELEWALQ------GREWMP 150 (526) T ss_pred HHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHH-HhhhhhcceeEEEEEeec------CCceeE Confidence 222222233334443 1 12 23344566767553 577766664 458889986 66666431 23222 Q ss_pred --EEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceE Q lcl|NC_021297. 130 --IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVV 207 (480) Q Consensus 130 --i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv 207 (480) +.+.+|+.+. ||+.... ...+.. ... ..... .+++ ++ T Consensus 151 ~~l~~r~~~~F~--~~~~~~~----------------------------~l~~~~-~~~------~g~~l-~~~k---~i 189 (526) T protein:vir:79 151 LAFHHRPQSWFQ--LNPEDQN----------------------------ELRLRD-NSP------AGEAL-QPFG---WI 189 (526) T ss_pred EEeeeecccceE--eccCCCc----------------------------EEEecC-CCC------Cceee-cCCc---eE Confidence 2223332111 2211110 000100 000 00001 1112 23 Q ss_pred EeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhhhhhhhccc Q lcl|NC_021297. 208 PLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDIYYGRILTL 283 (480) Q Consensus 208 ~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~~~~~~~~~~ 283 (480) .+.++...+.++|.+.+..-.-..+ -=+..+.+++.-++.|+.|+++.+ -+....+++.... ..++.+....+ T Consensus 190 v~~~~~~~g~p~g~gLlr~~~w~~~-fK~~~~~~w~~F~E~yG~P~~igk--y~~~a~~~ek~~L~~av~~i~~da~~ii 266 (526) T protein:vir:79 190 IHRPRARSGYVARSGLFRVLAWPYL-FRHYATSDLAEMLEIYGLPIRLGK--YPPGTADEEKATLLRAVTGLGHAAAGII 266 (526) T ss_pred EEeecCCcCCccccchHHHHHHHHH-HHHhhHHHHHHHHHHcCCceEEEe--cCCCCCHHHHHHHHHHHHHHhcCcEEEe Confidence 3455666677888877643211111 124577888888899999977665 2111111111111 22233333333 Q ss_pred -CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccc-cccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021297. 284 -ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAIIA-TDSRIVKMAERKGRIFGGAWE- 359 (480) Q Consensus 284 -~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~n~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~- 359 (480) .|...+|.+....+...|...++..-.+|++..- -+.+... +.+..+.-|+.- ...-....+..-.+.+...|. T Consensus 267 P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL--GqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~ 344 (526) T protein:vir:79 267 PETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVL--GGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSR 344 (526) T ss_pred cCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHh--hhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3456677776555666777777777777766630 0111110 001111123321 222233333344455666674 Q ss_pred HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~ 439 (480) ++++.++.++...........++.|....+.++.+.++.+.+++..|. .++.+.+.+.+|+......+ . T Consensus 345 ~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~~~~~e----------~ 413 (526) T protein:vir:79 345 DLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNE----------P 413 (526) T ss_pred HHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCCch----------h Confidence 477777777654332222346778888888999999999999998874 47999999999985432111 0 Q ss_pred HHHHHhcccccCcccccCCCCCCCCcccccCCCCC-CCCCC-----------C Q lcl|NC_021297. 440 MIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGF-NRTKT-----------R 480 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~-~~~~~-----------~ 480 (480) ..... .++.+.+..+.........+.+. ...+. + T Consensus 414 ~l~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~ 459 (526) T protein:vir:79 414 VLRPA-------AQPAILSRQHGQRVAALATIVGPRYGDQQALDKALADLPAK 459 (526) T ss_pred hcccc-------CCccccccccccccccccccccccCchhhHHHHHHHHHHHH Confidence 00000 00000000000000000000000 00000 0 No 224 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=390 Identities=14% Similarity=0.100 Sum_probs=154.6 Q ss_pred HHHHHHHHHHHHHHHHHHhcccCcchhcC--c--ccchhhhhh-------ccc--------cchHHHHHHHHHhhhccCc Q lcl|NC_021297. 11 LQGLLARDLPNLLEAEAYRNGTRRLKTIG--I--GAPPELAYL-------DVQ--------PGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 11 l~~~~~~~~~r~~~~~~YY~g~~~i~~~~--~--~~~~~~~~~-------~~~--------~n~~~~iVd~~~~~l~~~~ 71 (480) |-++. ...-..+....+........-+ . .+....... -.. +.-...+|+..++-+-.-+ T Consensus 1 ~~~~~--~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp 78 (432) T protein:vir:81 1 MPDEK--KLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMP 78 (432) T ss_pred CCchh--hcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCc Confidence 11111 1122222333433221110000 0 000000000 000 0111223333333222223 Q ss_pred ccc---CCCc--h-hHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeE Q lcl|NC_021297. 72 FRI---SEDS--E-GLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMY 139 (480) Q Consensus 72 ~~~---~~d~--~-~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~ 139 (480) +.+ ..+. + ....+..++.. |. -..+...+..+.+.+|.||+++... +|++ .+.+++|..+. T Consensus 79 ~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-------~g~~~~L~~l~~~~v~ 151 (432) T protein:vir:81 79 LTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLANDRLT 151 (432) T ss_pred eeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCCceE Confidence 321 1111 1 12334455532 32 2356677888899999999887642 2443 56788999888 Q ss_pred EEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccC Q lcl|NC_021297. 140 AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) Q Consensus 140 ~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~ 219 (480) +..++.. ++. |.....+ +.. ..+.++. |+||++.. ..+.+ T Consensus 152 v~~~~~g--~~~----y~~~~~~-g~~---~~~~~~~-----------------------------iih~r~~~-~dg~~ 191 (432) T protein:vir:81 152 ITTDPKG--NTA----YRYRRTD-GQM---IDIPKQQ-----------------------------IWKIMGYS-LDGEN 191 (432) T ss_pred EEECCCC--cEE----EEEEecC-ceE---EEEcccc-----------------------------EEEecCCC-CCCcc Confidence 8876532 111 1111111 111 1112222 23343321 22356 Q ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhcCCCccccccccc---cchhh--hhhhhhcccCCCCceEE Q lcl|NC_021297. 220 GRSEISPELRKVTDAASRTLMNLQSASQILGT---PLRVISGVTTDELTNDGE---NTTLD--IYYGRILTLASEAAKIS 291 (480) Q Consensus 220 G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~---~~~~i~g~~~~~~~~~~~---~~~~~--~~~~~~~~~~g~~~~~~ 291 (480) |.|.|. .+.+++...+.--.....+|.+ |-.++. .+.. ..++.. ...+. ...++++.++ ++.++. T Consensus 192 G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~-~~~~-l~~e~~~~~~~~~~~~~nag~~~vl~-~g~~~~ 264 (432) T protein:vir:81 192 GLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQ-IDRF-LTDDQYDSFAKKVSGSVEAGRAPLLE-GGMDVK 264 (432) T ss_pred cccHHH----HHHHHHHHHHHHHHHHHHHHhcCCCcceEEe-cCCC-CCHHHHHHHHHHHhhhhcCCCceecC-CCceEE Confidence 776553 3344444333332233333333 322222 1111 111110 11111 1123455554 345777 Q ss_pred Eecccc-HHHHHHHHHHHHHHHHhccCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021297. 292 EFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIM 369 (480) Q Consensus 292 ~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~ 369 (480) ++.... -..+++..+..+.+|+...++|+..+|....+ .+.+..++.....+...+- .-+-..|++.+.. .+. T Consensus 265 ~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl---~P~~~~ie~~l~~--kLl 339 (432) T protein:vir:81 265 SLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTL---SPWLRRIEQSIAL--NLL 339 (432) T ss_pred EccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHH---HHHHHHHHHHHHh--hcc Confidence 765432 23477888889999999999999999854322 1222333332222222110 1111112211111 121 Q ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHH-HHHHHHHHHHHHHHHHHhccc Q lcl|NC_021297. 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQRE-QMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 370 ~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 448 (480) ..... ....+++.+......|..+.++++.+++++| +++...+++++|+.+-+-. ...-+.. .-..++.. T Consensus 340 ~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~t~NE~R~~~glpp~~g~~~~~~~~~--~~~pl~~~---- 410 (432) T protein:vir:81 340 SPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGNAAVLTVQS--AMVPLDSI---- 410 (432) T ss_pred Ccccc-CceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCCcceEeecC--cccchhhh---- Confidence 11111 1123444444556678889999999998875 6777778888877542100 0000000 00000000 Q ss_pred ccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 449 KAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) +.+..+++..+++ .+++... +. T Consensus 411 --~~~~~~~~~~~~~-n~~~~~~-----~~ 432 (432) T protein:vir:81 411 --GLQASPEPASGLG-NQQQDKV-----SK 432 (432) T ss_pred --ccCCCCCCCCCCC-Ccccccc-----cC Confidence 0011111111110 1111111 11 No 225 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=97.48 E-value=5.9e-05 Score=43.78 Aligned_cols=369 Identities=15% Similarity=0.165 Sum_probs=139.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC----- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----- 75 (480) |- ...|+..-+. . -...+.++ .++.......+.........+.-...+|+..++-+-..++.+. T Consensus 1 mg-~~~~~~~~~~---~---~~~~~~~~----~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~ 69 (403) T protein:vir:10 1 MG-FKSWITEKLN---P---GQRIIRDM----EPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNI 69 (403) T ss_pred Cc-chhhhhhccc---h---hhhhhhcc----cccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccc Confidence 44 3333322111 0 00001110 1111111111100001001111111223222222222222211 Q ss_pred ---CCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCC Q lcl|NC_021297. 76 ---EDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 76 ---~d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~ 147 (480) .+......+..++.. | ....+...+..+++.+|.||+++.. . .+..++|..+.+..+.. T Consensus 70 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~----------~-~l~~l~~~~~~v~~~~~-- 136 (403) T protein:vir:10 70 VTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG----------T-SLYHVPAALMQVEADAN-- 136 (403) T ss_pred cccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC----------c-eeEeecCcceEEEEcCC-- Confidence 111112334555543 3 2245667788889999999976531 1 24456665554433321 Q ss_pred CeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeeccc----ccCccCCccc Q lcl|NC_021297. 148 RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP----RLGNRYGRSE 223 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~----~~~~~~G~s~ 223 (480) . .. +.|. . ..+ ..|..++ |+||.... ...+.+|.|. T Consensus 137 ~-~~---~~~~-~-~~~-----~~~~~~e-----------------------------iih~~~~~~~~~~~~~~~G~s~ 176 (403) T protein:vir:10 137 K-FI---KKFI-F-NNQ-----INYRVDE-----------------------------IIFIKDNSYVCGTNSQISGQSR 176 (403) T ss_pred c-eE---EEEE-e-cCc-----eeecccc-----------------------------eEEecccccccCCCCCcccccH Confidence 1 10 0110 0 000 0011111 22222111 1144567776 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCC-Cccccccccccchhhh------hhhhhcccCCCCceEEEe Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEF 293 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~ 293 (480) +. .+.+.++....-..-...+|. .|-.++... ...+...+.-...|+. ..++++.+++ +.++..+ T Consensus 177 i~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~~ 251 (403) T protein:vir:10 177 VA----TVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLILDG-GMKAKPY 251 (403) T ss_pred HH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeecCC-CceeEEe Confidence 53 333444333222222223333 343344311 1111000000111211 1123444443 3556555 Q ss_pred cc-ccH--HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021297. 294 KA-AEL--RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 294 ~~-~~~--~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~ 370 (480) +. .+. ..|++..+....+|+...++|+..+|....++.+...+.+....|.-.+...+..+...| + T Consensus 252 ~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L-----------~ 320 (403) T protein:vir:10 252 SQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPNIELFYYMTIIPMLNKLTSSLTFFF-----------G 320 (403) T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------C Confidence 42 111 246788888899999999999999974322222222233333333333333332222211 1 Q ss_pred CCCcccceeeeEEecC--CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_021297. 371 REVTEEYTRLETVWRD--PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 371 ~~~~~~~~~i~v~f~~--~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ..+.+.+.. ....+..+.++++.+++..| +++...+++.+|+.+-+.+.. +...-.. T Consensus 321 -------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G--~lT~NE~R~~~gl~pi~~~~~------------d~~~~p~ 379 (403) T protein:vir:10 321 -------YKITPNTKEVAALTPDKEAEAKHLTSLVNNG--IITGNEARSELNLEPLDDEQM------------NKIRIPA 379 (403) T ss_pred -------ceeeeccchhhhcccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCcccc------------ccccccc Confidence 123333332 23456778888888888765 667777888888765321111 1111111 Q ss_pred cc--CcccccCCCCCCCCcccccC Q lcl|NC_021297. 449 KA--QADATPKPTVTDTKTETQTS 470 (480) Q Consensus 449 ~~--~~~~~~~~~~~d~~~~~~~~ 470 (480) .. .......+++++..+.+++. T Consensus 380 n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 380 NVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ccccccccCCCCcCCCCCCCcCCC Confidence 00 00111111111111111111 No 226 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=97.48 E-value=5.9e-05 Score=43.78 Aligned_cols=389 Identities=12% Similarity=0.005 Sum_probs=156.1 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcc---ccCCCc--- Q lcl|NC_021297. 6 EHVERLQGLLARD-LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGF---RISEDS--- 78 (480) Q Consensus 6 ~~i~~l~~~~~~~-~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~---~~~~d~--- 78 (480) =++..+.+..... ......+...+-+... ...+..+... .-+.+.....+|+..++-+-.-++ ...++. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~-~~~g~~v~~~---~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYD-TYTGKRISSQ---RAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTR 76 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcc-cccCceechh---hhhccHHHHHHHHHHHHhhhhCceEEEEecCCccee Confidence 0111111110000 0000111111111100 0011111111 001112223334444433322222 222211 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+..++.. | ....+...+..+.+.+|.||+++..+ .|++ -+.+++|..+.+..+... .+. T Consensus 77 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-------~g~~~~L~~l~~~~v~~~~~~~~--~~~- 146 (413) T protein:vir:48 77 VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-------LGEVVELLPIDPGCVEPKLNSQW--QPV- 146 (413) T ss_pred ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-------CCcEEEEEEEcCceEEEEEcCCc--eEE- Confidence 112335555532 2 34567788889999999999988642 2444 477889998888876432 121 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) |+ .....+.. ..|.+++++ ||.+.. ..+++|.|.+.. +...+ T Consensus 147 ---y~-~~~~~g~~---~~~~~~evi-----------------------------h~~~~~-~d~~~G~s~i~~-~~~~i 188 (413) T protein:vir:48 147 ---YQ-VTFPDGSV---DVLTQDEIW-----------------------------HVRTLT-LDGLVGLNPIAY-AREAI 188 (413) T ss_pred ---EE-EEecCceE---EEEccccEE-----------------------------EecCcC-CCCcccccHHHH-HHHHH Confidence 11 11111111 123333333 333221 233567776532 33333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCc-cccccccccchhhh------hhhhhcccCCCCceEEEecccc-HHHHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVISGVTT-DELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEE 304 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~g~~~-~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~ 304 (480) +.......-....+...+.|-.++..... .....+.-...|+- ..++++.++ ++.++.++.... -..+++. T Consensus 189 ~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~ 267 (413) T protein:vir:48 189 SLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILE-MGLDWKSMALNAEDSQFLET 267 (413) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEeccCChhHHHHHHH Confidence 32222221222222222334444432111 11000111111211 112344443 346777765432 2247788 Q ss_pred HHHHHHHHHhccCCChHHhccccc-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSE-NPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) .+....+|+...++|+..+|.... +.++..... +....|.-.+. .+++.+.. .+...... ....++ T Consensus 268 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~--------~ie~~l~~--~L~~~~~~-~~~~~~ 336 (413) T protein:vir:48 268 RKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLT--------RIEQRINT--GLVRESKQ-GKFYAK 336 (413) T ss_pred HHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHH--------HHHHHHHh--hccCcccc-CCeEEE Confidence 888899999999999999985432 212222111 11111111111 11111110 11111111 123455 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..+.++++.+++++| +++...+++++|+.+-+- - +...-.. +..+.+..+ T Consensus 337 fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~g~~p~~g--g------------D~~~~~~----n~~~~~~~~ 396 (413) T protein:vir:48 337 FNAGALLRGDMKSRFEAYATGINWG--IYSPNDCRDLEDMNPRPG--G------------DVYLTPM----NMTTSPSAG 396 (413) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------ceeeccc----ccccccccc Confidence 5556666678888999999988775 566666777777754320 0 0000000 000111111 Q ss_pred CCCcccccCCCCCCCCCC Q lcl|NC_021297. 462 DTKTETQTSPSGFNRTKT 479 (480) Q Consensus 462 d~~~~~~~~p~~~~~~~~ 479 (480) +. .+.....++.+.++| T Consensus 397 ~~-~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 397 DD-NGKKKESGDADKTAS 413 (413) T ss_pred cc-CCCCCCCCCccccCC Confidence 11 111111223334444 No 227 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.48 E-value=5.9e-05 Score=43.76 Aligned_cols=411 Identities=9% Similarity=0.041 Sum_probs=167.6 Q ss_pred CCCHHHHHHHH-----HHHHHHHHHHHHHHHHHhcccCcchhcCccc-------------ch---hhhhhccccchHHHH Q lcl|NC_021297. 1 MTTYHEHVERL-----QGLLARDLPNLLEAEAYRNGTRRLKTIGIGA-------------PP---ELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~l-----~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~-------------~~---~~~~~~~~~n~~~~i 59 (480) |..+-+.--+- +.+. ...++.-..+-+.+ |+...+.... .. -+..+--......-+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~--~~~~~~~~~~~~~~-~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~ 77 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREP--QTSRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CCeeECCCCCccccccccch--hhhhhhhhhhhhcc-cCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 11111100000 0000 00111111111111 1111000000 00 000000001111112 Q ss_pred HHHHHhhhccCcccc--C-C----CchhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeE- Q lcl|NC_021297. 60 LRTLSDRLDIEGFRI--S-E----DSEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL- 129 (480) Q Consensus 60 Vd~~~~~l~~~~~~~--~-~----d~~~~~~l~~i~~~n-~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~- 129 (480) +.+-...+....+.+ + + +.+..+.+.+++..- +|......+. +|.-||.+ ++++|... +|... T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~------~g~~~~ 150 (526) T protein:vir:99 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQ------GREWMP 150 (526) T ss_pred HHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeec------CCceeE Confidence 222222223333433 1 1 223345667777553 5777776654 68889986 66667431 23222 Q ss_pred --EEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceE Q lcl|NC_021297. 130 --IRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVV 207 (480) Q Consensus 130 --i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv 207 (480) +.+.+|+.+ .||+.... ...++. +.. ..... .+++ ++ T Consensus 151 ~~l~~r~~~~f--~~~~~~~~----------------------------~l~~~~-~~~------~g~~l-~~~k---~i 189 (526) T protein:vir:99 151 LAFHHRPQSWF--QLNPEDQN----------------------------ELRLRD-NSP------AGEAL-QPFG---WI 189 (526) T ss_pred EEeeeecccce--eeccCCCc----------------------------EEEecC-CCC------Cceee-cCCC---eE Confidence 223333211 11211110 001100 000 00001 1122 23 Q ss_pred EeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhhhhhhhccc Q lcl|NC_021297. 208 PLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDIYYGRILTL 283 (480) Q Consensus 208 ~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~----~~~~~~~~~~~ 283 (480) .+.++...+.++|.+.+..-.-..+ -=+..+.+++.-++.|+.|.++.+ -+....++..... ..++.+....+ T Consensus 190 ~~~~~~~~g~p~g~gLlr~~~w~~~-fK~~~~~~w~~f~E~yG~P~~igk--y~~~a~~~ek~~L~~av~~i~~d~~~ii 266 (526) T protein:vir:99 190 IHRPRARSGYVARSGLFRVLAWPYL-FRHYATSDLAEMLEIYGLPIRLGK--YPPGTADEEKATLLRAVTGLGHAAAGII 266 (526) T ss_pred EEeecCCcCCccccchHHHHHHHHH-HHHhhHHHHHHHHHHcCCceEEEe--cCCCCCHHHHHHHHHHHHHHhhCcEEEe Confidence 3455666777888887643211111 124577888888899999977665 1111111111111 22233233333 Q ss_pred -CCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhcccc-ccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021297. 284 -ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-ENPASAEAII-ATDSRIVKMAERKGRIFGGAWE- 359 (480) Q Consensus 284 -~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~- 359 (480) .|...+|.+....+...|...++..-.+|++..- -+.+.... .+..+..|+. ....-....+..-.+.+...|. T Consensus 267 P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL--GqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~ 344 (526) T protein:vir:99 267 PETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVL--GGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSR 344 (526) T ss_pred cCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHh--hhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3456677776555666777777777777766630 01111100 0111122332 1222233334444455666774 Q ss_pred HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~ 439 (480) ++++.++.++...........++.|....+.++.+.++.+.+++..|. .++.+.+.+.+|+......+ . T Consensus 345 ~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~e----------~ 413 (526) T protein:vir:99 345 DLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNE----------P 413 (526) T ss_pred HHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCcc----------c Confidence 477777777654432223456778888889999999999999998873 48999999999985442111 0 Q ss_pred HHHHHhcccccCcccccCCCCCCC-CcccccCCCCCCC---CCCC Q lcl|NC_021297. 440 MIDTLYSTTKAQADATPKPTVTDT-KTETQTSPSGFNR---TKTR 480 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~-~~~~~~~p~~~~~---~~~~ 480 (480) .+.... .+......+....... ++.....|....- -..+ T Consensus 414 ~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~ 456 (526) T protein:vir:99 414 VLRSAA--QPAILSRQHGQRVAALATIVGPRYGDQQALDKALADL 456 (526) T ss_pred ccCCCC--CCcccccccccccccccccccccCcchhhHHHHHHHH Confidence 000000 0000000000000000 0000000000000 0000 No 228 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.39 E-value=7.8e-05 Score=43.11 Aligned_cols=377 Identities=12% Similarity=0.085 Sum_probs=138.8 Q ss_pred HHHhcccCcchh--cCc-----ccchhhh-------hhccccchHHHHHHHHHhhhccCcc---ccCCCchh-HHHHHHH Q lcl|NC_021297. 26 EAYRNGTRRLKT--IGI-----GAPPELA-------YLDVQPGWVATYLRTLSDRLDIEGF---RISEDSEG-LEELWNW 87 (480) Q Consensus 26 ~~YY~g~~~i~~--~~~-----~~~~~~~-------~~~~~~n~~~~iVd~~~~~l~~~~~---~~~~d~~~-~~~l~~i 87 (480) -++|.+...... ... ...+..+ -++... .-.+|+..++-+-.-++ +...++.. ...+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 111111111100 000 0000000 111110 11234433333322233 22222221 2234444 Q ss_pred HHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec Q lcl|NC_021297. 88 WQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 88 ~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) +.. | ....+...+..+.+.+|.||+++.++.. .|.+ .+..++|..+.+..++.. .+ +|+.... T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~-----g~~~~~l~~l~p~~v~v~~~~~~--~~----~y~~~~~ 147 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPI-----TNEPAMFEFYAPSQTQVDTSDPD--NI----IYRFTPY 147 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-----CCEEEEEEEeCCceEEEEEcCCC--eE----EEEEEEc Confidence 432 3 2345667788889999999999865321 2333 366788888876554321 11 1111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHH Q lcl|NC_021297. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~ 241 (480) +++. ..++..+. |+||.+.. ..+.+|.|.+. .+.+.+....+- T Consensus 148 ~~~~---~~~~~~~d-----------------------------viH~r~~~-~d~~~G~s~l~----~~~~~i~~~~~~ 190 (417) T protein:vir:38 148 NSSM---QKVCGFED-----------------------------VIHWKFFS-YDTIMGRSPLL----SLGDEIGLQESG 190 (417) T ss_pred CCcE---EEEecCcc-----------------------------eEEecCCC-CCCccccCHHH----HHHHHHHHHHHH Confidence 1110 01111122 34555432 23456777653 233333322222 Q ss_pred HHHHHHHhc---chhhhhcC-CCccccccccccchhhh-----hhhhhcccCCCCceEEEecccc-HHHHHHHHHHHHHH Q lcl|NC_021297. 242 LQSASQILG---TPLRVISG-VTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKE 311 (480) Q Consensus 242 ~~~~~~~~~---~~~~~i~g-~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~ 311 (480) ..-...+|. .|-.++.- ........+.-...|+. ..+.++.++ ++.++.+++... -..|++..+..+.+ T Consensus 191 ~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~ 269 (417) T protein:vir:38 191 VSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVD-ATMDYQPLEVDTNVLNLINSNNYSTAQ 269 (417) T ss_pred HHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecc-CCceEEEccCCHHHHHHHHHHHhhHHH Confidence 222222333 34333321 11111100100111111 112344443 346776665332 22477888888999 Q ss_pred HHhccCCChHHhccccccchHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021297. 312 AASITGLPPQYLSSSSENPASAEAI--IATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~n~~Sg~Al--~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~ 389 (480) |+...++|+..+|..+.+ +|...+ .+....|.-.+...+..+.. .+..... .....+.|... . T Consensus 270 Ia~~fgVPp~~lg~~~~~-s~~e~~~~~~~~~tl~P~~~~ie~~l~~----------~Ll~~~~---~~~~~~~fd~~-~ 334 (417) T protein:vir:38 270 IAKALRVPAYRLAQNSPN-QSVKQLADDYIRNDLPFYFEPITSEFEL----------KLLDDAQ---RHQYCIGFDTK-S 334 (417) T ss_pred HHHHhCCCHHHhCCCCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHh----------hhcChhh---cccceEEechh-h Confidence 999999999999854322 222211 11111222222211111111 1111111 12234555211 1 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHHH-HHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQMR-DWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ..... ..++.+++.+| +++...+++.+|+.+-+ .+++. ...- ...+... ..+. .......+++... T Consensus 335 l~~~~-~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~----~~~d~~~-~~~~--~~~~~~kgg~~~~ 404 (417) T protein:vir:38 335 VNGLP-IADVNTAVNGG--LWTGNEGRAELGKKPLKDPNMDRIQSTLNT----VFLDQKE-AYQA--EHAAELKGGDTNA 404 (417) T ss_pred hhHHH-HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeeecccc----ccccccc-cccc--ccccccCCCCCCC Confidence 11122 22355566554 66777778887775321 11110 0000 0000000 0000 0111111122111 Q ss_pred ccccCCCCCCCCC Q lcl|NC_021297. 466 ETQTSPSGFNRTK 478 (480) Q Consensus 466 ~~~~~p~~~~~~~ 478 (480) +.....+|.|..| T Consensus 405 ~~~~~~~~~~~~~ 417 (417) T protein:vir:38 405 KGNQNGSGTNANS 417 (417) T ss_pred CCCCcCCCCcCCC Confidence 2222222333333 No 229 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.32 E-value=9.7e-05 Score=42.59 Aligned_cols=395 Identities=11% Similarity=0.034 Sum_probs=153.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-cCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~ 78 (480) |..+ .++.++-+.+.... +.+-..+-+.... .+.....-..+.-....-...+|+..+.-+-.-++.+- ..+ T Consensus 1 ~~~~-~~~~~~k~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:96 1 MAKE-NIVTRIKKKLIDNW-----IDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred Cccc-cchhhhhhHHhhhh-----hccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc Confidence 5552 23333222211100 0000001111110 00000000001111112223333333333222233221 112 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+.+++.. | .-..+...++.+.+.+|.||+++-++ ..|++ .+..++|..+.++.++... .+ T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~~-- 145 (409) T protein:vir:96 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-EL-- 145 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-EE-- Confidence 223345555532 3 23455677888999999999988653 34443 5778889888887764321 11 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) +|......+.. ..|.++. |+||++.....+.+|.|.+.- +...+ T Consensus 146 ---~y~~~~~~g~~---~~~~~~e-----------------------------vih~r~~~~~~~~~G~s~l~~-~~~~i 189 (409) T protein:vir:96 146 ---YYSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASNMVQGISPIDV-LKNTT 189 (409) T ss_pred ---EEEEEcCCceE---EEEcccc-----------------------------EEEeCCCCCCCccccccHHHH-HHHHH Confidence 11111111110 0122222 455554333455678876632 33333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhh---hhhhhhcccCCCCceEEEecccc-HHHHHHHHH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) +..+. .... .+..+..+-.++. +........+.-...|+ ...++++.++ ++.++.+++... -..+++..+ T Consensus 190 ~~~~~-~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 265 (409) T protein:vir:96 190 DFDNA-VRTF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred HHHHH-HHHH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHH Confidence 32211 1111 1222222211221 22111111111011111 1122344443 446777665432 224777778 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecC Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRD 386 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~ 386 (480) ....+|+...++|+.-+|....+.-| .+......+...+ -.-+-..++..+. ..+...........+++.... T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~~~s--~~e~~~~~f~~~~---l~P~~~~ie~~l~--~~Ll~~~~~~~g~~i~fd~~~ 338 (409) T protein:vir:96 266 LTRERVANVFQLPSIFLNARSNTNFA--KNEELNRFYLQHT---LLPIVKQYEEEFN--RKLLTKTDREKNRYFKFNVKS 338 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHHH---HHHHHHHHHHHHH--hhcCCcccccCcceEEeechh Confidence 88899999999999999854322111 1111111111111 0001111111111 011111111112234444445 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcc Q lcl|NC_021297. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTE 466 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) -.-.+..++++++.+++.+| +++.-.+++.+|+.+-+- -.+.. +...........+......+++ +.+ T Consensus 339 ll~~d~~~~~e~~~~~~~~G--~~T~NE~R~~~g~~pi~g--gD~~~-------~~~n~~~~~~~~~~~~~~~gG~-~n~ 406 (409) T protein:vir:96 339 YLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--GDKPL-------ISGDLYPIDTPLELRKSLKGGD-KNV 406 (409) T ss_pred hhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCC--cceee-------ecccccccccchhhcccccCCC-CCc Confidence 55678888999999998875 567777788887754320 00000 0000000000000000011111 111 Q ss_pred ccc Q lcl|NC_021297. 467 TQT 469 (480) Q Consensus 467 ~~~ 469 (480) +++ T Consensus 407 ~e~ 409 (409) T protein:vir:96 407 NES 409 (409) T ss_pred CCC Confidence 111 No 230 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=97.29 E-value=0.00011 Score=42.39 Aligned_cols=350 Identities=17% Similarity=0.144 Sum_probs=133.5 Q ss_pred HHHHHHHH-----HHH---HHHHHHhcccCcchhc-Ccccchhh---------------------------hhhccccch Q lcl|NC_021297. 12 QGLLARDL-----PNL---LEAEAYRNGTRRLKTI-GIGAPPEL---------------------------AYLDVQPGW 55 (480) Q Consensus 12 ~~~~~~~~-----~r~---~~~~~YY~g~~~i~~~-~~~~~~~~---------------------------~~~~~~~n~ 55 (480) ...|..-. +++ .-..+|.-|+.++... +...+..- ...-+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 11121110 111 1122333343332111 11000000 000001122 Q ss_pred HHHHHHHHHhhhccCccccCCCchhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEE-ecCcccccCCCcee- Q lcl|NC_021297. 56 VATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITV-SHPDVESGDPAGIP- 128 (480) Q Consensus 56 ~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v-~~~~~~~~d~~~~~- 128 (480) ...+|+..++-+-.-++.+-.+.+..+.+..++.. |. ...+...++.+.+ .|.+|+.+ .. +.+|.+ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~ll-lGnay~~~i~r------~~~G~~~ 153 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQ-LGEAFVLPMAH------GSDGYPI 153 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHh-hCCcEEEEEEE------CCCCcEE Confidence 23344444443322233221111111112222221 21 2233344444444 48898764 33 234554 Q ss_pred EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEE Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVP 208 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~ 208 (480) .+.+++|..+.+.++.+. .+ +| +..+.. +.=.|+| T Consensus 154 ~L~pl~p~~v~v~~~~~g--~~-----~y-----------------------~~~~~~---------------~~~eiiH 188 (409) T protein:vir:83 154 RFRVVPPWLVNVELKKGA--RR-----EY-----------------------RIGGLN---------------VTDEILH 188 (409) T ss_pred EEEEECCcceEEEEcCCc--eE-----EE-----------------------EEcccc---------------CccceEE Confidence 477888887776665321 10 11 000000 0014677 Q ss_pred eecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc---ccchhhh----hhhhhc Q lcl|NC_021297. 209 LTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---ENTTLDI----YYGRIL 281 (480) Q Consensus 209 ~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~----~~~~~~ 281 (480) |+......+.+|.|.|+- ....++..+....-..+.+...+.|..++. .... ..++. -...|+. ..++.+ T Consensus 189 ir~~~~~~~~~G~spi~~-~~~~i~~~~a~~~~~~~~f~nga~p~gil~-~~~~-ls~e~~~~~~~~~~~~~~~nag~~~ 265 (409) T protein:vir:83 189 IRYQGNTADAHGHGPLES-AAPRQVVIGLLQKYVQNLAETGGVPLYWLG-VERR-LSETEAVDLMDRWIESRSKYAGHPA 265 (409) T ss_pred eCCCCCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEee-cCCC-CCHHHHHHHHHHHHHhhCCccCccc Confidence 776555566788887642 333333222221111222222233444432 1111 11111 0111211 122333 Q ss_pred ccCCCCceE---EEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccch---HH---HHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 282 TLASEAAKI---SEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA---SA---EAIIATDSRIVKMAERKGR 352 (480) Q Consensus 282 ~~~g~~~~~---~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~---Sg---~Al~~~~~~l~~k~~~~~~ 352 (480) .+.++ .++ ..++..++ .|++..+....+|+...++|++.+|....+.. |. ..+.+....|.-.+.+.+. T Consensus 266 il~~g-~~~~~~~~~s~~d~-q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~ 343 (409) T protein:vir:83 266 LVTGG-ATLNQAKSMSAQDL-SLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMA 343 (409) T ss_pred eecCC-cccccccCCCHHHH-HHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 222 22333333 47888888899999999999999985332211 11 1111222222222222222 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHH Q lcl|NC_021297. 353 IFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW 432 (480) Q Consensus 353 ~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~ 432 (480) .+.. .+... ...+++.+..-...+..++++++.+++++| +++...+++.+|+.+ T Consensus 344 ~l~~----------~Ll~~-----~~~~~f~~~~llr~d~~~r~~~~~~~~~~G--~lT~NE~R~~~glpp--------- 397 (409) T protein:vir:83 344 ALDR----------WALPS-----PQHLELNRDDYTRPSLVERATAYKIMIEAG--VMEPNEARAMERLHS--------- 397 (409) T ss_pred HHHH----------hhCCC-----CcEEEeehhhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCC--------- Confidence 2111 11111 123555555555677788888888777654 444333333322211 Q ss_pred HHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 433 DKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ..+++.-..+++ T Consensus 398 ----------------~~ggd~l~~~gv 409 (409) T protein:vir:83 398 ----------------EAAAVRLSGGGV 409 (409) T ss_pred ----------------CCCCcccCCCCC Confidence 011111112222 No 231 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=97.28 E-value=0.00011 Score=42.34 Aligned_cols=406 Identities=12% Similarity=0.075 Sum_probs=192.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ++..-. .+.||+ +|..+..+++-+..+..+-..+ ++.+--..+|.-- |-...|+-+-.+.. T Consensus 55 ~~~~~~-~~eLI~-------~YR~ma~~pEvd~Av~eIvne~--------iv~d~~~~pV~l~---ld~~~~s~~iK~kI 115 (511) T protein:vir:56 55 SEGTIP-VKELIK-------SYRALAEYHEVDDAIQEIVDEA--------IVYENDKEVVWLN---LDNTDFSENIKAKI 115 (511) T ss_pred ccCccc-hHHHHH-------HHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecccCcchHHHHHH Confidence 110000 012222 2333444443333322210000 0000000000000 00000111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|++--+|+....+..+.-.+.|+.|.+.-.. .++|-..++.++|+.+-.+..-.. .....+..+ T Consensus 116 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid-----~k~GI~eLr~lDPr~i~~vr~i~~--~~~~~~~v~--- 185 (511) T protein:vir:56 116 NEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILD-----KDNNIIELRPLNPMKMELVREIQK--ETIDGVEVV--- 185 (511) T ss_pred HHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec-----cccceeehhhcCcccchhhhhhhc--ccccccccc--- Confidence 4555666666688889999999999999998886432 245778888899987766543111 111001000 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeeccccc----CccCCcccchhhHHHHHHH Q lcl|NC_021297. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRL----GNRYGRSEISPELRKVTDA 234 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~----~~~~G~s~i~~~v~~liD~ 234 (480) . ....+-+|.+.......... .... ...++ +||. |.|.+.... ......|-|.+.++++-. T Consensus 186 --~-~~~ey~~Y~~~~~~~~~~~~-------~~~~-~~~~v-kI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQ- 252 (511) T protein:vir:56 186 --K-GTLEYYVYKQSDYKMPSWMS-------ATNR-AQTSF-RIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQ- 252 (511) T ss_pred --c-ceeeeeEecCCCcccCcccc-------cccc-cccce-eechhheeeecccceeccCCCCeeeccchhhhHHHHh- Confidence 0 01122233332211100000 0000 00111 2332 222322211 112244556565555432 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc---cCC Q lcl|NC_021297. 235 ASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT---LAS 285 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~---~~g 285 (480) -+++.|.+.+.+....|-+=++-.+.-..+..+... .+-......|. .+| T Consensus 253 -Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGg 331 (511) T protein:vir:56 253 -LKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGS 331 (511) T ss_pred -hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCC Confidence 257788888877777776655422222222211110 01111222232 134 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE----NPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~----n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) -+.++..++.+.--.-++-++-....++...++|.+-+....+ |-.-|..|-.-+.....-+.+++..|..-+.++ T Consensus 332 rgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~ 411 (511) T protein:vir:56 332 KGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDP 411 (511) T ss_pred CccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4566777776653344566677777888899999888863322 111234566777788888999999999999999 Q ss_pred HHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHHH---HHhccccC----CCHHHHHHh-CCCChhHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVSK---LYANGQGP----IPKEQARID-LGYTATQREQM 429 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad~~~k---l~~~~~~~----~s~et~~~~-lg~~~~~~~~~ 429 (480) ++.=+-+.|.--..++ ..|.+.|..-..-.....++.+.. +.+...+. +|.+++++. |.+++++++++ T Consensus 412 Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~ 491 (511) T protein:vir:56 412 LKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAM 491 (511) T ss_pred HHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHH Confidence 8876555554333333 356777754333333333333221 22222333 499998765 79999998888 Q ss_pred HHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 430 RDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) .+.++++..+ .... .++++. T Consensus 492 ~k~I~~E~k~---~~~~--------~~e~~f 511 (511) T protein:vir:56 492 QSEIDEEETN---PRFQ--------QDDQGF 511 (511) T ss_pred HHHHHHhhcC---CCCC--------CcccCC Confidence 7666655431 1111 111222 No 232 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=97.27 E-value=0.00011 Score=42.26 Aligned_cols=400 Identities=13% Similarity=0.073 Sum_probs=193.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..+..+ |+ .+|+.+..+++-+..+..+-..+ ++.+--..+|.-- |--..++-+-.+.. T Consensus 64 ~~~~~e----LI-------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~~~~s~~ik~kI 121 (516) T protein:vir:10 64 ISGTKD----LI-------NTYRQLINNPEVERAVANIVNEA--------IVYERGHKVVSLD---LDDTDFGSNVKEKI 121 (516) T ss_pred cchHHH----HH-------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecccCcchHHHHHH Confidence 222222 21 23333444444333332210000 0000000000000 00001111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+.-.+.|+.|++...+ ...+|-..++.++|+.+..+.- ..+. T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~Elr~lDPr~i~~vR~-------------i~~~ 184 (516) T protein:vir:10 122 LEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP----NPKKGIAELRRLDPRFMEYYRE-------------IVTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec----CccccceeeeeeCCcceeeEee-------------eccc Confidence 4555666666688889999999999999999885543 3457888899999987766542 1111 Q ss_pred cCCce-----EEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccCC--cccchhhHHHH Q lcl|NC_021297. 161 DDVAV-----PDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYG--RSEISPELRKV 231 (480) Q Consensus 161 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~G--~s~i~~~v~~l 231 (480) +..+. ...+-+|.++.. .|...+.. +... +++ +||- |.|.+......+.| .|-|.+.++++ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~----~~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 185 DIGGTTIVKGYREFFIYTTGNE-GYSYNGRI----FEPN----TRI-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPA 254 (516) T ss_pred ccccchhhhhhhheeeeccCcc-ccccccce----eCCC----cce-eechhheeeecccceeCCCCceeeeehhhhHhH Confidence 11110 112233443332 22222211 0000 000 1221 33443222222211 23344555444 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc--- Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT--- 282 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~--- 282 (480) -. -+++.|.+.+-+....|-+=++-.+.-..+..+... .+-......|. T Consensus 255 NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 NQ--LKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc Confidence 32 257788888877777777655422222222211110 01111222232 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+|-+.++..++.+.--.-++-++-....++...++|.+-+....+.+ .-|..|-.-+.....-+.+++..|..-+. T Consensus 333 eGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~ 412 (516) T protein:vir:10 333 DGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFL 412 (516) T ss_pred CCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 134456677777665334456667777788889999988886443311 22345666677788888999999999999 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHH-------HHHHHHhccccCCCHHHHHHh-CCCChhHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKAD-------AVSKLYANGQGPIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad-------~~~kl~~~~~~~~s~et~~~~-lg~~~~~~~ 427 (480) ++++.=+-+.|.--..++ ..|.+.|..-..-.....++ .+.++-.-....+|.+++++. |..++++++ T Consensus 413 ~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~ 492 (516) T protein:vir:10 413 DPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIA 492 (516) T ss_pred HHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHH Confidence 998876555554333333 34677775433333333332 222222111246799998765 799999988 Q ss_pred HHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+.++++..+ .+.. ++.+.++. T Consensus 493 ~e~k~I~~E~~~---~~~~------~p~~~~~f 516 (516) T protein:vir:10 493 QEEKQIEQEAGI---KRFQ------NPENEDDF 516 (516) T ss_pred HHHHHHHHhhhC---CCCC------CCCccccC Confidence 887666655421 1100 11111111 No 233 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=97.27 E-value=0.00011 Score=42.26 Aligned_cols=400 Identities=13% Similarity=0.073 Sum_probs=193.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..+..+ |+ .+|+.+..+++-+..+..+-..+ ++.+--..+|.-- |--..++-+-.+.. T Consensus 64 ~~~~~e----LI-------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~---L~~~~~s~~ik~kI 121 (516) T protein:vir:10 64 ISGTKD----LI-------NTYRQLINNPEVERAVANIVNEA--------IVYERGHKVVSLD---LDDTDFGSNVKEKI 121 (516) T ss_pred cchHHH----HH-------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEE---ecccCcchHHHHHH Confidence 222222 21 23333444444333332210000 0000000000000 00001111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEe Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+.-.+.|+.|++...+ ...+|-..++.++|+.+..+.- ..+. T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~Elr~lDPr~i~~vR~-------------i~~~ 184 (516) T protein:vir:10 122 LEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP----NPKKGIAELRRLDPRFMEYYRE-------------IVTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec----CccccceeeeeeCCcceeeEee-------------eccc Confidence 4555666666688889999999999999999885543 3457888899999987766542 1111 Q ss_pred cCCce-----EEEEEEEeCCcEEEEEecCCcccccccccccccccccccce--EEeecccccCccCC--cccchhhHHHH Q lcl|NC_021297. 161 DDVAV-----PDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYG--RSEISPELRKV 231 (480) Q Consensus 161 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv--v~~~n~~~~~~~~G--~s~i~~~v~~l 231 (480) +..+. ...+-+|.++.. .|...+.. +... +++ +||- |.|.+......+.| .|-|.+.++++ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~----~~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 185 DIGGTTIVKGYREFFIYTTGNE-GYSYNGRI----FEPN----TRI-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPA 254 (516) T ss_pred ccccchhhhhhhheeeeccCcc-ccccccce----eCCC----cce-eechhheeeecccceeCCCCceeeeehhhhHhH Confidence 11110 112233443332 22222211 0000 000 1221 33443222222211 23344555444 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc--------------------------hhhhhhhhhcc--- Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT--------------------------TLDIYYGRILT--- 282 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~--------------------------~~~~~~~~~~~--- 282 (480) -. -+++.|.+.+-+....|-+=++-.+.-..+..+... .+-......|. T Consensus 255 NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 NQ--LKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc Confidence 32 257788888877777777655422222222211110 01111222232 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+|-+.++..++.+.--.-++-++-....++...++|.+-+....+.+ .-|..|-.-+.....-+.+++..|..-+. T Consensus 333 eGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~ 412 (516) T protein:vir:10 333 DGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFL 412 (516) T ss_pred CCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 134456677777665334456667777788889999988886443311 22345666677788888999999999999 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHH-------HHHHHHhccccCCCHHHHHHh-CCCChhHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKAD-------AVSKLYANGQGPIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~----~~i~v~f~~~~~~~~~~~ad-------~~~kl~~~~~~~~s~et~~~~-lg~~~~~~~ 427 (480) ++++.=+-+.|.--..++ ..|.+.|..-..-.....++ .+.++-.-....+|.+++++. |..++++++ T Consensus 413 ~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~ 492 (516) T protein:vir:10 413 DPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIA 492 (516) T ss_pred HHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHH Confidence 998876555554333333 34677775433333333332 222222111246799998765 799999988 Q ss_pred HHHHHHHHHHHHHHHHHhcccccCcccccCCCC Q lcl|NC_021297. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+.++++..+ .+.. ++.+.++. T Consensus 493 ~e~k~I~~E~~~---~~~~------~p~~~~~f 516 (516) T protein:vir:10 493 QEEKQIEQEAGI---KRFQ------NPENEDDF 516 (516) T ss_pred HHHHHHHHhhhC---CCCC------CCCccccC Confidence 887666655421 1100 11111111 No 234 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.26 E-value=0.00011 Score=42.23 Aligned_cols=372 Identities=11% Similarity=0.006 Sum_probs=157.2 Q ss_pred HHHHHHhcccCcch----------------hcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCc--h-- Q lcl|NC_021297. 23 LEAEAYRNGTRRLK----------------TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS--E-- 79 (480) Q Consensus 23 ~~~~~YY~g~~~i~----------------~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~--~-- 79 (480) ..+.++++++..-. ..+..+... .-+.+.-...+|+..+.-+-.-++.+ ..+. + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~ 77 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPE---TALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIA 77 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechH---HhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceecc Confidence 22223333321100 001111110 00111112344444444333333322 2121 1 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 80 ~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ....+.+++.. | ....+...+..+.+.+|.||+++..+ ..|++ .+..++|..+.+..+... . T Consensus 78 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~L~pl~~~~v~v~~~~~g--~---- 145 (419) T protein:vir:57 78 FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRN------GRGDITELIPINPHKVIVLKGPDG--M---- 145 (419) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCcceEEEECCCc--e---- Confidence 13345565542 2 34566778888999999999988653 34554 677888888877654321 1 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) .+|... ..+ .++..+. |+|+.+.. ..+.+|.|.+.. +...++ T Consensus 146 -~~y~~~-~~~-----~~~~~~~-----------------------------vih~r~~~-~d~~~G~s~i~~-~~~~i~ 187 (419) T protein:vir:57 146 -PYYDIP-SIG-----EILPMRM-----------------------------VHHIKSFS-LDGYIGTSPIQT-NPDVLG 187 (419) T ss_pred -EEEEEc-CCc-----eEEchhh-----------------------------EEEecCcC-CCCcccccHHHH-HHHHHH Confidence 112111 111 0111112 23444322 234678876642 333333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCC-Ccccccccccc----chhhh------hhhhhcccCCCCceEEEecccc-HHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGEN----TTLDI------YYGRILTLASEAAKISEFKAAE-LRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~-~~~~~~~~~~~----~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~ 301 (480) ....+..-....+...+.|-.+|.-- .......++.. ..|.- ..++++.++ ++.++.++.... -..| T Consensus 188 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~ 266 (419) T protein:vir:57 188 LGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQ-EGMTYKQLSQDNEKAQL 266 (419) T ss_pred HHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecC-CCceEEEcCCChhhHHH Confidence 22222111112222223343333210 00111111110 11111 112344444 446777766432 2247 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEE 376 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~-----~~~~~~~~~~ 376 (480) ++..+....+|+...++|+..+|....+.-|. +......+. ...|.-++..+. .+...... . T Consensus 267 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~~~f~----------~~~l~P~~~~ie~~l~~~ll~~~~~-~ 333 (419) T protein:vir:57 267 LQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNN--IEHQGLQYV----------IYTMLAILKRHESAMMRDLLLPSER-R 333 (419) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCCcccc--HHHHHHHHH----------HHHHHHHHHHHHHHHHhhccCcccc-C Confidence 78888888999999999999998543221121 111111111 112222222211 11111111 1 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- - |.+.-.. +..+.+ T Consensus 334 ~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--g------------D~~~~~~--n~~~~~ 395 (419) T protein:vir:57 334 DFYIEFNVSSLLRGDQKSRYESYALGRQWG--WLSVNDIRRMENLTPIPG--G------------DKYLTPL--NMVDSK 395 (419) T ss_pred CeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------Ceeeecc--cccccc Confidence 234555555666778889999998988765 667777788888754321 0 0000000 000000 Q ss_pred CCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 457 KPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 457 ~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .....+. ...+..|....-.+|| T Consensus 396 ~~~~~~~-~~~~~~~~~~~~~~~~ 418 (419) T protein:vir:57 396 ALTGIGK-ATPQQLKDIEAILCTR 418 (419) T ss_pred ccccccC-CCcccCcchhhhhhcc Confidence 0000011 1112222233334455 No 235 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.21 E-value=0.00013 Score=41.88 Aligned_cols=383 Identities=14% Similarity=0.086 Sum_probs=158.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch-hc--------------CcccchhhhhhccccchHHHHHHHHHh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-TI--------------GIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~-~~--------------~~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |..|+=.++ ...+..-...++..+.|..... .. +..+.++. -.+. .=...+|+..+. T Consensus 1 ~~~~~~~~~-----~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-al~~--~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTID-----LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDER-ILQI--STVWRCVSLIST 72 (424) T ss_pred CCCCccccc-----cCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHH-hhcc--HHHHHHHHHHHH Confidence 444322111 1222223333444444432110 00 00011100 0111 112233444443 Q ss_pred hhccCcccc---CCCc---h--hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEE Q lcl|NC_021297. 66 RLDIEGFRI---SEDS---E--GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIR 131 (480) Q Consensus 66 ~l~~~~~~~---~~d~---~--~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~ 131 (480) -+-.-++.+ ..+. + ....+..++.. | .-..+...+..+.+.+|.||+++-.+ ..|++ .+. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~ 146 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLL 146 (424) T ss_pred hhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEE Confidence 332223321 1111 1 22345555542 3 23456677888999999999988643 34544 577 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeec Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n 211 (480) +++|..+.+..+.. .+. |.. ..++. ...|.+++ |+||++ T Consensus 147 ~l~~~~v~v~~~~~---~~~----y~~-~~~g~----~~~~~~~e-----------------------------Vihir~ 185 (424) T protein:vir:18 147 PLQSANMDVKLVGK---KVV----YRY-QRDSE----YADFSQKE-----------------------------IFHLKG 185 (424) T ss_pred EecCcceEEEEcCC---eEE----EEE-EeCCe----EEEecccc-----------------------------EEEecC Confidence 78888877655421 111 111 11110 01122222 344443 Q ss_pred ccccCccCCcccchhhHHHHHHHHHHHHHHHHHHH---HHhcchhhhhcCCCccccccccc---cchhhh-----hhhhh Q lcl|NC_021297. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSAS---QILGTPLRVISGVTTDELTNDGE---NTTLDI-----YYGRI 280 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~---~~~~~~~~~i~g~~~~~~~~~~~---~~~~~~-----~~~~~ 280 (480) .. ..+.+|.|.|. .+.+++...++-..-.. ...+.|-.++.-... ...++.. ...|+. ..+++ T Consensus 186 ~~-~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~l~~e~~~~~~~~~~~~~~~~nag~~ 259 (424) T protein:vir:18 186 FG-FTGLVGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGGPVKKRL 259 (424) T ss_pred cC-CCCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCCcccCCc Confidence 21 23456776553 23333332222222222 222334333331110 0111110 111211 11234 Q ss_pred cccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 281 LTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 281 ~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) +.++ ++.++.++.... -..|++..+....+|+...++|+..+|....+...+.+++.....+...+- .-+-..++ T Consensus 260 ~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl---~P~~~~ie 335 (424) T protein:vir:18 260 WILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL---QPYISRWE 335 (424) T ss_pred eecc-CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHH---HHHHHHHH Confidence 4454 346777665432 234778888889999999999999998554433333334433333322211 00111111 Q ss_pred HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021297. 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 360 ~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~ 439 (480) +.+.. .+...... ....+++.+......|..+.++.+.++..+| +++...+++.+|+.+-+- -.+ T Consensus 336 ~~ln~--~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~g--gD~-------- 400 (424) T protein:vir:18 336 NSIQR--WLIPSKDV-GRLHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNMPPLPG--GDV-------- 400 (424) T ss_pred HHHHh--hcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCe-------- Confidence 11110 11111111 1234555666667778889999999988765 566666777777654310 000 Q ss_pred HHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 440 MIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) ..- ..+..+-...+ .+.+| .|+++ T Consensus 401 ----~~~----~~n~~~l~~~~-----~~~~~--~~n~a 424 (424) T protein:vir:18 401 ----AMR----QAQYVPITDLG-----TNKEP--RNNGA 424 (424) T ss_pred ----eee----ccCccchhhhh-----ccCCc--cccCC Confidence 000 00000000000 00000 01111 No 236 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=97.14 E-value=0.00016 Score=41.47 Aligned_cols=366 Identities=12% Similarity=0.083 Sum_probs=134.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHH----HHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPN----LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r----~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |- +..++.......... ......+..|.-.. ..+.. ..-+.+.-...+|+..++-+-..+|.+.. T Consensus 1 Mg----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~---~~al~~~~v~~~i~~ia~~ia~~p~~v~~ 69 (385) T protein:vir:10 1 MG----LLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQL----SYVSA---LSALQNTNVYSVINRIASDVASAHFKTEN 69 (385) T ss_pred Cc----cccchhcccccccccccccchhhhhhhccccCc----cccCH---HHhhccHHHHHHHHHHHHHHhhCceeeec Confidence 21 111100000000000 00000111010000 00111 10011112234455554444444565532 Q ss_pred CchhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 77 DSEGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 77 d~~~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .. +..++++ | ....+...+..+.+.+|.||+++.... ..+..++|..+.+..+. .. T Consensus 70 ~~-----~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---------~~~~p~~~~~v~~~~~~--~~---- 129 (385) T protein:vir:10 70 TA-----TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---------LEHIPNSDVQINYLPGN--MG---- 129 (385) T ss_pred cc-----hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---------eeEeecCCceEEEEEcC--Cc---- Confidence 21 1223322 2 334556677888889999999875321 11223333333333221 10 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc--cccCccCCcccchhhHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRK 230 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~--~~~~~~~G~s~i~~~v~~ 230 (480) +.++.....++.. ..+.+++ |+||++. ....+.+|.|.+.. +.. T Consensus 130 -~~~~~~~~~~~~~---~~~~~~e-----------------------------iihik~~~~~~~~~~~G~s~i~~-~~~ 175 (385) T protein:vir:10 130 -IVYTVLESNDRPQ---MVLRQDQ-----------------------------MLHFRLMPDPQYRYLIGRSPLES-LQN 175 (385) T ss_pred -eEEEEEEcCCceE---EEEcccc-----------------------------EEEeccCCCCcccccccccHHHH-HHH Confidence 1111111111100 1122222 3444431 12234567776532 333 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-----hhhhhcccCCCCceEEEeccc--cHHHH Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAA--ELRNF 301 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~--~~~~~ 301 (480) .++.......-..+.+...+.|..++. |...+....+.-...|+. ..++++.++ ++.++.+++.. ..+.+ T Consensus 176 ~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~l 254 (385) T protein:vir:10 176 ALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLP-DGFDYTQLEMKTDVFKAL 254 (385) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccC-CCceEEecCCChhHHHHH Confidence 333222222222222222233433333 111111000000111111 112344444 34667665543 34434 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) .+..+....+|+...++|+..+|......+++..+............ -+-..+++.+.. .+.+ ..++ T Consensus 255 ~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~----P~~~~ie~~l~~--~l~~-------~~~~ 321 (385) T protein:vir:10 255 ADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLN----SYVNPIVDELRL--KMNA-------PDLE 321 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHH----HHHHHHHHHHHH--hhCC-------ceEE Confidence 57778889999999999999998642221222222211111111111 111111111111 1111 1355 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..+.++++.+++++| +++...+++.+|+.+-+-..+.+ ......... ++ T Consensus 322 f~~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~p~~~~~~----------------~~~~~~~~~---~g 380 (385) T protein:vir:10 322 LDIKDMLDVDDSALINQVSNLAKSG--VLGAEQAQFILTRSGFLPDNLPE----------------FKPLTTQVK---GG 380 (385) T ss_pred eechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCccCCCCCcc----------------ccCcccccC---CC Confidence 5666666788889999999988765 55555555555432200000000 000000000 11 Q ss_pred CCCccc Q lcl|NC_021297. 462 DTKTET 467 (480) Q Consensus 462 d~~~~~ 467 (480) +. .++ T Consensus 381 ~~-~dn 385 (385) T protein:vir:10 381 DE-GDN 385 (385) T ss_pred CC-CCC Confidence 10 011 No 237 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.13 E-value=0.00016 Score=41.44 Aligned_cols=398 Identities=13% Similarity=0.100 Sum_probs=158.1 Q ss_pred CCCHHH--HHHHHHHHHHHHHHHHHH---HHHHhccc-Cc----chhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHE--HVERLQGLLARDLPNLLE---AEAYRNGT-RR----LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~~r~~~---~~~YY~g~-~~----i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |.++.- ++.++...+....+.... ......+. +. ....+..+.+.. -++. .=...+|+..++-+-.- T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~-a~~~--~aV~~~v~~Ia~~ia~l 77 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADA-IMRL--DAVAACVKLVSQAVAAM 77 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHh-hhcc--hHHHHHHHHHHHhhccC Confidence 666533 233332222211110000 00000000 00 000111111110 0011 11122233333222222 Q ss_pred ccc---cCCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEcccee Q lcl|NC_021297. 71 GFR---ISED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYM 138 (480) Q Consensus 71 ~~~---~~~d---~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~ 138 (480) ++. ...| +.....+..++.. | .-..+...+..+.+.+|.||+++... +|++ .+..++|..+ T Consensus 78 p~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-------~g~~~~L~~l~p~~v 150 (432) T protein:vir:97 78 PLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLANDRL 150 (432) T ss_pred ceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCcce Confidence 322 1111 1122334555532 3 23456677888999999999887642 2443 5678999999 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCcc Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~ 218 (480) .++.+... ++. |... ..++.. ..+.++. |+|+++.. ..+. T Consensus 151 ~v~~~~~g--~~~----y~~~-~~~g~~---~~~~~~~-----------------------------iih~r~~~-~dg~ 190 (432) T protein:vir:97 151 TITTDTKG--NTA----YRYR-RTDGQM---IDIPRQQ-----------------------------IWKIMGYS-LDGE 190 (432) T ss_pred EEEEcCCC--cEE----EEEE-ecCceE---EEEcccc-----------------------------EEEecCcC-CCCc Confidence 88876532 221 1111 111111 1122222 23343322 2335 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCCCccccccccccchhhh------hhhhhcccCCCCce Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGVTTDELTNDGENTTLDI------YYGRILTLASEAAK 289 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~ 289 (480) +|.|.|. .+.+.+...+.-......+|. .|-.++. .+.. ..++ ....++- ..++++.++ ++.+ T Consensus 191 ~G~spi~----~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~-~~~~-l~~e-~~~~~~~~~~~~~nag~~~vl~-~g~~ 262 (432) T protein:vir:97 191 NGLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQ-IDRF-LTDD-QYDSFSKKVSGSVEAGRAPLLE-GGMD 262 (432) T ss_pred ccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceeEe-cCCC-CCHH-HHHHHHHHHhhhhcCCCceecC-CCce Confidence 6776553 333444333333333333333 3322332 1111 1111 1111111 123344454 3467 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENP-ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~ 367 (480) +.++..+. -..+++..+....+|+...++|++.+|....+. +.+..+......+...+- .-+-..|++.+.. . T Consensus 263 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl---~P~~~~ie~~ln~--k 337 (432) T protein:vir:97 263 VKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTL---SPWLRRIEQSIAL--N 337 (432) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHH---HHHHHHHHHHHhh--h Confidence 77765432 235778888889999999999999998543221 112233333222222111 0111112221111 1 Q ss_pred HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHH-HHHHHHhc Q lcl|NC_021297. 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETE-DMIDTLYS 446 (480) Q Consensus 368 ~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~-~~~~~~~~ 446 (480) +...... ....+++.+......|..+.++++.+++++| +++...+++.+|+.+-+-. ......... ..++.. T Consensus 338 Ll~~~e~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~glpp~~g~--~~~~~~~~~~~pl~~~-- 410 (432) T protein:vir:97 338 LLTPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGN--AAVLTVQSAMVPLDSI-- 410 (432) T ss_pred ccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCC--cceEeecccccchhhh-- Confidence 1111111 1123445555556678889999999998875 6677777888776442200 000000000 000000 Q ss_pred ccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 447 TTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ..+..+++..+++ .+++ +.=|| T Consensus 411 ----~~~~~~~~~~~~~-~~~~-------~~~~~ 432 (432) T protein:vir:97 411 ----GLQASPEPASGLG-NQQQ-------DKVSK 432 (432) T ss_pred ----cccCCCCCCCCCC-Cccc-------ccccC Confidence 0011111111111 1111 11111 No 238 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.97 E-value=0.00023 Score=40.54 Aligned_cols=400 Identities=13% Similarity=0.081 Sum_probs=158.5 Q ss_pred CCCHHH--HHHHHHHHHHHHHHHHHH---HHHHhccc-C----cchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHE--HVERLQGLLARDLPNLLE---AEAYRNGT-R----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~~r~~~---~~~YY~g~-~----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |.++.- ++.++-..+...-+.-.. -.....+. + .....+..+... .-+.+.-...+|+..++-+-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~---~al~~~~V~~~i~~Ia~~ia~l 77 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNAD---AIMRLDAVAACVKLVSQAIAAM 77 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchh---hhhcchHHHHHHHHHHHhhhhC Confidence 666543 233332222221110000 00000000 0 000111111111 0011122223344333333222 Q ss_pred ccc---cCCCc---hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce-eEEEEEcccee Q lcl|NC_021297. 71 GFR---ISEDS---EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYM 138 (480) Q Consensus 71 ~~~---~~~d~---~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~-~~i~~~~p~~~ 138 (480) +|. ...+. .....+..++.. | ....+...+..+.+.+|.||+++... +|+ ..+..++|..+ T Consensus 78 p~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-------~g~~~~L~~l~~~~v 150 (432) T protein:vir:10 78 PLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLANDRL 150 (432) T ss_pred ceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCCce Confidence 332 11111 122334555432 3 23456677888999999999887542 244 35778899999 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCcc Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~ 218 (480) .++.|... ++. |+....+ +.. ..+.++.+ +||++.. ..+. T Consensus 151 ~v~~~~~g--~~~----y~~~~~~-g~~---~~~~~~~i-----------------------------ih~~~~~-~dg~ 190 (432) T protein:vir:10 151 TITTDTKG--NTA----YRYRRTD-GQM---IDIPKQQI-----------------------------WKIMGYS-LDGE 190 (432) T ss_pred EEEEcCCC--cEE----EEEEecC-ceE---EEEcCccE-----------------------------EEecCCC-CCCc Confidence 88876432 221 1111111 111 11222222 3333321 2335 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCCCccccccccc---cchhhh--hhhhhcccCCCCceE Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGVTTDELTNDGE---NTTLDI--YYGRILTLASEAAKI 290 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~---~~~~~~--~~~~~~~~~g~~~~~ 290 (480) .|.|.|. .+.+.+...+.-......+|. .|-.++.- +.. ..++.. ...+.- ..++++.++ ++.++ T Consensus 191 ~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~-~~~-l~~e~~~~~~~~~~~~~nag~~~vl~-~g~~~ 263 (432) T protein:vir:10 191 NGLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQI-DRF-LTDDQYDSFAKKVSGSVEAGRAPLLE-GGMDV 263 (432) T ss_pred ccccHHH----HHHHHHHHHHHHHHHHHHHHhcCCCcceEEec-CCC-CCHHHHHHHHHHHhhhhhCCCceecC-CCceE Confidence 6776553 334444433333222333333 34333331 111 111110 111111 113444554 34677 Q ss_pred EEecccc-HHHHHHHHHHHHHHHHhccCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 291 SEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 291 ~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) .++.... -..|++..+....+|+...++|++.+|....+ .+.+..++.....+...+- .-+-..|++.+.. .+ T Consensus 264 ~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl---~P~~~~ie~~ln~--kL 338 (432) T protein:vir:10 264 KSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTL---SPWLRRIEQSIAL--NL 338 (432) T ss_pred EEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHH---HHHHHHHHHHHHh--hh Confidence 7765432 22477888889999999999999999854322 1222333322222221110 0011112111111 11 Q ss_pred hCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_021297. 369 MGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 369 ~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ....... ...+++........|..++++++.+++++| +++...+++.+|+.+-+-. ........ .+.... T Consensus 339 ~~~~~~~-~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~glppi~g~--~~~~~~~~-----~~~pl~ 408 (432) T protein:vir:10 339 LSPAERR-RYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGN--AAVLTVQS-----AMVPLD 408 (432) T ss_pred cCccccC-ceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCC--cceEeecC-----cccchh Confidence 1111111 123444444555678888999999998875 6777777888877542100 00000000 000000 Q ss_pred ccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 449 KAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ..+.+..+++..++ ++..+..=|| T Consensus 409 ~~~~~~~~~~~~~~--------~~~~~~~~~~ 432 (432) T protein:vir:10 409 SIGLQASPEPASGL--------GNQQQDKVSK 432 (432) T ss_pred hhcccCCCCCCCCC--------CCcccccccC Confidence 00001111111111 0111111111 No 239 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=96.96 E-value=0.00023 Score=40.48 Aligned_cols=386 Identities=13% Similarity=0.074 Sum_probs=156.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc-hhc--------------CcccchhhhhhccccchHHHHHHHHHh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL-KTI--------------GIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i-~~~--------------~~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |..++-.+. +..+..-...++..+.|.... +.. +..+.++ .-+.+.-.-.+|+..++ T Consensus 1 ~~~~~~~~~-----~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~---~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTID-----LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDE---RILQISTVWRCVSLIST 72 (424) T ss_pred CCCCcceEe-----ecCCCchHHHHHhhhcccccccccccccccccccccccccccccHH---HhhccHHHHHHHHHHHH Confidence 433322111 112222233333333332111 000 0011110 00111112233444433 Q ss_pred hhccCcccc---CCCc-----hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEE Q lcl|NC_021297. 66 RLDIEGFRI---SEDS-----EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIR 131 (480) Q Consensus 66 ~l~~~~~~~---~~d~-----~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~ 131 (480) -+-.-++.+ ..+. .....+.+++.. | ....+...+..+.+.+|.||+++-.+ ..|++ .+. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~ 146 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLL 146 (424) T ss_pred hhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEE Confidence 333333322 1111 112345555542 3 23456677888999999999988543 34554 577 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeec Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n 211 (480) .++|..+.+..+.. .+. |.. ..++.. ..|.+++ |+||++ T Consensus 147 pl~~~~V~v~~~~~---~~~----y~~-~~~g~~----~~~~~~e-----------------------------Iih~r~ 185 (424) T protein:vir:18 147 PLQSANMDVKLVGK---KVV----YRY-QRDSEY----ADFSQKE-----------------------------IFHLKG 185 (424) T ss_pred EecCcceEEEEcCC---eEE----EEE-EeCCeE----EEecccc-----------------------------EEEecC Confidence 88888887655421 111 111 111110 1122222 344443 Q ss_pred ccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc---cchhhh-----hhhhhccc Q lcl|NC_021297. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---NTTLDI-----YYGRILTL 283 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~---~~~~~~-----~~~~~~~~ 283 (480) .. ..+.+|.|.+.- +...++.......-..+.+...+.|-.++.-... ...++.. ...|+- ..++++.+ T Consensus 186 ~~-~dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-~l~~e~~~~~~~~~~~~~~g~nag~~~vl 262 (424) T protein:vir:18 186 FG-FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGGPVKKRLWIL 262 (424) T ss_pred cC-CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCCcccCCceec Confidence 21 234567775532 2222322111111112222223334444431111 0111110 111211 12234444 Q ss_pred CCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 284 ASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 284 ~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) + ++.++.+++.+. -..|++..+....+|+...++|+..+|....+...+..++.....+...+- .-+-..|++.+ T Consensus 263 ~-~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl---~P~~~~ie~~l 338 (424) T protein:vir:18 263 E-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL---QPYISRWENSI 338 (424) T ss_pred c-CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHH---HHHHHHHHHHH Confidence 4 346777665432 224778888889999999999999998543332223333333332222110 11111111111 Q ss_pred HHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 363 RIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMID 442 (480) Q Consensus 363 ~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~ 442 (480) .. .+...... ....+++.+......|..++++.+.++..+| +++...+++.+|+.+-+- - + T Consensus 339 ~~--~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~g--G------------D 399 (424) T protein:vir:18 339 QR--WLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLPG--G------------D 399 (424) T ss_pred Hh--hcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------C Confidence 11 11111111 1233555556666788889999999988765 566666777777654210 0 0 Q ss_pred HHhcccccCcccccCCCCCCCCcccccCCCCCCCC Q lcl|NC_021297. 443 TLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRT 477 (480) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~ 477 (480) .+.... +..+-... +.+.+| ..|++ T Consensus 400 ~~~~~~----n~~~l~~~-----~~~~~p-~~~ga 424 (424) T protein:vir:18 400 VAMRQS----QYVPITDL-----GTNKEP-RNNGA 424 (424) T ss_pred eeeecc----CccchHhh-----hccCCC-ccCCC Confidence 000000 00000000 000111 11111 No 240 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=96.95 E-value=0.00024 Score=40.39 Aligned_cols=389 Identities=13% Similarity=0.097 Sum_probs=165.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc--CCC- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SED- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--~~d- 77 (480) --|+..+-.-|...-.-...++..+ ||+-.. ......-++.+-...+....|++ +.+ T Consensus 39 gltp~~l~~iL~~a~~gd~~~~~~L--~~dm~~------------------~D~hi~s~l~~Rk~av~~~~w~I~p~~~~ 98 (512) T protein:vir:19 39 GVTPNRAAQMLRDAERGDLTAQADL--AFDMEE------------------KDTHLFSELSKRRLAIQALEWRIAPARDA 98 (512) T ss_pred CCCHHHHHHHHHHhhCCCHHHHHHH--HHHHHh------------------hChHHHHHHHHHHHHHhCCCceEecCCCC Confidence 1122322222221111111222211 221110 01111112222222223334443 222 Q ss_pred ----chhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCee Q lcl|NC_021297. 78 ----SEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 78 ----~~~~~~l~~i~~~n-~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~ 150 (480) .+..+.+.+++..- +|......+ .+|..||.+ ++++|... +....| .+.+++|+.+ .|++..... T Consensus 99 ~~~~~~~a~~v~~~l~~~~~f~~~~~~l-ldA~~~G~s~~Ei~w~~~----~g~~~~~~~~~r~~~~f--~~~~~~~~~- 170 (512) T protein:vir:19 99 SAQEKKDADMLNEYLHDAAWFEDALFDA-GDAILKGYSMQEIEWGWL----GKMRVPVALHHRDPALF--CANPDNLNE- 170 (512) T ss_pred CHHHHHHHHHHHHHHhcCCCHHHHHHHH-HhhhhhcceeeeeEeeee----CCceeeeeeeeeccccc--eeccCCCcE- Confidence 12334566666543 577777765 468889986 56667421 111122 2333443321 122211100 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHH Q lcl|NC_021297. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) + .++. ... ..... .+++ ++.+.++...+.++|.+.+.. +-- T Consensus 171 ---l------------------------r~~~-~~~------~G~~l-~~~k---~i~~~~~~~~g~p~g~gLlr~-~~w 211 (512) T protein:vir:19 171 ---L------------------------RLRD-ASY------HGLEL-QPFG---WFMHRAKSRTGYVGTNGLVRT-LIW 211 (512) T ss_pred ---E------------------------EecC-CCC------Cceee-cCCc---eEEEeccCCCCCcccccHHHH-HHH Confidence 0 0000 000 00000 1112 334455666777888887643 222 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhhhhhcccC-CCCceEEEeccccHHHHHHHH Q lcl|NC_021297. 231 VTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 231 liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~----~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l 305 (480) ..=-=+..+.+++.-++.|+.|.++.+ -+....++.... ..+++.+....++ |...+|.+....+...|...+ T Consensus 212 ~~~fK~~~~~~w~~f~E~yG~P~~igk--y~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li 289 (512) T protein:vir:19 212 PFIFKNYSVRDFAEFLEIYGLPMRVGK--YPTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMI 289 (512) T ss_pred HHHHHHHHHHHHHHHHHHcCCCeeEEe--cCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHH Confidence 222235678888889999999976554 222111111111 1223333333333 445667665555566677777 Q ss_pred HHHHHHHHhccCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~i~v~ 383 (480) +..-.+|++..- -+.+....++. +.-|+. ....-....+..-.+.+...+. ++++-++.++............+. T Consensus 290 ~~~d~~Isk~iL--GqtlTs~~g~~-Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~ 366 (512) T protein:vir:19 290 GWAEKAISKAIL--GGTLTTEAGDK-GARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIV 366 (512) T ss_pred HHHHHHHHHHHh--hhhhccccccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEE Confidence 777777766531 01111110010 111221 1222333334444455666774 577777777654332222346678 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) |....+.++...++.+.++. .| ..++.+.+.+.+|+......+- ... ..+..+.+.+..... T Consensus 367 f~~~e~eDl~~~a~~~~~l~-~G-~~i~~~~i~e~~Gip~~~~~e~----------~~~----~~~~~~~~~~~~~~~-- 428 (512) T protein:vir:19 367 FDTSEAGDITALSDAIPKLA-AG-MRIPVSWIQEKLHIPQPVGDEA----------VFT----IQPVVPDNGSQKEAA-- 428 (512) T ss_pred ecCCChhhHHHHHHHHHHHh-cC-CCCCHHHHHHHhCCCCCCCccc----------ccc----CCCcccccccccccc-- Confidence 88888899999999998886 55 3579999999999854421110 000 000000000000000 Q ss_pred CcccccCCC--CCCCC--CCC Q lcl|NC_021297. 464 KTETQTSPS--GFNRT--KTR 480 (480) Q Consensus 464 ~~~~~~~p~--~~~~~--~~~ 480 (480) ...+..|. ..+.. +.. T Consensus 429 -~~~~~~~~~~~~d~~~~~~~ 448 (512) T protein:vir:19 429 -LSAEDIPQEDDIDRMGVSPE 448 (512) T ss_pred -ccccCCCchhhHhHHhhhHH Confidence 00000000 00000 000 No 241 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.87 E-value=0.00029 Score=40.00 Aligned_cols=379 Identities=12% Similarity=0.118 Sum_probs=150.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCcch-h---------------cCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 7 HVERLQGLLARDLPNLLEAEAYRNGTRRLK-T---------------IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 7 ~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~-~---------------~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) ++-++...+.-.-.-...+..++..+..-. . .+..+.... -++. .=...+|+..++-+-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~-al~~--~~v~~cv~~Ia~~iA~l 77 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPET-AMKL--AAVYSCIYVLSSSLAQM 77 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccccchhhhhhhccccCCceechHH-hhcc--HHHHHHHHHHHHHHhhC Confidence 222211111111111222233332221000 0 000010000 0011 01122344333333222 Q ss_pred cccc---CCCc--h-hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEcccee Q lcl|NC_021297. 71 GFRI---SEDS--E-GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYM 138 (480) Q Consensus 71 ~~~~---~~d~--~-~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~ 138 (480) ++.+ .++. + ....+..++.. | ....+...+..+.+.+|.+|+.+-.+ ..|.+ .+.+++|..+ T Consensus 78 p~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~v 151 (424) T protein:vir:45 78 PLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRN------RRGEVISLDCCMPWET 151 (424) T ss_pred ceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEecCceE Confidence 3321 1111 1 12234555432 3 23356677888999999999987643 34555 4777888777 Q ss_pred EEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCcc Q lcl|NC_021297. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~ 218 (480) .+..+. .++. |.. ....+. ..+.+++ |+||++. ...+. T Consensus 152 ~i~~~~---~~~~----y~~-~~~~~~----~~~~~~e-----------------------------Vih~r~~-~~d~~ 189 (424) T protein:vir:45 152 TLMNTG---GRYT----YGL-YNEYGA----FAISPDD-----------------------------MIHIRAL-GNNQK 189 (424) T ss_pred EEEEcC---CeEE----EEE-EecCce----EEECccc-----------------------------EEEecCc-CCCCc Confidence 654321 1111 100 001110 0112222 3444432 23456 Q ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHh---cchhhhhcCCCccccccccc---cchhhh-------hhhhhcccCC Q lcl|NC_021297. 219 YGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVISGVTTDELTNDGE---NTTLDI-------YYGRILTLAS 285 (480) Q Consensus 219 ~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~---~~~~~~-------~~~~~~~~~g 285 (480) +|.|.+. .+.+.+...+.-..-...+| +.|..++.-.. . ..++.. ...|.. ..++++.++ T Consensus 190 ~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~-l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~- 262 (424) T protein:vir:45 190 MGLSPIM----QHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-G-LNKESWGWLKDQWQKASQALRRQENKTMLLP- 262 (424) T ss_pred ccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-C-CCHHHHHHHHHHHHHHhccccccCCceeEcC- Confidence 6777653 33344433322222222233 33444443111 1 111111 111211 112344444 Q ss_pred CCceEEEeccccHH-HHHHHHHHHHHHHHhccCCChHHhccccccchHH-HH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 EAAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EA--IIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 286 ~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg-~A--l~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) ++.++.+++....+ .|++..+....+|+...++|+..+|....+.-|. .. +.+....|.--+.. |++. T Consensus 263 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~--------ie~~ 334 (424) T protein:vir:45 263 ADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTN--------WEQE 334 (424) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHH--------HHHH Confidence 34677777644322 4788888889999999999999998543221121 11 11111112221211 1111 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMI 441 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~ 441 (480) +.. .+...........+++........|..++++.+.+++++| +++...+++.+|+.+-+- ..... T Consensus 335 ln~--kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g--~~T~NE~R~~~gl~pi~g----------gD~~~ 400 (424) T protein:vir:45 335 LNR--RLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDG--WMSRNEARAFEDMNPVEG----------LDEML 400 (424) T ss_pred HHH--hcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC----------cceee Confidence 110 1111111111123555555555678888999999988776 566767778777754210 00000 Q ss_pred HHHhcccccCcccccCCCCCCCCcccccCCCC Q lcl|NC_021297. 442 DTLYSTTKAQADATPKPTVTDTKTETQTSPSG 473 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~ 473 (480) ...+..... ++..+ +. .++++++. T Consensus 401 ~~~n~~~~~-~~~~~-~~------~~~~~~~~ 424 (424) T protein:vir:45 401 VSVNAANPA-GDFKP-PK------NDEGKTNE 424 (424) T ss_pred ecccccccc-cccCC-CC------CCCCCCCC Confidence 011111100 01111 11 11111111 No 242 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=96.65 E-value=0.00044 Score=39.00 Aligned_cols=389 Identities=12% Similarity=0.063 Sum_probs=155.4 Q ss_pred HHHHHHHHHHHHHHHHHHHhcc----cC-cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCc--- Q lcl|NC_021297. 10 RLQGLLARDLPNLLEAEAYRNG----TR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS--- 78 (480) Q Consensus 10 ~l~~~~~~~~~r~~~~~~YY~g----~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~--- 78 (480) -++..+-....+-..-..++.. -. .....+..+.+.. -++. .-...+|+..++-+-.-+|.+ ..+. T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~-al~~--~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~ 77 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPET-ALAL--SAVRACVTLLAESVAQLPVELYRRDKNGGRQ 77 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhccCcccCCceechHH-hhcc--HHHHHHHHHHHHhhccCceEEEEEcCCCcee Confidence 1111111111110011111111 00 0000111111110 0111 112223443333332223321 1111 Q ss_pred -hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 79 -EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 79 -~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) .....+..++.. | ....+...+..+.+.+|.||+++-++ .+|++ .+..++|..+.++.++.. . T Consensus 78 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~g--~-- 147 (421) T protein:vir:10 78 RATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRD------GKGYPKELIPINPKKVIVLKGPDG--M-- 147 (421) T ss_pred ecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEecCceEEEEECCCc--e-- Confidence 112234555432 3 23456677888999999999988653 34555 477788888877665321 1 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) .+|.... .++++. . . -|+|+.+.. ..+.+|.|.+. .+... T Consensus 148 ---~~y~~~~------------~g~~~~--------------~-------~--eiih~~~~~-~d~~~G~spi~-~~~~~ 187 (421) T protein:vir:10 148 ---PYYEIPE------------IGETLP--------------M-------R--MMHHVKVFS-LDGYIGSSPIQ-TNADV 187 (421) T ss_pred ---EEEEEcC------------CCcEEc--------------h-------h--hEEEecCcC-CCCcccccHHH-HHHHH Confidence 1111111 111110 0 0 134454432 34567887664 23333 Q ss_pred HHHHHHHHHHHH-HHHHHhcchhhhhcCC-Ccccccccccc----chhhh------hhhhhcccCCCCceEEEeccccHH Q lcl|NC_021297. 232 TDAASRTLMNLQ-SASQILGTPLRVISGV-TTDELTNDGEN----TTLDI------YYGRILTLASEAAKISEFKAAELR 299 (480) Q Consensus 232 iD~~~~~~s~~~-~~~~~~~~~~~~i~g~-~~~~~~~~~~~----~~~~~------~~~~~~~~~g~~~~~~~~~~~~~~ 299 (480) ++.. ....+.. +.+...+.|-.+|.-. .......++.. ..|+- ..+.++.++ ++.++.++.....+ T Consensus 188 i~~~-~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d 265 (421) T protein:vir:10 188 LGLN-LAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQ-EGMSYKQMSQDNEK 265 (421) T ss_pred HHHH-HHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEecCCChhH Confidence 3321 2222222 2222223344444311 01011011111 11111 112344444 34677777643222 Q ss_pred -HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 300 -NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 300 -~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) .|++..+..+.+|+...++|+..+|....+.-|. +......+...+ -.-+-..|++.+.. .+...... ... T Consensus 266 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~~~f~~~t---l~P~~~~ie~~ln~--kL~~~~~~-~~~ 337 (421) T protein:vir:10 266 AQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNN--IEHQGLQFVMYT---LLAWLKRHEGALQR--DLLLPSER-RDL 337 (421) T ss_pred HHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCcccc--HHHHHHHHHHHH---HHHHHHHHHHHHhh--hccCcccc-CCe Confidence 4778888899999999999999997543221111 111111111110 00111112111111 11111111 112 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCC Q lcl|NC_021297. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) .+++........|..+.++.+.+++.+| +++...+++.+|+.+-+- -.+. .-..+-.. ..+..+.+ T Consensus 338 ~v~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--gD~~--------~~~~n~~~--~~~~~~~~ 403 (421) T protein:vir:10 338 YIEFNVSGLLRGDQKSRYESYALGRQWG--WLSVNDIRRMENLPPIAG--GDKY--------LTPLNMVD--SAQIIPGD 403 (421) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--ccee--------eecccccc--ccccccCC Confidence 3444445555678888999998988765 667777788887754320 0000 00000000 00111111 Q ss_pred CCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 459 TVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ..++..+++..|.-+|+ T Consensus 404 -----~~~~~~~~~e~d~~~~~ 420 (421) T protein:vir:10 404 -----KKPTAQQMAEIDTILSR 420 (421) T ss_pred -----CCcccccCccccccccc Confidence 11222333344444555 No 243 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=96.65 E-value=0.00044 Score=38.99 Aligned_cols=373 Identities=14% Similarity=0.120 Sum_probs=137.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHH--HHHHHhcccCcchhcCcccchhhhhhccc-cchHHHHHHHHHhhhccCcccc--- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLL--EAEAYRNGTRRLKTIGIGAPPELAYLDVQ-PGWVATYLRTLSDRLDIEGFRI--- 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~--~~~~YY~g~~~i~~~~~~~~~~~~~~~~~-~n~~~~iVd~~~~~l~~~~~~~--- 74 (480) |- |.+.+.. +.+-. .-..++.........+ . ..+ .++. ......+|+..++.+-.-++.+ T Consensus 1 Mg--------~~~~f~~-k~~~~~~~~~~~~~~~~~~~~~~--~-~~~--~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~ 66 (403) T protein:vir:80 1 MG--------LFNFFRR-KTRSEPTNAISWFLTQEAYDTLA--I-PGY--TRLSDNPEVRMAVHKIAELISSMTIHLMQN 66 (403) T ss_pred Cc--------ccccccc-cccccccchhhhhcccccccccc--c-chh--hhhhhhHHHHHHHHHHHHhhhhCceEEEEe Confidence 11 0000100 00000 0000000000000000 0 000 0110 1122334454444332223321 Q ss_pred CCC--chhHHHHHHHHHh--cC---HHHHHHHHHHHHhhc--CeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeC Q lcl|NC_021297. 75 SED--SEGLEELWNWWQA--ND---LDEESVLGHDDSLTF--GRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDP 144 (480) Q Consensus 75 ~~d--~~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~--G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~ 144 (480) .++ ......+..++.. |. -..+...++.+.+.. |.||+++.. +..|++ .+..++|..+.++.+. T Consensus 67 ~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~------~~~g~~~~L~~l~p~~v~~~~~~ 140 (403) T protein:vir:80 67 TDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKY------TTSGLIDELIPLAPSKVSFVDTD 140 (403) T ss_pred cCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEE------cCCCcEEEEEEEcCCeeEEEEcC Confidence 111 1122334454432 32 234445566666665 567776643 234554 5667888877766553 Q ss_pred CCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccc-cCccCCccc Q lcl|NC_021297. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR-LGNRYGRSE 223 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~-~~~~~G~s~ 223 (480) ... . + +|.. ..|.++ -|+||..+.. ..+..|.|. T Consensus 141 ~g~-~----~-~y~~----------~~~~~~-----------------------------eiih~~~~~~~~~~~~G~s~ 175 (403) T protein:vir:80 141 TGY-Q----I-WYQG----------KAYNYD-----------------------------EVLHFIVNPDPEKPYMGRGY 175 (403) T ss_pred Cce-E----E-EEee----------cccchh-----------------------------hEEEEeccCCCcCccccccH Confidence 210 0 0 1100 011122 2344442222 223446664 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccchh-hh-----hhhhhcccCCCCceEEEe Q lcl|NC_021297. 224 ISPELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTTL-DI-----YYGRILTLASEAAKISEF 293 (480) Q Consensus 224 i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g-~~~~~~~~~~~~~~~-~~-----~~~~~~~~~g~~~~~~~~ 293 (480) + ..+.+.++....-..-...++. .|-.++.- ........+.....+ +. ..+..+.++++..++.++ T Consensus 176 ~----~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 251 (403) T protein:vir:80 176 R----VVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQV 251 (403) T ss_pred H----HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeeccccccccee Confidence 4 2344444433322222223333 33333321 111111111111111 11 112333344433333333 Q ss_pred c---cccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021297. 294 K---AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 294 ~---~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~ 370 (480) . ..+. .+++..+....+|+...++|++-+|....+ +.....+....|.-.+...+. .|.+ .+.. T Consensus 252 ~~l~~~d~-q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~~~~~f~~~~l~P~~~~ie~----~l~~------kll~ 318 (403) T protein:vir:80 252 KPLSLKDL-AIHETVELDKRTVAGIFGVPAFLLGVGKYD--KDEYNNFINSTILPIAKGIEQ----ELTR------KLLI 318 (403) T ss_pred ccCCHHHH-HHHHHHHHhHHHHHHHhCCCHHHcCCCCcc--HHHHHHHHHHHHHHHHHHHHH----HHHH------hccC Confidence 2 2233 467888888999999999999999743222 221111111111111111111 1110 1111 Q ss_pred CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_021297. 371 REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) Q Consensus 371 ~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (480) ..+ ..+++....-...|..++++.+.+++.+| +++...+++.+|+.+.+-- .+......-..++.. ... T Consensus 319 ---~~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~G--i~t~NE~R~~~gl~p~~gg--d~~~~~~n~~pl~~~---~~~ 387 (403) T protein:vir:80 319 ---SPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRG--LMEGNEVRDWLGLSPKEGL--SELVILENYIPLDKI---GDQ 387 (403) T ss_pred ---CCC-cEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCC--CeEeecccccchhhc---cch Confidence 111 23443334555678888999999998875 6677778888887654210 000000000000000 000 Q ss_pred CcccccCCCCCCCCcc Q lcl|NC_021297. 451 QADATPKPTVTDTKTE 466 (480) Q Consensus 451 ~~~~~~~~~~~d~~~~ 466 (480) ....+.+..+.+++++ T Consensus 388 ~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 388 NKLKGGEKGGADGQTD 403 (403) T ss_pred hhccCCCCCCCCCCCC Confidence 0001111111122112 No 244 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=96.57 E-value=0.0005 Score=38.67 Aligned_cols=396 Identities=14% Similarity=0.100 Sum_probs=165.5 Q ss_pred CCC-----HHHHH------HHHHHHHHHHHHHHHHHHHH-hcccC----cc-hhcCcccchhhhhhccccchHHHHHHHH Q lcl|NC_021297. 1 MTT-----YHEHV------ERLQGLLARDLPNLLEAEAY-RNGTR----RL-KTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~-----~~~~i------~~l~~~~~~~~~r~~~~~~Y-Y~g~~----~i-~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) |+. .-..+ +-+..+....... +..+ +.|-. ++ +..+... .-+..+. ......-++++- T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~---~~~~~~~~~~p~~~~il~~~~~~~-~~y~~m~-~D~~i~s~l~~R 75 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARS---IDFFALGMYLPNPDPVLKALGKDI-RVYRELR-ADAHVGGCVRRR 75 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhccc---cccccccccCcchhHHHhhccCCH-HHHHHHh-hChHHHHHHHHH Confidence 111 00111 1122222111111 1111 11100 00 0000000 1111221 223333344443 Q ss_pred HhhhccCcccc---CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeE---EEEEccc Q lcl|NC_021297. 64 SDRLDIEGFRI---SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL---IRVESPL 136 (480) Q Consensus 64 ~~~l~~~~~~~---~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~---i~~~~p~ 136 (480) ...+....|.+ .++++..+.+.+++++-+|......+ .+|..||.+ ++++|.. .+|... +.+++|+ T Consensus 76 k~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~ 148 (491) T protein:vir:79 76 KAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGK------VGNYIVPIDVVGKPAD 148 (491) T ss_pred HHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEee------cCCeeeEEeeeeeccc Confidence 33344445554 23444567888888877788777766 568889986 5666742 133332 3344443 Q ss_pred eeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccC Q lcl|NC_021297. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) .+. ||+.. ...+....+.. ..... .+++ ++.+.++...+ T Consensus 149 ~f~--~d~~~------------------------------~l~l~~~~~~~-----~g~~l-p~~k---~i~~~~~~~~g 187 (491) T protein:vir:79 149 WFV--YDPEN------------------------------QLRFRSKEHWV-----QGEEL-PARK---FLVPRQEATYL 187 (491) T ss_pred cee--eccCC------------------------------ceEEeecCCCC-----Cceee-cCCC---eEEEEecCCCC Confidence 221 22211 01111111000 00001 1112 34455666777 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhhhhhccc-CCCCceEE Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-ASEAAKIS 291 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~----~~~~~~~~~~~~-~g~~~~~~ 291 (480) .++|.|.+.. +--..--=+..+.+++.-++.|+.|.++.+ -+....+++... ...+..+...++ .|...++. T Consensus 188 ~p~g~gLl~~-~~w~~~fK~~~~~~w~~f~E~~G~P~~igk--y~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ 264 (491) T protein:vir:79 188 NPYGFPDLSM-CFWPTTFKKGGLKFWVQFTEKYGSPMLVGK--HPRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIK 264 (491) T ss_pred CcccchhHHH-HHHHHHHHHhhHHHHHHHHHHcCCCeEEEe--cCCCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEE Confidence 7888887754 222222235567788888899999976654 221111111111 122233223333 34566776 Q ss_pred Eecc--ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 292 EFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 292 ~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~ 368 (480) +... .+...|...++..-.+|+...- -+.+.. ++ .++.|+. ....-....++.-.+.+...+.++++-++.+ T Consensus 265 ea~~~~g~~~~y~~li~~~d~~Isk~iL--GqtlTt--~~-~gs~a~~~vh~~v~~~i~~~D~~~i~~tln~li~~l~~~ 339 (491) T protein:vir:79 265 EAAGKSGSADVYERLLHFCRGEVSIALL--GQNQTT--EA-TSTRASAQAGLEVTDDIRDGDKAIVVEAMNMLIRWICDL 339 (491) T ss_pred eccCCCCChhHHHHHHHHHHHHHHHHHh--hhhhcc--Cc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6542 3455676666655555555431 001111 11 1111221 1222233333444455666777777777777 Q ss_pred hCCCCcccceeeeEEecCCCCCCH-HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021297. 369 MGREVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 369 ~~~~~~~~~~~i~v~f~~~~~~~~-~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) ++.... ...+.|..+ .+. ...++.+.++++.|. .++.+.+.+.+|+...+..+. .. . T Consensus 340 N~~~~~----~p~f~~~e~--ee~~~~~a~~~~~L~~~G~-~i~~~~~~e~~Gip~~~~~e~----------~~---~-- 397 (491) T protein:vir:79 340 NFDGAA----RPVFDMWEQ--EQVDEIQAGRDEKLTRAGA-RFTPAYFKRAYNLQDGDLDER----------PL---P-- 397 (491) T ss_pred cCCCCC----cceEeecCc--CchhHHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCCcc----------cc---C-- Confidence 764321 234455433 333 457888889998874 578999999999865432110 00 0 Q ss_pred cccCcccccCCCCCCCCcccccCCC-CCCCCCCC Q lcl|NC_021297. 448 TKAQADATPKPTVTDTKTETQTSPS-GFNRTKTR 480 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~~~~~~~p~-~~~~~~~~ 480 (480) ...++..+........+.++.... -.+....+ T Consensus 398 -~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 430 (491) T protein:vir:79 398 -VSAVDAVGAASFAEFEAPDQDALDAALNALSAR 430 (491) T ss_pred -cCcccccccccccccCCCCCcchHHHHHHHHHH Confidence 000000000000000000000000 00000011 No 245 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.57 E-value=0.0005 Score=38.66 Aligned_cols=392 Identities=10% Similarity=0.012 Sum_probs=143.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-hcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---C-CC-- Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAY-RNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---S-ED-- 77 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~Y-Y~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~-~d-- 77 (480) .-++++|. .+.........+ ..|.--.............+.-..+.+...+|+..++-+-.-++.+ . ++ T Consensus 1 Mg~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~ 76 (423) T protein:vir:81 1 MGFLQKLG----LAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGR 76 (423) T ss_pred CchhHhhc----cccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCce Confidence 12222221 000000000000 0000000000000000011111123344455665555443333322 1 11 Q ss_pred ch-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce-eEEEEEccceeEEEEeCCCCCeeE Q lcl|NC_021297. 78 SE-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~-~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~-~~i~~~~p~~~~~v~d~~~~~~~~ 151 (480) +. ....+..++.+ | ....+...+..+.+.+|.||+++...... .+. ..+..+++..+.+..+......+. T Consensus 77 ~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~----~~~~~~l~p~~~~~v~~~~~~~~~~~~~ 152 (423) T protein:vir:81 77 ERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGV----DTPTLDIRPIPVSWVQRRAYKDGWGSLD 152 (423) T ss_pred eeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----CcceEEEeecccceeeeeeccCCCcceE Confidence 11 12334555543 2 24556677788899999999988643211 111 123333333222211111001110 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHH Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) +.+. ......+.. ..+.+++ |+|+++.......+|.|.+. .+ T Consensus 153 Y~~~--~~~~~~g~~---~~~~~~e-----------------------------vih~r~~~~~~~~~G~spi~----~~ 194 (423) T protein:vir:81 153 YIII--ESGDNDGRS---VKVPGER-----------------------------VIHRHGYNPKTMKRGKSPVQ----SL 194 (423) T ss_pred EEEE--EecCCCceE---EEEcccc-----------------------------eEEecCCCCCCccccccHHH----HH Confidence 0000 000000000 0011111 34555333333456887653 33 Q ss_pred HHHHHHHHHHHHHHHHHhc---chhhhhcCC--Cc-cccccccc---cchhhh-------hhhhhcccCCCCceEEEecc Q lcl|NC_021297. 232 TDAASRTLMNLQSASQILG---TPLRVISGV--TT-DELTNDGE---NTTLDI-------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 232 iD~~~~~~s~~~~~~~~~~---~~~~~i~g~--~~-~~~~~~~~---~~~~~~-------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.+...+.-......+|. .|-.+|+-. .. ....++.. ...|+. ..++++.++ ++.++.+++. T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~ 273 (423) T protein:vir:81 195 RDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLE-DGMKAENFHT 273 (423) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecC-CCceEEeccC Confidence 4444433332222333333 344444311 11 01111110 011111 113444554 3467777654 Q ss_pred ccHH-HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCC Q lcl|NC_021297. 296 AELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REV 373 (480) Q Consensus 296 ~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-~~~ 373 (480) +..+ .|++..+....+|+...++|+..+|....+.-|. +..+...+...+ -.-+-..|++.+.. .+.. .+. T Consensus 274 s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn--~e~~~~~f~~~~---L~P~~~~ie~~l~~--~L~~~~~~ 346 (423) T protein:vir:81 274 TSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSN--VREFRKALYGDN---LGSWIRIIQDVMNL--FLLPRVGI 346 (423) T ss_pred ChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCccc--HHHHHHHHHHHH---HHHHHHHHHHHHhh--hhcCcccc Confidence 4322 4777778888999999999999997532221121 222222222111 00111112211111 1111 111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ......+++.+..-...|..++++++.+++.. .|+++...+++.+|+.+.+- . |.+.- +.+.. T Consensus 347 ~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~-~G~~T~NE~R~~~gl~p~~g--G------------D~~~~--p~n~~ 409 (423) T protein:vir:81 347 DNEKFYFEFNLEEKLRASFEEAAEIKRAAVGN-VAWMTINEVRAMDNLPSIDG--G------------DDLAR--PLNTE 409 (423) T ss_pred ccCccEEEecchhhhccCHHHHHHHHHHHHhC-CCCcCHHHHHHHhCCCCCCC--c------------ceeec--ccccc Confidence 11112344444555667888888887776543 24666666677777654320 0 00000 00001 Q ss_pred cccCCCCCCCCccc Q lcl|NC_021297. 454 ATPKPTVTDTKTET 467 (480) Q Consensus 454 ~~~~~~~~d~~~~~ 467 (480) ....++.++++.++ T Consensus 410 ~~~~~~~~~~~~~t 423 (423) T protein:vir:81 410 FGDSEDAPGEEVET 423 (423) T ss_pred cCccCCCCCCCCCC Confidence 11111111111122 No 246 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=372 Identities=10% Similarity=0.065 Sum_probs=139.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCchhH Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSEGL 81 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~~~ 81 (480) .=++..+..........-..+...+.+.... .+..+... .-+.+.-+..+|+..++-+-.-++.+ .+++... T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vt~~---~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~ 75 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRY--SGVYVTDS---NILQSSDVYELLQDISNQMVLADIVVEDEFGNEIKD 75 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhccccc--CccccChh---hhhccHHHHHHHHHHHHhhcccceEEEcCCCcccch Confidence 2222222211111000001111222221111 11111111 00111223444444444433333332 1222222 Q ss_pred HHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEE Q lcl|NC_021297. 82 EELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 82 ~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ..+..++.+ | ....+...++.+.+.+|.||+++-. .+. . -+..+.|..+... ..+| T Consensus 76 ~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~----------~~~-~--~~~~~~~~~~~~~-------~~~~ 135 (394) T protein:vir:62 76 DIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNG----------AQI-H--LASNVFTELDDNL-------VEHF 135 (394) T ss_pred hhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEec----------cee-e--ccccceEEECCce-------EEEE Confidence 334445443 3 2345667788899999999998732 111 1 1123333332210 0000 Q ss_pred EEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHH Q lcl|NC_021297. 158 TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~ 237 (480) ..+. .. |..=-|+|+.+.. ..+.+|.|.+. .+.+++.. T Consensus 136 ---------------~~~~-~~---------------------~~~~eiih~r~~~-~d~~~G~s~~~----~~~~~i~~ 173 (394) T protein:vir:62 136 ---------------NIGG-HE---------------------IPPCMIRHVKNIG-ADHLRGKGILD----LGRDTLEG 173 (394) T ss_pred ---------------eeCC-EE---------------------echhheEEecCcC-CCCccccChHH----HHHHHHHH Confidence 0000 00 0111245555432 34467777653 33333333 Q ss_pred HHHHH---HHHHHHhcchhhhhcCCCccccccccc----cchhhh------hhhhhcccC-CCCceEEEeccccH-HHHH Q lcl|NC_021297. 238 TLMNL---QSASQILGTPLRVISGVTTDELTNDGE----NTTLDI------YYGRILTLA-SEAAKISEFKAAEL-RNFA 302 (480) Q Consensus 238 ~~s~~---~~~~~~~~~~~~~i~g~~~~~~~~~~~----~~~~~~------~~~~~~~~~-g~~~~~~~~~~~~~-~~~~ 302 (480) ..+-. .......+.|..++. .......++.. ...|.- ..+++..++ |.+.++.++..... ..++ T Consensus 174 ~~~~~~~~~~~~~ng~~~~~il~-~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~ 252 (394) T protein:vir:62 174 VMSAEKTLTDKYKKGGLLTFLLN-LDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTL 252 (394) T ss_pred HHHHHHHHHHHHHccCCcceEEE-eCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHH Confidence 32222 222232334444443 11111111110 111111 112333333 34556656654322 2477 Q ss_pred HHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 303 ~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) +..+....+|+...++|+.-+|....++.+...+.+....|.-.+ ..+++.+.. .+.+.. +...+.+ T Consensus 253 e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~~~~~~l~P~~--------~~ie~~l~~--kll~~~---~~~~~~~ 319 (394) T protein:vir:62 253 AYLNVYKKDLGKFLGINVDTYTELIKEDIEKAMMYIHNKAVRPIM--------KNFEDHLSL--LFYAQN---SGKRIKF 319 (394) T ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHH--------HHHHHHHhh--hhcCcc---ccCceEE Confidence 888888999999999999999854322111111111111221111 112111111 122211 1234667 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTD 462 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) .|......+....++++.+++.+| +++...+++.+|+.+-+-..-.+.. .......+ ....+...+..++. T Consensus 320 ~fd~~~~~~~~~~~~~~~~~~~~g--~~T~NE~R~~~gl~p~~~~~gd~~~---~~~n~~~~---~~~~~~~~~~kgge- 390 (394) T protein:vir:62 320 KINILDFVTYSNKTNIGYNLVRTA--ITSPDNVADMLGFPKQNTKESQAIY---ISNDVTEI---GKKEATDGSLGGGE- 390 (394) T ss_pred EechhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeee---cccccccc---cccccccccCCCCC- Confidence 776555555566777788887765 5677777888777542110000000 00000000 00000000000000 Q ss_pred CCccc Q lcl|NC_021297. 463 TKTET 467 (480) Q Consensus 463 ~~~~~ 467 (480) +.++ T Consensus 391 -~~en 394 (394) T protein:vir:62 391 -ENEN 394 (394) T ss_pred -CCCC Confidence 0011 No 247 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=96.20 E-value=0.00089 Score=37.32 Aligned_cols=262 Identities=11% Similarity=0.042 Sum_probs=112.0 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccc Q lcl|NC_021297. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPL 136 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~ 136 (480) +. .-++.+ ..++.....+..++.. | ....+...++.+.+.+|.||+.+..+ .+|++ .+..++|. T Consensus 1 ia---~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~------~~G~~~~l~~l~~~ 71 (278) T protein:vir:78 1 MA---SLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPD 71 (278) T ss_pred Cc---cceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEEC------CCCcEEEEEEECCc Confidence 11 111111 1112222334444331 2 34566778889999999999987653 34554 67788999 Q ss_pred eeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccC Q lcl|NC_021297. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~ 216 (480) .+.+..+.... .. +|......+.. ..+.+++ |+||++..... T Consensus 72 ~v~v~~~~~~~--~~----~y~~~~~~g~~---~~~~~~e-----------------------------vih~~~~~~~~ 113 (278) T protein:vir:78 72 VVEMLIENQSR--EL----YYSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASN 113 (278) T ss_pred eeEEEEcCCCc--eE----EEEEEcCCceE---EEEcccc-----------------------------EEEECCCCCCC Confidence 88877664322 11 11111111111 1122222 45555443345 Q ss_pred ccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC--CCccccccccccchhh---hhhhhhcccCCCCceEE Q lcl|NC_021297. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG--VTTDELTNDGENTTLD---IYYGRILTLASEAAKIS 291 (480) Q Consensus 217 ~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g--~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~ 291 (480) +++|.|.+.. +...++. ..+-....+..++.+-..+.- ....+...+.-...|+ ...++++.++ ++.++. T Consensus 114 ~~~G~s~~~~-~~~~i~~---~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~ 188 (278) T protein:vir:78 114 MVQGISPIDV-LKNTTDF---DNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIE 188 (278) T ss_pred CeeeccHHHH-HHHHHHH---HHHHHHHHHHHhcCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecC-CCceEE Confidence 5678876542 3333332 222112223344433333321 1111111110011111 1223344444 345676 Q ss_pred Eeccc-cHHHHHHHHHHHHHHHHhccCCChHHhccccc-cchHH-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 292 EFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASA-EAIIAT-DSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) Q Consensus 292 ~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg-~Al~~~-~~~l~~k~~~~~~~~~~~l~~~~~~~~~ 367 (480) +++.. .-..+++..+....+|+...++|+.-+|.... +-++. .+.+.. ...+.-.++..+ .+|.+ . T Consensus 189 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~----~~ln~------~ 258 (278) T protein:vir:78 189 PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE----EEFNR------K 258 (278) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH----HHHHh------h Confidence 66543 22346788888899999999999999985432 22221 122111 111221111111 11111 1 Q ss_pred HhCCCCcccceeeeEEecCCCC Q lcl|NC_021297. 368 IMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 368 ~~~~~~~~~~~~i~v~f~~~~~ 389 (480) +....... ....+.|.-... T Consensus 259 L~~~~e~~--~g~~~~f~~~~l 278 (278) T protein:vir:78 259 LLTKTDRE--KIGILNLTLNLI 278 (278) T ss_pred cCChhHhc--CCceEEEecccC Confidence 11111000 123344421111 No 248 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.19 E-value=0.00089 Score=37.30 Aligned_cols=341 Identities=13% Similarity=0.075 Sum_probs=131.6 Q ss_pred CCCHHHHHHHHHHHHHHHHH-HHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLP-NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |. |+. .+..+.. ....+.....+.... ..+..+.... -++. .=.-.+|+..++-+-..++. ++ T Consensus 1 M~----~~~----~f~~r~~~~~~~~~~~~~~~~~~-~~~~~v~~~~-al~~--~av~~cv~~ia~~ia~~p~~--~~-- 64 (359) T protein:vir:10 1 MS----ILN----PFERRSSITPNNYYPFMVQNGSI-VPNSLVDATE-ALKN--SDLYAVTSLISSDIAGTRFI--GN-- 64 (359) T ss_pred Cc----ccc----hhhccccCCCCcchhhhhccccc-cCCcccCHHH-hhcc--hHHHHHHHHHHHhhhcCccc--cc-- Confidence 11 111 0111000 000000000000000 0111111110 0111 00112344444333333332 11 Q ss_pred hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 80 GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 80 ~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ..+..++.+ | .-..+...+..+.+.+|.||+++-++ ..|.+ .+..++|..+.+..++. . + T Consensus 65 --~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~g~~~~l~~l~~~~v~i~~~~~---~----~ 129 (359) T protein:vir:10 65 --QVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKG------DNSLMKELRLIPSNAITIDLTDD---T----L 129 (359) T ss_pred --hHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEEC------CCCeEEEEEEeCCceEEEEEcCC---e----E Confidence 122333333 3 22345567777888999999988653 34554 46778888777655431 1 1 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) +|.......+. ...+.++++++++... + +.....+..|.|.+. .+.+. T Consensus 130 ~y~~~~~~~~~---~~~~~~~evih~~~~~------------------------~-~~~~~dg~~G~spi~----~~~~~ 177 (359) T protein:vir:10 130 TYEVNQFDDYP---SAKYNASEMIHVKIMA------------------------Y-GVDTLHNLVGHSPLE----SLTSE 177 (359) T ss_pred EEEEEecCCce---EEEEcccceEEeccCC------------------------C-CCCccCccccccHHH----HHHHH Confidence 12111111111 1122333333322110 0 001123345776553 23333 Q ss_pred HHHHHHHHHHHHHHh---cchhhhhcC--CCccccccccccchhhhh-----hhhhcccCCCCceEEEeccccHH-HHHH Q lcl|NC_021297. 235 ASRTLMNLQSASQIL---GTPLRVISG--VTTDELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAAELR-NFAE 303 (480) Q Consensus 235 ~~~~~s~~~~~~~~~---~~~~~~i~g--~~~~~~~~~~~~~~~~~~-----~~~~~~~~g~~~~~~~~~~~~~~-~~~~ 303 (480) +.....-..-...+| +.|-.++.- ...+....+.-...|+.. .+++..++ ++.++.+++....+ .+++ T Consensus 178 i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~le 256 (359) T protein:vir:10 178 IGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLD-QSADFSTVSINADVANYLN 256 (359) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecC-CCcceeeecCCHHHHHHHH Confidence 333222222222222 334334321 111111111011112111 12344444 44677666544333 4788 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIV-KMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~-~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) ..+....+|+...++|++.+|....+.++...++..+...+ ....-....+...|.+ .+ ..+. ...+ T Consensus 257 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~------~~-----~~~~-~~~~ 324 (359) T protein:vir:10 257 SMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCDS------SI-----GVDM-SPIT 324 (359) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------hh-----cccc-hhhh Confidence 88889999999999999999865444344444444333221 2222222222111111 00 0000 0001 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~ 425 (480) .| +.......+.+++++| +++...+++.++..+== T Consensus 325 ~~------d~~~~~~~~~~~~~~G--~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 325 DY------SNSVFKADILNWVKEG--IIEPTEAKTLLESKGII 359 (359) T ss_pred hc------CHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC Confidence 11 2233333455666654 56666667666542221 No 249 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=96.02 E-value=0.0011 Score=36.77 Aligned_cols=419 Identities=13% Similarity=0.065 Sum_probs=158.5 Q ss_pred CC---CHHH---HHHHHHHHHHHHHHHHHHHHHHhc-ccC-cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcc Q lcl|NC_021297. 1 MT---TYHE---HVERLQGLLARDLPNLLEAEAYRN-GTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~---~~~~---~i~~l~~~~~~~~~r~~~~~~YY~-g~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) || ++.- -+.+....=.. .. .+.|+ -.+ +....+... .-+..+.-......-++++....+....+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~--~~----~~~~~~~e~~~~lr~~~~~-~ly~~m~e~D~~i~s~l~~rk~av~~~~w 73 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVV--DG----WTVWDPFEQTPELQWPQSV-AVYSRMDNEDSRVTSLLEAISLPIRSTPW 73 (469) T ss_pred CCCcccCCCCccchhhhhhcccc--cc----hhhccccccccccccccch-HHHHHHHhhChHHHHHHHHHHHHHhcCCc Confidence 22 2211 11111110000 00 01111 000 000000000 01111111123333333333333334444 Q ss_pred cc--C-CCchhHHHHHHHHH-----------------hcCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEE Q lcl|NC_021297. 73 RI--S-EDSEGLEELWNWWQ-----------------ANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 73 ~~--~-~d~~~~~~l~~i~~-----------------~n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~ 131 (480) ++ + ++++..+.+.+.+. +..+..++.++...+..||.+ +++||..... ..+|+..+. T Consensus 74 ~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~--~~dG~~~~~ 151 (469) T protein:vir:10 74 RIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ--SPDGRFWLR 151 (469) T ss_pred eEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc--cCCCceeee Confidence 43 1 22222222222211 124566777778888889987 6688854321 124555444 Q ss_pred EEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeC-CcEEEEEecCCcccccccccccccccccccc----- Q lcl|NC_021297. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLP-DETVPLRRNGGLNDQWVVDGDVIKHGLGVVP----- 205 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP----- 205 (480) .+.+.. .+.. .++.... +++ ....+.... ..... ....+.++.+| T Consensus 152 ~l~~rp----------~~~i--~~~~~~~-~~~-l~~~~~~~~~~~~~~---------------~~~~~~~~~~~lp~~k 202 (469) T protein:vir:10 152 KLAPRP----------QWTI--SKFNVAP-DGG-LESIEQIAPPARTRG---------------SLYVANIAPPEIPVNR 202 (469) T ss_pred eeeecC----------cccc--eeeeecc-CCc-eeeeeecCccccccc---------------ccccCCCCccccccCc Confidence 333321 0000 0111111 111 111110000 00000 00001111111 Q ss_pred eEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhh----h--hhhh Q lcl|NC_021297. 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLD----I--YYGR 279 (480) Q Consensus 206 vv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~----~--~~~~ 279 (480) ++.+.++...+.++|.|.+..-....+ -=+..+.+++.-++.|+.|.++.+- +..-.++....... + +... T Consensus 203 ~i~~~~~~~~g~p~g~gLlr~~~~~~~-fK~~~~~~w~~f~EryG~P~~vgky--~~~a~~~ek~~l~~a~~~~~~g~~a 279 (469) T protein:vir:10 203 LVVYTRNKRPGQWQGKSILRSAYKHWL-LKDKLLRIEAATAERNGMGIPVGTA--SSATDEDEVRKMAALARSVRGGINA 279 (469) T ss_pred EEEEEecCCCCCcccchhHHHHHHHHH-HHHHHHHHHHHHHHHcCCcceEEec--CCCCCHHHHHHHHHHHHHHhcCCce Confidence 455667778888999988754222211 2245777888888999999876542 21111111111111 1 1122 Q ss_pred hcccC-CCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 280 ILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAW 358 (480) Q Consensus 280 ~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 358 (480) ...++ |...++.+.+ .+...|...++..-.+|++..--..-....++++.+.|.. ...-....++.-.+.+...+ T Consensus 280 ~~iip~~~~ie~~ea~-g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~v---h~ev~~d~~~sDa~~i~~tl 355 (469) T protein:vir:10 280 GVGLAQGQILELLGVS-GNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASV---LEDPFTQAVHAYATSICRIA 355 (469) T ss_pred EEEccCCceEEEeecC-CCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHH---HHHHHHHHHHHHHHHHHHHH Confidence 22232 4455565543 3445666666666666665531000001111111111221 12222223333334555566 Q ss_pred H-HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhcccc---CCCHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021297. 359 E-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG---PIPKEQARIDLGYTATQREQMRDWDK 434 (480) Q Consensus 359 ~-~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~---~~s~et~~~~lg~~~~~~~~~~~~~~ 434 (480) . ++++-++.++-+.. ....++.|.... ......++.+.++++.|.. -++.+.+.+.+|+......+...... T Consensus 356 n~~li~~l~~lN~g~~---~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~ 431 (469) T protein:vir:10 356 NQHIIEDLVDINFGVD---TPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPE 431 (469) T ss_pred HHHHHHHHHHhcCCCC---CCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccch Confidence 4 46665566653221 123456775544 4456678889899988752 25677888999886443211100000 Q ss_pred HHHHHHHHHHhcccccCcccccCCCCCCC-CcccccCCCCC-CCCCCC Q lcl|NC_021297. 435 QETEDMIDTLYSTTKAQADATPKPTVTDT-KTETQTSPSGF-NRTKTR 480 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~~~~~~p~~~-~~~~~~ 480 (480) +. ++.+. +.+.+..... ...+.+++... +.+.-. T Consensus 432 ~~---------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 467 (469) T protein:vir:10 432 EP---------AAVPN---QSAAPARTRSSGNADARARAPKADQGVLF 467 (469) T ss_pred hc---------ccCCC---CCccccccCCCCCcccccccCCChHHhhc Confidence 00 00000 0001111000 00000111000 000000 No 250 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=95.93 E-value=0.0012 Score=36.52 Aligned_cols=361 Identities=9% Similarity=0.037 Sum_probs=138.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) |. +...+.+.. ......|++.- . ..+. ............+|+..++-+-..++.+- .+.+ T Consensus 1 Mg----~f~~~f~~~-------~~~~~~~~~~~----~-~~~~---~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~ 61 (385) T protein:vir:95 1 MG----LFDSVFKRH-------SELSWMYDLEF----L-QDKS---KKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK 61 (385) T ss_pred Cc----hhhhhhccC-------cccccccchhh----h-hccc---hhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc Confidence 22 222222111 11111111110 0 0000 01111122334445555544433444331 1222 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 80 ~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ....+..++.. | ....+...++.+.+.+|.||++.... .+.+ ...++.|.. ..++.. . T Consensus 62 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~-------~~~~~~~~~~~~~~-~~~~~~------~-- 125 (385) T protein:vir:95 62 EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDE-------GHFFVADDFEKEDE-LGLYSH------R-- 125 (385) T ss_pred ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecC-------CCeeeccccccccc-cccccc------c-- Confidence 23345555532 3 23456677888999999999865321 1111 111111110 000000 0 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) ++....+.. ..... +..--|+||++.......+|.|.+. .+.. T Consensus 126 --~~~~~~~~~--~~~~~-----------------------------~~~~eiih~~~~~~~~~~~G~s~~~----~~~~ 168 (385) T protein:vir:95 126 --FTNVLVNDF--EFKRV-----------------------------FTMDDVIYLKYNNQKLDAFSLGLFE----DYGE 168 (385) T ss_pred --ceeeeeccc--ceeee-----------------------------eccccEEEecCCCCCcccccchHHH----HHHH Confidence 000000000 00000 1111245555544334456766442 2223 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc---ccchhhh-------hhhhhcccCCCCceEEEeccc------- Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---ENTTLDI-------YYGRILTLASEAAKISEFKAA------- 296 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~------- 296 (480) .+.. ......+.+.+.-++.-.......++. -...|+. ..+.++.++ ++.++.+++.. T Consensus 169 ~i~~----~~~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~-~g~~~~~l~~~~~~~~s~ 243 (385) T protein:vir:95 169 IFGR----MIDLQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLT-EGLAYEEHSNRGAAQSAQ 243 (385) T ss_pred HHHH----HHHHHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcC-CCceeEeecccccccCCH Confidence 3322 222333333332222110000111111 0111211 111233333 44566655421 Q ss_pred cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) .-..|++..+....+|+...++|+..++++..| .+...+.+....|.-.+...+..+.. .+........ T Consensus 244 ~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------~L~~~~~~~~ 312 (385) T protein:vir:95 244 QFSELNELKKTVLTDVARMIGVPPSLVLGEMAD-LEKTIESYLQFCINPLLRKIEAELNS----------KFFYQDEYLN 312 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcC-HHHHHHHHHHHHHHHHHHHHHHHHHh----------hcCChhhccc Confidence 123588888999999999999999999754322 12212222222222222111111111 1111111111 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ..+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+-+.-.+ ..- +.+-.... T Consensus 313 -~~~~fd~~~l~~~D~~~~~~~~~~~~~~g--~lt~NE~R~~~g~~p~~~~~gd~------------~~~--~~n~~~~~ 375 (385) T protein:vir:95 313 -DDMHIKVVGIDKRDPLKLSEAIDKLVASG--TFTRNQVRIMTGEEPADDPELDK------------FII--TKNLQSAD 375 (385) T ss_pred -ceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCce------------eee--cccceecc Confidence 23555666667788889999999998875 56777778888775421000000 000 00000000 Q ss_pred CCCCCCCCcc Q lcl|NC_021297. 457 KPTVTDTKTE 466 (480) Q Consensus 457 ~~~~~d~~~~ 466 (480) ...+++...+ T Consensus 376 ~~kgge~~~e 385 (385) T protein:vir:95 376 AFKGGESNEE 385 (385) T ss_pred cccCCCCCCC Confidence 0111111111 No 251 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=95.70 E-value=0.0016 Score=35.92 Aligned_cols=424 Identities=12% Similarity=0.100 Sum_probs=185.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH-----hcccCcchh---------------cCccc------chhhhhhccccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAY-----RNGTRRLKT---------------IGIGA------PPELAYLDVQPG 54 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~Y-----Y~g~~~i~~---------------~~~~~------~~~~~~~~~~~n 54 (480) |.+.+.|-+.=-..+...+ +..+..+ =+|...|+. ++... -..|+.+-..+. T Consensus 1 ~~~~~~w~~~de~~~~~~~--~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~p 78 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNF--LSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDP 78 (533) T ss_pred CCCcchhhhhhHHHHHHHh--hchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCc Confidence 8887766543222221111 1111111 122212210 00000 001111111122 Q ss_pred hHHHHHHHHHhhhcc-----Ccccc-CCCchhHHHHH-HHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce Q lcl|NC_021297. 55 WVATYLRTLSDRLDI-----EGFRI-SEDSEGLEELW-NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 55 ~~~~iVd~~~~~l~~-----~~~~~-~~d~~~~~~l~-~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~ 127 (480) =+...|+..++-... .++.+ .++.+..+.+. .|..-.+|+....+..+.-.+.|+.|.+.-.. ..+.|- T Consensus 79 EVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik----~~k~GI 154 (533) T protein:vir:58 79 LISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEK----GSDGTI 154 (533) T ss_pred chhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccC----Ccccch Confidence 333333333322211 11111 11222222222 34444578999999999999999999987432 234677 Q ss_pred eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc-- Q lcl|NC_021297. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP-- 205 (480) Q Consensus 128 ~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP-- 205 (480) ..++.+||..+-.+++...+ . .+-+|++..... .....+ -.|| T Consensus 155 ~elr~lDPr~i~~vr~~~t~--~-----------------eyyvy~~~~~~~----~s~~~~------------~kI~~d 199 (533) T protein:vir:58 155 EKFQVVSPYIFSKRYNPETD--T-----------------WYYVITDVYRNV----VSGYFN------------EDIPEE 199 (533) T ss_pred hhheecCCeeeEEEEeeccc--e-----------------EEEeeccccccc----ccCccc------------cccchh Confidence 78899999988888765332 1 112333321110 111000 0111 Q ss_pred -eEEeecc-cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-------------- Q lcl|NC_021297. 206 -VVPLTND-PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-------------- 269 (480) Q Consensus 206 -vv~~~n~-~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-------------- 269 (480) |+++..- .+..+..+.|-|.+.++++-. -+++.|.+.+-+....|-+=++-.+.-..+..+. T Consensus 200 aI~y~~SGl~d~~~~~iisyLhkAiKp~NQ--LkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNk 277 (533) T protein:vir:58 200 DVIHFSHKIDTNFFPYGRSYLESARAIWNQ--LRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRD 277 (533) T ss_pred heeeeeeccccCCCCceehhhhHHHHHHHH--HHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccc Confidence 2222211 222345566666665555432 2567777777777666665554222222111110 Q ss_pred -------cch-----hh---hhhhhhcc---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccch Q lcl|NC_021297. 270 -------NTT-----LD---IYYGRILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA 331 (480) Q Consensus 270 -------~~~-----~~---~~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~ 331 (480) +.+ +. ......|. .+|-..++..++...+ .-++-++-..+.++...++|.+-++...+.. T Consensus 278 lvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~l-gemeDV~YF~kkLy~ALnVP~sRl~~e~~fg- 355 (533) T protein:vir:58 278 YWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKV-DLAEDVEYMLNRLISALKVPKAFIGYEGDVN- 355 (533) T ss_pred eEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCC-CcHHHHHHHHHHHHHHhCCCeeecCCCCCCc- Confidence 000 00 11222222 1234566777776553 4456677777888899999988776543321 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH---HHHhcccc Q lcl|NC_021297. 332 SAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS---KLYANGQG 408 (480) Q Consensus 332 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~---kl~~~~~~ 408 (480) -|..|-.-......-+.+++..|..-|++-+ .+.|.-... ++++.|..-..-.....++.+. .+.+...+ T Consensus 356 r~~eItRDEiKF~KFI~rLR~rF~~ll~~qL----ilk~iit~e---ew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dp 428 (533) T protein:vir:58 356 AKNTLATQDIKFNNTIKRIQGFFVEELERMV----RMNKEFADQ---DFRLVMNRSNSIVEGERFAVIEQRIGIAERLKG 428 (533) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHhccc----ccccCcchh---heeeeeeccchHHHHHHHHHHHHHHHHHHHhcc Confidence 2234544555566677777777776665422 223322222 3467775433333333333332 23344467 Q ss_pred CCCHHHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhcccccC------------cccccCCCCC---CCCcccccCCC Q lcl|NC_021297. 409 PIPKEQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQ------------ADATPKPTVT---DTKTETQTSPS 472 (480) Q Consensus 409 ~~s~et~~~~-lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~---d~~~~~~~~p~ 472 (480) .+++.++++. |..+++.+++.+. ++++. .+..+.....+ +++.+.+.++ +..++.+..++ T Consensus 429 yvgk~yi~k~ILr~tdei~~q~e~-ie~E~---~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 504 (533) T protein:vir:58 429 WVREDWIYSNILQIPYDLKPQEEV-AEAAG---GGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELG 504 (533) T ss_pred hhhHHHHHHHHhcCChhhhHHHHH-HHHhh---cCCCCCCCCcccccCCcccCccccCcccCCCChhhHhcccCCccccc Confidence 8889887765 6888765444332 22221 11111111000 0111111110 00000000011 Q ss_pred -------------CCCCCCCC Q lcl|NC_021297. 473 -------------GFNRTKTR 480 (480) Q Consensus 473 -------------~~~~~~~~ 480 (480) ..+|++.+ T Consensus 505 ~~~~~~~a~~~~~~~~g~~~~ 525 (533) T protein:vir:58 505 GELNLGGAFEEFEEETGGGEE 525 (533) T ss_pred ccccccccchhhhhhcCCccc Confidence 11122222 No 252 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=385 Identities=14% Similarity=0.050 Sum_probs=149.1 Q ss_pred CCCHHHHHHHHHHHHH---HHHHHHHHH-----------HHHhcccCc--chh-------cCcccchhhhhhccccchHH Q lcl|NC_021297. 1 MTTYHEHVERLQGLLA---RDLPNLLEA-----------EAYRNGTRR--LKT-------IGIGAPPELAYLDVQPGWVA 57 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~---~~~~r~~~~-----------~~YY~g~~~--i~~-------~~~~~~~~~~~~~~~~n~~~ 57 (480) |- ++..+...-. ...++..-- -+.+.|-.+ +.. .+..+.... -++ +.-.. T Consensus 1 Mg----l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~-al~--~~~V~ 73 (431) T protein:vir:10 1 MG----LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETR-ALR--NMAVL 73 (431) T ss_pred Cc----chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhh-hhc--cHHHH Confidence 22 1111111000 000110000 001111000 000 011111000 001 11122 Q ss_pred HHHHHHHhhhccCccc---cCCCc--hhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce Q lcl|NC_021297. 58 TYLRTLSDRLDIEGFR---ISEDS--EGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 58 ~iVd~~~~~l~~~~~~---~~~d~--~~~~~l~~i~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~ 127 (480) .+|+..++-+-.-++. ..++. .....+..++.. |. -..+...++.+.+.+|.||+++-.+ ...- T Consensus 74 ~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~------~g~~ 147 (431) T protein:vir:10 74 RCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWS------GNRP 147 (431) T ss_pred HHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCce Confidence 3333333322222332 11111 122345555542 32 2355667888999999999988642 1222 Q ss_pred eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceE Q lcl|NC_021297. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVV 207 (480) Q Consensus 128 ~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv 207 (480) ..+..++|..+.+..+... .+. |.... ..+.. ..+..+. |+ T Consensus 148 ~~L~pl~~~~v~~~~~~~~--~~~----y~~~~-~~g~~---~~~~~~d-----------------------------Vi 188 (431) T protein:vir:10 148 IRLIPMDRGSAKGRLTSTW--QIV----YDYTT-PTGDK---IELPARE-----------------------------VF 188 (431) T ss_pred EEEEEEcCceeEEEEcCCC--eEE----EEEEe-CCceE---EEEchhh-----------------------------EE Confidence 4567788888877665321 111 11111 11110 0111222 24 Q ss_pred EeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCC-Cccccccccccchhhh------hh Q lcl|NC_021297. 208 PLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGV-TTDELTNDGENTTLDI------YY 277 (480) Q Consensus 208 ~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~i~g~-~~~~~~~~~~~~~~~~------~~ 277 (480) ||++. ...+.+|.|.+. .+.+.+.....-......+|. .|-.++.-. ...+...+.-...|+. .. T Consensus 189 Hir~~-~~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~ 263 (431) T protein:vir:10 189 HLRDL-SIDGVSGVSRVK----LSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENA 263 (431) T ss_pred EecCc-CCCCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 44432 234466777653 233333332222222223333 343333211 1111101100111211 11 Q ss_pred hhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 278 GRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) Q Consensus 278 ~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 356 (480) ++++.++ ++.++.++.... --.+++..+....+|+...++|++.+|....+ ++..++.....+...+ -.-+-. T Consensus 264 g~~~vl~-~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~--t~sn~eq~~~~f~~~t---L~P~~~ 337 (431) T protein:vir:10 264 GSWMLLE-EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS--WGSGIEQLAIFFIQYG---LSHWFV 337 (431) T ss_pred CCceecC-CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC--ccccHHHHHHHHHHHH---HHHHHH Confidence 2344454 446777776432 23567877888899999999999999864322 1222222222222211 011111 Q ss_pred HHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc--cCCCHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021297. 357 AWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ--GPIPKEQARIDLGYTATQREQMRDWDK 434 (480) Q Consensus 357 ~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~--~~~s~et~~~~lg~~~~~~~~~~~~~~ 434 (480) .|++.+.. .+....... ...+++.+..-...|..++++++.+++.+|. ++++...+++.+|+.+-+-..- T Consensus 338 ~ie~~ln~--~Ll~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~g----- 409 (431) T protein:vir:10 338 SWEQAAAR--AFLPEKMLG-QRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVA----- 409 (431) T ss_pred HHHHHHHh--hccChhhcC-CceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccc----- Confidence 12211111 111111111 2345555555667788899999999887652 3456666666665533210000 Q ss_pred HHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCC Q lcl|NC_021297. 435 QETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGF 474 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~ 474 (480) +.+.... ...+.+ +.+.+|..+ T Consensus 410 -------D~~~~p~----n~~~~~-------~~~~~p~~~ 431 (431) T protein:vir:10 410 -------DQLRNPM----TQKQKG-------SGDEPPATT 431 (431) T ss_pred -------cceeccc----ccccCC-------CCCCCCCCC Confidence 0000000 000000 111112222 No 253 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=95.35 E-value=0.0022 Score=35.11 Aligned_cols=381 Identities=13% Similarity=0.053 Sum_probs=149.1 Q ss_pred HHHHHHHHH-HHHH----HHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCc--h Q lcl|NC_021297. 10 RLQGLLARD-LPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS--E 79 (480) Q Consensus 10 ~l~~~~~~~-~~r~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~--~ 79 (480) -+......+ ...- .-+-...-|-... ..+..+.+.. -++ +.-...+|+..++-+-.-++.+ ..+. + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s-~~~~~v~~~~-al~--~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~ 76 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARS-EAGQVVTPAS-ALS--LTVLQNCVTLLAESIAQLPVELYERSGDDRKP 76 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhccccc-ccCcccChHH-hhc--cHHHHHHHHHHHHhhccCceEEEEecCCCccc Confidence 000000000 0000 0000000110000 0111111110 011 1112233444443333333322 2221 1 Q ss_pred -hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 80 -GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 80 -~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) ....+..++.. | .-..+...+..+.+.+|.||+++..+ ..|++ -+..++|..+.+..+... .+ T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~L~~i~~~~v~i~~~~~~--~~-- 146 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRD------QDGVIQGLYPLDNEAVTVMKGPDL--KP-- 146 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEecCceEEEEECCCc--eE-- Confidence 12345555542 3 23456677888899999999988653 34555 477888888877665321 11 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) +| ...+.. . +..==|+|+.++. ..+.+|.|.+.- + . T Consensus 147 ---~y-----------------------~~~~~~---~----------~~~~~i~h~~~~~-~d~~~G~s~i~~-~---~ 182 (419) T protein:vir:80 147 ---MY-----------------------RVAGAD---P----------LPQRLVHHVRWMS-INGYTGLSPVLL-H---A 182 (419) T ss_pred ---EE-----------------------EEcCcc---c----------cchhheEEecCCC-CCCcccccHHHH-H---H Confidence 11 000000 0 0000144555443 345678886642 2 3 Q ss_pred HHHHHHHHHHHHHHHHh---cchhhhhcCC-Cccccccccccc----hhhh------hhhhhcccCCCCceEEEeccccH Q lcl|NC_021297. 233 DAASRTLMNLQSASQIL---GTPLRVISGV-TTDELTNDGENT----TLDI------YYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~---~~~~~~i~g~-~~~~~~~~~~~~----~~~~------~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) +.++..+.-..-...+| +.|-.+|.-. ......+++... .|+. ..++++.++ ++.++.++..... T Consensus 183 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~s~~ 261 (419) T protein:vir:80 183 NAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQ-EGMKFKPLSMTNV 261 (419) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecC-CCceEEeccCChh Confidence 33332222222222222 3344444310 111111111111 1110 113345554 3467777764433 Q ss_pred H-HHHHHHHHHHHHHHhccCCChHHhccccc-cchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021297. 299 R-NFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEA--IIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 299 ~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 374 (480) + .+++..+....+|+...++|+..+|.... +-++... +.+....|.-.+...+. .|.+ .+...... T Consensus 262 d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~----~l~~------kll~~~~~ 331 (419) T protein:vir:80 262 DAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQ----AKTR------DLLLPSER 331 (419) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH----HHhh------hccCcccc Confidence 3 47788888899999999999999975322 2111111 11111112111111111 1111 11111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccc Q lcl|NC_021297. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) . ...+++.+......|..+.++.+.+++.+| +++...+++.+|+.+-+- -.+. .-. ........+ T Consensus 332 ~-~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--GD~~--------~~~--~n~~~~~~~ 396 (419) T protein:vir:80 332 K-QYFIEYNLAGLLRGDQSSRYAAYAVGRQWG--WLSINDIRRLENMPPVKG--GDIY--------LSP--MNMVDASKP 396 (419) T ss_pred C-CeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--ccee--------eec--ccccccccc Confidence 1 123555555566678889999888888765 566666777777654320 0000 000 000000001 Q ss_pred ccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 455 TPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .+.+.+. ++++.+ ..++- -| T Consensus 397 ~~~~~~~---~~~~~~--~~~~~-~~ 416 (419) T protein:vir:80 397 QPIPMGK---TEPTKA--ALDEI-GR 416 (419) T ss_pred ccccCCC---CCchhh--hHHHH-Hh Confidence 1111110 000000 00000 11 No 254 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=308 Identities=12% Similarity=0.064 Sum_probs=114.3 Q ss_pred eeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccccc-----ceEEeec Q lcl|NC_021297. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVV-----PVVPLTN 211 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i-----Pvv~~~n 211 (480) =.-.+|...........+. |. ......++.+..++....++... +++.+.+ =++.+.+ T Consensus 1 v~Eivw~~~~g~~~~~~l~-~r---~~~~~~~f~~~~~~~l~~~~~~~-------------~~g~~~~~lp~~kfi~~~~ 63 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLA-WR---PPRTISRFDVAPDGGLVAIEQWG-------------VFGKATVRIPVDRLVVFVN 63 (355) T ss_pred CeEEEEEeeCCeEEEeeee-ec---CccceeeeeeccCCceeEEEecC-------------CCCCCcceeccCCEEEEEe Confidence 0012343221111111110 00 00001111111112211111111 1111111 1455667 Q ss_pred ccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-----------cchhh------ Q lcl|NC_021297. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-----------NTTLD------ 274 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-----------~~~~~------ 274 (480) +...+.++|.+.+..-.-..+ -=+..+.+++.-++.|+.|+.+.+|-......+... ..... T Consensus 64 ~~~~g~p~G~gLlr~~~w~~~-fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~ 142 (355) T protein:vir:78 64 EREGANWLGQSLLRQAYKNWL-LKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFR 142 (355) T ss_pred CCCCCCccchhhHHHHHHHHH-HHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhh Confidence 778888999987754222222 124567778888888888887877654322111110 00000 Q ss_pred hhhhhh-cccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_021297. 275 IYYGRI-LTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGR 352 (480) Q Consensus 275 ~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~ 352 (480) .+.... +.-.|...++.+.... ...|...++..-.+|++..-- +.+........+.-|+.. ...-....++.-.+ T Consensus 143 ~g~~a~~iip~g~~ie~~ea~g~-~~~~~~~i~~~d~~Isk~iLG--qtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~ 219 (355) T protein:vir:78 143 AGEAAGGYIPHGANFTLTGVQGK-LPEMDGPIRYHDEQIARAVLA--HFLTLGGDKSTGSYALGDTFASFFTGSLNAVMK 219 (355) T ss_pred CCcceeEeecCCceEEEeecCCC-cccHHHHHHHHHHHHHHHHhh--hhhccccCCccchhhHHHHHHHHHHHHHHHHHH Confidence 111111 1112334555543322 222444444444555544311 001000000011112222 22223333333445 Q ss_pred HHHHHHH-HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH----HHHHHhCCCChhHHH Q lcl|NC_021297. 353 IFGGAWE-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK----EQARIDLGYTATQRE 427 (480) Q Consensus 353 ~~~~~l~-~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~----et~~~~lg~~~~~~~ 427 (480) .+...|. ++++-++.++..... ....+.|.. ...+..+.++.+.+++..|.. ++. ..+++.+|+.+.... T Consensus 220 ~i~~~ln~~li~~l~~lN~~~~~---~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~-~~~~~~~~~~~e~~gip~p~~~ 294 (355) T protein:vir:78 220 HIADVTQQHVVEDLVDQNWGPEE---PAPRLVPAQ-LGKEQPVTAEAIRALVECGAF-TADPELEKDLRARYGLPAPAER 294 (355) T ss_pred HHHHHHHHHHHHHHHHhcCCCCC---CCCEEEecC-cChhHHHHHHHHHHHHhCCCc-cccHHHHHHHHHHhCCCCCCCC Confidence 5556664 466666666533321 224566754 345666788999999888743 443 357788887533211 Q ss_pred HHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCC------------CCCCCCC Q lcl|NC_021297. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSG------------FNRTKTR 480 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~------------~~~~~~~ 480 (480) +. . . ................+........+...|.. .=.+++| T Consensus 295 ~~-~-----~----~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~~ 349 (355) T protein:vir:78 295 DD-G-----A----DAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAHR 349 (355) T ss_pred Cc-c-----c----CCccccccccccccccCCccccccccccCCCCCChhhhHHHHHHhhccccC Confidence 00 0 0 00000000000000000000000011111100 0011111 No 255 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.89 E-value=0.0012 Score=36.65 Aligned_cols=196 Identities=10% Similarity=0.079 Sum_probs=82.4 Q ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhh--hhcccCCCCceEEEecc Q lcl|NC_021297. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYG--RILTLASEAAKISEFKA 295 (480) Q Consensus 218 ~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~ 295 (480) .|. +..|-+.+..-...... .+.. +....+ ..+.+.+.+-++-+++. T Consensus 1 V~k-------~~~l~~~~~~~~~~~~~--------r~~~----------------~~~~~~~~~~~~ld~~~e~~e~~~~ 49 (201) T protein:vir:10 1 MWK-------AKGLADLCDDSDGAARL--------RLAQ----------------VDNNSGVGQAIGIDADSEEYNVLNS 49 (201) T ss_pred Ccc-------chHHHHHhcCChHHHHH--------HHHH----------------HHHhhhhhhhheeecCCcceeeeec Confidence 222 12222222110000000 0000 000000 11222233233433322 Q ss_pred ccHHHHHHHHHHHHHHHHhccCCChHHh-cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021297. 296 AELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 373 (480) ++..+.+.+......+++.+++|..-| |..... ++||..=..-|...+. ...+..+.+.|+++++++.. T Consensus 50 -~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~--~~Qe~~l~p~le~l~~~~~~------ 120 (201) T protein:vir:10 50 -DIGGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETFYGYVD--RKRKAELLPLLEFLLPFIVT------ 120 (201) T ss_pred -CcCChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC------ Confidence 334445566667888999999997666 433332 3467654444444433 33356778888888876432 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-ChhHHHHHHHHHHHHHHHHHHHHhcccccCc Q lcl|NC_021297. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TATQREQMRDWDKQETEDMIDTLYSTTKAQA 452 (480) Q Consensus 374 ~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (480) + .++.+.|.+-...+..+.|+...+.+++. . .+-..|. +++++.+. .++. .... .-+ T Consensus 121 ~---~~~~~~f~pL~~~s~kekAei~~~~a~a~------~-~~~~~g~i~~~e~r~~---L~~~------~~~~---~~~ 178 (201) T protein:vir:10 121 E---QEWSVEFNPLSQVSDKDKSEILEKNVNSV------A-ALIAAGIIDADEARDT---LRAI------STEV---KIG 178 (201) T ss_pred C---CCceEeeCCCCCCCHHHHHHHHHHHHHHH------H-HHHHcCCCCHHHHHHH---HHhc------CCcC---CCC Confidence 1 25889999999999999998777665432 1 1223453 33332211 1111 0000 000 Q ss_pred ccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 453 DATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 453 ~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +..-+++....+.+ ..+-.+.|+ T Consensus 179 ~~~~~~~~~~~e~~-dp~~~~~~~ 201 (201) T protein:vir:10 179 EGSIQTEVVINESE-DPLDVSANN 201 (201) T ss_pred CCCCCccccccccC-CCCCCCCCC Confidence 00001111100000 000012222 No 256 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=94.05 E-value=0.0055 Score=32.95 Aligned_cols=385 Identities=10% Similarity=0.033 Sum_probs=154.1 Q ss_pred HHHHHHHHHHHHH--HHHHHHhc---ccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCCc--- Q lcl|NC_021297. 10 RLQGLLARDLPNL--LEAEAYRN---GTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDS--- 78 (480) Q Consensus 10 ~l~~~~~~~~~r~--~~~~~YY~---g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~--- 78 (480) -+......+.... .....|.. |-... ..+..+.... -++. .=...+|+..+.-+-.-++. ..++. T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s-~~~~~vt~~~-al~~--~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 76 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRS-DSGQVVTPAS-ALAL--TVLQNCVTLLAESIAQLPIELYERSGEDRKP 76 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCc-cCCcccchHH-hhcc--HHHHHHHHHHHHhhccCceEEEEecCCcccc Confidence 0000000000000 00001110 10000 0111111110 1111 11233444444333222322 22111 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+..++.. | .-..+...++.+.+.+|.+|+++..+ .+|.+ -+..++|..+.+..+... .+ T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~l~pl~~~~v~v~~~~~~--~~-- 146 (419) T protein:vir:14 77 ATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRD------SDGVIQGLYPLDNEAVTVMRGSDL--KP-- 146 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEecCceEEEEECCCc--eE-- Confidence 112245555542 3 33456677888999999999998653 34555 477888888877665321 11 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) +|... .... +. .+ =|+|+.++. ..+.+|.|.+.- +. T Consensus 147 ---~y~~~-------------~~~~-------------~~-~~---------~i~h~~~~~-~dg~~G~s~i~~-~~--- 182 (419) T protein:vir:14 147 ---VYRVR-------------GSDP-------------MP-QR---------LVHHVRWMS-INGYTGLSPVLL-HA--- 182 (419) T ss_pred ---EEEEc-------------cCcc-------------cc-hh---------heeEecCcC-CCCcccccHHHH-HH--- Confidence 11000 0000 00 00 134444432 245678887642 33 Q ss_pred HHHHHHHHHHHHHHHHh---cchhhhhcCCC-ccccccccccch----hhh------hhhhhcccCCCCceEEEeccccH Q lcl|NC_021297. 233 DAASRTLMNLQSASQIL---GTPLRVISGVT-TDELTNDGENTT----LDI------YYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~---~~~~~~i~g~~-~~~~~~~~~~~~----~~~------~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) ..+...++-..-...+| +.|..+|.-.. .....+++.... |+. ..++++.++ ++.++.+++.... T Consensus 183 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~ 261 (419) T protein:vir:14 183 NAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQ-EGMTFRPLSMTNV 261 (419) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecC-CCceEEEccCChh Confidence 33333222222222333 33444443110 100011111111 111 113345554 3467777765433 Q ss_pred H-HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021297. 299 R-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 299 ~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 377 (480) + .+++..+....+|+...++|+..+|....+.-|+ +......+...+ -.-+-..+++.+.. .+...... .. T Consensus 262 d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~--~E~~~~~f~~~~---L~P~~~~ie~~l~~--kll~~~~~-~~ 333 (419) T protein:vir:14 262 DAALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQSLQFVIYT---LLPWVKRHEQAKTR--DLLLPSER-KQ 333 (419) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHHHHHHHHHH---HHHHHHHHHHHHhh--hccCcccc-CC Confidence 3 4678778888999999999999997532221121 222222222211 01111111111111 11111111 12 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccC Q lcl|NC_021297. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++.+......|..+.++++.+++++| +++...+++.+|+.+-+- -.. ..-..+ ....+.+.+. T Consensus 334 ~~i~fd~~~l~r~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--GD~--------~~~~~n--~~~~~~~~~~ 399 (419) T protein:vir:14 334 YFIEYNLAGLLRGDQSSRYAAYAVGRQWG--WLSINDIRRLENMPPVKG--GDI--------YLSPMN--MVDASKPQQL 399 (419) T ss_pred eEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCe--------eeeccc--cccccccccc Confidence 33555555566678888999999988765 567777777777654320 000 000000 0000000110 Q ss_pred CCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 458 PTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 458 ~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) + .++++..+++.++...+ T Consensus 400 ~-----~~~~~~~~~~~~e~~~~ 417 (419) T protein:vir:14 400 P-----VGKSEPTKAAIDEIGRI 417 (419) T ss_pred c-----CCCCCCccccccchhcc Confidence 0 11222222344444444 No 257 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=93.83 E-value=0.0062 Score=32.67 Aligned_cols=414 Identities=14% Similarity=0.019 Sum_probs=143.1 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHH-----------------------HHhcccCcch--hcCcccchhhhhhccccchHHH Q lcl|NC_021297. 5 HEHVERLQGLLARD-LPNLLEAE-----------------------AYRNGTRRLK--TIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 5 ~~~i~~l~~~~~~~-~~r~~~~~-----------------------~YY~g~~~i~--~~~~~~~~~~~~~~~~~n~~~~ 58 (480) .-|+.+|....... +.....+. .+-.|..... ..+..+.. ..-..+..... T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~---~~a~~~~~v~~ 77 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLAT---QAYQANGPVFA 77 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccch---hhhhccHHHHH Confidence 44555555443210 11111110 1111110000 00000100 11122234445 Q ss_pred HHHHHHhhhccCcccc---CCCc--h-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeEEEEEecCccccc--CCCc Q lcl|NC_021297. 59 YLRTLSDRLDIEGFRI---SEDS--E-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAG 126 (480) Q Consensus 59 iVd~~~~~l~~~~~~~---~~d~--~-~~~~l~~i~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~--d~~~ 126 (480) +|+..++-+-.-++.+ .++. + ....+..++.+ | ....+...+..+.+.+|.||+.+..+..+.. +..| T Consensus 78 ~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g 157 (466) T protein:vir:81 78 CMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVD 157 (466) T ss_pred HHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCc Confidence 5555555443334432 1111 1 12233444433 2 3345677788899999999999876543211 2223 Q ss_pred e-eEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_021297. 127 I-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP 205 (480) Q Consensus 127 ~-~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP 205 (480) . ..+..++|..+.+..+......+.+ .|. ............|..+ - T Consensus 158 ~~~~l~~l~~~~v~~~~~~~~~~~~~y---~~~-~~~~~~~~~~~~~~~~-----------------------------d 204 (466) T protein:vir:81 158 VVVEERMVRGGRGELGGGQLGWRKVGY---LYT-EGGRQSGNESVGFLAE-----------------------------D 204 (466) T ss_pred ceeEEEEecCcceEEEEcCCCceEEEE---EEE-ecCcccccceeeeccc-----------------------------c Confidence 2 2456677777666665332211111 110 0000000000011111 2 Q ss_pred eEEeeccc-ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhhh------h Q lcl|NC_021297. 206 VVPLTNDP-RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDIY------Y 277 (480) Q Consensus 206 vv~~~n~~-~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g-~~~~~~~~~~~~~~~~~~------~ 277 (480) |+||++.. ...+.+|.|.+.- ....++.......-....+...+.|-.++.- ........+.-...|+-. . T Consensus 205 viHir~~~~~~d~~~G~s~i~~-~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~ 283 (466) T protein:vir:81 205 VVHFAPIPDPLASYRGMSWLTP-ILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNA 283 (466) T ss_pred EEEEcCCCCcccccccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 45665432 2345678776532 3333322211211122222222334444431 111111111111112111 1 Q ss_pred hhhcccCCCCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccc-cchHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_021297. 278 GRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMA-ERKGRIF 354 (480) Q Consensus 278 ~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~-~~~~~~~ 354 (480) ++++.++ ++.++.++.... -..|++..+..+.+|+...++|++.+|.... +.+++..+......+...+ .-....+ T Consensus 284 g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~i 362 (466) T protein:vir:81 284 WKNLNLY-PGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNL 362 (466) T ss_pred ccceEcC-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 2344444 346777765432 2347888889999999999999999985422 1122222332222222111 1111222 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEe--cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHH Q lcl|NC_021297. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVW--RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW 432 (480) Q Consensus 355 ~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f--~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~ 432 (480) ...|.+ .+...... ..+.+.| ..-+-.+....+++...... .-.+++ .-|+++++...+.+ T Consensus 363 e~~l~~------~L~~~~~~---~~~~~~f~~~~llr~d~~~r~~~~~~~~~------~~~~~~-~~g~t~nE~r~~~~- 425 (466) T protein:vir:81 363 SGCIGH------VMPDMGPD---VRLWYDADDVPFLREDEKDAADIQKVRAE------TINTLI-TAGYEPESVVAAVN- 425 (466) T ss_pred HHHHHh------hcCCcccC---cceEEEecchhhhccCHHHHHHHHHHHHH------HHHHHH-HcCCChhhcccccc- Confidence 222221 11111111 1233444 23333455555543211100 001111 12344443322110 Q ss_pred HHHHHHHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 433 DKQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) .-+. ..+........+..+.+.....+..+...-.|.+++. T Consensus 426 ----~gd~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 426 ----SGDL-RLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred ----CCcc-ccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 0000 0000000000000000000000000000000111111 No 258 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=93.33 E-value=0.0079 Score=32.10 Aligned_cols=365 Identities=11% Similarity=0.036 Sum_probs=128.9 Q ss_pred HHHHHHHhcccCc-chh--cCcccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCCchhHHHHHHHHHh--c-- Q lcl|NC_021297. 22 LLEAEAYRNGTRR-LKT--IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~g~~~-i~~--~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~~~~~~l~~i~~~--n-- 91 (480) +-.+...-.++.. +.. .+...........+.......+|+..++.+-.-++. ...+......+..++.. | T Consensus 1 MGlf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~lL~~~PN~~ 80 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLYWINTKANPN 80 (395) T ss_pred CcchhhhcCCCcccccccccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHHHHhhcCCCC Confidence 1111111111100 000 000000000010111122233444444433223332 22222223345555542 3 Q ss_pred -CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEE Q lcl|NC_021297. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRAT 170 (480) Q Consensus 92 -~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (480) .-..+...+....+.+|.||+++-.+. ... + |.....++. .... ..++...+.. .+.. T Consensus 81 ~t~~~f~~~~~~~lll~Gnayi~~~~~~--------~~~---~-~~~~~~~~~--~~~~----~~~~~~~~~~---~~~~ 139 (395) T protein:vir:98 81 QSASQFWVEVIQKLLVDGETLIFVIPGK--------GIY---V-ADSFTQDKK--ISGS----QFKVSRVQGQ---TYEK 139 (395) T ss_pred CCHHHHHHHHHHHHhhcCceEEEEEeCC--------cee---c-CCccccccc--ccCc----ccceeeecCc---eeee Confidence 234566778888999999998765321 111 1 111111110 0000 0000000000 0011 Q ss_pred EEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH-HHHHHHHHHH-HHHHHH Q lcl|NC_021297. 171 LYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT-DAASRTLMNL-QSASQI 248 (480) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li-D~~~~~~s~~-~~~~~~ 248 (480) .|.+++ |+||++.......++.+-+ .....++ ..++..+... ...... T Consensus 140 ~~~~~e-----------------------------vih~k~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (395) T protein:vir:98 140 TFTFDQ-----------------------------VIYLKNDNSDLMSKVESLW-EEYGELLGHVINNQKIANQIRFTMI 189 (395) T ss_pred EecCcc-----------------------------EEEecCCCCCccccccchh-hhHHHHHHHHHHHHHHHHHHHHhhc Confidence 122222 3444432222222222211 1111111 1111111000 000011 Q ss_pred hcchhhhhcCCCccccccccc---cchhh-------hhhhhhcccCCCCceEEEeccc-------cHHHHHHHHHHHHHH Q lcl|NC_021297. 249 LGTPLRVISGVTTDELTNDGE---NTTLD-------IYYGRILTLASEAAKISEFKAA-------ELRNFAEEMEVFRKE 311 (480) Q Consensus 249 ~~~~~~~i~g~~~~~~~~~~~---~~~~~-------~~~~~~~~~~g~~~~~~~~~~~-------~~~~~~~~l~~~~~~ 311 (480) +..+...+.+ ......+... ...++ .....++.++ .+.++.+++.. ....+++..+..+.+ T Consensus 190 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~ 267 (395) T protein:vir:98 190 PPKDKVRERA-QENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVT-ANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAE 267 (395) T ss_pred cccccccccc-cccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecC-CCceeEecccccccccChhHHHHHHHHHHHHHH Confidence 1111111111 1111111000 00111 1222233333 34566555422 233577777788899 Q ss_pred HHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC Q lcl|NC_021297. 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT 391 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~ 391 (480) |+...++|++.++++..| .+...+.+....|.-.+...+..+... +....... ....+.|......+ T Consensus 268 Ia~~fgVP~~~l~~~~sn-~e~~~~~f~~~tl~P~~~~ie~~l~~k----------ll~~~~~~--~g~~f~~~~l~~~d 334 (395) T protein:vir:98 268 FAEMLGIPISLLHGDIAD-NQKNYELLLEGPIESLITNIVDGLEYA----------IFDKSETL--QGSFIKVTGLKNYD 334 (395) T ss_pred HHHHhCCCHHHhcCCccc-HHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCChhhhc--CcceeeehhhhccC Confidence 999999999999754332 222223333333333332222221111 11111111 12345666777788 Q ss_pred HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCccc Q lcl|NC_021297. 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKTET 467 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ..+.++++.+++.+| +++...+++.+|+.+-+-....+. .+....... + . .++..+++.++ T Consensus 335 ~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~Pi~~~~gD~~-------~~~~n~~~~----~-~-~gge~~~~~~~ 395 (395) T protein:vir:98 335 LFSISNQADKLISSG--FVFIDEVREEIGLPELPDGLGKVL-------YMTKNYESV----L-E-RGGEVDEEVET 395 (395) T ss_pred HHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCcee-------eecccceec----c-c-ccCCCCCCCCC Confidence 899999999998876 667777788877754311000000 000000000 0 0 00001111111 No 259 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=375 Identities=10% Similarity=0.016 Sum_probs=132.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCchhHHH Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~ 83 (480) .-+++.+...+......... ......... ........+...-...+|+..+.-+-..++.+- .++..... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~--------~~~~~~~~~-~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~~~~ 71 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNL--------TDTVWCSIP-SEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEVRKK 71 (395) T ss_pred CchHHHHHhhhccccccccc--------ccchhhccc-cccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccccch Confidence 22333333333221111000 000000000 000011111111223334433333333334332 22233334 Q ss_pred HHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 84 LWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 84 l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) +..++.. | ....+...+..+.+.+|.||+++..+. .. + |..+.+... ...-..++. T Consensus 72 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---------~~---~-~~~~~~~~~------~~~~~~~~~ 132 (395) T protein:vir:40 72 NWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---------IY---V-ADSFTKNDK------SLYENTYTE 132 (395) T ss_pred HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---------ee---e-cCCcccccc------ccccceeee Confidence 4454442 3 224556678888999999998764321 11 1 111100000 000000000 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) ...++ ..+...|.+++ |+||+.+......++ ..+...+... T Consensus 133 v~~~~--~~~~~~~~~~e-----------------------------vih~r~~~~~~~~~~--------~~l~~~~~~~ 173 (395) T protein:vir:40 133 VTLKD--LTLKKEFKESE-----------------------------VLHLTLNNESIKSII--------DGFYLLYGDL 173 (395) T ss_pred eeecC--ceeeeeecccc-----------------------------EEEeecCCCCccccc--------hhHHHHHHHH Confidence 00000 00001122222 333332221111221 1223333333 Q ss_pred HHHHHHHHHHhcch--hhhhcCC-Cccccccccccchhhh-------hhhhhcccCCCCceEEEecccc-HHHHHHHH-- Q lcl|NC_021297. 239 LMNLQSASQILGTP--LRVISGV-TTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAE-LRNFAEEM-- 305 (480) Q Consensus 239 ~s~~~~~~~~~~~~--~~~i~g~-~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l-- 305 (480) ++...+...+...+ .+++... .......+.....|+. ..+.++.++ ++.++.++.... ...+++.- T Consensus 174 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~ 252 (395) T protein:vir:40 174 LTAAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVE-DGMEIDELAGDSKIAESRDIKKM 252 (395) T ss_pred HHHHHHHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecC-CCceEEeccCChhhhhHHHHHHH Confidence 44443333332222 2222111 1111111111111211 122233333 446777765432 12344322 Q ss_pred -HHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021297. 306 -EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 306 -~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f 384 (480) +..+.+|+...++|+.-+++...| .+...+.+....|.-.+...+..+.. .+...........+++.+ T Consensus 253 ~~~~~~~Ia~~fgVPp~~l~~~~sn-~e~~~~~f~~~~L~P~~~~ie~~l~~----------kLl~~~~~~~g~~i~fd~ 321 (395) T protein:vir:40 253 IDDVFEMVANSFNIPLGLAKGDTVG-LSEQVNSFLMFSINPIAEMFTDEGNR----------KFYGRDSVLERTYMKLDT 321 (395) T ss_pred HHHHHHHHHHHhCCCHHHhcCCCcC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hcCChhhhcCCceEEEec Confidence 334678999999999999865433 12222222222222222222111111 111211111223456666 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCC Q lcl|NC_021297. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) ..-...+..+.++++.+++.+| +++.-.+++.+|+.+-+-....+. .+...........+....++..+ T Consensus 322 ~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~~~~gD~~-------~~~~n~~~~~~~~~~~kgge~~~-- 390 (395) T protein:vir:40 322 TRIKVQDIQEIASSMDVLFHIG--VNTIDDNLRMIGREPVMSPETQER-------FVTKNYAPLGENEEDLKGGDINE-- 390 (395) T ss_pred hhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCCCCcee-------eeccccccccccccccCCCCCCC-- Confidence 6667778899999999988765 667777788887754310000000 00000000000000001111111 Q ss_pred cccccCCCCCC Q lcl|NC_021297. 465 TETQTSPSGFN 475 (480) Q Consensus 465 ~~~~~~p~~~~ 475 (480) +. +.+ T Consensus 391 ---~~---~~~ 395 (395) T protein:vir:40 391 ---NK---GDS 395 (395) T ss_pred ---Cc---CCC Confidence 10 001 No 260 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=92.26 E-value=0.012 Score=31.09 Aligned_cols=419 Identities=11% Similarity=-0.009 Sum_probs=151.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCc----ccchhhhhhccccchHHHHHHHHHhhhc----cC-- Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGI----GAPPELAYLDVQPGWVATYLRTLSDRLD----IE-- 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~----~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~~-- 70 (480) |-. -...|..+. .|.+-...++++++ +.+++... .....-+..+..-+-....++++++.|. +- T Consensus 1 m~~---~~~~l~~k~-~R~~~e~~w~e~a~--~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 74 (514) T protein:vir:80 1 MRQ---QASAMWAEY-RDSTAIRKAEDFAK--FTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGR 74 (514) T ss_pred Ccc---chHHHHHHh-hcchHHHHHHHHHH--HhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCC Confidence 322 222222221 12222333344442 12222211 1111101111222334555666665442 21 Q ss_pred cc-ccC-CCc-------------hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee Q lcl|NC_021297. 71 GF-RIS-EDS-------------EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 71 ~~-~~~-~d~-------------~~~-------~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~ 128 (480) +| +.. +|+ +.. ..+...+..++|.....++.++...+|.+-+++..+ .- T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~---------~~ 145 (514) T protein:vir:80 75 PSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPG---------TG 145 (514) T ss_pred cccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecC---------CC Confidence 22 221 111 111 124445667899999999999999999987766321 12 Q ss_pred EEEEEccceeEEEEeCCCCCeeEEEEEEEEEec---------------CCceEEEEEEEe-----CCc----EEEEEecC Q lcl|NC_021297. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD---------------DVAVPDRATLYL-----PDE----TVPLRRNG 184 (480) Q Consensus 129 ~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~-----~~~----~~~~~~~~ 184 (480) .++.++-.+.++.-|. .++....+.+...... +.+....+++|+ +++ +.+|.... T Consensus 146 ~~~~~pl~~y~v~~d~-~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~ 224 (514) T protein:vir:80 146 KMLVWTMQSYTVRRTS-HGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELE 224 (514) T ss_pred cEEEEEcCeEEEeeCC-CcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEecc Confidence 3566665665444443 3333322222221110 001111223332 111 11111111 Q ss_pred CcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcc Q lcl|NC_021297. 185 GLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTD 262 (480) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~~ 262 (480) +. ........++..+|++++..+...++.||+|-..+ ..+-+..+|.+.-...........|.+.+. |.... T Consensus 225 g~-----~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~-al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~ 298 (514) T protein:vir:80 225 GK-----RVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEE-YSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAV 298 (514) T ss_pred ce-----eecccCccccccCCeeeeeeEecCCCCcccchHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccch Confidence 10 00111111234589999988888899999997654 456566677665555555555555543331 11110 Q ss_pred ccccccccchhhhhh-hhhcccCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhccCCChHHhccccccchHHHHHH Q lcl|NC_021297. 263 ELTNDGENTTLDIYY-GRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) Q Consensus 263 ~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~ 337 (480) ..+..+. +.+..-..++....++. ..+++. .++.++.-|+........ . .+..+. ++.-++ T Consensus 299 --------~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~----~-rd~~rv-TAtEV~ 364 (514) T protein:vir:80 299 --------DDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTGQ----V-RDAERV-TVEEIR 364 (514) T ss_pred --------hhhcccCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhcc----C-CCCCCC-CHHHHH Confidence 1111111 22211111233344443 234443 344444444432222110 1 122222 333333 Q ss_pred HHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHhCCC---CcccceeeeEEecCCC-CCCHHHHHHHHHHH---Hhc Q lcl|NC_021297. 338 ATDSRIVK----MAERKGRIFGG-AWERAMRIAMQIMGRE---VTEEYTRLETVWRDPS-TPTVAAKADAVSKL---YAN 405 (480) Q Consensus 338 ~~~~~l~~----k~~~~~~~~~~-~l~~~~~~~~~~~~~~---~~~~~~~i~v~f~~~~-~~~~~~~ad~~~kl---~~~ 405 (480) .....+.. -..++...|-. -+.+.+.++.+...+. .+.+. +++.+.-++ .-...+.++.+... ++. T Consensus 365 ~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l--~~~~~vs~la~l~r~~~~~~l~~~~~~i~~ 442 (514) T protein:vir:80 365 TVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGV--YRPSIITGIPALTRNIETANILRATQEASA 442 (514) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchh--hcceeeecHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111 12222222222 2334444433222122 22222 333332221 11111222222221 111 Q ss_pred cc-------cCCCHHHHH----HhCCCCh-------hHHHHHHHHHHHHHHHHHHH--HhcccccCcccccC Q lcl|NC_021297. 406 GQ-------GPIPKEQAR----IDLGYTA-------TQREQMRDWDKQETEDMIDT--LYSTTKAQADATPK 457 (480) Q Consensus 406 ~~-------~~~s~et~~----~~lg~~~-------~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 457 (480) .. ..+....++ ..+|.+. ++.+..+++.+++.++.+.. .......+..-.++ T Consensus 443 l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 443 IVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred HhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 10 112222222 3355542 22222222222222221111 11111111111111 No 261 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=91.94 E-value=0.014 Score=30.82 Aligned_cols=404 Identities=11% Similarity=-0.001 Sum_probs=139.3 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHH-----hcccC-----cchhcCcccchhhhhhccccchHHHHHHHHHhh Q lcl|NC_021297. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAY-----RNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~r~~~~~~Y-----Y~g~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 66 (480) |..= ++.+..-........+++...... |.|-- +|...... -.-+..+.. .....-++++-... T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~-~~ly~~m~~-D~hi~s~l~~Rk~a 78 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDG-LLVYHKMLS-DGTVKNALNYIFGR 78 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccc-hHHHHHHhh-ChHHHHHHHHHHHH Confidence 1100 000000000000000000000000 00000 00000000 001111111 12222222222222 Q ss_pred hccCccccC--CCc----hhHHHHHHHHHh-------cCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEEE Q lcl|NC_021297. 67 LDIEGFRIS--EDS----EGLEELWNWWQA-------NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 67 l~~~~~~~~--~d~----~~~~~l~~i~~~-------n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~~ 132 (480) +....+++. +++ +..+.+.+++.. ..|..++..+ .+|..||.+ ++++|.. ..+|...+.. T Consensus 79 v~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~-----~~dg~~~~~~ 152 (448) T protein:vir:77 79 IRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTL-----GADGKLILDK 152 (448) T ss_pred HhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEee-----cCCCceeecc Confidence 333344431 222 222334444432 2566666665 689999987 5667742 1244443322 Q ss_pred EccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc Q lcl|NC_021297. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~ 212 (480) +.+. .+.. ++++.-..+++ ..+....+.........+..+-+++.+ +++.+ T Consensus 153 l~~r------~~~~-------~~~f~~~~~~~-------------l~~~~~~~~~~~~~~~~~~~~lP~~~~--i~~~~- 203 (448) T protein:vir:77 153 IVPI------HPFN-------IDEVLYDEEGG-------------PKALKLSGEVKGGSQFVNGLEIPIWKT--VVFLH- 203 (448) T ss_pred cccc------CCCc-------cceeeeecCCc-------------eEEEecCCcccccccCCCccccccceE--EEEec- Confidence 2211 0000 00110000110 111111111000011111222334443 33443 Q ss_pred cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhh------hhhhhhccc-CC Q lcl|NC_021297. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLD------IYYGRILTL-AS 285 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~------~~~~~~~~~-~g 285 (480) ...+.++|.+.+..-.-..+ -=+..+.+++.-++.|+.|.++.+--......++......+ .+......+ .| T Consensus 204 ~~~g~p~g~gLlr~~~w~~~-fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g 282 (448) T protein:vir:77 204 NDDGSFTGQSALRAAVPHWL-AKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDD 282 (448) T ss_pred CCcCCcccchHHHHHHHHHH-HHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCC Confidence 34567888887754222121 12556778888899999998776521111111111111111 112121222 34 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021297. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~ 364 (480) ...++.+..... ..+.+.++..-.+|+...-- +.+....++.+++.+......-.......-.+.+...+. ++++- T Consensus 283 ~~ie~~ea~~~~-~~~~~~i~~~d~~Isk~iLG--qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~ 359 (448) T protein:vir:77 283 WKFDTVDLKSAM-PDAIPYLTYHDAGIARALGI--DFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPK 359 (448) T ss_pred ceEEEEecCCCc-cCHHHHHHHHHHHHHHHHhc--cccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455566544322 22334454444555444310 001101111112222222111111111222233444453 45555 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL 444 (480) Q Consensus 365 ~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~ 444 (480) ++.++.+... ....+.|....+.++.+.++.+.+++. ...+.+|+..... .. T Consensus 360 l~~lNfg~~~---~~P~~~f~~~e~eDl~~~a~~~~~l~~---------~~~~~~~ip~~~~----------------~~ 411 (448) T protein:vir:77 360 LVLPNWPGAT---RFPRLTFEMEERNDFSAAANLMGMLIN---------AVKDSEDIPTELK----------------AL 411 (448) T ss_pred HHHhcCCCCC---CCCEEEecCCChhhHHHHHHHhHHHHH---------HHHHHhcCCccCC----------------cC Confidence 5555533221 124677887888888888887776642 1233333321100 00 Q ss_pred hcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 445 YSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) ....+ ....+.+.. ..+..++...+..|| T Consensus 412 ~~~~~--~~~~~~~~~-----~~~~~~~~~~~~~~~ 440 (448) T protein:vir:77 412 IDALP--SKMRRALGV-----VDEVREAVRQPADSR 440 (448) T ss_pred CCCCc--hhcccccCC-----CCCCCchhhcchhhH Confidence 00000 011111111 122233344555555 No 262 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=90.33 E-value=0.021 Score=29.74 Aligned_cols=296 Identities=15% Similarity=0.120 Sum_probs=102.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH----HHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEA----EAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~----~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) +.||+.++..- ..-.|... .+|| ..++. ...+.+.--...++.-++.....++.. -+..|+ T Consensus 26 ~~~p~~~~~~~------~~~~~~~~~~~~~~~~--~pp~~------~~~la~l~~~~~~h~~~i~~k~n~l~~-l~~~Pn 90 (346) T protein:vir:10 26 FGDPIPVLDRA------DILNYLECSAMYEKWY--NPPMS------FDGLAKSLRSSTHHESAIITKANILLS-TCEVDS 90 (346) T ss_pred cCCcceecCch------hHHHHHHHhhcCCceE--ecCCC------HHHHHHHHHhhhhcchhhhhhhhhHHH-HHhCCC Confidence 44444433210 00000000 1111 11111 111211100111111111111111100 011111 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 77 d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) .- +. ...+.+++.+.+.+|.||+.+..+ ..|++ .+..++|..+.+..+.. .. T Consensus 91 ~~---------~t----~~~f~~~~~d~ll~Gnay~~i~r~------~~G~~~~L~pl~~~~v~~~~~~~---~~----- 143 (346) T protein:vir:10 91 RY---------LS----RRDLSSFVKDYLVFGNAYFEVVRN------RLGQVQRIESPLAKYVRKGLEAG---QF----- 143 (346) T ss_pred CC---------CC----HHHHHHHHHHHHhcCCeEEEEEEc------CCCcEEEEEEecCCceEEEEcCC---eE----- Confidence 10 00 112345667888999999987653 23443 56677777665433321 01 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) +|..... .+..+.|.. + -|+||++..-..+.+|.|.+.. ....+ T Consensus 144 ~~~~~~~-----------~g~~~~~~~-------------------~--dIih~r~~~~~~~~~G~~~~~~----a~~si 187 (346) T protein:vir:10 144 YYVPQRF-----------DHQEHEFAK-------------------G--SIYHLLEPDINQDIYGLPQYLS----ALQSA 187 (346) T ss_pred EEEEEcc-----------CCeEEEEec-------------------c--cEEEecCCCCCCCeeeccHHHH----HHHHH Confidence 0100001 111111110 0 1455554333355678876543 22333 Q ss_pred HHHHHHHHHHHHHhc---chhh--hhcCCCccccccccccchhhhhh-----hhhccc-CCC---CceEEEeccccH-HH Q lcl|NC_021297. 236 SRTLMNLQSASQILG---TPLR--VISGVTTDELTNDGENTTLDIYY-----GRILTL-ASE---AAKISEFKAAEL-RN 300 (480) Q Consensus 236 ~~~~s~~~~~~~~~~---~~~~--~i~g~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g~---~~~~~~~~~~~~-~~ 300 (480) ....+-..-...+|. .|-. ++++...+....+.-...|+... +.++.+ +|. +.++..+..... .. T Consensus 188 ~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~q 267 (346) T protein:vir:10 188 WLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDE 267 (346) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHH Confidence 222211111222332 2332 23343222211111011121111 122222 222 234554443322 24 Q ss_pred HHHHHHHHHHHHHhccCCChHHhccccccchHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021297. 301 FAEEMEVFRKEAASITGLPPQYLSSSSENPASA-----EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE 375 (480) Q Consensus 301 ~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg-----~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 375 (480) |++..+....+|+..-++|+..+|....+.++. .++.+....|.=.+++ +++++ ...+.+ T Consensus 268 f~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~--------iee~n----~~L~~e--- 332 (346) T protein:vir:10 268 FFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQER--------LKEFN----QWLGQE--- 332 (346) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHH--------HHHHH----hhcccc--- Confidence 777777888999999999999988543332211 1112211111111111 11111 111211 Q ss_pred cceeeeEEecCCCCCCHHH Q lcl|NC_021297. 376 EYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 376 ~~~~i~v~f~~~~~~~~~~ 394 (480) -+.|.+...-.-.+ T Consensus 333 -----~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 333 -----VIKFKPSKLLQRTQ 346 (346) T ss_pred -----eeeechhhhcccCC Confidence 13453221111111 No 263 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=89.60 E-value=0.025 Score=29.32 Aligned_cols=289 Identities=13% Similarity=0.080 Sum_probs=104.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---cchhcCcccchhhhhh---ccccchH-HHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYL---DVQPGWV-ATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---~i~~~~~~~~~~~~~~---~~~~n~~-~~iVd~~~~~l~~~~~~ 73 (480) -.+++..+.. ..-++.+.-++.|+. ++.. ..+.+. ...|+-| ....+-++....++++. T Consensus 28 f~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~------~~la~~~~a~~~h~~~i~~k~n~l~~~~~Pn~~l 93 (344) T protein:vir:20 28 FGEPVPVLDR--------RDILDYVECISNGRWYEPPVSF------TGLAKSLRAAVHHSSPIYVKRNILASTFIPHPWL 93 (344) T ss_pred cCCceEecCc--------chhhhhhhhhhcCceecCCCCH------HHHHHHHhhhhhhCccceehhhhHHHhccCCCCC Confidence 1111111110 000111122233321 1111 111111 0111111 00011111111222211 Q ss_pred cCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 74 ISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) . ...+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-+..+.. T Consensus 94 t-------------------~~~f~~~~~d~ll~Gnay~~i~rn~------~G~~~~L~pl~~~~vr~~~~~~------- 141 (344) T protein:vir:20 94 S-------------------QQDFSRFVLDFLVFGNAFLEKRYST------TGKVIRLETSPAKYTRRGVEED------- 141 (344) T ss_pred C-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEcCCceeEeeecCC------- Confidence 0 0113456677788999999876532 3443 45566665443322110 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) ++|.... .+..+.|. .=-|+|+++..-..+.+|.|.+.- .+ T Consensus 142 --~~~~~~~------------~~~~~~~~---------------------~~eIiHir~~~~~~~~yGls~~~~----a~ 182 (344) T protein:vir:20 142 --VYWWVPS------------FNEPTAFA---------------------PGSVFHLLEPDINQELYGLPEYLS----AL 182 (344) T ss_pred --EEEEEcc------------CCeEEEEc---------------------CccEEEeCCCCCCCCcccccHHHH----HH Confidence 0111010 01111110 012566665433456788887643 33 Q ss_pred HHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhh----hhhh--hcccCC---CCceEEEeccccH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDI----YYGR--ILTLAS---EAAKISEFKAAEL 298 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~----~~~~--~~~~~g---~~~~~~~~~~~~~ 298 (480) +.+....+-..-...+|. .|-.+ ++|....+...+.-...|+- +.++ ++..++ ++.++..+..... T Consensus 183 ~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~ 262 (344) T protein:vir:20 183 NSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVAT 262 (344) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChh Confidence 444333322222233332 23233 33432222111111111221 1112 223333 2455666654322 Q ss_pred H-HHHHHHHHHHHHHHhccCCChHHhccccccchH-H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_021297. 299 R-NFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-A----EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) Q Consensus 299 ~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~S-g----~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~ 372 (480) + .|++..+....+|+..-++|+.-+|....+.++ | .++.+....|.=.+.+ |+++ ..+.|.. T Consensus 263 d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~--------~e~i----n~~lg~~ 330 (344) T protein:vir:20 263 KDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDR--------IREI----NGWLGQE 330 (344) T ss_pred HHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHH--------HHHH----HHhcCCc Confidence 2 478888888999999999999999853322211 1 1111111111111111 1111 2222321 Q ss_pred CcccceeeeEEecCCCCCCHHH Q lcl|NC_021297. 373 VTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 373 ~~~~~~~i~v~f~~~~~~~~~~ 394 (480) . +.|.++......+ T Consensus 331 ------~--i~F~~~~l~~~d~ 344 (344) T protein:vir:20 331 ------V--IRFKNYSLDTDND 344 (344) T ss_pred ------c--cccCccccccCCC Confidence 1 2333222211111 No 264 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=89.38 E-value=0.027 Score=29.21 Aligned_cols=295 Identities=15% Similarity=0.105 Sum_probs=109.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---cchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ..|++..+.. ..-+..+.-+|.|+. ++. ...+.+..-...++.-++....+.+... |. ++. T Consensus 58 fg~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~------~~~La~~~~~~~~h~s~l~~k~n~l~~~-~~-Pnp 121 (376) T protein:vir:10 58 FDDPTPVMNR--------AEILDYVECWSNGEWFEPPVS------FAGLAKSFRASTHHSSALFFKANVLAST-FR-PHR 121 (376) T ss_pred cCCceeccCc--------chhhhhhhhhhcCceecCCCC------HHHHHHHHhhhHHhhhhHHHHhHHHHhc-cC-CCC Confidence 2222222110 000111223344431 221 1122222111122222333333322111 11 111 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEE Q lcl|NC_021297. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 78 ~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) -- . ...+.+++.+.+.+|.||+.+-.+ ..|.+ .+..++|..+.+..|.. ++ T Consensus 122 ~l---------T----~~~f~~~v~d~ll~Gnay~~~~rn------~~G~~~~L~pl~~~~vr~~~d~~---------~~ 173 (376) T protein:vir:10 122 WL---------S----RHAFERWALDFLTFGNGYLERRRN------MVGGTLRLEPALAKYVRRKADFN---------GF 173 (376) T ss_pred CC---------C----HHHHHHHHHHHHhcCCeEEEEEEC------CCCCEEEEEEeCCcceEEEeeCC---------eE Confidence 00 0 111345667888899999987653 23443 57778887766555432 01 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHH Q lcl|NC_021297. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~ 236 (480) |.....+. ...|.++. |+||++-.-..+.+|.|.+.- .+..+. T Consensus 174 ~~~~~~~~----~~~~~~~e-----------------------------ViHir~~~~~~~~yGls~~~~----a~~si~ 216 (376) T protein:vir:10 174 VYVNGWQE----RHEFEPDS-----------------------------VFQLVRPDINQEVYGLPEYLS----SLHSAW 216 (376) T ss_pred EEEEcCCe----EEEEcccc-----------------------------EEEecCCCCCCCcccccHHHH----HHHHHH Confidence 11111110 01112222 455554323345678776543 333333 Q ss_pred HHHHHHHHHHHHhcc---hh--hhhcCCCccccccccccchhhhhh-----hhhcc-cCC---CCceEEEecccc-HHHH Q lcl|NC_021297. 237 RTLMNLQSASQILGT---PL--RVISGVTTDELTNDGENTTLDIYY-----GRILT-LAS---EAAKISEFKAAE-LRNF 301 (480) Q Consensus 237 ~~~s~~~~~~~~~~~---~~--~~i~g~~~~~~~~~~~~~~~~~~~-----~~~~~-~~g---~~~~~~~~~~~~-~~~~ 301 (480) ...+-..-...+|.+ |- ++++|........+.-...|+-.. +.++. .++ ++.++..+.... -..| T Consensus 217 l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf 296 (376) T protein:vir:10 217 LNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEF 296 (376) T ss_pred HHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHH Confidence 222221112233332 22 233443322221111111121111 11222 222 345666665432 2248 Q ss_pred HHHHHHHHHHHHhccCCChHHhccccccchHH---H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASA---E--AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg---~--Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) ++..+....+|+..-++|+.-+|....+.++. . ++.+....|.=.+.+ |+++.. ..+.. T Consensus 297 ~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~--------ieeln~----~L~~~---- 360 (376) T protein:vir:10 297 FNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQAR--------FAELND----WLGEE---- 360 (376) T ss_pred HHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHH--------HHHHHh----hcccc---- Confidence 88888889999999999999888543322111 1 111111112111111 111111 11111 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~ 400 (480) . +.|.+.. ...+|+-+ T Consensus 361 ~----~~F~~~~----Llr~d~ka 376 (376) T protein:vir:10 361 V----VRFDDYE----IPPAPVAA 376 (376) T ss_pred c----cccChhH----hhcccccC Confidence 0 3443211 11111111 No 265 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=89.10 E-value=0.028 Score=29.07 Aligned_cols=304 Identities=14% Similarity=0.071 Sum_probs=109.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..|++..+.. ..-++..+-+|.|+.--+-+ ....+.+.--...++.-++-..+..+... |. ++.--. T Consensus 33 ~~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~---~~~~la~~~~~~~~h~~~l~~k~n~l~~~-~~-Pn~~~t 99 (351) T protein:vir:78 33 FDDPTPVMNR--------AEILDYVECWSNGEWFEPPV---SFAGLAKSFRASTHHSSALFFKANVLAST-FR-PHRWLS 99 (351) T ss_pred cCCceeecCc--------chhhhhhhhhccCceecCCC---CHHHHHHHHhhhHhhhhhhhhhhhHHhhc-cc-CCCCCC Confidence 2222222110 00122334456664211101 11122222111123333333333332221 11 111100 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce-eEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+++.+.+.+|.||+.+-.+. .|. ..+..++|..+.+..+.. ++|.. T Consensus 100 -------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~v~~~~~~~---------~~~~~ 151 (351) T protein:vir:78 100 -------------RHAFERWALDFLTFGNGYLERRRNM------VGGTLRLEPALAKYVRRKADFS---------GFVYV 151 (351) T ss_pred -------------HHHHHHHHHHHHhcCCeEEEEEECC------CCCEEEEEEecCcceEEeeeCC---------eEEEE Confidence 1123456678888999999876532 343 356677776655443321 01111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) ..++.. ..|.++ -|+||.+-.-..+.+|.|.+.-.+.++. ++... T Consensus 152 ~~~~~~----~~~~~~-----------------------------eVihir~~~~~~~~yGl~~~~~a~~si~--l~~~a 196 (351) T protein:vir:78 152 NGWQER----HEFAPD-----------------------------SVFQLVRPDINQEVYGLPEYLSSLHSAW--LNESS 196 (351) T ss_pred ecCCeE----EEEccc-----------------------------cEEEEcCCCCCCCcccccHHHHHHHHHH--HHHHH Confidence 111000 011111 1455553322355678876643322221 12222 Q ss_pred HHHHHH-HHHhcchh--hhhcCCCccccccccccchhhhhh-----hhhccc-CC---CCceEEEeccccH-HHHHHHHH Q lcl|NC_021297. 240 MNLQSA-SQILGTPL--RVISGVTTDELTNDGENTTLDIYY-----GRILTL-AS---EAAKISEFKAAEL-RNFAEEME 306 (480) Q Consensus 240 s~~~~~-~~~~~~~~--~~i~g~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g---~~~~~~~~~~~~~-~~~~~~l~ 306 (480) ..+... ....+.|- ++++|....+...+.-...|+... +.++.. ++ ++.++..+..... ..|++..+ T Consensus 197 ~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~ 276 (351) T protein:vir:78 197 TLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKN 276 (351) T ss_pred HHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHH Confidence 222111 11122222 223343222211111111122111 112222 22 3446666654322 24778878 Q ss_pred HHHHHHHhccCCChHHhccccccchHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEec Q lcl|NC_021297. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRI-VKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWR 385 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l-~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~ 385 (480) ....+|+..-++|+.-+|....+.++...++.....+ .....=..+. ++++. ...+.. -+.|. T Consensus 277 ~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~----iee~n----~~l~~~--------~~~F~ 340 (351) T protein:vir:78 277 VTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQAR----FAELN----DWLGDE--------VVRFD 340 (351) T ss_pred HhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHH----HHHHH----hhcCcc--------ceecC Confidence 8889999999999999885433322111111111111 1111111111 11111 111211 13443 Q ss_pred CCCCCCHHHHHHHHH Q lcl|NC_021297. 386 DPSTPTVAAKADAVS 400 (480) Q Consensus 386 ~~~~~~~~~~ad~~~ 400 (480) +... ..+|+-+ T Consensus 341 ~~~L----lr~d~ka 351 (351) T protein:vir:78 341 DYEI----PPAPVAA 351 (351) T ss_pred hhhh----ccccccC Confidence 2211 1111111 No 266 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=89.06 E-value=0.029 Score=29.05 Aligned_cols=400 Identities=11% Similarity=-0.011 Sum_probs=133.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH-----hcccC-----cchhcCcccchhhhhhccccchHHHHHHHHHhhhccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAY-----RNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~Y-----Y~g~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |--+++++..-...-...-+.+.....- |.|-- +|...... -.-+..+.. .....-++++-...+... T Consensus 5 ~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~-~~ly~~m~~-D~hi~s~l~~Rk~av~~~ 82 (448) T protein:vir:79 5 GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDG-LLVYHKMLS-DGTVKNALNYIFGRIRSA 82 (448) T ss_pred CCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccc-hHHHHHHhh-ChHHHHHHHHHHHHHhcC Confidence 1111111000000000000000000000 00000 00000000 001111111 122222233333333344 Q ss_pred ccccC--CCchh----HHHHHHHHHh-------cCHHHHHHHHHHHHhhcCeE-EEEEecCcccccCCCceeEEEEE--- Q lcl|NC_021297. 71 GFRIS--EDSEG----LEELWNWWQA-------NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVE--- 133 (480) Q Consensus 71 ~~~~~--~d~~~----~~~l~~i~~~-------n~~~~~~~~~~~~a~~~G~a-~~~v~~~~~~~~d~~~~~~i~~~--- 133 (480) .+.+. +++.. .+.+.+.+.. ..|..+..+ +.+|..||.+ ++++|.. ..+|+..+..+ T Consensus 83 ~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~-----~~~g~~~~~~l~~r 156 (448) T protein:vir:79 83 KWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTL-----GADGKLILDKIVPI 156 (448) T ss_pred CceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeee-----cCCCceeccccccc Confidence 44441 22222 2233333322 235554443 5678899987 5566632 12344332222 Q ss_pred cccee-EEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecc Q lcl|NC_021297. 134 SPLYM-YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 134 ~p~~~-~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~ 212 (480) +|... .-.||+ ++ + . .+.+ .. +.........+.++-+++. ++++.+ T Consensus 157 ~~~~~~~f~~~~----------------d~-~-l---~~~~--------~~-~~~~~~~~~~~~~~lP~~~--~i~~~~- 203 (448) T protein:vir:79 157 HPFNIDEVLYDE----------------EG-G-P---KALK--------LS-GEVKGGSQFVSGLEIPIWK--TVVFLH- 203 (448) T ss_pred CCccccceeeec----------------CC-c-e---EEee--------cC-CcccccccCCCccccccce--EEEEec- Confidence 22110 001111 11 0 0 0111 11 1000001111112223333 344544 Q ss_pred cccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh------hhhhhccc-CC Q lcl|NC_021297. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI------YYGRILTL-AS 285 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~------~~~~~~~~-~g 285 (480) ...+.++|.+.+..-.-. .=-=+..+.+++.-++.|+.|+++.+--......++......++ +......+ .| T Consensus 204 ~~~g~p~g~gLlr~~~w~-~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~ 282 (448) T protein:vir:79 204 NDDGSFTGQSALRAAVPH-WLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDD 282 (448) T ss_pred CccCCcccchhHHHHHHH-HHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCC Confidence 455678888877532221 11225678888889999999987665211111111111111111 12122222 34 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021297. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~ 364 (480) ...++.+..... ..+.+.++..-.+|+...- -+.+....++.++..++.....-....+..-.+.+...+. +++.- T Consensus 283 ~~ie~~ea~~~~-~~~~~~i~~~d~~Isk~iL--GqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~ 359 (448) T protein:vir:79 283 WKFDTVDLKSAM-PDAIPYLTYHDAGIARALG--IDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPK 359 (448) T ss_pred ceEEEEecCCCc-ccHHHHHHHHHHHHHHHHh--hhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455565543322 2233445444455544431 0111111111111222221111111112222233445554 35555 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL 444 (480) Q Consensus 365 ~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~ 444 (480) ++.++-+... ....+.|....+.++.+.|+.+.+++..+ ..+ +.+.++. T Consensus 360 l~~lNfg~~~---~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~--~~~------------------~~~~~~~-------- 408 (448) T protein:vir:79 360 LVLPNWPSAT---RFPRLTFEMEERNDFSAAANLMGMLINAV--KDS------------------EDIPTEL-------- 408 (448) T ss_pred HHHhcCCCcC---CCcEEEecCCChHHHHHHHHHhhhhhccc--hhh------------------HHHHHHh-------- Confidence 5555532221 12467787777777777777776654321 011 1111111 Q ss_pred hcccccCcccccCCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021297. 445 YSTTKAQADATPKPTVTDTKTETQTSPSGFNRTKTR 480 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~~~~~ 480 (480) .+... +.+++.+..........+ .....+.|| T Consensus 409 ~~~p~--~~~~~~~~a~~~~~~~~~--~~~~~~~~~ 440 (448) T protein:vir:79 409 KALID--ALPSKMRRALGVVDEVRE--AVRQPADSR 440 (448) T ss_pred hcCCC--CCCCccccccCCCCcccc--cccCCcccc Confidence 00000 111111111111111111 123344555 No 267 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=88.24 E-value=0.034 Score=28.67 Aligned_cols=201 Identities=10% Similarity=0.054 Sum_probs=74.3 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) +|. ..++. .++. +.... + ...+. . .-+..==|+||++-....+.+|.|.|.- ... T Consensus 1 ~r~---~~dg~-~~y~--~~~~~---~--~~~g~--------~--~~~~~~eilH~r~~~~~~~~~Glspi~~----a~~ 55 (219) T protein:vir:98 1 MRV---CKDGN-YKYL--MKKSL---Y--DTKSE--------I--YEYNKNDVIFIKLYDPMQQVYGSPDYVG----GIT 55 (219) T ss_pred Cce---eecCe-EEEE--Eecce---e--cCCce--------e--EEeccccEEEecCCCCCCCcceecHHHH----HHH Confidence 111 11111 1110 00000 0 00000 0 0000001345554323345678886543 233 Q ss_pred HHHHHHHHHHHHHHHh---cchhhhh--cCCCccccccccccchhhhhh-----hhh-cccCC---CCceEEEeccc--c Q lcl|NC_021297. 234 AASRTLMNLQSASQIL---GTPLRVI--SGVTTDELTNDGENTTLDIYY-----GRI-LTLAS---EAAKISEFKAA--E 297 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~---~~~~~~i--~g~~~~~~~~~~~~~~~~~~~-----~~~-~~~~g---~~~~~~~~~~~--~ 297 (480) ++....+-..-...+| +.|-.+| +|..+.....+.-...|+-.. ..+ +..+| ++.++..+... + T Consensus 56 ~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d 135 (219) T protein:vir:98 56 SALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQK 135 (219) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHH Confidence 3332222222222233 3343333 232222111111111121110 111 22222 24566655432 3 Q ss_pred HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhCC-CCcc Q lcl|NC_021297. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-QIMGR-EVTE 375 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~-~~~~~-~~~~ 375 (480) . .|++..+..+.+|+..-++|++.+|....+.+++..+..+...+.. ..|.-.+..+. .+... ... T Consensus 136 ~-qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~----------~tL~P~~~~ie~~ln~~~~~~- 203 (219) T protein:vir:98 136 D-EFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQA----------DEVLPLQEIIAESINSDYEIK- 203 (219) T ss_pred H-HHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHH----------HHHHHHHHHHHHHhhhhhcCC- Confidence 3 4889888999999999999999998543222222222221111111 11111111111 11110 111 Q ss_pred cceeeeEEecCCCCCCHH Q lcl|NC_021297. 376 EYTRLETVWRDPSTPTVA 393 (480) Q Consensus 376 ~~~~i~v~f~~~~~~~~~ 393 (480) ..+++.|..+...|+- T Consensus 204 --~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 204 --SALKVNFKQPEKRDKN 219 (219) T ss_pred --CccEEeecCcccccCC Confidence 2356788777766655 No 268 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=87.21 E-value=0.04 Score=28.23 Aligned_cols=352 Identities=13% Similarity=0.041 Sum_probs=127.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) |- +..++... ......-+.+. ....+.. ...+.......+|+..++-+-.-++.+- .+.. T Consensus 1 Mg----~f~~l~~~-------~~~~~~~~~~~-----~~~~~~~---~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~ 61 (376) T protein:vir:78 1 MG----FFSELFKR-------NKEIEWMWDLD-----FLEDKTT---KVYLKKMALNTCVKHIARTIAKSDFRLKNGETS 61 (376) T ss_pred Cc----hhhhhhcc-------CCccccccchh-----hccccch---hhhhhhHHHHHHHHHHHHhhcccceeecccccc Confidence 22 22222111 00000000000 0000000 1011112233444444443333344332 1222 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 80 ~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ....+..++.. | ........+..+.+.+|.+|+++..+. .|.+ . ..+|+..... ... T Consensus 62 ~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~------~~~~--~-----~~~~~~~~~~-----~~~ 123 (376) T protein:vir:78 62 VRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTD------DFLI--A-----DSYVRKEFAF-----FPD 123 (376) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCC------Ceee--c-----cceeecccce-----eee Confidence 23344455432 3 234566778888999999998875432 2221 1 1111111000 000 Q ss_pred EEEE-EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 155 RLYT-TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 155 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) .++. ...+.+ +...+..++++++ ..+... |. ..+..+.. T Consensus 124 ~~~~~~~~~~~---~~~~~~~~evih~-----------------------------~~~~~~----~~----~~~~~~~~ 163 (376) T protein:vir:78 124 VFEGVTVKDYR---YNRNFSMDDVIFL-----------------------------EYGNER----LS----AFTDGMFE 163 (376) T ss_pred eeeeeeeecce---eeeeeccccEEEe-----------------------------ccCCCC----ch----hhhhHHHH Confidence 0110 001100 0111223333332 211111 11 11222333 Q ss_pred HHHHHHHHHHHHHHHhcchhhh--hcCCC--ccccccccccchhhhh-------hhhhcccCCCCceEEEecccc----- Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRV--ISGVT--TDELTNDGENTTLDIY-------YGRILTLASEAAKISEFKAAE----- 297 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~--i~g~~--~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~~~~~~~~----- 297 (480) .+..++........ +.+.... ++... .++...+.-...|+.. ...++.++ ++.++.+++.+. T Consensus 164 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~ 241 (376) T protein:vir:78 164 DYGELFGKMIRAQM-RNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQL-EGFNYEEFGTTSVNNSQ 241 (376) T ss_pred HHHHHHHHHHHHHH-hcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC-CCceEEeeccCccccch Confidence 34444333333322 2222111 11111 1111110001112111 11233333 345665554322 Q ss_pred -HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) ...|++..+....+|+...++|+..+|++..| .+...+.+....+.-.+...+..+... +... . T Consensus 242 ~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~-~e~~~~~f~~~~l~P~~~~ie~~l~~k----------ll~~----~ 306 (376) T protein:vir:78 242 SFDEVKKLRKEMIDYVASILGIPSSLLHGDMAD-LSNNMKAYMEYCIDPLTKKLEDELNAK----------LFTF----S 306 (376) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHhh----------hCCc----c Confidence 12578888888999999999999999865433 222222222222322222222221111 1111 1 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCccccc Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...+...+......|..+.++++.+++.+| +++...+++.+|+.+-+-... +...- +. +-.+ T Consensus 307 ~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~lg~~p~~~g~~------------d~~~~--~~--n~~~ 368 (376) T protein:vir:78 307 EFLAGEHIKIIHKKDIIENAEAVDKLVASG--SFNRNEVRELLGAERVDNPEL------------DKYLI--TK--NYQS 368 (376) T ss_pred cceecccchhhcccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCC------------ceeee--cc--Ccee Confidence 112222333334567788899998888876 556666666666543210000 00000 00 0000 Q ss_pred CCCCCCCCccccc Q lcl|NC_021297. 457 KPTVTDTKTETQT 469 (480) Q Consensus 457 ~~~~~d~~~~~~~ 469 (480) -..+ +.+| T Consensus 369 ~~~~-----~e~g 376 (376) T protein:vir:78 369 ADEG-----GEDG 376 (376) T ss_pred hhcc-----ccCC Confidence 0000 1111 No 269 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=85.89 E-value=0.05 Score=27.74 Aligned_cols=293 Identities=14% Similarity=0.051 Sum_probs=105.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---cchhcCcccchhhhhh---ccccchH-HHHHHHHHhhhccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYL---DVQPGWV-ATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---~i~~~~~~~~~~~~~~---~~~~n~~-~~iVd~~~~~l~~~~~~ 73 (480) -.|++..+.. ..-++.+.-++.|+. ++.. ..+.+. ...|+-| ...++.++....++++. T Consensus 28 ~~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~------~~la~~~~a~~~h~s~i~~k~n~l~~~~~Pnp~~ 93 (344) T protein:vir:56 28 FGEPVPVLDR--------RDILDYVECISNGRWYEPPVSF------TGLAKSLRAAVHHSSPIYVKRNILASTFIPHPWL 93 (344) T ss_pred cCCceeecCc--------chhhhHHHhhhcCccccCCCCH------HHHHHHHhhhhhhCccceehhhhHHhhcCCCCCC Confidence 1222221110 000122223334432 1111 111111 0111100 00111112212223222 Q ss_pred cCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 74 ISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) . ...+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-+..+.. T Consensus 94 t-------------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~v~~~~~~~------- 141 (344) T protein:vir:56 94 S-------------------QQDFSRFVLDFLVFGNAFLEKRYST------TGKVIRLETSPAKYTRRGVEED------- 141 (344) T ss_pred C-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEeCCceeEEeecCC------- Confidence 1 0112356677888999999875432 3443 46666665544322211 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) ++|....+ +..+.| ..=-|+|+++-.-..+.+|.|.+.- .. T Consensus 142 --~~~~~~~~------------g~~~~~---------------------~~~dIiHir~~~~~~~~~Gls~~~~----a~ 182 (344) T protein:vir:56 142 --VYWWVPSF------------NEPTAF---------------------APGSVFHLLEPDINQELYGLPEYLS----AL 182 (344) T ss_pred --EEEEEecC------------CeEEEE---------------------cCccEEEECCCCCCCCcccccHHHH----HH Confidence 01111111 111111 0112455554222345678886643 33 Q ss_pred HHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhh----hhhhh--cccCC---CCceEEEeccccH Q lcl|NC_021297. 233 DAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDI----YYGRI--LTLAS---EAAKISEFKAAEL 298 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~----~~~~~--~~~~g---~~~~~~~~~~~~~ 298 (480) +++....+-..-...+|. .|-.+ ++|....+...+.-...|+- +.++. +..++ ++.++..+..... T Consensus 183 ~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~ 262 (344) T protein:vir:56 183 NSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVAT 262 (344) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChH Confidence 333332222222223333 23333 33432222111111111211 11222 22232 3456666654322 Q ss_pred -HHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 299 -RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIV-KMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 299 -~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~-~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) ..|++..+....+|+..-++|+..+|....+.++...+........ ....=.+ ..++++. .+.+.+. T Consensus 263 d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~----~~ie~~n----~~l~~~~--- 331 (344) T protein:vir:56 263 KDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQ----DRIREIN----GWIGQEV--- 331 (344) T ss_pred HHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHH----HHHHHHH----hhhcccc--- Confidence 2488888888899999999999999854332211111111111111 1111111 1112221 1222111 Q ss_pred ceeeeEEecCCCCCCHHH Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAA 394 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~ 394 (480) +.|.+.......+ T Consensus 332 -----~~F~~y~l~~~~~ 344 (344) T protein:vir:56 332 -----IRFKNYSLDTDNG 344 (344) T ss_pred -----ccCCCccccccCC Confidence 2333332222111 No 270 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=85.06 E-value=0.056 Score=27.46 Aligned_cols=352 Identities=12% Similarity=0.028 Sum_probs=124.1 Q ss_pred HHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CC--------Cchh Q lcl|NC_021297. 12 QGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE--------DSEG 80 (480) Q Consensus 12 ~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~--------d~~~ 80 (480) ...| .+...+-.+.-. ........-..............+|+..++-+-.-++.+ .. .+.. T Consensus 1 Mg~f-------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~ 72 (378) T protein:vir:16 1 MNLF-------GKVVSFSRGKLN-NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMA 72 (378) T ss_pred Cccc-------hhhhhhhccccc-CCcceeeecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccc Confidence 1111 111111111000 000000000000000011122333443333332223221 10 1112 Q ss_pred HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEE Q lcl|NC_021297. 81 LEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 81 ~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ...+..+++. | ....+...+..+.+.+|.||++...+ +..|.+. .+-|. T Consensus 73 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d-----~~~g~~~--~l~~~------------------- 126 (378) T protein:vir:16 73 GSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-----DNTGELL--DLLFA------------------- 126 (378) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----cCCceEE--EEEec------------------- Confidence 3456666652 2 34566777888999999999864321 1112221 01000 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHH Q lcl|NC_021297. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~ 235 (480) +.+ ..|..+.++ ||++. .....|.| .+..+.+++ T Consensus 127 ------~~~-----~~~~~~dii-----------------------------h~r~~--~~~~~~~s----~l~~~~~~i 160 (378) T protein:vir:16 127 ------DDK-----KEYKPEELV-----------------------------RLTSP--FYINEDTS----ILDNALASI 160 (378) T ss_pred ------CCe-----eEecccceE-----------------------------EecCc--cCccchhH----HHHHHHHHH Confidence 000 011222333 33210 01111222 133334444 Q ss_pred HHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhh---------hhhhhcccCCCCceEEEeccccHHHHHHHH Q lcl|NC_021297. 236 SRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~~~~~i~g-~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l 305 (480) +..++ .+.+..++.- ..............++. ..++++.++ ++.++.+++......-+..+ T Consensus 161 ~~~~~--------~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~~~~~~~~ 231 (378) T protein:vir:16 161 QTKLE--------QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKDEI 231 (378) T ss_pred HHHHh--------cCccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcC-CCceEEEccCChhhhhHHHH Confidence 32221 1222212211 11111111111111111 112344454 45677776643322223456 Q ss_pred HHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEec Q lcl|NC_021297. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWR 385 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~ 385 (480) +.+..+|+...++|+..+++.. +....+.+....|.-.+...+..+...|-.--. +..+... ....++++.+. T Consensus 232 ~~~~~~Ia~~fgVPp~~l~g~~---~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e---~~~~~~~-~~~~~~~f~~~ 304 (378) T protein:vir:16 232 DLIKSELLTGYFMNENILLGTA---SQEQQIYFYNSTIIPLLIQLEKELTYKLISTNR---RRVVKGN-LYYERIIVDNQ 304 (378) T ss_pred HHHHHHHHHHhCCCHHHhcCCc---hHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhh---hhhhhhc-ccccceeeccc Confidence 7778899999999998886432 111122222222222222222222111100000 0001011 11123555556 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCCCc Q lcl|NC_021297. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDTKT 465 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) .-...|..+.++++.+++.+| +++...+++.+|+.+-+- -.+..--..-..++.. ....+ + T Consensus 305 ~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--gD~~~~~~n~~~~~~~---~~~~~--~---------- 365 (378) T protein:vir:16 305 LFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIANLNAVAVKNL---SDLQG--S---------- 365 (378) T ss_pred hhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeeccccccccch---hhhcC--c---------- Confidence 666788889999999998875 667777788887754321 0000000000000000 00000 0 Q ss_pred ccccCCCCCCCCC Q lcl|NC_021297. 466 ETQTSPSGFNRTK 478 (480) Q Consensus 466 ~~~~~p~~~~~~~ 478 (480) ..++.|++.+... T Consensus 366 ~~~~~~~~e~~ne 378 (378) T protein:vir:16 366 RKDVTSTDETNNQ 378 (378) T ss_pred cCCCCCCCCCCCC Confidence 0011111111111 No 271 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=84.49 E-value=0.06 Score=27.28 Aligned_cols=302 Identities=14% Similarity=0.059 Sum_probs=110.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..|++..+.. ..-+...+-+|.|+.--+-+ ....+.+.--...++.-++-...+.+... |. ++..-. T Consensus 33 ~~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~---~~~~la~~~~~~~~h~~~l~~k~n~l~~~-~~-Pnp~~t 99 (351) T protein:vir:79 33 FDDPTPVMNR--------AEILDYVECWSNGEWFEPPV---SFAGLAKSFRASTHHSSALFFKANVLAST-FR-PHRWLS 99 (351) T ss_pred cCCceeecCc--------chhhhhhhhhhcCceecCCC---CHHHHHHHHhhhHhhhhhhhhhhhHHhhc-cc-CCCCCC Confidence 2222222110 00122334456664211101 11122222111123333333333333211 11 111000 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCce-eEEEEEccceeEEEEeCCCCCeeEEEEEEEEE Q lcl|NC_021297. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~-~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+++.+.+.+|.||+.+-.+. .|. ..+..++|..+.+..+.. ++|.. T Consensus 100 -------------~~~f~~~v~d~ll~Gnay~~~~r~~------~G~~~~L~~l~~~~v~~~~~~~---------~~~~~ 151 (351) T protein:vir:79 100 -------------RHAFERWALDFLTFGNGYLERRRNM------VGGTLRLEPALAKYVRRKADFS---------GFVYV 151 (351) T ss_pred -------------HHHHHHHHHHHHhcCCeEEEEEECC------CCCEEEEEEeCCcceeeeecCC---------eEEEE Confidence 1113456677888999999886532 343 356777776654433221 01111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHHH Q lcl|NC_021297. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~ 239 (480) ...+. ...|.++. |+|+.+-.-..+.+|.|.+.. ..+.+.... T Consensus 152 ~~~g~----~~~~~~~e-----------------------------Iihir~~~~~~~~yGl~~~~~----a~~si~l~~ 194 (351) T protein:vir:79 152 NGWQE----RHEFEPDS-----------------------------VFQLVRPDINQEVYGLPEYLS----SLHSAWLNE 194 (351) T ss_pred ecCce----EEEEcCcc-----------------------------EEEeCCCCCCCCcccccHHHH----HHHHHHHHH Confidence 11110 01111222 455554333355678776543 233332222 Q ss_pred HHHHHHHHHhc---chh--hhhcCCCccccccccccchhhhhh-----hhhcc-cCC---CCceEEEecccc-HHHHHHH Q lcl|NC_021297. 240 MNLQSASQILG---TPL--RVISGVTTDELTNDGENTTLDIYY-----GRILT-LAS---EAAKISEFKAAE-LRNFAEE 304 (480) Q Consensus 240 s~~~~~~~~~~---~~~--~~i~g~~~~~~~~~~~~~~~~~~~-----~~~~~-~~g---~~~~~~~~~~~~-~~~~~~~ 304 (480) +-..-...+|. .|- ++++|........+.-...|+-.. +.++. .++ ++.++..+.... -..|++. T Consensus 195 ~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~ 274 (351) T protein:vir:79 195 SSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNI 274 (351) T ss_pred HHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHH Confidence 21111222222 222 223343222211111111121111 11222 222 234566555432 2248888 Q ss_pred HHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021297. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVK-MAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~ 383 (480) .+....+|+..-++|+.-+|....+.+++..++.....+.. ...=.+. .|+++ ..+.+.+ -+. T Consensus 275 k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~----~ie~l----n~~lg~~--------~~~ 338 (351) T protein:vir:79 275 KNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQA----RFAEL----NDWLGDE--------VVT 338 (351) T ss_pred HHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHH----HHHHH----HhhcCcc--------eee Confidence 88889999999999999998533322211111111111111 1111111 11111 1222221 135 Q ss_pred ecCCCCCCHHHHHHHHHHH Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKL 402 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl 402 (480) |.+.. .....+|. T Consensus 339 F~~~~------llr~d~~a 351 (351) T protein:vir:79 339 FDDYE------IPPAPVAA 351 (351) T ss_pred eChhh------hccccccC Confidence 53321 11111111 No 272 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=84.16 E-value=0.063 Score=27.18 Aligned_cols=288 Identities=14% Similarity=0.079 Sum_probs=105.2 Q ss_pred CC---------------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---cchhcCcccchhhhhh- Q lcl|NC_021297. 1 MT---------------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYL- 49 (480) Q Consensus 1 ~~---------------------------~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~---~i~~~~~~~~~~~~~~- 49 (480) |+ +++..+.. ..-++.+.-+++|+. ++.. ..+++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~------~~la~~~ 66 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLDR--------RDILDYVECISNGRWYEPPISF------TGLAKSL 66 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecCC--------cchhHHHHhhhcCccccCCCCH------HHHHHHH Confidence 21 11111100 001122222334432 1111 111111 Q ss_pred ccccchHHHHHHHHHhhh----ccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCC Q lcl|NC_021297. 50 DVQPGWVATYLRTLSDRL----DIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) Q Consensus 50 ~~~~n~~~~iVd~~~~~l----~~~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~ 125 (480) +. ..++.-++......+ .++++.. ...+..++.+.+.+|.||+.+-.+ .. T Consensus 67 ~a-~~~h~~~i~~k~n~l~~~~~Pn~~~t-------------------~~~f~~~~~d~ll~Gnay~~i~rn------~~ 120 (344) T protein:vir:60 67 RA-AVHHSSPIYVKRNILASTFIPHPWLS-------------------QQDFSRFVLDFLVFGNAFLEKRYS------TT 120 (344) T ss_pred Hh-hhhhccchhhhhhHHHhhccCCCCCC-------------------HHHHHHHHHHHHhcCCeEEEEEEC------CC Confidence 00 001111111111111 1222110 012345667788899999987653 23 Q ss_pred cee-EEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccccc Q lcl|NC_021297. 126 GIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVV 204 (480) Q Consensus 126 ~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 204 (480) |++ .+..++|..+.+..+.. ++|... ..+..+.|. .= T Consensus 121 G~~~~L~~l~~~~vr~~~~~~---------~~~~v~------------~~~~~~~~~---------------------~~ 158 (344) T protein:vir:60 121 GKVIRLETSPAKYTRRGVEED---------VYWWVP------------SFNEPTAFA---------------------PG 158 (344) T ss_pred CcEEEEEEcCcceEEEeecCC---------eEEEEc------------cCCeEEEEc---------------------Cc Confidence 443 46666666554433211 011111 011111110 11 Q ss_pred ceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhh--hcCCCccccccccccchhhhh--- Q lcl|NC_021297. 205 PVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGT---PLRV--ISGVTTDELTNDGENTTLDIY--- 276 (480) Q Consensus 205 Pvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~---~~~~--i~g~~~~~~~~~~~~~~~~~~--- 276 (480) .|+|+++-.-..+.+|.|.+.- .++++....+-..-...+|.+ |-.+ ++|....+...+.-...|+-. T Consensus 159 eIiHir~~~~~~~~yGlsp~~~----a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~ 234 (344) T protein:vir:60 159 SVFHLLEPDINQELYGLPEYLS----ALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGR 234 (344) T ss_pred cEEEEcCCCCCCCcccccHHHH----HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCC Confidence 2566664333456788887643 333333322222222233332 2222 334322221111111112211 Q ss_pred -hhh--hcccCC---CCceEEEeccccH-HHHHHHHHHHHHHHHhccCCChHHhccccccchH-H----HHHHHHHHHHH Q lcl|NC_021297. 277 -YGR--ILTLAS---EAAKISEFKAAEL-RNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-A----EAIIATDSRIV 344 (480) Q Consensus 277 -~~~--~~~~~g---~~~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~S-g----~Al~~~~~~l~ 344 (480) .++ ++..++ ++.++..+..... ..|++..+....+|+..-++|+.-+|....+.++ | .++.+....|. T Consensus 235 ~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~ 314 (344) T protein:vir:60 235 NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELI 314 (344) T ss_pred CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHH Confidence 112 222233 2345665543322 2488888888999999999999998853332211 1 11111112221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021297. 345 KMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 345 ~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~ 394 (480) =.+. .|++ +..+.|... +.|.+......-. T Consensus 315 Pl~~--------~~e~----ln~~lg~~~--------i~F~~~~l~~~d~ 344 (344) T protein:vir:60 315 PLQD--------RIRE----INGWLGQEV--------IRFKNYSLDTDNG 344 (344) T ss_pred HHHH--------HHHH----HHHhcCCcc--------cccCccccCCCCC Confidence 1111 1121 122233211 1233222111111 No 273 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=83.93 E-value=0.064 Score=27.11 Aligned_cols=309 Identities=9% Similarity=-0.019 Sum_probs=105.4 Q ss_pred CC----------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHH Q lcl|NC_021297. 1 MT----------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 1 ~~----------------------~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) |+ |++.++.. ..-++-+.=+|+|.-+.-.-+. ....+++.--...++.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~--------~~~~~~~~~~~~~~~~~~epp~-~~~~La~l~~~n~~h~~ 71 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTN--------SWMTRYCELFYNDFDDYWEPPI-SLKGLAEIANANGYHGS 71 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCc--------chHHHHHHHHhcCCCccccCCC-CHHHHHHHHhhhhhhhh Confidence 33 12211110 0001112222333111100000 01222222111122222 Q ss_pred HHHHHHhhhccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccce Q lcl|NC_021297. 59 YLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLY 137 (480) Q Consensus 59 iVd~~~~~l~~~~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~ 137 (480) ++....+.+... |. ++.--. ...+.+++.+.+.+|.||+.+-.+. .|++ .+..++|.. T Consensus 72 ~i~~k~N~l~~~-~~-Pn~~~t-------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~~l~~~~ 130 (348) T protein:vir:26 72 LLKARANYVAGR-FM-NGGGLP-------------MYKMNSACWDYFGLGMSAFVKIRSY------LKNVIALEPLPMVH 130 (348) T ss_pred hHhhhhhHHhhc-cc-CCCCCC-------------HHHHHHHHHHHHhcCCeEEEEEEcC------CCcEEEEEEecCce Confidence 333333322211 11 111000 1123356677888999999876542 3443 466666655 Q ss_pred eEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCc Q lcl|NC_021297. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~ 217 (480) +-+.-|. . +++... ++.. ..|.++. |+||++-.-..+ T Consensus 131 v~~~~d~----~-----~~~~~~-~g~~----~~f~~~d-----------------------------IiHir~~~~~~~ 167 (348) T protein:vir:26 131 MRKRKNG----D-----FVQLLR-NNEQ----KVFKAKD-----------------------------VIFIPQYDPQQQ 167 (348) T ss_pred eEeeecC----c-----EEEEEe-cCeE----EEEcCcc-----------------------------EEEEcCCCCCCC Confidence 4332110 0 011111 1100 0111222 344443222345 Q ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhh--h---hhcc-cCC- Q lcl|NC_021297. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY--G---RILT-LAS- 285 (480) Q Consensus 218 ~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~~~--~---~~~~-~~g- 285 (480) .+|.|.+.. .+.++....+-..-...+|. .|-.+ +++........+.-...|+-.. + .++. .++ T Consensus 168 ~~Gls~~~~----a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g 243 (348) T protein:vir:26 168 IYGLPDYLG----SIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNG 243 (348) T ss_pred cccccHHHH----HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCC Confidence 678776543 33333322221111222222 23222 2332222111111111121111 1 1222 222 Q ss_pred --CCceEEEeccccHH-HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 286 --EAAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAI-IATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 286 --~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al-~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) ++.++..+.....+ .|++..+....+|+..-++|+.-+|....+.++...+ +....-......=....+..+|. T Consensus 244 ~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln-- 321 (348) T protein:vir:26 244 KEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVN-- 321 (348) T ss_pred CccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 23456666543322 4778777778899999999999988533322211111 11111111111111112111121 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHH Q lcl|NC_021297. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAV 399 (480) Q Consensus 362 ~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~ad~~ 399 (480) ...+ . .....+++.|.+.. +...+-++ T Consensus 322 -----~~l~--~-~~~~~~~fdl~~~~---e~~~~~a~ 348 (348) T protein:vir:26 322 -----NDPE--I-PDNLKLKFNLNPGV---ESANGSAV 348 (348) T ss_pred -----hhhC--C-CCccEEEEecCccc---ccchhhcC Confidence 1111 1 11123333332211 11111112 No 274 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=83.15 E-value=0.071 Score=26.89 Aligned_cols=350 Identities=12% Similarity=0.029 Sum_probs=122.0 Q ss_pred HHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhcc--ccchHHHHHHHHHhhhccCcccc---CC-C-------c Q lcl|NC_021297. 12 QGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDV--QPGWVATYLRTLSDRLDIEGFRI---SE-D-------S 78 (480) Q Consensus 12 ~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~--~~n~~~~iVd~~~~~l~~~~~~~---~~-d-------~ 78 (480) ...|. +...+-.+. ..........+....+ .......+|+..++-+-.-++.+ .. + + T Consensus 1 Mg~f~-------~~~~f~~~~---~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:93 1 MNLFG-------KVVSFSRGK---LNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred Cccch-------hhhhhhccc---cCCCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccc Confidence 11110 011110000 0000000000000000 01122233333333332223321 10 0 1 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) .....+..+++. | ....+...+..+.+.+|.||+++..+ +..|.+.. +| T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~-----~~~g~~~~----------l~----------- 124 (378) T protein:vir:93 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-----DNTGELLD----------LL----------- 124 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----cCCceEEE----------EE----------- Confidence 122456666652 3 23466677888999999999865321 11222211 11 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) ..+.+ ..|.++++++ |++- ....-|.|. +..+.+ T Consensus 125 ------~~~~~-----~~~~~~diih-----------------------------~r~~--~~~~~~~s~----l~~~~~ 158 (378) T protein:vir:93 125 ------FADDK-----KEYKTEELVR-----------------------------LTSP--FYINEDTSI----LDNALA 158 (378) T ss_pred ------ecCCe-----eEeccceeEE-----------------------------ecCc--cccchhhHH----HHHHHH Confidence 00000 0112222322 2210 011112221 223333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhh---------hhhhhcccCCCCceEEEeccccHHHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKAAELRNFAE 303 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i~g-~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 303 (480) +++..++ .+.+-.++.- ..............|+. ..+.++.++ ++.++.+++......-++ T Consensus 159 ~i~~~~~--------~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~g~~~~~l~~~~~~~~~~ 229 (378) T protein:vir:93 159 SIQTKLE--------QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKD 229 (378) T ss_pred HHHHHHh--------cCcccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcC-CCceEEEccCChhhhhHH Confidence 3322211 1222222210 01111000111111111 112344444 456777776443333345 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~ 383 (480) .++....+|+...++|+.-+++.. +....+.+....|.-.+...+..+...|-.-- -+-.+... ....++++. T Consensus 230 ~~~~~~~~Ia~~fgVPp~~l~g~~---~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~---er~~~~~~-~~~~~~~fd 302 (378) T protein:vir:93 230 EIDLIKSELLTGYFMNENILLGTA---TQEQQIYFYNSTIIPLLIQLEKELTYKLISTN---RRRVVKGN-LYYERIIVD 302 (378) T ss_pred HHHHHHHHHHHHhCCCHHHhcCCc---HHHHHHHHHHHHHHHHHHHHHHHHHhhcCChh---Hhhhhhhc-ccccceeec Confidence 667788899999999998885431 11111111111122222222211111110000 00001000 111235555 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccccCCCCCCC Q lcl|NC_021297. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTDT 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +..-...|..+.++++.+++.+| +++...+++.+|+.+-+- -.+.. -..+-........... T Consensus 303 ~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g--gD~~~--------~~~n~~~~~~~~~~~~------ 364 (378) T protein:vir:93 303 NQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYI--------ANLNAVAVKNLSDLQG------ 364 (378) T ss_pred cchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeee--------eccccccccchhhhcC------ Confidence 56667788889999999998876 567777788888764321 00000 0000000000000000 Q ss_pred CcccccCCCCCCCCC Q lcl|NC_021297. 464 KTETQTSPSGFNRTK 478 (480) Q Consensus 464 ~~~~~~~p~~~~~~~ 478 (480) +.....|++.+... T Consensus 365 -~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 365 -SRKDVTSTDETNNQ 378 (378) T ss_pred -ccCCCCCCCCCCCC Confidence 00001111111111 No 275 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=83.07 E-value=0.072 Score=26.86 Aligned_cols=292 Identities=12% Similarity=0.078 Sum_probs=101.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CcchhcCcccchhhhhhccccchHHHHHHHHHhhh----ccCccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGT---RRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~---~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l----~~~~~~ 73 (480) ..+++.++..- .-++.++-++.|+ +++.. ..++..--...++.-++......+ .++++. T Consensus 25 ~~~p~~~~~~~--------~~~~~~~~~~~~~~~~pp~~~------~~la~l~~a~~~h~s~i~~k~n~l~~~~~Pn~~l 90 (340) T protein:vir:98 25 FGEPVPVLDKR--------DILDYVECISNGKWYEPPVSF------SGLAKSLRSAVHHSSPIYVKRNVLASTYIPHPLL 90 (340) T ss_pred cCCceeecCcc--------hhhhhhhhhhcCceecCCCCH------HHHHHHHHhccccchhhhhhhhHHhhccCCCCCC Confidence 22222222100 0011112223332 22211 112221000111111121112211 222211 Q ss_pred cCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEE Q lcl|NC_021297. 74 ISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~ 152 (480) . ...+.+++.+.+.+|.||+.+-.+. .|.+ .+..++|..+-...+.. T Consensus 91 t-------------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~vr~~~~~~------- 138 (340) T protein:vir:98 91 S-------------------RQDFSRFALDYLVFGNAFLEQRHSV------TGQLIKLLTSPAKYTRRGVDDS------- 138 (340) T ss_pred C-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEeCCceEEEcccCc------- Confidence 0 1123456677888999999876532 3433 45566665443322110 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHH Q lcl|NC_021297. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~li 232 (480) ++|....++.. ..|.++. |+||.+-.-..+.+|.|.+... + T Consensus 139 --~~~~~~~~~~~----~~~~~~e-----------------------------ViHir~~~~~~~~~Gls~~~~a----~ 179 (340) T protein:vir:98 139 --VFWFVENFTQP----HEFAPDT-----------------------------VFHLLEPDINQEIYGLPEYLSA----L 179 (340) T ss_pred --EEEEEecCCeE----EEEcccc-----------------------------EEEEcCCCCCCCcccccHHHHH----H Confidence 01111111100 0111111 4555532223456788776432 2 Q ss_pred HHHHHHHHHHHHHHHHhcc---hh--hhhcCCCccccccccccchhhhh--h---hhhcc-cCC---CCceEEEecccc- Q lcl|NC_021297. 233 DAASRTLMNLQSASQILGT---PL--RVISGVTTDELTNDGENTTLDIY--Y---GRILT-LAS---EAAKISEFKAAE- 297 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~---~~--~~i~g~~~~~~~~~~~~~~~~~~--~---~~~~~-~~g---~~~~~~~~~~~~- 297 (480) .++....+-..-...+|.+ |- ++++|....+...+.-...|+-. . +.++. .++ ++.++..+.... T Consensus 180 ~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~ 259 (340) T protein:vir:98 180 NSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVAT 259 (340) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChh Confidence 3332221111111223322 32 33344333222111111112211 1 12222 222 234565555332 Q ss_pred HHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021297. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVK-MAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 376 (480) -..|++..+....+|+..-++|+.-+|....+.++...+......+.. ...=... .|+++. ...+.+. T Consensus 260 d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~----~iee~n----~~L~~e~--- 328 (340) T protein:vir:98 260 KDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVFVRNELSPLQD----RFREVN----DWLGMEV--- 328 (340) T ss_pred HHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHHH----HHHHHH----hcccccc--- Confidence 224888888889999999999999988543322111111111111111 1111111 111111 1112111 Q ss_pred ceeeeEEecCCCCCCHHHHHH Q lcl|NC_021297. 377 YTRLETVWRDPSTPTVAAKAD 397 (480) Q Consensus 377 ~~~i~v~f~~~~~~~~~~~ad 397 (480) ++|.+....+ +| T Consensus 329 -----~rF~~~~l~~----~d 340 (340) T protein:vir:98 329 -----IRFKEYTLDN----PE 340 (340) T ss_pred -----cccCcccccc----CC Confidence 2332211111 11 No 276 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=80.04 E-value=0.099 Score=26.10 Aligned_cols=365 Identities=12% Similarity=0.066 Sum_probs=126.0 Q ss_pred HHHHHHHhcccC-cch--hcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---CCCchhHHHHHHHHHh--c-- Q lcl|NC_021297. 22 LLEAEAYRNGTR-RLK--TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~g~~-~i~--~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~~~~~l~~i~~~--n-- 91 (480) +-.++.+..++. .+. ..+........+..+.+.....+|+..++.+-.-++.+ ..+......+..+++. | T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~~ 80 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANPN 80 (395) T ss_pred CcchhhhcCCCCccccccccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHhhcCCCC Confidence 111111111111 000 00000111111111111223334444444433333332 2222223345555542 3 Q ss_pred -CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEE Q lcl|NC_021297. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRAT 170 (480) Q Consensus 92 -~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (480) ....+...++.+.+.+|.||+++..+. ... + +. .+++.....+ .+++....+. ..+-. T Consensus 81 ~t~~~f~~~l~~~lll~Gna~~~~~~~~--------~~~---~-~~-~~~~~~~~~~------~~~~~v~~~~--~~~~~ 139 (395) T protein:vir:96 81 QSASQFWVEVVQKLLVDGETLIFVIPGK--------GIY---V-AD-AFTQDKKLSG------NKFKVSRVQG--QTYEK 139 (395) T ss_pred CCHHHHHHHHHHHHhhcCceEEEEEcCC--------cee---c-CC-cccccccccc------ceeeeeeecc--ceeee Confidence 334567778888999999998875431 110 1 11 1111000000 0010000000 00111 Q ss_pred EEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccch--hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 171 LYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS--PELRKVTDAASRTLMNLQSASQI 248 (480) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~--~~v~~liD~~~~~~s~~~~~~~~ 248 (480) .+.++++ +||+++......++.+-+. ..++.+.-++...-....-.... T Consensus 140 ~~~~~dv-----------------------------ih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 190 (395) T protein:vir:96 140 IFTFDQV-----------------------------IYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTP 190 (395) T ss_pred EeccCce-----------------------------EEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhh Confidence 1223333 3333222212222222111 11222221111111111111111 Q ss_pred hcchhhhhcCCCc-ccccccc-ccchhh-------hhhhhhcccCCCCceEEEeccccHH-------HHHHHHHHHHHHH Q lcl|NC_021297. 249 LGTPLRVISGVTT-DELTNDG-ENTTLD-------IYYGRILTLASEAAKISEFKAAELR-------NFAEEMEVFRKEA 312 (480) Q Consensus 249 ~~~~~~~i~g~~~-~~~~~~~-~~~~~~-------~~~~~~~~~~g~~~~~~~~~~~~~~-------~~~~~l~~~~~~i 312 (480) +... ....|... ......+ ....++ ...+.++.++ ++.++.+++....+ .+.+.....+.+| T Consensus 191 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eI 268 (395) T protein:vir:96 191 PKDK-VRERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVT-ANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEF 268 (395) T ss_pred cccc-cccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEcc-CCceeEecccChhhhhhhhHHHHHHHHHHHHHHH Confidence 1211 11111111 0000000 001111 1122233333 33566555433221 3344445567889 Q ss_pred HhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCH Q lcl|NC_021297. 313 ASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 313 ~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~ 392 (480) +...++|++.++++..| .+...+.+....|.-.+...+. .+.. .+....... ..+.+.|......+. T Consensus 269 a~~fgVPp~~l~~~~sn-~e~~~~~f~~~~L~P~~~~ie~--------~l~~--~Ll~~~e~~--~~~~f~~~~l~~~d~ 335 (395) T protein:vir:96 269 AEMLGIPISLLHGDIAD-NQKNYELLLEGPIESLITNIVD--------GLEY--AIFDKSETL--EGSFIKVTGLKNYDL 335 (395) T ss_pred HHHhCCCHHHhcCCCcc-HHHHHHHHHHHHHHHHHHHHHH--------HHHh--hcCChhhhc--CceeEeecchhccCH Confidence 99999999999754332 1221222222222222211111 1110 111111111 123456666777888 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccc-cCCCCCCCCccc Q lcl|NC_021297. 393 AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT-PKPTVTDTKTET 467 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~ 467 (480) .+.++++.+++.+| +++...+++.+|+.+-+-... +...- +.+-.+. ..++..+++.++ T Consensus 336 ~~~~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~~~g------------D~~~~--~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 336 FSISSQADKLISSG--FVFIDEVREEIGLPELPDGLG------------KVLYM--TKNYESVLERGGEVDEEVET 395 (395) T ss_pred HHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCC------------ceeee--cccceechhccCCCCCCCCC Confidence 99999999988775 566666777777654211000 00000 0000000 000001111111 No 277 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=75.36 E-value=0.15 Score=25.14 Aligned_cols=293 Identities=15% Similarity=0.107 Sum_probs=104.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccc-hhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) -.|++..+.. ..-++.++-||+|+. +...+. ..+++.--...++.-++......+... |. ++..- T Consensus 36 ~~~p~~v~~~--------~~~~~y~~~~~~~~~----~~pp~~~~~la~~~~~~~~h~~~l~~k~n~l~~~-~~-Pn~~~ 101 (350) T protein:vir:11 36 FGDPMPVLDG--------RGILDYLECWPNGRW----YEPPLSMEGLAKSVGSSVYLQSGLKFKRNMLAKT-FI-PHRLL 101 (350) T ss_pred eCCceeecCc--------chhhHHHHHhhcCcc----ccCCCCHHHHHHHHhhhhhhccchhhhhhhhhhc-cc-CCCCC Confidence 1222221110 000122233455531 111111 111111000011111111111111110 11 11100 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) . ...+.+++.+.+.+|.||+.+..+. .|++ .+..++|..+.+.-+.. .+|. T Consensus 102 ---------t----~~~f~~~v~d~ll~Gnay~~~~rn~------~G~~~~L~~l~~~~vr~~~~~~---------~~~~ 153 (350) T protein:vir:11 102 ---------S----RATFEQFSLDWLTFGSAYLEQPRSR------LGTRMPLQAPLAKYMRRGTDLE---------TFYQ 153 (350) T ss_pred ---------C----HHHHHHHHHHHHhcCCeEEEEEEcC------CCCEEEEEEeCCceeEeeecCC---------eEEE Confidence 0 1113456678888999999886542 3443 56677776554322211 0111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) ...++.. ..|.++. |+||.+-.-..+.+|.|.+.. .++++... T Consensus 154 ~~~~~~~----~~~~~~e-----------------------------Vihir~~~~~~~~yGls~~~~----a~~si~l~ 196 (350) T protein:vir:11 154 VRSWKDE----HEFEKGS-----------------------------VIQLREADINQEIYGVPEWFC----ALQSALLN 196 (350) T ss_pred EeeCCeE----EEECccc-----------------------------EEEeCCCCCCCCcccccHHHH----HHHHHHHH Confidence 1111100 0111111 445554322345678876543 33333322 Q ss_pred HHHHHHHHHHhc---chh--hhhcCCCccccccccccchhhhhh-----hhhcc-cCC---CCceEEEeccccH-HHHHH Q lcl|NC_021297. 239 LMNLQSASQILG---TPL--RVISGVTTDELTNDGENTTLDIYY-----GRILT-LAS---EAAKISEFKAAEL-RNFAE 303 (480) Q Consensus 239 ~s~~~~~~~~~~---~~~--~~i~g~~~~~~~~~~~~~~~~~~~-----~~~~~-~~g---~~~~~~~~~~~~~-~~~~~ 303 (480) .+-..-...+|. .|- ++++|....+...+.-...|+... +.++. .++ ++.++..+..... ..|++ T Consensus 197 ~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e 276 (350) T protein:vir:11 197 ESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGS 276 (350) T ss_pred HHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHH Confidence 221111122222 222 223343222211111111222111 11222 222 2345665554322 24888 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASA-----EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg-----~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 378 (480) ..+....+|+..-++|+..+|....+.++. .+..+....|.-.+ . .|+++. ...+.. T Consensus 277 ~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~----~----~ie~ln----~~l~~~------ 338 (350) T protein:vir:11 277 IKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQ----T----RLQQVN----EMIGEE------ 338 (350) T ss_pred HHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHH----H----HHHHHH----hhcCcc------ Confidence 888888999999999999998533322111 11111111121111 1 111211 122211 Q ss_pred eeeEEecCCCCCCH Q lcl|NC_021297. 379 RLETVWRDPSTPTV 392 (480) Q Consensus 379 ~i~v~f~~~~~~~~ 392 (480) .+.|.+.....+ T Consensus 339 --~~~F~~~~~~~l 350 (350) T protein:vir:11 339 --VVRFAQFDAPGL 350 (350) T ss_pred --ccccCcccccCC Confidence 022433222222 No 278 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=71.52 E-value=0.19 Score=24.48 Aligned_cols=345 Identities=12% Similarity=0.069 Sum_probs=122.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-----cccCcch-hcCcccchhhhhhccccchHHHHHHHHHhhhccCcccc---- Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYR-----NGTRRLK-TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---- 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY-----~g~~~i~-~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---- 74 (480) .-+.. ++..|. .|..... ..+..+. ........+|+..++-+-.-++.+ T Consensus 1 M~if~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~v~~Ia~~iA~lp~~~~~~~ 59 (378) T protein:vir:94 1 MNLFG--------------KVVSFSRGKLNNDTQRVTAWQNEAVE-------YTSAFVTNIHNKIANEITKVEFNHVKYK 59 (378) T ss_pred CchhH--------------HhHhhhhcccccCcceeeeeecchhh-------hhhHHHHHHHHHHHHhHhhCceeeeeec Confidence 11111 111221 1111110 0111110 011123334444433332222211 Q ss_pred CCC-------chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEE-EecCcccccCCCceeEEEEEccceeEEE Q lcl|NC_021297. 75 SED-------SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYIT-VSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 75 ~~d-------~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~-v~~~~~~~~d~~~~~~i~~~~p~~~~~v 141 (480) ..+ +.....+..+++. | .-..+...+..+.+..|.||++ ++. +.+|.+.. T Consensus 60 ~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~------~~~g~~~~----------- 122 (378) T protein:vir:94 60 KSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFD------SETGELLD----------- 122 (378) T ss_pred ccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEee------CCCCcEEE----------- Confidence 111 1123445566643 3 2345666788889999999875 221 11122110 Q ss_pred EeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCc Q lcl|NC_021297. 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~ 221 (480) . ++. .++ ..|.++.+ +|+.+ +++. T Consensus 123 ------------~-~~~---~~~-----~~~~~~dv-----------------------------ih~~~------~~~~ 146 (378) T protein:vir:94 123 ------------L-LFA---NDK-----KEYKPEEL-----------------------------VRLTS------PFYI 146 (378) T ss_pred ------------E-EEe---cCc-----EEechhce-----------------------------eeecC------cCCc Confidence 0 000 000 01122222 22221 1111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhh---------hhhhhcccCCCCceEE Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI---------YYGRILTLASEAAKIS 291 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g-~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~ 291 (480) +.+...+..+.++++..+. .+.+--++.- ..............++. ..+.++.++ ++.++. T Consensus 147 ~~~~~~~~~~~~~~~~~~~--------~~~~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~-~g~~~~ 217 (378) T protein:vir:94 147 NEDTSILDNALASIQTKLE--------QGKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVD-NKTEIV 217 (378) T ss_pred ccchhHHHHHHHHHHHHHh--------hCCcccceeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecc-CCceEE Confidence 1111112222222222211 1111111111 01111000011111111 112344554 456777 Q ss_pred EeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021297. 292 EFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR 371 (480) Q Consensus 292 ~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~ 371 (480) +++......-++.++....+|+...++|+..+++... ....+.+....|.-.+...+..+...|-.--. ...+. T Consensus 218 ~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~~~---e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e---~~~g~ 291 (378) T protein:vir:94 218 ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTAT---QEQQIYFYNSTIIPLLIQLEKELTYKLISTNR---RRVVK 291 (378) T ss_pred EccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCCch---HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhH---hhhhh Confidence 7754332222366677888999999999988864321 11122222222322222222222211100000 00111 Q ss_pred CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccC Q lcl|NC_021297. 372 EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) Q Consensus 372 ~~~~~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (480) . ......+.+.+..-...+..+.++++.++..+| +++.-.+++.+|+.+-+- -.+..- ... .. .+......+ T Consensus 292 ~-~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G--~~t~NE~R~~~g~~p~~g--gd~~~~-~~n-~~-~~~~~~~~~ 363 (378) T protein:vir:94 292 G-NLYYERIIVDNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIA-NLN-AV-AVKNLSDLQ 363 (378) T ss_pred h-hcccceeEeecchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeee-ccc-cc-chhcchhcc Confidence 1 111234555556666788889999999998876 566666777877654320 000000 000 00 000000000 Q ss_pred cccccCCCCCCCCcccccCCCCCCCCC Q lcl|NC_021297. 452 ADATPKPTVTDTKTETQTSPSGFNRTK 478 (480) Q Consensus 452 ~~~~~~~~~~d~~~~~~~~p~~~~~~~ 478 (480) . ..+...+++.+.-+ T Consensus 364 --~----------~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 364 --G----------NRKDVTSTDETNNQ 378 (378) T ss_pred --c----------ccCCCCCCCCCCCC Confidence 0 00111111111111 No 279 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=67.69 E-value=0.25 Score=23.90 Aligned_cols=461 Identities=11% Similarity=-0.008 Sum_probs=173.5 Q ss_pred CCC---------HHHHHHHHHH-------HHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHH Q lcl|NC_021297. 1 MTT---------YHEHVERLQG-------LLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~---------~~~~i~~l~~-------~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) |.+ ++-+.++-.. .+..-..+.....+-|.+...-.+ ....++.+-|.=...|.-+.- T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~------~~~~r~nl~~sni~~i~P~iY 74 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAH------DAETRWNLFSTNIQTQMASLY 74 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCC------ccccccchhhhhHHHHhhhhh Confidence 443 3322211111 111222333444444544321111 111122222222222222222 Q ss_pred hhhccCcc---cc-CCCchhH----HHHHHHH------HhcCHHHHHHHHHHHHhhcCeEEEEEecCcccc-c------- Q lcl|NC_021297. 65 DRLDIEGF---RI-SEDSEGL----EELWNWW------QANDLDEESVLGHDDSLTFGRAYITVSHPDVES-G------- 122 (480) Q Consensus 65 ~~l~~~~~---~~-~~d~~~~----~~l~~i~------~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~-~------- 122 (480) ++. +++. +. ..|.... +.+.+.+ ++++|+..+....++++.||+|-+.|.+-.... . T Consensus 75 ar~-P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~ 153 (663) T protein:vir:34 75 GQT-PKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAIL 153 (663) T ss_pred cCC-CcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccC Confidence 211 1111 11 1122122 2333322 345688889999999999998866664311000 0 Q ss_pred C----C-------------CceeEEEEEccceeEEEEeCCCCC-eeEEEE-EEEEE------------------------ Q lcl|NC_021297. 123 D----P-------------AGIPLIRVESPLYMYAELDPRNTR-RVTRAV-RLYTT------------------------ 159 (480) Q Consensus 123 d----~-------------~~~~~i~~~~p~~~~~v~d~~~~~-~~~~~~-~~~~~------------------------ 159 (480) | + .-.++|-.+.=.++ ++||.... ++.+.. +.|-+ T Consensus 154 D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~~ 231 (663) T protein:vir:34 154 DEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASVPKVG 231 (663) T ss_pred CCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhccCcC Confidence 0 0 00222322222222 12221000 111111 11100 Q ss_pred -ec---------CCceEEEEEEEeCCc-EEEEEecCCcccccccccc---cccccccccceEEeecccccCccCCcccch Q lcl|NC_021297. 160 -RD---------DVAVPDRATLYLPDE-TVPLRRNGGLNDQWVVDGD---VIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 160 -~~---------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .. ...+...-++|.... .++|...+.. .++...+ ..++.|+ ||...+++- ...+.++.++|. T Consensus 232 ~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~--~~L~~~~p~lgl~~ffP-cPrpl~~~~-~~ds~ipvpd~~ 307 (663) T protein:vir:34 232 KPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYS--AVLDTQPDPLGLESFFP-CPKPLLANW-TTDKVVPRPDFV 307 (663) T ss_pred CccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcc--eecccCCCCCCCCCCCC-Cccccccee-cCCCeecCCcHH Confidence 00 001234556676532 2222222221 1221111 0112233 565555543 234667888876 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCc---cccccccccchhhhhhhhhcccCCCCceE-EEecc---- Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTT---DELTNDGENTTLDIYYGRILTLASEAAKI-SEFKA---- 295 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~--g~~~---~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~---- 295 (480) - -..+++.+|.+--+ .|.+...--+ .+|. |... ....+...+..+-+.....+...|+-.+. ..++- T Consensus 308 ~-y~~~~~E~n~~t~R-in~l~d~ikv-~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~ 384 (663) T protein:vir:34 308 L-AQDLYKEIDLVSTR-ITLLERAIRV-VGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVV 384 (663) T ss_pred H-HHHHHHHHHHHHHH-HHHHHhhhhh-ceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccchhHH Confidence 3 57888888876433 3433321111 1221 1110 00001111111111111111111221121 12221 Q ss_pred ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------- Q lcl|NC_021297. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI------- 368 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~------- 368 (480) ..+.+....-..++..+..+||+.+-.=|. +..+-.+.|-..+-+.+-.++.+++.......+.+.++...+ T Consensus 385 ~aI~~l~~~r~qir~d~~qITGiaDi~Rga-~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~ 463 (663) T protein:vir:34 385 AALTSLRDYRRELVDALHQVTGMADIMRGA-SDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDV 463 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHhhcc-cCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCH Confidence 234444455566667777788876544332 222234556667777788888888888888888887765432 Q ss_pred ------hCCCCcc----------------cceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHH--HHHhCCCChh Q lcl|NC_021297. 369 ------MGREVTE----------------EYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQ--ARIDLGYTAT 424 (480) Q Consensus 369 ------~~~~~~~----------------~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et--~~~~lg~~~~ 424 (480) .|...+. ....+.|.=..+..+|..+.-..++++.++... +++.. ++.+.|..-. T Consensus 464 etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~-~~qq~~pl~~q~p~~~p 542 (663) T protein:vir:34 464 ASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIAS-FMQGVAPLAQQVPGSAP 542 (663) T ss_pred HHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhhHH Confidence 2332221 223344444455667776655566665544211 22222 2233443333 Q ss_pred HHHHHHHHH------HHHHHHHHHHHhcccccCcccccCCCCCCCCccccc----CC----------CCCCCCCCC Q lcl|NC_021297. 425 QREQMRDWD------KQETEDMIDTLYSTTKAQADATPKPTVTDTKTETQT----SP----------SGFNRTKTR 480 (480) Q Consensus 425 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~----~p----------~~~~~~~~~ 480 (480) -..++..+- ..+.+.+++++.......+ ..+.+..+.++..+.. +- .-.-+.++| T Consensus 543 ~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa-~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e~q~~~~~ 617 (663) T protein:vir:34 543 FLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQ-KQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAEVQGDLLR 617 (663) T ss_pred HHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHh-hccCCCCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333221 1233334444433222111 0001111111000000 00 000000000 No 280 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=67.09 E-value=0.26 Score=23.82 Aligned_cols=348 Identities=13% Similarity=0.053 Sum_probs=121.4 Q ss_pred HHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccc--cchHHHHHHHHHhhhccCcccc----CCC-------c Q lcl|NC_021297. 12 QGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQ--PGWVATYLRTLSDRLDIEGFRI----SED-------S 78 (480) Q Consensus 12 ~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~--~n~~~~iVd~~~~~l~~~~~~~----~~d-------~ 78 (480) ...| .++..+..+.-.- .. ... ..+....+. ......+|+..++-+-.-++.+ ..+ + T Consensus 1 Mg~f-------~~~~~~~~~~~~~-~~-~~~-~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLF-------GKVVSFSRGKLNN-DT-QRV-TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CCcc-------ccchhcccccccC-Cc-cee-eeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccc Confidence 0011 1111111110000 00 000 000000011 1123334444443332223221 111 1 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccceeEEEEeCCCCCeeEEE Q lcl|NC_021297. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 79 ~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) .....+.++++. | ....+...+..+.+.+|.||+++... +..|.++. +-| T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~-----~~~g~~~~--l~p------------------ 125 (378) T protein:vir:94 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFD-----DNTGELLD--LLF------------------ 125 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----CCCceEEE--EEe------------------ Confidence 112455666653 3 23466777888999999999864221 11222211 100 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHH Q lcl|NC_021297. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD 233 (480) .+.+ . .|.++.+ +||++- .....|.|- +..+.. T Consensus 126 -------~~~~-~----~~~~~di-----------------------------iH~~~~--~~~~~g~s~----l~~~~~ 158 (378) T protein:vir:94 126 -------ADDK-K----EYKPEEL-----------------------------VRLTSP--FYINEDTSI----LDNALA 158 (378) T ss_pred -------cCCe-e----Eeeeeee-----------------------------EEecCc--CCccchhHH----HHHHHH Confidence 0000 0 0112222 333321 011123332 223333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhh--cCCCccccccccccchhhh---------hhhhhcccCCCCceEEEeccccHHHHH Q lcl|NC_021297. 234 AASRTLMNLQSASQILGTPLRVI--SGVTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKAAELRNFA 302 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~~~~~i--~g~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~~~~~~~~ 302 (480) +++..++. +.+-.++ .+. ............|+- ..+.++.++ ++.++.+++......-+ T Consensus 159 ~i~~~~~~--------~~~~gil~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~~~~~ 228 (378) T protein:vir:94 159 SIQTKLEQ--------GKLRGLLKINAF-LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNK 228 (378) T ss_pred HHHHHHhc--------ccccceeeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceecC-CCceEEEccCChhhhhH Confidence 33322211 1221122 111 111101111111110 122344444 44677776644322223 Q ss_pred HHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeE Q lcl|NC_021297. 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLET 382 (480) Q Consensus 303 ~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~v 382 (480) +.++....+|+...++|+.-+++.. +....+.+....|.-.+...+..+...|-.-- -+-.+... .-...+++ T Consensus 229 ~~~~~~~~~Ia~~fgVP~~~l~~~~---se~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~---er~~g~~~-~~~~~~~f 301 (378) T protein:vir:94 229 DEIDLIKSELLTGYFMNENILLGTA---SQEQQIYFYNSTIIPLLIQLEKELTYKLISTN---RRRVVKGN-LYYERIIV 301 (378) T ss_pred HHHHHHHHHHHHHhCCCHHHhcCCh---HHHHHHHHHHHHHHHHHHHHHHHHHhhcCChh---Hhhhhhhc-ccccceee Confidence 5567778899999999998885432 11111111111122222111111111110000 00001000 01123445 Q ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHH-HHHHHHHHHHHhcccccCcccccCCCCC Q lcl|NC_021297. 383 VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWD-KQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 383 ~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+..-...|..+.++++.+++.+| +++.-.+++.+|+.+-+- -.+.. ..... .++.. ... +.+.....+ T Consensus 302 ~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--GD~~~~~~n~~-~~~~~---~~~--~~~~~~~~~ 371 (378) T protein:vir:94 302 DNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIANLNAV-AVKNL---SDL--QGSRKDVTS 371 (378) T ss_pred cchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeeeccccc-ccccc---hhh--cCCcCCCCC Confidence 555666778889999999998875 566766777777654321 00000 00000 00000 000 000000000 Q ss_pred CCCcccc Q lcl|NC_021297. 462 DTKTETQ 468 (480) Q Consensus 462 d~~~~~~ 468 (480) ++++..+ T Consensus 372 ~~e~~n~ 378 (378) T protein:vir:94 372 TDETNNQ 378 (378) T ss_pred CCCCCCC Confidence 1111111 No 281 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=63.56 E-value=0.32 Score=23.33 Aligned_cols=297 Identities=10% Similarity=0.018 Sum_probs=105.9 Q ss_pred CCCHHHHHHHHHHHHHHHHH-HHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLP-NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) ..+++.++..- ..+ +-.. .+....+|| ++++.. ..+.+.--...++..++....+.+.. .|... T Consensus 22 ~~~p~~~~~~~-~~~-~~~~~~~~~~~~~~--~pP~~~------~~La~l~~~~~~h~~~L~~k~N~~~~-~f~~~---- 86 (337) T protein:vir:78 22 FSMPEAIDPTA-WMT-DYTGVFYNPYGEYY--QPPIDR------KGLAKVARANAHHGAILMARRNMVAG-RFTNQ---- 86 (337) T ss_pred ecCcccccCcc-hhH-hhhhhhhccCccee--cCCCCH------HHHHHHhhcchhhhhHHHhhhccccc-cCcCc---- Confidence 22222222110 000 0000 000011222 222211 11222211122333333332221111 11110 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEEEEEE Q lcl|NC_021297. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 80 ~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ...+..++.+.+.+|.||+.+-.+ ..|++ .+..++|..+-..-|. . .+|. T Consensus 87 --------------~~~~~~~~~d~ll~GNay~~~~rn------~~G~~~~L~pl~~~~v~~~~d~----~-----~~~~ 137 (337) T protein:vir:78 87 --------------RATITAFVHNYLQFGDGGLLKLRN------SFGQVVGLHPLSSVYLRRREDG----C-----FVYL 137 (337) T ss_pred --------------HHHHHHHHHHHHhhCCeEEEEEEC------CCCcEEEEEEeCCceeEeeeCC----e-----EEEE Confidence 123456777888999999987653 23443 4555665543222110 0 0111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHHHHHH Q lcl|NC_021297. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~ 238 (480) .. .+..+.| ..=-|+|+++-.-..+.+|.|.+.- .+.++... T Consensus 138 ~~-------------~~~~~~~---------------------~~~eIiHik~~~~~~~~~Gls~~~~----a~~si~l~ 179 (337) T protein:vir:78 138 QQ-------------GKPNLIY---------------------RPDDVIWLAQYDPEQQVYGMPDYLG----GLQSALLN 179 (337) T ss_pred Ec-------------CCceEEE---------------------CCccEEEECCCCCCCCcccccHHHH----HHHHHHHH Confidence 00 0111111 1112456664333456788886543 33343333 Q ss_pred HHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhh-----hhhc-ccCC---CCceEEEecccc-HHHHHH Q lcl|NC_021297. 239 LMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY-----GRIL-TLAS---EAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 239 ~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~~~-----~~~~-~~~g---~~~~~~~~~~~~-~~~~~~ 303 (480) .+-..-...+|. .|-.+ ++|...+....+.-...|+-.. +.++ ..+| ++.++..+.... -..|++ T Consensus 180 ~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle 259 (337) T protein:vir:78 180 QDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAA 259 (337) T ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHH Confidence 222222223333 23223 2333222211111011121111 1122 2222 234566665432 224778 Q ss_pred HHHHHHHHHHhccCCChHHhccccccchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021297. 304 EMEVFRKEAASITGLPPQYLSSSSENPASA--EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg--~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~ 381 (480) ..+....+|+..-++|++.+|....+..+| .+-+....-......=..+.+..++.+ .+ ... ... T Consensus 260 ~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~--------~l--l~~---~~~ 326 (337) T protein:vir:78 260 IKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINS--------AG--LPR---ALW 326 (337) T ss_pred HHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhh--------hc--CCh---hhc Confidence 778888899999999999998644432222 111111111111111111111111111 01 111 011 Q ss_pred EEecCCCCCCH Q lcl|NC_021297. 382 TVWRDPSTPTV 392 (480) Q Consensus 382 v~f~~~~~~~~ 392 (480) +.|..+...-+ T Consensus 327 ~~f~~~~~~~~ 337 (337) T protein:vir:78 327 VTFRETIGAAV 337 (337) T ss_pred eeccccccccC Confidence 23332222221 No 282 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=61.52 E-value=0.35 Score=23.07 Aligned_cols=302 Identities=10% Similarity=0.069 Sum_probs=105.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhh--c----cCcccc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL--D----IEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l--~----~~~~~~ 74 (480) |.+-...- . +..... .+ ..-.+..|...+--...+|+ . ++-|.. T Consensus 1 ~~~~~~~~------------------~-----~~~~~~---~~----~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~ep 50 (345) T protein:vir:37 1 MKTNVKTD------------------N-----KKGIVI---AP----INDRTFSLNEISASPALDYVGIGFDENYNCYLP 50 (345) T ss_pred CCCCcccc------------------c-----hhhccc---Cc----ceeEEeecCCcccccchhhhhhhhcCCccccCC Confidence 11100000 0 000000 00 00001111110000111111 0 000111 Q ss_pred CCCchhH------------------HHHHHHHHhcC-H-HHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEE Q lcl|NC_021297. 75 SEDSEGL------------------EELWNWWQAND-L-DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVE 133 (480) Q Consensus 75 ~~d~~~~------------------~~l~~i~~~n~-~-~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~ 133 (480) +-+-+.. ..+...+.-|. + ...+.+++.+.+.+|.||+.+-.+. .|.+ .+..+ T Consensus 51 p~~~~~la~l~~~~~~h~~~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl 124 (345) T protein:vir:37 51 PVNRHALAKLPHQNAQHGGILHSRANMVSSLYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNG------FGQVVRLVPL 124 (345) T ss_pred CCCHHHHHHHhhcccccccceeeechHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcC------CCcEEEEEEE Confidence 1000000 01111111221 1 1234567888999999999876542 3443 56777 Q ss_pred ccceeEEEEeCCCCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeeccc Q lcl|NC_021297. 134 SPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP 213 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~ 213 (480) +|..+.+..|... .+.++++... ..+.. ..|.++ -|+|+++-. T Consensus 125 ~~~~vr~~~d~~~----~~~~~~~~~~-~~g~~---~~~~~~-----------------------------dVihir~~~ 167 (345) T protein:vir:37 125 SSLYLRVRKDGGY----SYLMKKSLYD-TAQEI---YRYDAK-----------------------------DIIFIKLYD 167 (345) T ss_pred cCceeEEEEeCCe----eEEEEEeEec-CCceE---EEEccc-----------------------------cEEEecCCC Confidence 7776655443211 1111111100 00000 001111 145555433 Q ss_pred ccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhh-----hhhcc- Q lcl|NC_021297. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY-----GRILT- 282 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~~~-----~~~~~- 282 (480) -..+.+|.|.+...+.+ +. ++....... ..+|. .|-.+ +++........+.-...|+-.. ..++. T Consensus 168 ~~~~~~Gls~~~~a~~s-i~-l~~~a~~~~--~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~ 243 (345) T protein:vir:37 168 PMQQVYGSPDYVGGIQS-AL-LNSDATVFR--RRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVN 243 (345) T ss_pred CCCCcccccHHHHHHHH-HH-HHHHHHHHH--HHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceEEE Confidence 33456788776432222 11 222222222 22222 23333 2332222111111011122111 11222 Q ss_pred cCC---CCceEEEecccc-HHHHHHHHHHHHHHHHhccCCChHHhccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 283 LAS---EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-AEAIIATDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 283 ~~g---~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~S-g~Al~~~~~~l~~k~~~~~~~~~~~ 357 (480) .++ ++.++..+.... -..|++..+....+|+..-++|+..+|....+.++ +.+-+....-......=....+..+ T Consensus 244 ~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ 323 (345) T protein:vir:37 244 IANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAET 323 (345) T ss_pred cCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 345566655432 22477877888999999999999999854333221 1111211111111112222222222 Q ss_pred HHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021297. 358 WERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 358 l~~~~~~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~ 394 (480) +.++ ..+ . ....+.|.+. ++++ T Consensus 324 ln~~----~~~-----~---~~~~i~F~~~---~L~~ 345 (345) T protein:vir:37 324 INQD----PEI-----K---NLLKIKFREQ---NFAK 345 (345) T ss_pred hhhh----ccC-----C---CcceEEecch---hhcC Confidence 2211 111 1 1234566432 2222 No 283 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=51.19 E-value=0.59 Score=21.85 Aligned_cols=399 Identities=9% Similarity=-0.006 Sum_probs=119.9 Q ss_pred CCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCcchhcCcccchhhhhhccccchHHHHHHHHHhhhc--cCccccCCC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLA-RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--IEGFRISED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~--~~~~~~~~d 77 (480) ++..+++....+..+. .+..+..++.++++--+.-.. +... ..+....-..+++..+..+++ ..+|..+.. T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~----i~~~--~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~ 84 (452) T protein:vir:36 11 FSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMA----IDDE--PAKDSWKPDNRLAVNFTKYIVDTFTGYFNGIP 84 (452) T ss_pred cCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc----cccC--ccccccCccceeecchHHHHHHHHhhhhcccC Confidence 6666777666666654 455666566555544333211 1111 111111111234444555443 335555554 Q ss_pred ch---hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEE-EEccc--eeEEEEeCCCCCeeE Q lcl|NC_021297. 78 SE---GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR-VESPL--YMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~---~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~-~~~p~--~~~~v~d~~~~~~~~ 151 (480) .. ..+...+.+++-.-+..+......+......| |...+. ..+.. -.+-++++.. . T Consensus 85 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~--------------G~~~~~v~~d~~g~~~i~~~~p~~---~- 146 (452) T protein:vir:36 85 VKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIY--------------GRAFEFLYQDEDTQTNVVYNSPEN---M- 146 (452) T ss_pred ceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhc--------------CeEEEEEEecCCCeeEEEEEcccc---e- Confidence 33 23444444432111112222333333332222 222121 12111 1122233211 1 Q ss_pred EEEEEEEEecCCceEEEEEEEeCC-cEEEEEecCCcccccccccccccccccccceEEeecccccCccCCc--------- Q lcl|NC_021297. 152 RAVRLYTTRDDVAVPDRATLYLPD-ETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR--------- 221 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~--------- 221 (480) +-.|...........+.+|+.. .+.++..-+.... .........+.. +...+| .||. T Consensus 147 --~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i---~~~~~~~~~~~~--~~~~~~------~~g~iPvv~~~n~ 213 (452) T protein:vir:36 147 --FMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLET---IKISGENDEISF--GEGTYN------PYPDLPVVEFYFN 213 (452) T ss_pred --EEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeE---EEEEEcCCceEE--ecceec------cCCcccEEEecCC Confidence 1223222222233344555432 2222111111000 000000111110 001111 1222 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccc----- Q lcl|NC_021297. 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAA----- 296 (480) Q Consensus 222 s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~----- 296 (480) ..-..++..+++-++..-.-+++. ++.+...... ...-.+.. ........ .....++.+... T Consensus 214 ~~g~sd~e~v~~liDa~d~~~s~~----~~~~~~~~~p---~~~~~g~~----~~~~~~~~--~~~~~~~~~~~~~~~~~ 280 (452) T protein:vir:36 214 EERMSIFESVISLVNAFNKAISEK----ANDVDYFSDQ---YLTFLGAA----VEEEDLKN--IRSNRVINYYADGEGKN 280 (452) T ss_pred CCCCcchHHHHHHHHHHHHHHHHH----HHHHHHhcCc---eeEeecCC----cCchhhhh--hhhcceEEecCCCCccC Confidence 111123444444444333322222 2222122110 00000000 00000000 001122333221 Q ss_pred -c-------H--HHHHHHHHHHHHHHHhccCCChHHhccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 297 -E-------L--RNFAEEMEVFRKEAASITGLPPQYLSSS--SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 297 -~-------~--~~~~~~l~~~~~~i~~~~~~p~~~~g~~--~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 364 (480) . . +.+...++.+...| ..++.. ......|.+=...+.........+.......+...++- T Consensus 281 ~~~~~l~~~~~~~~~~~~~~~l~~~I--------~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 352 (452) T protein:vir:36 281 VDVKFLEKPDSDSQTENLLDRLTKLI--------FQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNS 352 (452) T ss_pred CcceeEeecCCHHHHHHHHHHHHHHH--------HHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1 12233333333322 223222 11112232222334445555556666666666666665 Q ss_pred HHHHhCCCCcccceeeeEEecCC---CCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH-HHHHHHH---HHHHH Q lcl|NC_021297. 365 AMQIMGREVTEEYTRLETVWRDP---STPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQMRDW---DKQET 437 (480) Q Consensus 365 ~~~~~~~~~~~~~~~i~v~f~~~---~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~-~~~~~~~---~~~~~ 437 (480) ++++.-.-.. .......|.+. ..++......+.+ + ++....|.+..+ .-++.-. -+++. T Consensus 353 ~~~li~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~~~a---~---------~~~k~~g~iS~et~~~~~~~~~d~~~E~ 418 (452) T protein:vir:36 353 RYKLFCELST--NVSNKDSWKDIEYTFTRNEPKDIKEQA---E---------TANILMGITSQETALSVISVIPDVQAEM 418 (452) T ss_pred HHHHHHHHHh--ccCCccccccceEEeCCCCCcCHHHHH---H---------HHHHHhccCChHHHHHhCCCCCCHHHHH Confidence 5555322111 11223344332 2244333221111 1 112223543321 1111100 01222 Q ss_pred HHHHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 438 EDMIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) +......... .........++. +.+.....+..+ T Consensus 419 ~ri~~E~~~~----~~~~~~~~~~~~-~~~~~~~~~~~e 452 (452) T protein:vir:36 419 EKIKKEEAST----AIFDKDKQPSEK-GTDTVVSETNEE 452 (452) T ss_pred HHHHHHHHHH----HHHHhhccCCCC-cccccCccccCC Confidence 2211111111 111112222221 112222122222 No 284 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=38.28 E-value=1.1 Score=20.41 Aligned_cols=348 Identities=12% Similarity=0.055 Sum_probs=116.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcC--cccchhhhhhccccchHHHHHHHHHhhhccCccc---cCCC-- Q lcl|NC_021297. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIG--IGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISED-- 77 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d-- 77 (480) .-++. ++..++..+-.....+ ..... +..........+|+..++-+-.-++. ...+ T Consensus 1 M~~f~--------------k~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~ 63 (378) T protein:vir:85 1 MNLFG--------------KVVSFSRGKLNNDTQRVTAWQNE---AVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDV 63 (378) T ss_pred Cchhh--------------hhhhhhhcccccCCcceeeeecc---chhhhhHHHHHHHHHHHHhHhhCceeEEEEecccc Confidence 11222 2222222111000000 00000 00011122233444433333222221 1110 Q ss_pred ------chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEE-ecCcccccCCCceeEEEEEccceeEEEEeCC Q lcl|NC_021297. 78 ------SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITV-SHPDVESGDPAGIPLIRVESPLYMYAELDPR 145 (480) Q Consensus 78 ------~~~~~~l~~i~~~--n---~~~~~~~~~~~~a~~~G~a~~~v-~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~ 145 (480) +.....+..+++. | .-..+...+..+.+.+|.||++. .. +.+|.+... T Consensus 64 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~------~~~g~~~~~-------------- 123 (378) T protein:vir:85 64 GSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFD------SETGELLDL-------------- 123 (378) T ss_pred ccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeec------CCCceEEEE-------------- Confidence 1123455666642 3 22455666788888999999752 21 112221110 Q ss_pred CCCeeEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccch Q lcl|NC_021297. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) +| ..++ ..|.+++++++. +. +....+. T Consensus 124 ----------~~---~~~~-----~~~~~~dvih~~-----------------------------~~------~~~~~~~ 150 (378) T protein:vir:85 124 ----------LF---ANDK-----KEYKPEELVRLV-----------------------------SP------FYINEDT 150 (378) T ss_pred ----------Ee---cCCC-----EEEcccceEEEe-----------------------------cC------cCccchh Confidence 00 0111 012222222221 10 0000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhh---------hhhhhcccCCCCceEEEecc Q lcl|NC_021297. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 226 ~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~-g~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~ 295 (480) ..+..+.++++..+ ..+.+--+++ .........+.....++. ..+.++.++ ++.++.+++. T Consensus 151 ~~~~~a~~~~~~~~--------~~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~ 221 (378) T protein:vir:85 151 SILDNALASIQTKL--------EQGKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVD-NKTEIVELKK 221 (378) T ss_pred hHHHHHHHHHHHHH--------hcCCcceEEEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecC-CCceEEeccC Confidence 01112222222111 1122211221 111111111111111110 112344444 3467777654 Q ss_pred ccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021297. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE 375 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 375 (480) .....-.+.++....+|+...++|+..++++.. ....+.+....|.-.+.+.+..+...|-.--. +..+... . T Consensus 222 ~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~s~~---e~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~e---r~~~~~~-~ 294 (378) T protein:vir:85 222 DYSVLNKDEIELIKSELLTGYFMNENILLGTAT---QEQQIYFYNSTIIPLLIQLEKELTYKLISTNR---RRVVKGN-L 294 (378) T ss_pred ChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCch---HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhh---hhhhhhc-c Confidence 321122355677778899999999998864321 11111122222222222222111111100000 0001000 0 Q ss_pred cceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhcccccCcccc Q lcl|NC_021297. 376 EYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 376 ~~~~i~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) ...++.+.+..-...+..+.++.+.+++.+| +++.-.+++.+|+.+-+- -... .-..+- . T Consensus 295 ~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~lgl~p~~g--GD~~--------~~~~N~--------~ 354 (378) T protein:vir:85 295 YYERIIVDNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDIY--------IANLNA--------V 354 (378) T ss_pred ccceeeecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeE--------eecccc--------c Confidence 1123444444555678888999999988875 566666778877754320 0000 000000 0 Q ss_pred cCCCCCC-CCcccccCCCCCCCCC Q lcl|NC_021297. 456 PKPTVTD-TKTETQTSPSGFNRTK 478 (480) Q Consensus 456 ~~~~~~d-~~~~~~~~p~~~~~~~ 478 (480) +-...++ ..+.+...+++.+.-+ T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 355 AVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred ccccchhhcCccCCCCCCCCCCCC Confidence 0000000 0000111111111111 No 285 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=32.81 E-value=1.4 Score=19.78 Aligned_cols=414 Identities=9% Similarity=-0.007 Sum_probs=144.3 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcchhcCcccchhhhhhccccch-H-HHHHHHHHhhhc--cCccccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGW-V-ATYLRTLSDRLD--IEGFRIS 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~n~-~-~~iVd~~~~~l~--~~~~~~~ 75 (480) ++...+++..++.++.. +..++.++.+.++--+.-..+........+..+. ..+ + .+|+..+..+++ ..+|..+ T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~-~~~~~~~ri~~n~~k~Ivd~~~~yl~g 116 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV-DPLKPDDRMITNFHANLVDQKVSYIVG 116 (492) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccc-cccccccccccchHHHHHHHHhhhhcc Confidence 67788888888888754 5555656666554332221111111110000000 000 0 134444544442 3356555 Q ss_pred CCch---hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCceeEEEEEccce---eEEEEeCCCCCe Q lcl|NC_021297. 76 EDSE---GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLY---MYAELDPRNTRR 149 (480) Q Consensus 76 ~d~~---~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~~i~~~~p~~---~~~v~d~~~~~~ 149 (480) .... .++...+.++.- +...+......+......| |...+.+....+ .+-+.+|.. T Consensus 117 ~p~~~~~~d~~~~~~l~~~-~~n~~~~~~~~~~~~~~~~--------------G~a~~~v~~d~dg~~~~~~~~p~~--- 178 (492) T protein:vir:97 117 KPIAFKHTDDEVVKRIDEV-LGNRFDDKLHSVLTGASNK--------------GIEWLHPYLDEEGEFKLFRVPAEQ--- 178 (492) T ss_pred cCceeccCchHHHHHHHHH-HhccHHHHHHHHHHHHhhc--------------CeEEEEEEecCCCceEEEEEcccc--- Confidence 4322 344555555542 2222223333333222111 222111111111 122223211 Q ss_pred eEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCccc-ccccccccc-c--ccccccceEEeecccccCccCCcccc- Q lcl|NC_021297. 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND-QWVVDGDVI-K--HGLGVVPVVPLTNDPRLGNRYGRSEI- 224 (480) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~--~~~~~iPvv~~~n~~~~~~~~G~s~i- 224 (480) .+-.|...........+.+|..+...++..-..... .+....... + ......+.++.. ...||.-.+ T Consensus 179 ---~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~g~vPvv 250 (492) T protein:vir:97 179 ---GIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFS-----TGSWGKIPFI 250 (492) T ss_pred ---eEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccc-----cCCCCCcceE Confidence 112232222222333444554433222111111100 111000000 0 000001111111 112332111 Q ss_pred --------hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhhhhhcccCCCCceEEEeccc Q lcl|NC_021297. 225 --------SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 225 --------~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 296 (480) ..++..+++-++..-.-+++... -+......-.-..... .. ... ...-. .....+..++.. T Consensus 251 ~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~----~~~~~~~~~l~~~g~~--~~--~~~-~~~~~--~~~~~~~~~~~~ 319 (492) T protein:vir:97 251 PFKNNDLEISDIFMYKTLIDAYNRRLSDLSN----TFKDSNELTYVLKNYD--DQ--ELP-EFKRL--LRYYGAIKVSDN 319 (492) T ss_pred EecCCCCCCCchHhHHHHHHHHHHHHHHHHH----HHHHhccceeeeecCC--cc--cch-hHHHH--HhhccceecCCC Confidence 12355555544433332222222 1112111000000000 00 000 00000 011122222211 Q ss_pred ----------cHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 297 ----------ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM 366 (480) Q Consensus 297 ----------~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~ 366 (480) +.+.+...++.+...|+..+++|+.+++ + .+|..=...+.........+.......+...++-++ T Consensus 320 ~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~----~-~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 394 (492) T protein:vir:97 320 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD----K-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELL 394 (492) T ss_pred CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCcc----c-cccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4667889999999999999999974432 2 344322333445555566666666667777666666 Q ss_pred HHhCCCCcccceeeeEEecCC---CCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH-HHHHH---HHHHHHHHH Q lcl|NC_021297. 367 QIMGREVTEEYTRLETVWRDP---STPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQMR---DWDKQETED 439 (480) Q Consensus 367 ~~~~~~~~~~~~~i~v~f~~~---~~~~~~~~ad~~~kl~~~~~~~~s~et~~~~lg~~~~~-~~~~~---~~~~~~~~~ 439 (480) ++.- ....+...|.+. ..++.+....+.+ +.+....|.+..+ .-++. .--+++.+. T Consensus 395 ~li~-----~~~~~~~~~~~i~v~f~~~~p~~~~e~a------------~~~~kl~G~iS~et~l~~l~~v~d~~~Eler 457 (492) T protein:vir:97 395 WFVF-----EHFDIKGEHKDVDISFNYNKVANTELQV------------QTAQQSMGIVSHETVLENHPFVEDLQAELER 457 (492) T ss_pred HHHH-----HHhcCCcccceeeEEecCCCCCCHHHHH------------HHHHHHhccCchHHHHHhCCCCCCHHHHHHH Confidence 5531 233444444332 2333333211111 1122223544332 11110 001122222 Q ss_pred HHHHHhcccccCcccccCCCCCCCCcccccCCCCCCC Q lcl|NC_021297. 440 MIDTLYSTTKAQADATPKPTVTDTKTETQTSPSGFNR 476 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~p~~~~~ 476 (480) ..+......+...+ -...+.+...+.+++.+..++ T Consensus 458 i~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 458 IEQEQTEYNKQLPN--LDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHHHHHhhhc--cccCCCCCCcccccccccccC Confidence 22211111111111 111222222222333333333 No 286 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=30.77 E-value=1.5 Score=19.54 Aligned_cols=398 Identities=8% Similarity=-0.056 Sum_probs=116.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH----------HHHhcccCc-chhcCcc-cchhhhhhccccchHHHHHHHHHhhhc Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEA----------EAYRNGTRR-LKTIGIG-APPELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~----------~~YY~g~~~-i~~~~~~-~~~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) -.-+.++|+..... ..++.++..+ .+.+.++.. .+..+.. +.-.+.. .++.-.+..++-..+.+.. T Consensus 28 ~~~i~~~i~~~~~~-~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~-~Ivd~~~~~l~g~p~~~~~ 105 (474) T protein:vir:96 28 EEMIIRLINDHKPK-IDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQ-NLVDQKVAYAVANPVTFSS 105 (474) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHH-HHHHhhhhhhcccCceeec Confidence 22224444443322 2222222222 122222211 1111100 0000000 0111111111111111100 Q ss_pred cC--------ccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhc--------CeEEEEEecCcccccCCCceeEEEE Q lcl|NC_021297. 69 IE--------GFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTF--------GRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 69 ~~--------~~~~~~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~--------G~a~~~v~~~~~~~~d~~~~~~i~~ 132 (480) .+ -|. .+ .......++. ..+......+ |.-.+.+..+. .. +-. T Consensus 106 ~d~~~~~~l~~~~-~n--~~~~~~~~~~---------~~~~~~G~~~~~~y~d~~~~~~i~~~~p~------~~---~~v 164 (474) T protein:vir:96 106 DDDKSLKTIQEVL-NH--KWDDKLVDIL---------TAASNKGIEWLQPYIDENGEFKTFRVPAE------QA---IPI 164 (474) T ss_pred CchHHHHHHHHHH-hc--CHHHHHHHHH---------HHHHhcCeeEEEEEecCCCceEEEEEccc------ce---EEE Confidence 00 000 00 1111111111 1111111111 22111111111 11 111 Q ss_pred Ecc---ce-eEEEEeCCCCCeeEEEEEEEEEe------cCCceEEEEEEEeCCcEE--EEEecCCccccccccccccccc Q lcl|NC_021297. 133 ESP---LY-MYAELDPRNTRRVTRAVRLYTTR------DDVAVPDRATLYLPDETV--PLRRNGGLNDQWVVDGDVIKHG 200 (480) Q Consensus 133 ~~p---~~-~~~v~d~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 200 (480) .++ .+ .+.++-...+... .+.+|+.. ..++.......+.....- .+.....-..+.+....-..+. T Consensus 165 ~d~~~~~~~~~~vr~~~~~~~~--~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~ 242 (474) T protein:vir:96 165 WTNKERDTLKAFIRYYRLDGAE--RVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNP 242 (474) T ss_pred EcCCCCCceEEEEEEEeecCce--EEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCC Confidence 111 01 1111111111111 11122110 000000000000000000 0000000000001000000111 Q ss_pred ccccceEEeecccccCccCCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc-cccchhhhh--h Q lcl|NC_021297. 201 LGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND-GENTTLDIY--Y 277 (480) Q Consensus 201 ~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~-~~~~~~~~~--~ 277 (480) +| .+.+. +.- . +++-+|.....+++.......--.-..-..|.+...+... .....+... . T Consensus 243 ~g-~sd~e-----------~v~---~-liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~ 306 (474) T protein:vir:96 243 QE-MSDLF-----------MYK---T-IIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDG 306 (474) T ss_pred CC-CCcHH-----------HHH---H-HHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCC Confidence 11 11110 000 1 1112222211111111111110000011111111111110 011111111 0 Q ss_pred hh--hcccCCCCceEEEeccccHHHHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021297. 278 GR--ILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 278 ~~--~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +. .+..+.+ .. ..+.++++++..++.++.++++.+..||++..+.+-...+.........+.......+. T Consensus 307 ~~~~~l~~~~~-------~~-~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 378 (474) T protein:vir:96 307 SGVDTIQIEVP-------VQ-SSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQ 378 (474) T ss_pred CceeEEeecCC-------hH-HHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1111111 12 25788999999999999999999999988776655555566666677777777777777 Q ss_pred HHHHHHHHHHHHHhC--CCCcccceeeeEEecCCCCCCHHHHHHHHHH--HHhccccCCCHHHHHHhCCCChhHHHHHHH Q lcl|NC_021297. 356 GAWERAMRIAMQIMG--REVTEEYTRLETVWRDPSTPTVAAKADAVSK--LYANGQGPIPKEQARIDLGYTATQREQMRD 431 (480) Q Consensus 356 ~~l~~~~~~~~~~~~--~~~~~~~~~i~v~f~~~~~~~~~~~ad~~~k--l~~~~~~~~s~et~~~~lg~~~~~~~~~~~ 431 (480) ..++-++.+. .... ..+...+....+. ......+....+-.+++ +.... +.+. -++.+++++++ T Consensus 379 ~~~~~i~~~~-~~~~~~~~i~i~f~~~~p~-~~~e~~~~~~~ag~iS~et~~~~~-~~v~---------d~~~E~~ri~~ 446 (474) T protein:vir:96 379 ELLQYIIDFY-KLNIKVQDVEITFNFNVMV-NELEQSQIGVQSQYLSKETVVTNH-PWVD---------DPVAELERIEQ 446 (474) T ss_pred HHHHHHHHHh-CCCcccceeeEEeccCCCc-CHHHHHHHHHhcCCCchHHHHHhC-CCCC---------CHHHHHHHHHH Confidence 7777666653 2221 1111111110000 00000111112221221 12211 2222 24567778877 Q ss_pred HHHHHHHHHHHHHhcccccCcccccCCC Q lcl|NC_021297. 432 WDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) ++.+..+...........+..+...+.+ T Consensus 447 E~~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 447 DNIDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHHHHhcccccccccccccCCCcccCC Confidence 7777666655544443333332222222 No 287 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=26.54 E-value=1.9 Score=19.01 Aligned_cols=302 Identities=12% Similarity=0.100 Sum_probs=102.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcCcc-----cchhhhhhccccchHHHHHHHHHhhhccCccccC Q lcl|NC_021297. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-----APPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~-----~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~ 75 (480) -.++..++..- . . ..+...-...+||+ +++...+.. .+..-.-....+|+... . + .+ T Consensus 41 fg~p~~~~~~~-~-~-~~~~~~~~~~~~~~--~pi~~~~la~~~~~~~~h~~~~~~~~n~l~l---------~---~-~P 102 (368) T protein:vir:79 41 FGDPVEVLDRR-E-L-LDYVECMRMGQWYE--PPMPWDGLARSFRAAAHHSSAVYVKRNILVS---------T---F-IP 102 (368) T ss_pred cCCceeecchh-h-H-HHHHHHHhccchhc--cCcCHHHHHHHHhhccccchhhhhhcchhhh---------h---c-CC Confidence 12222222110 0 0 00000000112332 222211100 00000001111222111 0 1 11 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEecCcccccCCCcee-EEEEEccceeEEEEeCCCCCeeEEEE Q lcl|NC_021297. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 76 ~d~~~~~~l~~i~~~n~~~~~~~~~~~~a~~~G~a~~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) +... . ...+.+++.+.+.+|.||+.+..+ ..|++ .+..++|..+...-|. + . T Consensus 103 n~~~---------t----~~~f~~l~~d~ll~Gnay~~~~r~------~~G~~~~L~~l~~~~v~~~~~~--~------~ 155 (368) T protein:vir:79 103 HPLL---------S----RATFERLVLDWQVFGNAYLERREN------VLGGTIRLDTPLAKYVRRGLDL--N------T 155 (368) T ss_pred CcCC---------C----HHHHHHHHHHHhhcCCeEEEEEEc------CCCCEEEEEEeCcccceeeccC--C------E Confidence 1110 0 011235667888999999987653 23443 4566666654332221 0 0 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccceEEeecccccCccCCcccchhhHHHHHHH Q lcl|NC_021297. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~~~~~~G~s~i~~~v~~liD~ 234 (480) .++... .+..+.|.. + -|+||++-.-..+.+|.|.|.- .... T Consensus 156 ~~~~~~-------------~~~~~~~~~-------------------~--dIihir~~~~~~~~yGlsp~~~----a~~s 197 (368) T protein:vir:79 156 YFFVQN-------------WQQPYTFAA-------------------G--SVFHLQEPDINQEVYGLPEYLS----ALNA 197 (368) T ss_pred EEEEec-------------CCeEEEEcc-------------------c--cEEEecCCCCCCCcccccHHHH----HHHH Confidence 011110 111111110 0 1456664333456788887643 3333 Q ss_pred HHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhh-----hhhhccc-C---CCCceEEEecccc-HH Q lcl|NC_021297. 235 ASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIY-----YGRILTL-A---SEAAKISEFKAAE-LR 299 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~---~~~~~--i~g~~~~~~~~~~~~~~~~~~-----~~~~~~~-~---g~~~~~~~~~~~~-~~ 299 (480) +....+-..-...+|. .|-.+ ++|....+...+.-...|+-. .++++.+ + .++.++..+.... -. T Consensus 198 i~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~ 277 (368) T protein:vir:79 198 TWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKD 277 (368) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHH Confidence 3322221111222332 23322 334322221111111112211 1123332 2 1345666665432 22 Q ss_pred HHHHHHHHHHHHHHhccCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhCCCCccc Q lcl|NC_021297. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM---QIMGREVTEE 376 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~---~~~~~~~~~~ 376 (480) .|++..+....+|+..-++|+.-+|....+.+++..+......+.. ..|.-+++.+. .+.+.. T Consensus 278 qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~f~~----------~~l~Pl~~~ie~ln~~l~~e---- 343 (368) T protein:vir:79 278 EFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMVFAR----------NEVKPLQDRLLAINDWIGDE---- 343 (368) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHH----------HHHHHHHHHHHHHHhccCcc---- Confidence 4788888889999999999999998543332111111111111111 11111111111 111111 Q ss_pred ceeeeEEecCCC--CCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021297. 377 YTRLETVWRDPS--TPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 377 ~~~i~v~f~~~~--~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) .+.|.+.. ..+.++.|+... + |. T Consensus 344 ----~~rF~~~~l~~~D~~a~a~~~~--r-------sa 368 (368) T protein:vir:79 344 ----VVRFAPYALGGHDQPAAAPGGQ--R-------SA 368 (368) T ss_pred ----eeeechhHhhcccccccCCccc--c-------cC Confidence 12342211 111111111000 0 10 Done!